In the health domain, the move of generating big data is opening new methodologies in detection as well as prediction of various diseases and disorders. The first phase of the present chapter has provided insights into the role of big data analytics in the detection of one such neuro-disorder, that is, autism spectrum disorder (ASD). The data lake concept has provided a direction to resolve the issue by providing a common platform for storing tremendous amount of data in all formats (structured, unstructured, or raw). However, if the entire data have potential value, the data lakes need to be strategically designed as otherwise it can lead to data swamps. Therefore, in the second phase, data lake based on Hadoop architecture and Apache Spark engine has been provided for the analysis of the health data. The proposed system has resolved the data storage issue, management, and analytics on a single platform. Hence, the novelty of the chapter is that it is pointing towards the faster exploration as well as management of data so that the timely generation of hypothesis can help in analyzing ASD.