Automated Deployment of Data Lake
Abstract: A Data Lake is a central location that can store all your structured and unstructured data, no matter the source or format. Automated deployment for data lake solution is an automated reference implementation that deploys a highly available, cost-effective data lake architecture on the AWS Cloud along with a user-friendly console for searching and requesting datasets. The solution automatically configures the core AWS services necessary to easily tag, search, share, transform, analyse, and govern specific subsets of data across a company or with other external users. The solution deploys a console that users can access to search and browse available datasets for their business needs. Keywords: Data Lake, Cloud Computing, Aws, Ec2, S3, Athena, Glue, Cloud formation.