As a part of your application architecture requirements, the company has requested the ability to run analytics against all the combined log files from the Elastic Load Balancer. Which services would you use together to collect logs and process log file analysis in an AWS environment?
- Amazon DynamoDB to store the logs and EC2 to run custom log analysis scripts
- Amazon EC2 for storing and processing the log files
- Amazon S3 for storing the ELB log files and EC2 for processing the log files analysis
- Amazon S3 for storing ELB log files and Amazon EMR for processing the log files analysis
This question is not that complicated, even if you don’t understand the options. If you see “collection of logs and processing of logs”, directly think of AWS EMR. Amazon EMR provides a managed Hadoop framework that makes it easy, fast, and cost-effective to process a vast amount of data across dynamically scalable Amazon EC2 instances. You can also run other popular distributed frameworks such as Apache Spark, HBase, Presto, and Flink in Amazon EMR, and interact with data in other AWS data stores such as Amazon S3 and Amazon DynamoD. Amazon EMR securely and reliably handles a broad set of big data use cases, including log analysis, web indexing, data transformations (ETL), machine learning, financial analysis, scientific simulation, and bioinformatics.