Which solution will MOST speed up the Athena query performance?

A data engineer needs Amazon Athena queries to finish faster. The data engineer notices that all the files the Athena queries use are currently stored in uncompressed .csv format. The data engineer also notices that users perform most queries by selecting a specific column. Which solution will MOST speed up...

January 27, 2025 No Comments READ MORE +

Which solution will meet this requirement with the LEAST operational effort?

A data engineer must use AWS services to ingest a dataset into an Amazon S3 data lake. The data engineer profiles the dataset and discovers that the dataset contains personally identifiable information (PII). The data engineer must implement a solution to profile the dataset and obfuscate the PII. Which solution...

January 24, 2025 No Comments READ MORE +

How should the data engineer invoke the Lambda function to write load statuses to the DynamoDB table?

A company loads transaction data for each day into Amazon Redshift tables at the end of each day. The company wants to have the ability to track which tables have been loaded and which tables still need to be loaded. A data engineer wants to store the load statuses of...

January 19, 2025 No Comments READ MORE +

Which solution will run the Glue jobs in the MOST cost-effective way?

A data engineer needs to schedule a workflow that runs a set of AWS Glue jobs every day. The data engineer does not require the Glue jobs to run or finish at a specific time. Which solution will run the Glue jobs in the MOST cost-effective way?A . Choose the...

January 18, 2025 No Comments READ MORE +