Which solution will meet these requirements with the LEAST operational overhead?

A data engineer must manage the ingestion of real-time streaming data into AWS. The data engineer wants to perform real-time analytics on the incoming streaming data by using time-based aggregations over a window of up to 30 minutes. The data engineer needs a solution that is highly fault tolerant. Which...

March 16, 2025 No Comments READ MORE +

Which Step Functions state should the data engineer use to meet these requirements?

A data engineer needs to use AWS Step Functions to design an orchestration workflow. The workflow must parallel process a large collection of data files and apply a specific transformation to each file. Which Step Functions state should the data engineer use to meet these requirements?A . Parallel stateB ....

March 14, 2025 No Comments READ MORE +

Which solution will meet these requirements with the LEAST latency?

A company needs to partition the Amazon S3 storage that the company uses for a data lake. The partitioning will use a path of the S3 object keys in the following format: s3://bucket/prefix/year=2023/month=01/day=01. A data engineer must ensure that the AWS Glue Data Catalog synchronizes with the S3 storage when...

March 11, 2025 No Comments READ MORE +

Which solution will meet these requirements with the LEAST operational overhead?

A company wants to implement real-time analytics capabilities. The company wants to use Amazon Kinesis Data Streams and Amazon Redshift to ingest and process streaming data at the rate of several gigabytes per second. The company wants to derive near real-time insights by using existing business intelligence (BI) and analytics...

March 11, 2025 No Comments READ MORE +

Which solution will meet these requirements?

A company uses Amazon Athena for one-time queries against data that is in Amazon S3. The company has several use cases. The company must implement permission controls to separate query processes and access to query history among users, teams, and applications that are in the same AWS account. Which solution...

March 9, 2025 No Comments READ MORE +

Which solution will meet this requirement?

A data engineer is configuring an AWS Glue job to read data from an Amazon S3 bucket. The data engineer has set up the necessary AWS Glue connection details and an associated IAM role. However, when the data engineer attempts to run the AWS Glue job, the data engineer receives...

March 1, 2025 No Comments READ MORE +

Which solution will meet this requirement?

A data engineer maintains custom Python scripts that perform a data formatting process that many AWS Lambda functions use. When the data engineer needs to modify the Python scripts, the data engineer must manually update all the Lambda functions. The data engineer requires a less manual way to update the...

February 28, 2025 No Comments READ MORE +

Which combination of steps should the data engineering team take to meet this requirement with the LEAST operational overhead?

A company has five offices in different AWS Regions. Each office has its own human resources (HR) department that uses a unique IAM role. The company stores employee records in a data lake that is based on Amazon S3 storage. A data engineering team needs to limit access to the...

February 23, 2025 No Comments READ MORE +

Which extract, transform, and load (ETL) service will meet these requirements?

A company is migrating on-premises workloads to AWS. The company wants to reduce overall operational overhead. The company also wants to explore serverless options. The company's current workloads use Apache Pig, Apache Oozie, Apache Spark, Apache Hbase, and Apache Flink. The on-premises workloads process petabytes of data in seconds. The...

February 18, 2025 No Comments READ MORE +

Which solution will meet these requirements with the LEAST operational overhead?

A financial services company stores financial data in Amazon Redshift. A data engineer wants to run real-time queries on the financial data to support a web-based trading application. The data engineer wants to run the queries from within the trading application. Which solution will meet these requirements with the LEAST...

February 13, 2025 No Comments READ MORE +