Which statement describes the results returned by this query?

A table named user_ltv is being used to create a view that will be used by data analysts on various teams. Users in the workspace are configured into groups, which are used for setting up data access using ACLs. The user_ltv table has the following schema: email STRING, age INT,...

March 26, 2025 No Comments READ MORE +

Which solution meets these requirements?

An upstream system is emitting change data capture (CDC) logs that are being written to a cloud object storage directory. Each record in the log indicates the change type (insert, update, or delete) and the values for each field after the change. The source table has a primary key identified...

March 16, 2025 No Comments READ MORE +

Which statement describes Delta Lake Auto Compaction?

Which statement describes Delta Lake Auto Compaction?A . An asynchronous job runs after the write completes to detect if files could be further compacted; if yes, an optimize job is executed toward a default of 1 GB.B . Before a Jobs cluster terminates, optimize is executed on all tables modified...

March 15, 2025 No Comments READ MORE +

A junior data engineer has been asked to develop a streaming data pipeline with a grouped aggregation using DataFrame df. The pipeline needs to calculate the average humidity and average temperature for each non-overlapping five-minute interval. Events are recorded once per minute per device.

A junior data engineer has been asked to develop a streaming data pipeline with a grouped aggregation using DataFrame df. The pipeline needs to calculate the average humidity and average temperature for each non-overlapping five-minute interval. Events are recorded once per minute per device. Streaming DataFrame df has the following...

March 7, 2025 No Comments READ MORE +

Given a job with at least one wide transformation, which of the following cluster configurations will result in maximum performance?

Each configuration below is identical to the extent that each cluster has 400 GB total of RAM, 160 total cores and only one Executor per VM. Given a job with at least one wide transformation, which of the following cluster configurations will result in maximum performance?A . • Total VMs;...

March 6, 2025 No Comments READ MORE +

Which statement explains the cause of this failure?

The downstream consumers of a Delta Lake table have been complaining about data quality issues impacting performance in their applications. Specifically, they have complained that invalid latitude and longitude values in the activity_details table have been breaking their ability to use other geolocation processes. A junior engineer has written the...

March 5, 2025 No Comments READ MORE +

Which approach will allow this developer to review the current logic for this notebook?

A junior developer complains that the code in their notebook isn't producing the correct results in the development environment. A shared screenshot reveals that while they're using a notebook versioned with Databricks Repos, they're using a personal branch that contains old logic. The desired branch named dev-2.3.9 is not available...

March 5, 2025 No Comments READ MORE +

Which statement describes how the Delta engine identifies which files to load?

A Delta table of weather records is partitioned by date and has the below schema: date DATE, device_id INT, temp FLOAT, latitude FLOAT, longitude FLOAT To find all the records from within the Arctic Circle, you execute a query with the below filter: latitude > 66.3 Which statement describes how...

March 3, 2025 No Comments READ MORE +

Which statement regarding stream-static joins and static Delta tables is correct?

Which statement regarding stream-static joins and static Delta tables is correct?A . Each microbatch of a stream-static join will use the most recent version of the static Delta table as of each microbatch.B . Each microbatch of a stream-static join will use the most recent version of the static Delta...

February 28, 2025 No Comments READ MORE +

Assuming that all data governance considerations are accounted for, which statement accurately informs this decision?

A small company based in the United States has recently contracted a consulting firm in India to implement several new data engineering pipelines to power artificial intelligence applications. All the company's data is stored in regional cloud storage in the United States. The workspace administrator at the company is uncertain...

February 22, 2025 No Comments READ MORE +