Which of the following approaches can enable the data engineering team to be notified if the ELT job has not been run in an hour?

A data engineer is using a Databricks SQL query to monitor the performance of an ELT job. The ELT job is triggered by a specific number of input records being ready to process. The Databricks SQL query returns the number of minutes since the job’s most recent runtime. Which of...

February 13, 2023 No Comments READ MORE +

What is the purpose of a gold layer in Multi-hop architecture?

What is the purpose of a gold layer in Multi-hop architecture?A . Optimizes ETL throughput and analytic query performance B. Eliminate duplicate records C. Preserves grain of original data, without any aggregations D. Data quality checks and schema enforcement E. Powers ML applications, reporting, dashboards and adhoc reports.View AnswerAnswer: E...

February 13, 2023 No Comments READ MORE +

Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?

Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?A . MERGE INTO table_name B. COPY INTO table_name C. UPDATE table_name D. INSERT INTO OVERWRITE table_name E. INSERT IF EXISTS table_nameView AnswerAnswer: A Explanation:...

February 12, 2023 No Comments READ MORE +

You are currently asked to work on building a data pipeline, you have noticed that you are currently working on a very large scale ETL many data dependencies, which of the following tools can be used to address this problem?

You are currently asked to work on building a data pipeline, you have noticed that you are currently working on a very large scale ETL many data dependencies, which of the following tools can be used to address this problem?A . AUTO LOADER B. JOBS and TASKS C. SQL Endpoints...

February 12, 2023 No Comments READ MORE +

You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup time

You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup timeA . Setup a second job ahead of first job to start the cluster, so...

February 12, 2023 No Comments READ MORE +

What is the main difference between the silver layer and the gold layer in medallion architecture?

What is the main difference between the silver layer and the gold layer in medallion architecture?A . Silver may contain aggregated data B. Gold may contain aggregated data C. Data quality checks are applied in gold D. Silver is a copy of bronze data E. God is a copy of...

February 12, 2023 No Comments READ MORE +

What is the expected behavior when a batch of data containing data that violates these constraints is processed?

A dataset has been defined using Delta Live Tables and includes an expectations clause: CON-STRAINT valid_timestamp EXPECT (timestamp > '2020-01-01') ON VIOLATION FAIL What is the expected behavior when a batch of data containing data that violates these constraints is processed?A . Records that violate the expectation are added to...

February 12, 2023 No Comments READ MORE +

If you create a database sample_db with the statement CREATE DATABASE sample_db what will be the default location of the database in DBFS?

If you create a database sample_db with the statement CREATE DATABASE sample_db what will be the default location of the database in DBFS?A . Default location, DBFS:/user/ B. Default location, /user/db/ C. Default Storage account D. Statement fails “Unable to create database without location” E. Default Location, dbfs:/user/hive/warehouseView AnswerAnswer: E...

February 12, 2023 No Comments READ MORE +