How VACCUM and OPTIMIZE commands can be used to manage the DELTA lake?

How VACCUM and OPTIMIZE commands can be used to manage the DELTA lake?A . VACCUM command can be used to compact small parquet files, and the OP-TIMZE command can be used to delete parquet files that are marked for dele-tion/unused. B. VACCUM command can be used to delete empty/blank parquet...

March 7, 2023 No Comments READ MORE +

Which of the following commands can be used to run one notebook from another notebook?

Which of the following commands can be used to run one notebook from another notebook?A . notebook.utils.run("full notebook path") B. execute.utils.run("full notebook path") C. dbutils.notebook.run("full notebook path") D. only job clusters can run notebook E. spark.notebook.run("full notebook path")View AnswerAnswer: C Explanation: The answer is dbutils.notebook.run(" full notebook path ") Here...

March 7, 2023 No Comments READ MORE +

What steps need to be taken to set up a DELTA LIVE PIPELINE as a job using the workspace UI?

What steps need to be taken to set up a DELTA LIVE PIPELINE as a job using the workspace UI?A . DELTA LIVE TABLES do not support job cluster B. Select Workflows UI and Delta live tables tab, under task type select Delta live tables pipeline and select the notebook...

March 7, 2023 No Comments READ MORE +

You are currently asked to work on building a data pipeline, you have noticed that you are currently working on a very large scale ETL many data dependencies, which of the following tools can be used to address this problem?

You are currently asked to work on building a data pipeline, you have noticed that you are currently working on a very large scale ETL many data dependencies, which of the following tools can be used to address this problem?A . AUTO LOADER B. JOBS and TASKS C. SQL Endpoints...

March 7, 2023 No Comments READ MORE +

Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?

Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?A . MERGE INTO table_name B. COPY INTO table_name C. UPDATE table_name D. INSERT INTO OVERWRITE table_name E. INSERT IF EXISTS table_nameView AnswerAnswer: A Explanation:...

March 7, 2023 No Comments READ MORE +

Which of the following developer operations in the CI/CD can only be implemented through a GIT provider when using Databricks Repos.

Which of the following developer operations in the CI/CD can only be implemented through a GIT provider when using Databricks Repos.A . Trigger Databricks Repos pull API to update the latest version B. Commit and push code C. Create and edit code D. Create a new branch E. Pull request...

March 7, 2023 No Comments READ MORE +

Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?

Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?A . Databricks Repos can facilitate the pull request, review, and approval process before merging branches B. Databricks Repos can merge changes from a secondary Git branch into a main Git branch C....

March 7, 2023 No Comments READ MORE +

Which of the following approaches can enable the data engineering team to be notified if the ELT job has not been run in an hour?

A data engineer is using a Databricks SQL query to monitor the performance of an ELT job. The ELT job is triggered by a specific number of input records being ready to process. The Databricks SQL query returns the number of minutes since the job’s most recent runtime. Which of...

March 7, 2023 No Comments READ MORE +