table(table_name))
table(table_name))A . format, checkpointlocation, schemalocation, overwrite B. cloudfiles.format, checkpointlocation, cloudfiles.schemalocation, overwrite C. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, mergeSchema D. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, overwrite E. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, appendView AnswerAnswer: C Explanation: The answer is cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, mergeSchema. Here is the end to end syntax of streaming ELT, below link contains complete...
What is the main difference between the silver layer and the gold layer in medallion architecture?
What is the main difference between the silver layer and the gold layer in medallion architecture?A . Silver may contain aggregated data B. Gold may contain aggregated data C. Data quality checks are applied in gold D. Silver is a copy of bronze data E. God is a copy of...
Which of the following techniques structured streaming uses to ensure recovery of failures during stream processing?
Which of the following techniques structured streaming uses to ensure recovery of failures during stream processing?A . Checkpointing and Watermarking B. Write ahead logging and watermarking C. Checkpointing and write-ahead logging D. Delta time travel E. The stream will failover to available nodes in the cluster F. Checkpointing and Idempotent...
What steps need to be taken to set up a DELTA LIVE PIPELINE as a job using the workspace UI?
What steps need to be taken to set up a DELTA LIVE PIPELINE as a job using the workspace UI?A . DELTA LIVE TABLES do not support job cluster B. Select Workflows UI and Delta live tables tab, under task type select Delta live tables pipeline and select the notebook...
Which of the following approaches can enable the data engineering team to be notified if the ELT job has not been run in an hour?
A data engineer is using a Databricks SQL query to monitor the performance of an ELT job. The ELT job is triggered by a specific number of input records being ready to process. The Databricks SQL query returns the number of minutes since the job’s most recent runtime. Which of...
Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?
Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?A . MERGE INTO table_name B. COPY INTO table_name C. UPDATE table_name D. INSERT INTO OVERWRITE table_name E. INSERT IF EXISTS table_nameView AnswerAnswer: A Explanation:...
as total_sales from sales
as total_sales from salesView AnswerAnswer: C Explanation: The answer is
You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup time
You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup timeA . Setup a second job ahead of first job to start the cluster, so...
Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?
Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?A . MERGE INTO table_name B. COPY INTO table_name C. UPDATE table_name D. INSERT INTO OVERWRITE table_name E. INSERT IF EXISTS table_nameView AnswerAnswer: A Explanation:...
Which of the following approaches can enable the data engineering team to be notified if the ELT job has not been run in an hour?
A data engineer is using a Databricks SQL query to monitor the performance of an ELT job. The ELT job is triggered by a specific number of input records being ready to process. The Databricks SQL query returns the number of minutes since the job’s most recent runtime. Which of...