Data science team members are using a single cluster to perform data analysis, although cluster size was chosen to handle multiple users and auto-scaling was enabled, the team realized queries are still running slow, what would be the suggested fix for this?

Data science team members are using a single cluster to perform data analysis, although cluster size was chosen to handle multiple users and auto-scaling was enabled, the team realized queries are still running slow, what would be the suggested fix for this?A . Setup multiple clusters so each team member...

March 17, 2023 No Comments READ MORE +

Which of the following techniques structured streaming uses to ensure recovery of failures during stream processing?

Which of the following techniques structured streaming uses to ensure recovery of failures during stream processing?A . Checkpointing and Watermarking B. Write ahead logging and watermarking C. Checkpointing and write-ahead logging D. Delta time travel E. The stream will failover to available nodes in the cluster F. Checkpointing and Idempotent...

March 17, 2023 No Comments READ MORE +

Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?

Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?A . MERGE INTO table_name B. COPY INTO table_name C. UPDATE table_name D. INSERT INTO OVERWRITE table_name E. INSERT IF EXISTS table_nameView AnswerAnswer: A Explanation:...

March 17, 2023 No Comments READ MORE +

Which of the following approaches can enable the data engineering team to be notified if the ELT job has not been run in an hour?

A data engineer is using a Databricks SQL query to monitor the performance of an ELT job. The ELT job is triggered by a specific number of input records being ready to process. The Databricks SQL query returns the number of minutes since the job’s most recent runtime. Which of...

March 17, 2023 No Comments READ MORE +

You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup time

You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup timeA . Setup a second job ahead of first job to start the cluster, so...

March 17, 2023 No Comments READ MORE +

Which of the following SQL statement can be used to query a table by eliminating duplicate rows from the query results?

Which of the following SQL statement can be used to query a table by eliminating duplicate rows from the query results?A . SELECT DISTINCT * FROM table_name B. SELECT DISTINCT * FROM table_name HAVING COUNT(*) > 1 C. SELECT DISTINCT_ROWS (*) FROM table_name D. SELECT * FROM table_name GROUP BY...

March 17, 2023 No Comments READ MORE +

Which of the following SQL statements can be used to update a transactions table, to set a flag on the table from Y to N

Which of the following SQL statements can be used to update a transactions table, to set a flag on the table from Y to NA . MODIFY transactions SET active_flag = 'N' WHERE active_flag = 'Y' B. MERGE transactions SET active_flag = 'N' WHERE active_flag = 'Y' C. UPDATE transactions...

March 17, 2023 No Comments READ MORE +

Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?

Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?A . Databricks Repos can facilitate the pull request, review, and approval process before merging branches B. Databricks Repos can merge changes from a secondary Git branch into a main Git branch C....

March 16, 2023 No Comments READ MORE +

table(table_name))

table(table_name))A . format, checkpointlocation, schemalocation, overwrite B. cloudfiles.format, checkpointlocation, cloudfiles.schemalocation, overwrite C. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, mergeSchema D. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, overwrite E. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, appendView AnswerAnswer: C Explanation: The answer is cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, mergeSchema. Here is the end to end syntax of streaming ELT, below link contains complete...

March 16, 2023 No Comments READ MORE +