You are currently asked to work on building a data pipeline, you have noticed that you are currently working on a very large scale ETL many data dependencies, which of the following tools can be used to address this problem?
You are currently asked to work on building a data pipeline, you have noticed that you are currently working on a very large scale ETL many data dependencies, which of the following tools can be used to address this problem?A . AUTO LOADER B. JOBS and TASKS C. SQL Endpoints...
Which of the following technique can be used to implement fine-grained access control to rows and columns of the Delta table based on the user's access?
Which of the following technique can be used to implement fine-grained access control to rows and columns of the Delta table based on the user's access?A . Use Unity catalog to grant access to rows and columns B. Row and column access control lists C. Use dynamic view functions D....
Once a cluster is deleted, below additional actions need to performed by the administrator
Once a cluster is deleted, below additional actions need to performed by the administratorA . Remove virtual machines but storage and networking are automatically dropped B. Drop storage disks but Virtual machines and networking are automatically dropped C. Remove networking but Virtual machines and storage disks are automatically dropped D....
Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?
Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?A . Databricks Repos can facilitate the pull request, review, and approval process before merging branches B. Databricks Repos can merge changes from a secondary Git branch into a main Git branch C....
Data engineering team has provided 10 queries and asked Data Analyst team to build a dashboard and refresh the data every day at 8 AM, identify the best approach to set up data refresh for this dashaboard?
Data engineering team has provided 10 queries and asked Data Analyst team to build a dashboard and refresh the data every day at 8 AM, identify the best approach to set up data refresh for this dashaboard?A . Each query requires a separate task and setup 10 tasks under a...
Which of the following SQL statements can be used to update a transactions table, to set a flag on the table from Y to N
Which of the following SQL statements can be used to update a transactions table, to set a flag on the table from Y to NA . MODIFY transactions SET active_flag = 'N' WHERE active_flag = 'Y' B. MERGE transactions SET active_flag = 'N' WHERE active_flag = 'Y' C. UPDATE transactions...
What is the main difference between the silver layer and the gold layer in medallion architecture?
What is the main difference between the silver layer and the gold layer in medallion architecture?A . Silver may contain aggregated data B. Gold may contain aggregated data C. Data quality checks are applied in gold D. Silver is a copy of bronze data E. God is a copy of...
If you create a database sample_db with the statement CREATE DATABASE sample_db what will be the default location of the database in DBFS?
If you create a database sample_db with the statement CREATE DATABASE sample_db what will be the default location of the database in DBFS?A . Default location, DBFS:/user/ B. Default location, /user/db/ C. Default Storage account D. Statement fails “Unable to create database without location” E. Default Location, dbfs:/user/hive/warehouseView AnswerAnswer: E...
You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup time
You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup timeA . Setup a second job ahead of first job to start the cluster, so...
You have written a notebook to generate a summary data set for reporting, Notebook was scheduled using the job cluster, but you realized it takes 8 minutes to start the cluster, what feature can be used to start the cluster in a timely fashion so your job can run immediatley?
You have written a notebook to generate a summary data set for reporting, Notebook was scheduled using the job cluster, but you realized it takes 8 minutes to start the cluster, what feature can be used to start the cluster in a timely fashion so your job can run immediatley?A...