Which of the following is an advantage of using Databricks Repos over the Databricks Notebooks versioning?

A data engineer needs to determine whether to use the built-in Databricks Notebooks versioning or version their project using Databricks Repos.

Which of the following is an advantage of using Databricks Repos over the Databricks Notebooks versioning?
A . Databricks Repos automatically saves development progress
B . Databricks Repos supports the use of multiple branches
C . Databricks Repos allows users to revert to previous versions of a notebook
D . Databricks Repos provides the ability to comment on specific changes
E . Databricks Repos is wholly housed within the Databricks Lakehouse Platform

Answer: B

Explanation:

Databricks Repos is a visual Git client and API in Databricks that supports common Git operations such as cloning, committing, pushing, pulling, and branch management. Databricks Notebooks versioning is a legacy feature that allows users to link notebooks to GitHub repositories and perform basic Git operations. However, Databricks Notebooks versioning does not support the use of multiple branches for development work, which is an advantage of using Databricks Repos. With Databricks Repos, users can create and manage branches for different features, experiments, or bug fixes, and merge, rebase, or resolve conflicts between them. Databricks recommends using a separate branch for each notebook and following data science and engineering code development best practices using Git for version control, collaboration, and CI/CD.

Reference: Git integration with Databricks Repos – Azure Databricks | Microsoft Learn, Git version control for notebooks (legacy) | Databricks on AWS, Databricks Repos Is Now Generally Available – New ‘Files’ Feature in …, Databricks Repos – What it is and how we can use it | Adatis.

Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments