Which of the following describes the relationship between Bronze tables and raw data?

Which of the following describes the relationship between Bronze tables and raw data?
A . Bronze tables contain less data than raw data files.
B . Bronze tables contain more truthful data than raw data.
C . Bronze tables contain aggregates while raw data is unaggregated.
D . Bronze tables contain a less refined view of data than raw data.
E . Bronze tables contain raw data with a schema applied.

Answer: E

Explanation:

Bronze tables are the first layer of a medallion architecture, which is a data design pattern used to organize data in a lakehouse. Bronze tables contain raw data ingested from various sources, such as RDBMS data, JSON files, IoT data, etc. The table structures in this layer correspond to the source system table structures “as-is”, along with any additional metadata columns that capture the load date/time, process ID, etc. The only transformation applied to the raw data in this layer is to apply a schema, which defines the column names and data types of the table. The schema can be inferred from the data source or specified explicitly. Applying a schema to the raw data enables the use of SQL and other structured query languages to access and analyze the data. Therefore, option E is the correct answer.

Reference: What is a Medallion Architecture? Raw Data Ingestion into Delta Lake Bronze tables using Azure Synapse Mapping Data Flow, Apache Spark + Delta Lake concepts, Delta Lake Architecture & Azure Databricks Workspace.

Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments