What is a characteristic of Pig?

What is a characteristic of Pig?A . Performs real-time reads and writes in HDFSB . Uses HiveQL to translate SQL queries into MapReduce jobsC . Data warehouse infrastructure that manages jobs in the clusterD . Alternative language to Java programming for MapReduceView AnswerAnswer: D

September 19, 2024No CommentsREAD MORE +

What enables Pravega to rapidly ingest streaming data into durable storage?

What enables Pravega to rapidly ingest streaming data into durable storage?A . Apache SparkB . Append-only logsC . Relational database schemaD . MongoDBView AnswerAnswer: B

September 17, 2024No CommentsREAD MORE +

Understand the current and future states of data governance and identify remaining gaps

Understand the current and future states of data governance and identify remaining gapsView AnswerAnswer: B

September 17, 2024No CommentsREAD MORE +

What are three programming languages supported by Apache Spark?

What are three programming languages supported by Apache Spark?A . Python, R, and ScalaB . PL/SQL, R, and ScalaC . Python, R, and C++D . Python, C, and ScalaView AnswerAnswer: A

September 17, 2024No CommentsREAD MORE +

What is a difference between Data Governance and Master Data Management (MDM)?

What is a difference between Data Governance and Master Data Management (MDM)?A . Data governance is an informal system of decision making. MDM is a formal system of decision making.B . Data governance implementation involves people, policies, rules, and metrics. MDM is done automatically with no interference.C . Data governance...

September 16, 2024No CommentsREAD MORE +

Which schema type should be used for this requirement?

You are designing a database for an e-commerce data warehouse. The data is normalized in 1NF format and will be used to create data marts for different departments inside the organization. Which schema type should be used for this requirement?A . StarB . ColumnarC . Highly NormalizedD . GraphView AnswerAnswer:...

September 16, 2024No CommentsREAD MORE +

What is the primary purpose of Apache Kafka in a data processing architecture?

What is the primary purpose of Apache Kafka in a data processing architecture?A . Storing historical dataB . Running machine learning algorithmsC . Processing real-time data streamsD . Running complex SQL queriesView AnswerAnswer: C

September 14, 2024No CommentsREAD MORE +

Match each data type with its corresponding Redis data structure.

Match each data type with its corresponding Redis data structure. View AnswerAnswer:

September 12, 2024No CommentsREAD MORE +

Which tools can be used for ELT?

In the ELT process, data is transformed after being loaded into the target system. Which tools can be used for ELT? (Select all that apply)A . Apache HadoopB . Apache SparkC . TalendD . Microsoft SSISE . Amazon RedshiftView AnswerAnswer: BCD

September 12, 2024No CommentsREAD MORE +

How is replication implemented in Redis?

How is replication implemented in Redis?A . Client-server communication protocol is used, whereby every client replicates data from the serverB . Data is shared among many computers to enable fault tolerance and data accessibilityC . Meta information is generated based on the client-server communication protocolD . The developer copies and...

September 12, 2024No CommentsREAD MORE +