Which approach meets the requirement for a centralized metadata layer?

A company has several teams of analysts. Each team of analysts has their own cluster. The teams need to run SQL queries using Hive, Spark-SQL, and Presto with Amazon EMR. The company needs to enable a centralized metadata layer to expose the Amazon S3 objects as tables to the analysts.

Which approach meets the requirement for a centralized metadata layer?
A . EMRFS consistent view with a common Amazon DynamoDB table
B . Bootstrap action to change the Hive Metastore to an Amazon RDS database
C . s3distcp with the outputManifest option to generate RDS DDL
D . Naming scheme support with automatic partition discovery from Amazon S3

Answer: A

Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments