Hortonworks Apache Hadoop Developer Hadoop 2.0 Certification exam for Pig and Hive Developer Online Training

Question #1

Which one of the following statements describes a Pig bag. tuple, and map, respectively?

A . Unordered collection of maps, ordered collection of tuples, ordered set of key/value pairs
B . Unordered collection of tuples, ordered set of fields, set of key value pairs
C . Ordered set of fields, ordered collection of tuples, ordered collection of maps
D . Ordered collection of maps, ordered collection of bags, and unordered set of key/value pairs

Question #2

You want to run Hadoop jobs on your development workstation for testing before you submit them to your production cluster.

Which mode of operation in Hadoop allows you to most closely simulate a production cluster while using a single machine?

A . Run all the nodes in your production cluster as virtual machines on your development workstation.
B . Run the hadoop command with the Cjt local and the Cfs file:///options.
C . Run the DataNode, TaskTracker, NameNode and JobTracker daemons on a single machine.
D . Run simldooop, the Apache open-source software for simulating Hadoop clusters.

Reveal Solution Hide Solution

Question #3

Which HDFS command uploads a local file X into an existing HDFS directory Y?

A . hadoop scp X Y
B . hadoop fs -localPut X Y
C . hadoop fs-put X Y
D . hadoop fs -get X Y

Reveal Solution Hide Solution

Question #4

In Hadoop 2.0, which TWO of the following processes work together to provide automatic failover of the NameNode? Choose 2 answers

A . ZKFailoverController
B . ZooKeeper
C . QuorumManager
D . JournalNode

Reveal Solution Hide Solution

Question #5

To use a lava user-defined function (UDF) with Pig what must you do?

A . Define an alias to shorten the function name
B . Pass arguments to the constructor of UDFs implementation class
C . Register the JAR file containing the UDF
D . Put the JAR file into the user's home folder in HDFS

Reveal Solution Hide Solution

Question #6

When is the earliest point at which the reduce method of a given Reducer can be called?

A . As soon as at least one mapper has finished processing its input split.
B . As soon as a mapper has emitted at least one record.
C . Not until all mappers have finished processing all records.
D . It depends on the InputFormat used for the job.

Reveal Solution Hide Solution

Question #7

Which one of the following statements describes the relationship between the ResourceManager and the ApplicationMaster?

A . The ApplicationMaster requests resources from the ResourceManager
B . The ApplicationMaster starts a single instance of the ResourceManager
C . The ResourceManager monitors and restarts any failed Containers of the ApplicationMaster
D . The ApplicationMaster starts an instance of the ResourceManager within each Container

Reveal Solution Hide Solution

Question #8

Which HDFS command copies an HDFS file named foo to the local filesystem as localFoo?

A . hadoop fs -get foo LocalFoo
B . hadoop -cp foo LocalFoo
C . hadoop fs -Is foo
D . hadoop fs -put foo LocalFoo

Reveal Solution Hide Solution

Question #9

You need to perform statistical analysis in your MapReduce job and would like to call methods in the Apache Commons Math library, which is distributed as a 1.3 megabyte Java archive (JAR) file.

Which is the best way to make this library available to your MapReducer job at runtime?

A . Have your system administrator copy the JAR to all nodes in the cluster and set its location in the HADOOP_CLASSPATH environment variable before you submit your job.
B . Have your system administrator place the JAR file on a Web server accessible to all cluster nodes and then set the HTTP_JAR_URL environment variable to its location.
C . When submitting the job on the command line, specify the Clibjars option followed by the JAR file path.
D . Package your code and the Apache Commands Math library into a zip file named JobJar.zip

Reveal Solution Hide Solution

Question #9

You need to perform statistical analysis in your MapReduce job and would like to call methods in the Apache Commons Math library, which is distributed as a 1.3 megabyte Java archive (JAR) file.

Which is the best way to make this library available to your MapReducer job at runtime?

A . Have your system administrator copy the JAR to all nodes in the cluster and set its location in the HADOOP_CLASSPATH environment variable before you submit your job.
B . Have your system administrator place the JAR file on a Web server accessible to all cluster nodes and then set the HTTP_JAR_URL environment variable to its location.
C . When submitting the job on the command line, specify the Clibjars option followed by the JAR file path.
D . Package your code and the Apache Commands Math library into a zip file named JobJar.zip

Reveal Solution Hide Solution

Hortonworks Apache Hadoop Developer Hadoop 2.0 Certification exam for Pig and Hive Developer Online Training

Hortonworks Apache Hadoop Developer Hadoop 2.0 Certification exam for Pig and Hive Developer Online Training

Hortonworks Apache Hadoop Developer Online Training

The questions for Apache Hadoop Developer were last updated at Apr 08,2025.

Latest Apache Hadoop Developer Dumps Valid Version with 108 Q&As