IBM C2090-101 IBM Big Data Engineer Online Training
IBM C2090-101 Online Training
The questions for C2090-101 were last updated at Dec 26,2024.
- Exam Code: C2090-101
- Exam Name: IBM Big Data Engineer
- Certification Provider: IBM
- Latest update: Dec 26,2024
Which of the following is not a capability of Pig?
- A . Low-latency queries
- B . Schemas are optional
- C . Nested relational data model
- D . A high level abstraction on top of MapReduce
Given a file named readme.txt, which command will copy the readme.txt file to the <user> directory on the HDFS?
- A . hadoop fs Ccp readme.txt hdfs://test.ibm.com:9000/<user>
- B . hadoop fs Ccp hdfs://test.ibm.com:9000/<user> readme.txt
- C . hadoop fs Cput readme.txt hdfs://test.ibm.com:9000/<user>
- D . hadoop fs Cput hdfs://test.ibm.com:9000/<user> readme.text
Which of the following is the most effective method for improving query performance on large Hive tables?
- A . Indexing
- B . Bucketing
- C . Partitioning
- D . De-normalizing data
Which one of the following is NOT provided by the SerDe interface?
- A . SerDe interface has to be built using C or C++ language
- B . Allows SQL-style queries across data that is often not appropriate for a relational database
- C . Serializer takes a Java object that Big SQL has been working with, and turns it into a format that BigSQL can write to HDFS
- D . Deserializer interface takes a string or binary representation of a record, and translates it into a Java object that Big SQL can manipulate
Which of the following are capabilities of the Apache Spark project?
- A . Large scale machine learning
- B . Large scale graph processing
- C . Live data stream processing
- D . All of the above
Which of the following Big SQL statements is valid?
- A . CREATE TABLE t1 WITH CS;
- B . WITH t1 AS (…)
(SELECT * FROM t1 WITH RR USE AND KEEP SHARE LOCKS)
UNION ALL
(SELECT * FROM t1 WITH UR); - C . SELECT deptno, deptname, mgrno FROM t1
WHERE admrdept =‘A00’
FOR READ ONLY WITH RS USE AND KEEP EXCLUSIVE LOCKS - D . ALTER TABLE t1 ALTER COLUMN deptname SET DATA TYPE VARCHAR(100) USE AND KEEP UPDATE LOCKS
Which of the following techniques is NOT employed by Big SQL to improve performance?
- A . Query Optimization
- B . Predicate Push down
- C . Compression efficiency
- D . Load data into DB2 and return the data
When embedding SPSS models within InfoSphere Streams, what SPSS product must be installed on the same machine with InfoSphere Streams?
- A . SPSS Modeler
- B . SPSS Solution Publisher
- C . SPSS Accelerator for InfoSphere Streams
- D . None, the SPSS software runs remotely to the Streams machine
Which of the following statements regarding Sqoop is TRUE? (Choose two.)
- A . All columns in a table must be imported
- B . Sqoop bypasses MapReduce for enhanced performance
- C . Each row from a source table is represented as a separate record in HDFS
- D . When using a password file, the file containing the password must reside in HDFS
- E . Multiple options files can be specified when invoking Sqoop from the command line
Use of Bulk Load in HBase for loading large volume of data will result in which of the following?
- A . It will use less CPU but will use more network resource
- B . It will use less network resource but more CPU
- C . It will behave same way as using HBase API for loading large volume of data
- D . None of the above