Big Data Analytics | Hadoop Introduction, Ecosystem MCQs: This section contains the Multiple-Choice Questions & Answers on Big Data Analytics - Hadoop Introduction, Ecosystem with explanations.

questions

Title
_____ is a platform for developing data flows for the extraction, transformation, and loading (ETL) of huge datasets, as well as for data analysis

_____ is a platform for developing data flows for the extraction, transformation, and loading (ETL) of huge datasets, as well as for data analysis.


  1. Spark
  2. HBase
  3. Hive
  4. Pig
discuss
In contrast to relational databases, Hive is a query engine that supports the elements of SQL that are specifically designed for querying data

In contrast to relational databases, Hive is a query engine that supports the elements of SQL that are specifically designed for querying data.


  1. True
  2. False
discuss
Custom extensions built in the ____ programming language are also supported by Hive

Custom extensions built in the ____ programming language are also supported by Hive.


  1. Java
  2. C#
  3. C
  4. C++
discuss
Amongst which of the following is / are correct,

Amongst which of the following is / are correct,


  1. Hive is a relational database that supports SQL queries.
  2. Pig is a relational database that supports SQL queries.
  3. Both A and B
  4. None of the mentioned above
discuss
In order to analyze all of this Big Data, Hive is a tool that has been developed

In order to analyze all of this Big Data, Hive is a tool that has been developed.


  1. True
  2. False
discuss
____ general-purpose model and runtime framework for distributed data analytics

____ general-purpose model and runtime framework for distributed data analytics.


  1. Mapreduce
  2. Spark
  3. Hive
  4. All of the mentioned above
discuss
Scalability is prioritized over latency in jobs such as _____

Scalability is prioritized over latency in jobs such as _____.


  1. HBase
  2. HDFS
  3. Hive
  4. Mapreduce
discuss
______ node serves as the Slave and is responsible for carrying out the Tasks that have been assigned to it by the JobTracker

______ node serves as the Slave and is responsible for carrying out the Tasks that have been assigned to it by the JobTracker.


  1. TaskReduce
  2. Mapreduce
  3. TaskTracker
  4. JobTracker
discuss
Apache Hive is data storage and ______ that stores and organizes data for study and querying

Apache Hive is data storage and ______ that stores and organizes data for study and querying.


  1. Querying tool
  2. Mapper
  3. MapReduce
  4. All of the mentioned above
discuss
The MapReduce framework is responsible for processing one or more pieces of data and producing the output results as ______

The MapReduce framework is responsible for processing one or more pieces of data and producing the output results as ______.


  1. Maptask
  2. Task execution
  3. Mapper
  4. All of the mentioned above
discuss
Apache Hive is a data ______ infrastructure that is built on top of the Hadoop platform

Apache Hive is a data ______ infrastructure that is built on top of the Hadoop platform.


  1. Warehouse
  2. Map
  3. Reduce
  4. None of the mentioned above
discuss
The Hadoop framework is built in Java, which means that MapReduce applications do not need to be written in _____

The Hadoop framework is built in Java, which means that MapReduce applications do not need to be written in _____.


  1. C#
  2. C
  3. Java
  4. None of the mentioned above
discuss
_____ maps input key/value pairs to a set of intermediate key/value pairs

_____ maps input key/value pairs to a set of intermediate key/value pairs.


  1. Reducer
  2. Mapper
  3. File system
  4. All of these
discuss
HQL is a query language that is used to construct the custom map-reduce framework in Hive, which is written in ______

HQL is a query language that is used to construct the custom map-reduce framework in Hive, which is written in ______.


  1. Java
  2. PHP
  3. C#
  4. None of the mentioned above
discuss
The _______ is the default partitioned in Hadoop, and it offers a method called getPartition that allows us to partition data

The _______ is the default partitioned in Hadoop, and it offers a method called getPartition that allows us to partition data.


  1. HashPartitioner
  2. Map function
  3. Reduce function
  4. All of the mentioned above
discuss
______ is best described as a programming model that is used to construct Hadoop-based applications that can be scaled up and down

______ is best described as a programming model that is used to construct Hadoop-based applications that can be scaled up and down.


  1. Oozie
  2. Zookepper
  3. MapReduce
  4. All of the mentioned above
discuss
Amongst which of the following is/are the Hive function Meta commands

Amongst which of the following is/are the Hive function Meta commands.


  1. Show functions
  2. Describe function
  3. Both A and B
  4. None of the mentioned above
discuss
_____ is a shell utility that can be used to run Hive queries in either interactive or batch mode, depending on the situation

_____ is a shell utility that can be used to run Hive queries in either interactive or batch mode, depending on the situation.


  1. $HIVE_HOME/bin/hive
  2. $HIVE/bin/
  3. $HIVE_HOME/hive
  4. All of the mentioned above
discuss
The _____ tool has the capability of listing all of the possible database schemas

The _____ tool has the capability of listing all of the possible database schemas.


  1. sqoop-list-databases
  2. Hbase-list
  3. hive schema
  4. sqoop-list-columns
discuss
Amongst which of the following is/are true with reference to User-defined Functions of Hive

Amongst which of the following is/are true with reference to User-defined Functions of Hive.


  1. function that fetches one or more columns from a row as arguments
  2. It returns a single value
  3. Both A and B
  4. None of the mentioned above
discuss
total MCQs: 30

MCQs

30

Views

118

Best Answers

299

Points

5