This set of MCQs helps students to learn about HDFS – Hadoop Distributed File System, which is the primary data storage system used by Hadoop applications. HDFS employs a NameNode and DataNode architecture to implement a distributed file system that provides high-performance access to data across highly scalable Hadoop clusters.
A ________ serves as the master and there is only one NameNode per cluster.
Which of the following scenario may not be a good fit for HDFS?
________ NameNode is used when the Primary NameNode goes down.
________ is the slave/worker node and holds the user data in the form of Data Blocks.
HDFS provides a command line interface called __________ used to interact with HDFS.
HDFS is implemented in _____________ programming language.
For YARN, the ___________ Manager UI provides host and port information.
For ________ the HBase Master UI provides information about the HBase Master uptime.
________ is a programming model designed for processing large volumes of data in parallel by dividing the work into a set of independent tasks.
The JobTracker pushes work out to available _______ nodes in the cluster, striving to keep the work as close to the data as possible.
On a tasktracker, the map task passes the split to the createRecordReader() method on InputFormat to obtain a _________ for that split.
HDFS works in a __________ fashion.
Master-worker
Master-slave
Worker/slave
All of the mentioned
The daemons associated with the MapReduce phase are ________ and task-trackers.
Job-tracker
Map-tracker
Reduce-tracker
All of the mentioned
InputFormat class calls the ________ function and computes splits for each file and then sends them to the jobtracker.
Puts
Gets
GetSplits
All of the mentioned