HDFS – Hadoop Distributed File System

This set of MCQs helps students to learn about HDFS – Hadoop Distributed File System, which is the primary data storage system used by Hadoop applications. HDFS employs a NameNode and DataNode architecture to implement a distributed file system that provides high-performance access to data across highly scalable Hadoop clusters.

Start Quiz

A ________ serves as the master and there is only one NameNode per cluster.

Data Node NameNode Data block Replication

Which of the following scenario may not be a good fit for HDFS?

HDFS is not suitable for scenarios requiring multiple/simultaneous writes to the same file HDFS is suitable for storing data related to applications requiring low latency data access HDFS is suitable for storing data related to applications requiring low latency data access None of the mentioned

________ NameNode is used when the Primary NameNode goes down.

Rack Data Secondary None of the mentioned

________ is the slave/worker node and holds the user data in the form of Data Blocks.

DataNode NameNode Data block Replication

HDFS provides a command line interface called __________ used to interact with HDFS.

“HDFS Shell” “FS Shell” “DFS Shell” None of the mentioned

HDFS is implemented in _____________ programming language.

C++ Java Scala None of the mentioned

For YARN, the ___________ Manager UI provides host and port information.

Data Node NameNode Resource Replication

For ________ the HBase Master UI provides information about the HBase Master uptime.

HBase Oozie Kafka All of the mentioned

________ is a programming model designed for processing large volumes of data in parallel by dividing the work into a set of independent tasks.

Hive MapReduce Pig Lucene

The JobTracker pushes work out to available _______ nodes in the cluster, striving to keep the work as close to the data as possible.

DataNodes TaskTracker ActionNodes All of the mentioned

On a tasktracker, the map task passes the split to the createRecordReader() method on InputFormat to obtain a _________ for that split.

InputReader RecordReader OutputReader None of the mentioned

HDFS works in a __________ fashion.

Master-worker

Master-slave

Worker/slave

All of the mentioned

The daemons associated with the MapReduce phase are ________ and task-trackers.

Job-tracker

Map-tracker

Reduce-tracker

All of the mentioned

InputFormat class calls the ________ function and computes splits for each file and then sends them to the jobtracker.

Puts

Gets

GetSplits

All of the mentioned

Quiz/Test Summary
Title: HDFS – Hadoop Distributed File System
Questions: 14
Contributed by:
Steve