HDFS – Hadoop Distributed File System

This set of MCQs helps students to learn about HDFS – Hadoop Distributed File System, which is the primary data storage system used by Hadoop applications. HDFS employs a NameNode and DataNode architecture to implement a distributed file system that provides high-performance access to data across highly scalable Hadoop clusters.

Start Quiz

A ________ serves as the master and there is only one NameNode per cluster.

Data Node NameNode Data block Replication

Which of the following scenario may not be a good fit for HDFS?

HDFS is not suitable for scenarios requiring multiple/simultaneous writes to the same file HDFS is suitable for storing data related to applications requiring low latency data access HDFS is suitable for storing data related to applications requiring low latency data access None of the mentioned

________ NameNode is used when the Primary NameNode goes down.

Rack Data Secondary None of the mentioned

________ is the slave/worker node and holds the user data in the form of Data Blocks.

DataNode NameNode Data block Replication

HDFS provides a command line interface called __________ used to interact with HDFS.

“HDFS Shell” “FS Shell” “DFS Shell” None of the mentioned

HDFS is implemented in _____________ programming language.

C++ Java Scala None of the mentioned

For YARN, the ___________ Manager UI provides host and port information.

Data Node NameNode Resource Replication

For ________ the HBase Master UI provides information about the HBase Master uptime.

HBase Oozie Kafka All of the mentioned