This set of MCQs helps students to learn about Hadoop I/O that form the building blocks for developing distributed systems, such as serialization frameworks and on-disk data structures and a transform for reading data from any source or writing data to any sink that implements Hadoop's Input Format or Output Format accordingly.
Apache Hadoop ___________ provides a persistent data structure for binary key-value pairs.
How many formats of SequenceFile are present in Hadoop I/O?
Which of the following format is more compression-aggressive?
The __________ is a directory that contains two SequenceFile.
The ______ file is populated with the key and a LongWritable that contains the starting byte position of the record.
The _________ as just the value field append(value) and the key is a LongWritable that contains the record number, count + 1.
____________ data file takes is based on avro serialization framework which was primarily created for hadoop.
The _________ codec from Google provides modest compression ratios.
Which of the following compression is similar to Snappy compression?
Which of the following supports splittable compression?
Which of the following is the slowest compression technique?
Gzip (short for GNU zip) generates compressed files that have a _________ extension.
Which of the following is based on the DEFLATE algorithm?
__________ typically compresses files to within 10% to 15% of the best available techniques.
Hadoop I/O Hadoop comes with a set of ________ for data I/O.
Methods
Commands
Classes
None of the mentioned