Basics of Hadoop

This set of MCQs helps students to learn about basics of Hadoop which is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models and it is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

Start Quiz

IBM and ________ have announced a major initiative to use Hadoop to support university courses in distributed computer programming.

Google Latitude Android (operating system) Google Variations Google

What license is Hadoop distributed under?

Apache License 2.0 Mozilla Public License Shareware Commercial

Sun also has the Hadoop Live CD ________ project, which allows running a fully functional Hadoop cluster using a live CD.

OpenOffice.org OpenSolaris GNU Linux

Which of the following genres does Hadoop produce?

Distributed file system JAX-RS Java Message Service Relational Database Management System

What was Hadoop written in?

Java (software platform) Perl Java (programming language) Lua (programming language)

Which of the following platforms does Hadoop run on?

Bare metal Debian Cross-platform Unix-like

Hadoop achieves reliability by replicating the data across multiple hosts and hence does not require ________ storage on hosts.

RAID Standard RAID levels ZFS Operating system

Above the file systems comes the ________ engine, which consists of one Job Tracker, to which client applications submit MapReduce jobs.

MapReduce Google Functional programming Facebook

The Hadoop list includes the HBase database, the Apache Mahout ________ system, and matrix operations.

Machine learning Pattern recognition Statistical classification Artificial intelligence

According to analysts, for what can traditional IT systems provide a foundation when they’re integrated with big data technologies like Hadoop?

Big data management and data mining Data warehousing and business intelligence Management of Hadoop clusters Collecting and storing unstructured data

What was Hadoop named after?

Creator Doug Cutting’s favorite circus act Cutting’s high school rock band The toy elephant of Cutting’s son A sound Cutting’s laptop made during Hadoop development

__________ can best be described as a programming model used to develop Hadoop-based applications that can process massive amounts of data.

MapReduce Mahout Oozie All of the mentioned

__________ has the world’s largest Hadoop cluster.

Apple Datamatics Facebook None of the mentioned

All of the following accurately describe Hadoop, EXCEPT ____________.

Open-source

Real-time

Java-based

Distributed computing approach

As companies move past the experimental phase with Hadoop, many cite the need for additional capabilities, including _______________.

Improved data storage and information retrieval

Improved extract, transform and load features for data integration

Improved data warehousing functionality

Improved security, workload management, and SQL support

Quiz/Test Summary
Title: Basics of Hadoop
Questions: 15
Contributed by:
Steve