This set of MCQs helps students to learn about basics of Hadoop which is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models and it is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
IBM and ________ have announced a major initiative to use Hadoop to support university courses in distributed computer programming.
What license is Hadoop distributed under?
Sun also has the Hadoop Live CD ________ project, which allows running a fully functional Hadoop cluster using a live CD.
Which of the following genres does Hadoop produce?
What was Hadoop written in?
Which of the following platforms does Hadoop run on?
Hadoop achieves reliability by replicating the data across multiple hosts and hence does not require ________ storage on hosts.
Above the file systems comes the ________ engine, which consists of one Job Tracker, to which client applications submit MapReduce jobs.
The Hadoop list includes the HBase database, the Apache Mahout ________ system, and matrix operations.
According to analysts, for what can traditional IT systems provide a foundation when they’re integrated with big data technologies like Hadoop?
What was Hadoop named after?
__________ can best be described as a programming model used to develop Hadoop-based applications that can process massive amounts of data.
__________ has the world’s largest Hadoop cluster.
All of the following accurately describe Hadoop, EXCEPT ____________.
Open-source
Real-time
Java-based
Distributed computing approach
As companies move past the experimental phase with Hadoop, many cite the need for additional capabilities, including _______________.
Improved data storage and information retrieval
Improved extract, transform and load features for data integration
Improved data warehousing functionality
Improved security, workload management, and SQL support