This set of MCQs helps students to learn about various important things such as Apache Spark, Flume, Lucene, Hama, HCatalog, Mahout, Drill, Crunch and Thrift in Hadoop.
Spark runs on top of ___________ a cluster manager system which provides efficient resource isolation across distributed applications.
Which of the following can be used to launch Spark jobs inside MapReduce?
Which of the following language is not supported by Spark?
Spark is packaged with higher level libraries, including support for _________ queries.
Spark is engineered from the bottom-up for performance, running ___________ faster than Hadoop by exploiting in memory computing and other optimizations.
Spark includes a collection over ________ operators for transforming data and familiar data frame APIs for manipulating semi-structured data.
All file access uses Java’s __________ APIs which give Lucene stronger index safety.
New ____________ type enables Indexing and searching of date ranges, particularly multi-valued ones.
SolrJ now has first class support for __________ API.
____________ Collection API allows for even distribution of custom replica properties.
Heap usage during IndexWriter merging is also much lower with the new _________
Users can easily run Spark on top of Amazon’s __________.
Infosphere
EC2
EMR
None of the mentioned
PostingsFormat now uses a __________ API when writing postings, just like doc values.
Push
Pull
Read
All of the mentioned