Bigdata MCQ QA Part2
Bigdata MCQ QA Part2
6) Combiner increases the amount of work to be done by the reducer by reducing the network
traffic.
a) True
b) False
12) What happens if a local variable exists with the same name as the global variable you want
to access?
a) Global variable is shadowed
17) In order to change settings within Hive session for dynamic partitioning we should set the
parameter: Hive.exec.dynamic.partition.mode=strict
a) True
b) False
20) Is it possible to delete a kafka topic when the broker is down?(multiple options)
a) Yes, we can delete it
b) Deletion will be recorded by zookeeper
c) No, topics are default category feed
d) No, topics cannot be deleted when the broker is unavailable
21) Why is Apache spark considered as an integrated solution for processing on all lambda
architecture?
a) It contains Spark SQL for SQL and structured data processing
b) It contains Spark Streaming that enables scalable, high-throughput, fault-tolerant
stream processing of live data structures
c) All the above
d) It contains Spark Core that includes high-level API and an optimized engine that
supports general execution graphs
22) Each version of data within a cell adds a versioning information through a
a) Version Value
b) Keyvalue
c) KeyNo
d) VersionNo
24) The underlying data is not deleted from HDFS when an HIVE external table is dropped
a) True
25) What does the following code print?
Val mammals=List(“Lion”,”Dolphin”,”Whale”)
Val mammalsRdd = sc.parallelize(mammals)
Val mammalsLengthRdd= mammalsRdd.map{(m:String)=>
m.length
}
mammalsLengthRdd.collect().foreach(println)
a) 16
b) 4,7,5
c) Lion,Dolphin,Whale
d) None
28) {0:f},{1:2f},{2:05.2f}.format(1.23456,1.23456,1.23456)
Output?
a) ‘1.234560,1.22345,1.23’
b) Error
c) ‘1.234560,1.23460,01.23’
d) No output
30) How would the data received from a GPS satellite and the Web be classified as?
a) Structured
b) Unstructured
c) Both structured and unstructured
d) Semi structured
33) Mapper and Reducer implementations can use the _____ to report progress or just indicate
that they are alive.
a) Reporter
b) Partitioner
c) OutputCollector
d) All the above
34) ______ is the utility which allows users to create and run jobs with any executable as the
mapper and/or the Reducer.
a) Hadoop Streaming
b) Hadoop Strdata
c) None
d) Hadoop Stream
37) Mapreduce code can be written in various languages other than java
a) True
b) False
43) To list tables with prefix ‘page’ in Hive, we use the syntax:
a) SHOW TABLES ‘page.*’
44) Can multiple clients write into an HDFS file concurrently?
a) True
b) False
45) Node manager runs services on the node to check its health and report the same to
resource manager
a) True
b) False
46) Thrift server in hive doesn’t allow external clients to interact with Hive over a network
a) True
b) False
47) HIVE also support custom extentions written in
a) Python
b) Scala
c) Ruby
d) Java
48) Once compilation and optimization completes, the executer executes the task
a) True
b) False
49) Client node load the data on the hadoop cluster
a) True
b) False
50) Which of the following is a platform for constructing data flows for extract, transform and
load(ETL) processing and analysis of large datasets
a) Oozie
b) Pig latin
c) Hive
d) Sqoop
51) Map operator trees are executed on mapper
a) True
b) False
52) _____ manager service feature monitors dozens of service health and performance metrics
about the services and role instances running on your cluster
a) Google
b) Amazon
c) None
d) Cloudera
53) Name node is monitored and upgraded in a ____ transition
a) Secure mode
b) Service mode
c) Safe mode
d) Boot mode
54) Check pointing is a feature for any non stateful transformation
a) True
b) False
55) Which are the three major parallel computing platforms
a) Network,cloud,multitenancy
b) Iaas, Paas,Saas
c) Clusters or grids, MPP, HPC
d) Database, sql, network
56) Hive is designed mainly for
a) None
b) OLAP,OLTP
c) OLTP
d) OLAP
57) Hive shell can run in both non-interactive mode and interactive mode
a) True
b) False
58) Which of the following jobs are optimized for scalability but not latency
a) Hive
b) Hadoop
c) Oozie
d) Pig
59) Which of the following is not an output format in hadoop
a) KepInputFormat
b) SequenceFileInputFormat
c) ByteInputFormat
d) TextInputFormat
60) Choose the correct statement:
a) Action operation evaluates and returns a new value
b) Transformations return a single value
c) An action function is called on a RDD object, all the data processing queries are
computed at that time and the result value is returned in new RDD
61) Hadoop cluster establishes the connection to the client using HTTP protocol
a) True
b) False
62) HAdoop is a framework that works with a variety of related tools. Common cohorts include
a) Mapreduce, Hummer, Iguama
b) Mapreduce, Heron and trumpet
c) Mapreduce, Mysql and google apps
d) Mapreduce, Hive, and Hbase
63) Data locality feature in Hadoop means ______
a) Relocate the data from one node to another
b) Distribute the data across multiple nodes
c) Store the same data across multiple nodes
d) Co-locate the data with the computing nodes