Spark Preliminaries MCQs Solution | TCS Fresco Play | Fresco Play
Make an effort to understand these solutions and apply them to your Hands-On difficulties. (It is not advisable that copy and paste these solutions).
All Question of the MCQs Present Below for Ease Use Ctrl + F with the question name to find the Question. All the Best!
1. Which of the following is NOT a characteristic shared by Hadoop and Spark?
Both have their own file system --correct
2. Apache spark has which of the following capabilities?
All the options --correct
3. Which of the following application types can Spark run in addition to batch-processing jobs?
All the options --correct
4. What kind of data can be handled by Spark ?
All the options --correct
5. What year was Apache Spark made an open source technology?
2010 --correct
6. Spark can store its data in?
All the options --correct
7. The transformation which produces one output value for each input value and the operation which produces an arbitrary number values for each input value.
map(),flatmap() --correct
8. Choose correct statement
Execution starts with the call of Action --correct
9. Choose correct statement about RDD
RDD is a distributed data structure --correct
10. Identify correct transformation
All the options --correct
11. We can edit the data of RDD like conversion to uppercase
False --correct
12. Which action returns all the elements of the dataset as an array.
collect() --correct
13. RDD is
All the options --correct
14. An instance of the Spark SQL execution engine that integrates with data stored in Hive:
HiveContext --correct
15. Spark supports loading data from Hbase.
True --correct
16. Benefits of using appropriate file formats in Spark
All the options --correct
17. Which of the following file formats are supported by Spark ?
All the options --correct
18. Spark can integrate with which of the following data storage systems?
All the options --correct
19. Which of the following Scala statement would be most appropriate to load the data (sfpd.txt) into an RDD? Assume that SparkContext is available as the variable “sc” and SQLContext as the variable “sqlContext.”
val sfpd=sc.loadFile(“/path to file/sfpd.txt”)
val sfpd=sc.loadText(“/path to file/sfpd.txt”)
val sfpd=sc.textFile(“/path to file/sfpd.txt”)
val sfpd=sqlContext.loadText(“/path to file/sfpd.txt”)
20. Which of the following is true of running a Spark application on Hadoop YARN?
There are two deploy modes that can be used to launch Spark applications on YARN – client mode and cluster mode --correct
21. Which is responsible for task scheduling and memory management ?
Spark Core --correct
22. Which tells spark how and where to access a cluster
Spark Context --correct
23. To launch a Spark application in any one of the four modes(local, standalone, MESOS or YARN) use
./bin/spark-submit --correct
24. Types of operations that can be performed on RDDs
Action and Map --correct
25. Which is the default Storage level in Spark ?
MEMORY_ONLY --correct
26. Which of the following is true of caching the RDD ?
All the options --correct
27. RDDs can also be unpersisted to remove RDD from a permanent storage like memory and/or disk.
True--correct
28. By default Spark uses which algorithm to remove old and unused RDD to release more memory.
Least Recently Used (LRU) --correct
29. Which is not a Storage level in Spark ?
HEAPANDDISK --correct
Post a Comment