Mock Quiz Hub

Mock Quiz Hub

Recent Updates
Added: OS Mid 1 Quiz
Added: OS Mid 2 Quiz
Added: OS Lab 1 Quiz
Time: 00:00

Quiz

Navigate through questions using the controls below

0%
Question 1 of 60 Quiz ID: q1
What are the three main characteristics that differentiate Big Data from data handled by earlier generation databases?
Question 2 of 60 Quiz ID: q2
Which of the following was mentioned as an early source of Big Data?
Question 3 of 60 Quiz ID: q3
What are transaction processing systems for Big Data often willing to sacrifice in exchange for very high scalability?
Question 4 of 60 Quiz ID: q4
Which of the following is NOT mentioned as a Big Data storage system?
Question 5 of 60 Quiz ID: q5
What is a key characteristic of distributed file systems according to the lecture?
Question 6 of 60 Quiz ID: q6
What scale example is given for highly scalable distributed file systems?
Question 7 of 60 Quiz ID: q7
How do distributed file systems handle hardware failure?
Question 8 of 60 Quiz ID: q8
Which of the following are examples of distributed file systems mentioned in the lecture?
Question 9 of 60 Quiz ID: q9
In Hadoop File System Architecture, what is the typical block size?
Question 10 of 60 Quiz ID: q10
What role does the NameNode play in HDFS?
Question 11 of 60 Quiz ID: q11
What does a DataNode do in HDFS?
Question 12 of 60 Quiz ID: q12
What is the data coherency model used by HDFS?
Question 13 of 60 Quiz ID: q13
What is a limitation of distributed file systems mentioned in the lecture?
Question 14 of 60 Quiz ID: q14
What is sharding in the context of Big Data?
Question 15 of 60 Quiz ID: q15
In the sharding example given, how might records be distributed?
Question 16 of 60 Quiz ID: q16
What is a positive aspect of sharding mentioned in the lecture?
Question 17 of 60 Quiz ID: q17
What is a major drawback of sharding?
Question 18 of 60 Quiz ID: q18
When were parallel databases originally developed?
Question 19 of 60 Quiz ID: q19
What scale were parallel databases originally designed for?
Question 20 of 60 Quiz ID: q20
How do parallel databases typically handle query failure due to machine failure?
Question 21 of 60 Quiz ID: q21
How do Map-reduce systems handle failures compared to parallel databases?
Question 22 of 60 Quiz ID: q22
Why is availability essential for parallel/distributed databases?
Question 23 of 60 Quiz ID: q23
What does consistency mean in the context of replicated data?
Question 24 of 60 Quiz ID: q24
In the majority protocol example given, if there are 3 replicas, how many replicas must reads/writes access?
Question 25 of 60 Quiz ID: q25
What does Brewer's CAP 'Theorem' state about network partitions?
Question 26 of 60 Quiz ID: q26
What does the MapReduce paradigm provide?
Question 27 of 60 Quiz ID: q27
What does MapReduce abstract from the programmer?
Question 28 of 60 Quiz ID: q28
What functions must the programmer provide in MapReduce?
Question 29 of 60 Quiz ID: q29
What scale of machines do very large MapReduce implementations run on?
Question 30 of 60 Quiz ID: q30
In the word count example, what does each worker do in the map phase?
Question 31 of 60 Quiz ID: q31
Given the input 'One a penny, two a penny, hot cross buns.', what would be one of the (word, count) pairs output by the map function?
Question 32 of 60 Quiz ID: q32
In the word count example, what is the final output for the word 'penny'?
Question 33 of 60 Quiz ID: q33
In the MapReduce word count pseudo-code, what does the emit function do in the map phase?
Question 34 of 60 Quiz ID: q34
What is the first attribute of the emit function called in MapReduce?
Question 35 of 60 Quiz ID: q35
What operation is effectively performed on the reduce key in MapReduce?
Question 36 of 60 Quiz ID: q36
Which companies are mentioned as widely using MapReduce for parallel processing?
Question 37 of 60 Quiz ID: q37
Which of the following is mentioned as an example use of MapReduce?
Question 38 of 60 Quiz ID: q38
What is an advantage of MapReduce over traditional SQL databases?
Question 39 of 60 Quiz ID: q39
What is a disadvantage of MapReduce compared to SQL?
Question 40 of 60 Quiz ID: q40
What do current generation execution engines natively support?
Question 41 of 60 Quiz ID: q41
Which execution engines are mentioned as examples in the lecture?
Question 42 of 60 Quiz ID: q42
What does Apache Tez provide according to the lecture?
Question 43 of 60 Quiz ID: q43
What does RDD stand for in Spark?
Question 44 of 60 Quiz ID: q44
How are RDDs computed in Spark?
Question 45 of 60 Quiz ID: q45
In which programming languages can Spark programs be written?
Question 46 of 60 Quiz ID: q46
What does streaming data refer to?
Question 47 of 60 Quiz ID: q47
Which of the following is NOT mentioned as an example of streaming data applications?
Question 48 of 60 Quiz ID: q48
What is windowing in the context of streaming data?
Question 49 of 60 Quiz ID: q49
What are the two bases on which windows may be created in streaming systems?
Question 50 of 60 Quiz ID: q50
What are punctuations used for in stream processing?
Question 51 of 60 Quiz ID: q51
What is a characteristic of continuous queries?
Question 52 of 60 Quiz ID: q52
What is a potential problem with continuous queries?
Question 53 of 60 Quiz ID: q53
What does CEP stand for in the context of stream processing?
Question 54 of 60 Quiz ID: q54
What characterizes many stream processing systems mentioned in the lecture?
Question 55 of 60 Quiz ID: q55
What is the lambda architecture in stream processing?
Question 56 of 60 Quiz ID: q56
What is a disadvantage of lambda architecture?
Question 57 of 60 Quiz ID: q57
What type of window doesn't overlap in streaming systems?
Question 58 of 60 Quiz ID: q58
What do publish-subscribe systems provide?
Question 59 of 60 Quiz ID: q59
Which parallel pub-sub system is mentioned as popular for managing streaming data?
Question 60 of 60 Quiz ID: q60
How can graphs be modelled as relations according to the lecture?

Quiz Summary

Review your answers before submitting

60
Total Questions
0
Answered
60
Remaining
00:00
Time Spent