TYCS - SEM6 - Data Science
TYCS - SEM6 - Data Science
2. Which of the following is the most important language for Data Science?
a) Java b) Ruby c) R d) Basic
17. Which of the following step is performed by data scientist after acquiring the data?
a) Data Cleansing b) Data Integration c) Data Replication d) Deletion
18. Which of the following package is used for reading excel data?
a) xlsx b) xlsc c) read.sheet d)VB
a) Collected for the first time b) Collected for the second time
26. The use of tabular data and graphs and charts makes it __________ to understand the
concept of bar charts and histograms.
28. Computer programs are written in a high level programming language; however, the
human-readable version of a program is called ………….
a) cache b) Instruction set c) source code d) word size
31. The stem and leaf displaying technique is used to present data in
36. NoSQL databases is used mainly for handling large volumes of ______________ data.
38. Amazon web services falls into which of the following cloud-computing category?
39. The _______ is a symbolic representation of facts or ideas from which information can
potentially be extracted.
40. Data mining is used to refer ______ stage in knowledge discovery in database.
42. ________analysis divides data into groups that are meaningful, useful, or both.
c) The systematic description of the syntactic structure of a specific database. It describes the
structure of the attributes the tables and foreign key relationships.
d) image
c) The systematic description of the syntactic structure of a specific database. It describes the
structure of the attributes the tables and foreign key relationships.
45. E-R model uses this symbol to represent weak entity set?
50. _____________ can be used for batch processing of data and aggregation operations.
a) Hive b) MapReduce c) Oozie d) PASCAL