0% found this document useful (0 votes)
298 views

TYCS - SEM6 - Data Science

The document contains a 50 question multiple choice quiz about data science topics. The questions cover subjects like data analysis, data visualization, databases, programming languages, cloud computing, and data modeling. The questions are intended to test knowledge of concepts important to the field of data science.

Uploaded by

TF TECH
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
298 views

TYCS - SEM6 - Data Science

The document contains a 50 question multiple choice quiz about data science topics. The questions cover subjects like data analysis, data visualization, databases, programming languages, cloud computing, and data modeling. The questions are intended to test knowledge of concepts important to the field of data science.

Uploaded by

TF TECH
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Department of CS/IT

TYCS – SEM 6 - DATA SCIENCE


MCQ Model Questions
Choose the correct option:
1. Which of the following would be more appropriate to be replaced with question mark in
the following figure?

a) Data Analysis b) Data Science c) Descriptive Analytics d) Commerce

2. Which of the following is the most important language for Data Science?
a) Java b) Ruby c) R d) Basic

3. Which of the following is the common goal of statistical modelling?


a) Inference b) Summarizing c) Subsetting d) script

4. -----------------------Shows all individual data points.


a) Box-plot b) Scatter Plot c) Line plot d) Pie chart

5. Xquery is a functional query language used to retrieve information stored in -----------


format.
a) HTML b) XML c) UML d) Jscript

6. Xpath specification has ------------------------ types of nodes


a) Four b) Five c) Six d) Seven

7. Data Visualization is also on element of the broader ------------------------


a) deliver presentation architecture b) data presentation architecture
c) dataset presentation architecture c) data process architecture
8. Which method shows hierarchical data in a nested format?
a) tree maps b) Scatter Plots c) Population pyramids d) Area Charts

9. Which of the following is most basic and commonly used techniques?


a) line charts b) Scatter plots c) Population pyramids d) Area charts

10. Which of the following is not a part of data science process?


a) discovery b) model planning c) communication building d) operationalize

11. In Xquery ___________ symbol preceded before the variable name.


a) @ b) $ c) # d) *

12. MongoDB support cross platform and is written in _____________ language.


a) C++ b) R c) Java d) Python

13. MongoDB is ___________ database.


a) SQL b) NoSQL c) RDBMS d) DBMS

14. Ridge Regression is when data suffers from ___________


a) Collinearity b) Multicollinearity c) Does not suffer d) Regression

15. Bayesian information Criterion (BIC) is related to _______________


a) Ridge regression b) AIC c) Cross validation d) Lasso Regression

16. Joins are used for combining _____________ product.


a) Vector b) Cartesian c) Scalar d) Euler

17. Which of the following step is performed by data scientist after acquiring the data?
a) Data Cleansing b) Data Integration c) Data Replication d) Deletion
18. Which of the following package is used for reading excel data?
a) xlsx b) xlsc c) read.sheet d)VB

19. Which of the following is another name for raw data?

a) destination data b) eggy data c) secondary d) Machine Learning

20. Arranging the customers names in ascending order is an example of

a) process b) information processing c) process d) information

21. Organisation, distribution and manipulation of information is classified as

a) data manipulating b) process selection

c) information extraction d) information processing

22. Quantitative data deals with _________________

a) numbers and things b) Characteristics c) images d) sketches

23. Qualitative data deals with _____________________

a) Characteristics b) numbers c) things d) price

24. Example for discrete data ______________________

a) The number of children b) height of children

c) weight of children d) behaviour of children

25. Primary data is __________________________________

a) Collected for the first time b) Collected for the second time

c) Not original data d) statistical operations have been performed.

26. The use of tabular data and graphs and charts makes it __________ to understand the
concept of bar charts and histograms.

a) easy b) difficult c) boring d) confusing


27. This language was developed by Dennis Ritchie of Bell Laboratories in order to
implement the operating system UNIX.
a) C b) C++ c) Java d) LISP

28. Computer programs are written in a high level programming language; however, the
human-readable version of a program is called ………….
a) cache b) Instruction set c) source code d) word size

29. Query language comes under:

a) Third generation b) Fourth generation c) Fifth generation d) First Generation

30. Bitmapped file formats can be most useful for ____________

a) Plots that may need to be resized

b) Plots that require animation or interactivity

c) Plots that are not scaled to a specific resolution

d) Scatterplots with many many points

31. The stem and leaf displaying technique is used to present data in

a) descriptive data analysis b) exploratory data analysis

c) nominal data analysis d) ordinal data analysis

32. Example for semi structured data__________________

a) XML data b) Relational data c) media logs d) word

33. Example for Unstructured data ________________

a) media logs b) XML data c) Relational data d) Oracle

34. Which of the following is not a NoSQL database?

a) SQL Server b) MongoDB c) Cassandra d) C


35. Which of the following is a NoSQL Database Type?

a) SQL b) Document Database c) JSON d) C++

36. NoSQL databases is used mainly for handling large volumes of ______________ data.

a) unstructured b) structured c) semi-structured d) images

37. The government and non government publications are considered as

a) external secondary data sources b) internal secondary data sources

c) external primary data sources d) internal primary data sources

38. Amazon web services falls into which of the following cloud-computing category?

a) Platform as a Service b) Software as a Service

c) Infrastructure as a Service d) Back-end as a Service

39. The _______ is a symbolic representation of facts or ideas from which information can
potentially be extracted.

a) knowledge b) data c) algorithm d) program

40. Data mining is used to refer ______ stage in knowledge discovery in database.

a) Selection b) retrieving c) discovery d) coding

41. A collection of interesting and useful patterns in database is called _______.

a) knowledge b) information c) data d) algorithm

42. ________analysis divides data into groups that are meaningful, useful, or both.

a) cluster b) text c) multimedia d) link


43. Data dictionary is _____________________

a) Large collection of data mostly stored in a computer system

b) The removal of noise errors and incorrect input from a database

c) The systematic description of the syntactic structure of a specific database. It describes the
structure of the attributes the tables and foreign key relationships.

d) image

44. Data cleaning is

a) Large collection of data mostly stored in a computer system

b) The removal of noise errors and incorrect input from a database

c) The systematic description of the syntactic structure of a specific database. It describes the
structure of the attributes the tables and foreign key relationships.

d) Decision support systems

45. E-R model uses this symbol to represent weak entity set?

a) Dotted rectangle b) Diamond c) Doubly outlined rectangle d) Square

46. Relational Algebra is

a) Data Definition Language b) Meta Language

c) Procedural query Language d) BASIC

47. What is a relationship called when it is maintained between two entities?

a) Unary b) Binary c) Ternary d)Quaternary

48. The RDBMS terminology for a row is

a) Tuple b) Relation c) Attribute d) Degree

49. CouchDB is ____________________

a) Document-oriented DBMS b) Relational DBMS


c) Compiler d) Interpreter

50. _____________ can be used for batch processing of data and aggregation operations.
a) Hive b) MapReduce c) Oozie d) PASCAL

You might also like