AI Class X Data Science
AI Class X Data Science
Students are requested to copy these questions and answers (1-24) in their AI Notebook.
1. Mann is learning AI and knew the fact that AI can be categorized into three broad domains. List these
domains.
Ans. Three domains are:
i. Data Science
ii. Computer Vision
iii. Natural Language Processing
2. Define Data Science.
Ans.
a. It is a concept of unifying statistics, data analysis, machine learning, and their related methods to
understand and analyze actual phenomena with data.
b. It employs techniques and theories drawn from many fields within the context of Mathematics,
Statistics, Computer Science, and Information Science.
3. List any six applications of Data science.
Ans.
a. Fraud and Risk Detection
b. Genetics and Genomics
c. Internet Search
d. Targeted Advertising
e. Website recommendations
f. Airline Route Planning
4. Name any five search engines.
Ans.
a. Google
b. Yahoo
c. Bing
d. Ask
e. AOL
5. Name any four platforms that use Data Science to promote their products according to the user’s
interests.
Ans.
a. Amazon
b. Twitter
c. Google Play
d. Netflix
e. LinkedIn
f. IMDB
6. How Data Science is used to identify the strategic improvements what airline companies can do to
improve their business?
Ans. The airline companies can:
i. Predict the flight delay
ii. Decide which class of airplanes to buy
iii. Whether to directly land at the destination or take a halt in between
iv. Effectively drive customer loyalty programs
7. The AI-based project requires data collection. Name the two purposes for data collection.
Ans.
a. Data for testing
b. Data for training
8. Write some examples for datasets for the following: Banks, movie theatre
Ans.
a. Banks – Databases of loans issued, account holders, locker owners, employee registrations, bank
visitors, etc.
b. Movie Theatre – Movie details, tickets sold offline and online, refreshment purchases, etc.
9. List any four sources of data from where data can be collected offline.
Ans.
a. Sensors
b. Surveys
c. Interviews
d. Observations
10. List any three sources of online data collection.
Ans.
a. Open Source Government portals
b. Reliable websites like kaggle
c. World organization’s open source statistical websites
11. Write any four points to keep in mind while accessing any data from any data sources.
Ans.
a. Data that is available for public usage only should be taken up.
b. Personal datasets should only be used with the consent of the owner.
c. One should never breach someone’s privacy to collect data.
d. Data should only be taken form reliable sources as the data collected from random sources can
be wrong or unusable.
e. Reliable sources of data ensure the authenticity of data which helps in proper training of the AI
model.
12. The tabular data format is a very suitable format for data science. State some commonly used tabular
formats for the same.
Ans.
a. CSV
b. Spreadsheet
c. SQL
13. Expand the following: CSV, ODS, SQL, DBMS
Ans.
a. CSV – Comma Separated Value
b. ODS – Open Document Spreadsheet
c. SQL – Structured Query Language
d. DBMS – Database Management System
14. Explain CSV format in short.
Ans.
a. It is a simple file format used to store tabular data.
b. Each line of this file is a data record and each record consists of one or more fields that are
separated by commas.
c. Since the values of records are separated by a comma, hence they are known as CSV files.
d. These types of formats are opened and managed by spreadsheet software, Documentation
software, and text editors.
15. Discuss the spreadsheet format for the tabular dataset.
Ans.
a. A Spreadsheet is a piece of paper or a computer program that is used for accounting and
recording data using rows and columns into which information can be entered.
b. Microsoft Excel is a program that helps in creating spreadsheets.
16. Describe the SQL format in short.
Ans.
a. SQL is a programming language also known as Structured Query Language.
b. It is a domain-specific language used in programming and is designed for managing data held in
different kinds of DBMS (Database Management System).
c. It is particularly useful in handling structured data.
17. Name any two examples of Data science?
Ans. (Any two out of the following)
Price Comparison Websites/ Website Recommendations/ Fraud and Risk detection/ Internet search/
personalized healthcare recommendations / Optimizing Traffic routes in real-time / image tagging.
18. Why do we need to collect data?
Ans. Data to a machine is similar to food for human being to function. The world of Artificial Intelligence
revolves around Data. Every company whether small or big is collecting data from as many sources as
possible. Data is called the New Gold today. It is through data collection that a business or management has
the quality information they need to make informed decisions from further analysis, study, and research.
Data collection allows them to stay on top of trends, provide answers to problems, and analyze new insights
to great effect.
19. What is data mining? Explain with example.
Ans. Data mining is the process of analyzing large data sets and extracting the useful information from
it. Data mining is used by companies to turn raw data into useful information. It is an interdisciplinary
subfield of computer science and statistics with an overall goal to extract information OR Data mining is
an automatic or semi-automatic technical process that analyses large amounts of scattered information
to make sense of it and turn it into knowledge. It looks for anomalies, patterns or correlations among
millions of records to predict results, as indicated by the SAS institute, a world leader in business
analytics.
Example: Price Comparison websites- They collect data about a product from different sites and then
analyze trends out of it and show up the most appropriate results. Data mining is also known as
Knowledge Discovery in Data (KDD).
20. What do you understand by Data Privacy? Discuss with some examples.
Ans. The world of Artificial Intelligence revolves around Data. Proper and ethical handling of own data or
user data is called data privacy. It is all about the rights of individuals with respect to their personal
information. Data privacy or information privacy is a branch of data security concerned with the proper
handling of data – consent, notice, and regulatory obligations. More specifically, practical data privacy
concerns often revolve around: Whether or how data is shared with third parties
For example:
A physical boundary, such as a locked front door, helps prevent others from entering a building
without explicit permission in the form of a key to unlock the door or a person inside opening the door.
A social boundary, such as a members-only club, only allows members to access and use club
resources.
An informational boundary, such as a non-disclosure agreement, restricts what information can be
disclosed to others. Privacy of information is extremely important in this digital age where everything is
interconnected and can be accessed and used easily. The possibilities of our private information being
extremely vulnerable are very real, which is why we require data privacy.
21. Why do apps collect data in our phone?
Ans. One of the major sources of data for many major companies is the device which all of us have in
our hands all the time: Smartphones. Smartphones have nowadays become an integral part of our lives.
Most of us use smartphones more than we interact with people around us. For the facilities that
smartphones provide us, Apps need a lot of data which is collected from the user like details about your
face, browsing history, or your geographic location, contact list etc. This data is collected with user’s
consent which he/she gives at the time of installing an app by clicking on “yes” or “allow” options which
clearly means that we ourselves are giving permissions to the Apps. Permissions by themselves are
harmless and even useful to provide users a good mobile experience. This data is collected to provide us
with a lot of facilities and features which have made our lives easier. Another reason to collect the data
is to provide us with customized recommendations and notifications according to our choices. One more
reason to collect the data is to make their app more accurate and efficient.
22. Write some applications of Data Science
Ans. Some real-life applications of Data Science:
a) Internet Search: Data science helps search engines like Google analyze vast amounts of data to provide
the most relevant search results.
b) Digital Advertisements: Data science models optimize targeted ads by analyzing user behavior,
maximizing click-through rates, and enhancing ROI.
c) Fraud and Risk Detection: Financial institutions use data science to detect fraudulent activities and
assess risks by analyzing historical transaction data.
d) Website Recommendations: Data science powers recommendation engines that suggest content or
products based on user preferences and browsing history.
e) Speech Recognition: Data science algorithms enable systems like Siri or Google Assistant to convert
spoken language into text for processing.
f) Image Recognition: Data science is used in applications such as facial recognition and object detection
by analyzing visual data from images.
g) Website Recommendations: Algorithms suggest personalized content or products to users by analyzing
browsing patterns and preferences.
h) Speech Recognition: Machine learning models process and convert spoken language into text, enhancing
virtual assistants' functionalities.
i) Image Recognition: Data science enables the identification and classification of objects in images, used
in areas like surveillance and medical imaging.
j) Virtual Reality: Data science enhances VR by analyzing user interactions, enabling immersive
experiences and personalized virtual environments.
23. How does Data Science support the AI Project Cycle?
Ans. Data Science supports the AI Project Cycle by providing tools and techniques for data collection,
analysis, and modelling, which are crucial in building AI models and deriving actionable insights.
24. Explain the role of Data Science in each stage of the AI Project Cycle with an example.
Ans. Data Science plays a key role in every stage of the AI Project Cycle:
Problem Scoping: Data Science helps define the problem using historical data.
Data Acquisition: Data Science involves gathering relevant data from various sources.
Data Exploration: Using Data Science tools, the collected data is cleaned and analyzed to uncover
insights.
Modelling: Machine learning algorithms in Data Science are used to build AI models.
Evaluation: Data Science helps evaluate the accuracy of the model by comparing predictions with actual
outcomes.
Example: In a project to predict student performance, Data Science helps gather exam scores, clean the
data, build a predictive model, and evaluate its accuracy.
3. Justify the statement – “Digital ads have been able to get a much higher CTR (Call-Through Rate) than
traditional advertisements.”
Ans.
a. Nowadays people search for any product online on e-commerce platforms.
b. Based on their search data science displays them on various websites to the digital billboards at
the airports.
c. They can be targeted based on the users’ past behaviour.
4. State any two reasons why the airline industry across the world bears heavy losses.
Ans.
a. Except for a few airline service providers, companies are struggling to maintain their occupancy
ratio and operating profits.
b. High rises in prices
c. Need to offer heavy discounts to customers
5. Write the factors that would affect the quantity of food wastage in social gathering functions or food
prepared in hotels.
Ans.
a. Total number of relatives/customers
b. Quantity of dishes prepared per day/functions
c. Real dish consumption
d. Unconsumed dish quantity per day/functions
e. Price of dish
f. Quantity of dishes for next day
6. State the use of the system map tool.
Ans.
a. The system map tool is used to figure out the relation of elements with the project’s goal.
b. In other words, a system map is effectively a list of components used in a problem statement.
c. The positive arrows determine a direct relationship of elements while the negative ones show an
inverse relationship of elements
7. What are the three domains of AI?
Ans.
● Data Science/ Big Data
● Computer Vision
● Natural Language Processing (NLP)