Chapter No.4 Exercise Solution (Computer)
Chapter No.4 Exercise Solution (Computer)
4
Exercise
(Short Questions)
1. Define data analytics and data science. Are they similar or different? Give a reason.
Answer:
• Data Analytics: The process of analyzing raw data to find patterns and trends for making
decisions.
• Data Science: A broader field that involves data collection, cleaning, analysis, and using
algorithms to extract insights.
• Difference: Data analytics focuses on specific data for decision-making, while data science
uses advanced tools like machine learning for deeper analysis.
2. Can you relate how data science is helpful in solving business problems?
Answer:
Data science helps businesses by:
4. Compare machine learning and deep learning in the context of formal and informal
education.
Answer:
• Machine Learning: A subset of AI that focuses on algorithms learning from data. It is used in
education for adaptive learning tools, grading systems, and personalized learning plans.
• Deep Learning: A more advanced form of machine learning using neural networks, often
applied to complex tasks like speech recognition, automated teaching tools, and virtual tutors.
• Example: Informal learning platforms use deep learning for voice-based assistants.
5. What is meant by sources of data? Give three sources of data excluding those
mentioned in the book.
Answer:
Sources of data refer to the origins from where data is collected. Examples:
2. Sensors and IoT Devices: Data from smart devices like temperature sensors.
• Database: A collection of organized data stored electronically. It can manage and query large
data efficiently. Example: MySQL, Oracle.
• Dataset: A specific collection of data usually in tabular form for analysis. Example: CSV files
containing student marks.
Key Difference: Databases store multiple datasets, whereas datasets are smaller, specific collections
of data.
7. Argue about the trends, outliers, and distribution of values in a dataset. Describe.
Answer:
• Trends: Patterns or tendencies observed in data over time (e.g., increasing sales).
• Outliers: Data points significantly different from other values, which can impact analysis
(e.g., 1000 in a dataset of 10, 15, 20).
• Distribution: How data values are spread, shown using graphs like histograms or box plots.
Understanding these helps in making accurate decisions and identifying data anomalies.
9. Express big data in your own words. Explain three Vs of big data with reference to email
data.
Answer:
Big Data: Large, complex datasets that cannot be managed with traditional tools.
3. Variety: Different types of email data like text, attachments, and images.
(Long Questions)
1. Data Collection:
o Gathering raw data from various sources like surveys, sensors, social media, and
databases.
Conclusion:
Data science plays a key role in industries like healthcare, finance, e-commerce, and technology by
analyzing data to make informed decisions.
2. Develop your own thinking on the various data types used in data science.
Answer:
Data science deals with multiple types of data that help solve problems effectively. These data types
are classified as follows:
1. Structured Data:
o Example: Customer information in an Excel sheet with names, emails, and phone
numbers.
2. Unstructured Data:
o Example: Text files, images, videos, audio files, and social media posts.
3. Semi-Structured Data:
o Data that does not follow a rigid structure but uses tags or markers to identify
elements.
4. Categorical Data:
o Data that represents categories or labels.
5. Numerical Data:
o Data that can be measured or counted. It is divided into:
• Structured data is used in machine learning models directly, whereas unstructured data (like
text and images) requires preprocessing.
Conclusion:
Understanding data types is essential for selecting the right tools and methods for analysis, making it
a fundamental concept in data science.
3. Compare how big data is applicable to various fields of life. Illustrate your answer with
suitable examples.
Answer:
Big Data refers to large, complex datasets that are difficult to manage using traditional tools. Its
applications span multiple fields, providing meaningful insights for improving processes and
decisionmaking.
1. Healthcare:
o Big data is used for analyzing patient records, predicting disease outbreaks, and
improving treatment plans. o Example: Hospitals use big data to identify patterns
in diseases like COVID-19 and suggest preventive measures.
2. Finance:
o It helps detect fraud, assess risks, and provide personalized banking services.
o Example: Banks use big data analytics to detect unusual patterns in transactions and
prevent fraud.
3. E-commerce:
o Big data enables businesses to understand customer preferences and recommend
products.
o Example: Amazon uses big data to suggest products based on purchase history.
4. Education:
o Example: Online platforms like Coursera use big data to recommend courses to
learners.
o Big data optimizes routes, reduces fuel costs, and manages traffic.
o Example: Uber uses big data to provide accurate trip durations and optimize driver
allocation.
6. Entertainment:
Conclusion:
Big data has transformed various sectors by improving efficiency, reducing costs, and enabling
smarter decision-making.
1. Better Decision-Making:
o Big data provides insights that help organizations make informed decisions.
2. Increased Efficiency:
o Managing large datasets requires significant storage capacity and advanced tools.
3. Data Quality:
o Raw data often contains errors or missing values, requiring cleaning before analysis.
o There is a shortage of experts who can analyze and process big data efficiently.
Conclusion:
While big data offers numerous benefits, addressing its challenges is crucial to fully harness its
potential.
5. Design a case study about how data science and big data have revolutionized the field of
healthcare.
Answer:
Background:
The healthcare industry generates vast amounts of data daily, including patient records, test results,
and treatment plans. Data science and big data technologies help analyze this data to improve
patient outcomes.
o Machine learning models analyze data to predict diseases before they occur.
2. Personalized Treatment:
3. Epidemic Control:
o Example: During COVID-19, big data tracked infection rates and spread patterns.
4. Operational Efficiency:
o Hospitals use big data to optimize resource allocation, reduce waiting times, and
improve operations.
Conclusion:
Big data and data science have revolutionized healthcare by enabling disease prevention,
personalized treatment, and operational efficiency. These technologies have the potential to save
lives and transform the healthcare industry.