0% found this document useful (0 votes)
37 views

Machine learning unit 1

The document provides an overview of machine learning, its definitions, types, and applications across various sectors such as healthcare, finance, and transportation. It explains the need for machine learning due to the increasing complexity of tasks and data, and categorizes machine learning into supervised, unsupervised, semi-supervised, and reinforcement learning. Additionally, it highlights specific use cases and algorithms associated with each type of machine learning, emphasizing its significance in modern technology.

Uploaded by

7667adi149
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
37 views

Machine learning unit 1

The document provides an overview of machine learning, its definitions, types, and applications across various sectors such as healthcare, finance, and transportation. It explains the need for machine learning due to the increasing complexity of tasks and data, and categorizes machine learning into supervised, unsupervised, semi-supervised, and reinforcement learning. Additionally, it highlights specific use cases and algorithms associated with each type of machine learning, emphasizing its significance in modern technology.

Uploaded by

7667adi149
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 39

The Dalai Lama Institute for Higher Education, Bangalore

Affiliated to Bangalore University

Computer Application Department

VI Semester
MACHINE LEARNING
Course Code: CA-C27T
Human learning
• Continuous learning in lifespan
• Intelligence vs memory
• Learning from experience
• Types of learning
• 1. learning under Expert Guidance
• 2. Learning guided by knowledge gained from experts
• 3. self learning
Machine learning
Machine learning is a branch of artificial intelligence (AI) and computer science which focuses on the
use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.
(IBM)
Arthur Samuel first used the term "machine learning" in 1959. It could be summarized as follows:
Dfn: Without being explicitly programmed, machine learning enables a machine to
automatically learn from data, improve performance from experiences, and predict things.
Machine learning algorithms create a mathematical model that, without being explicitly programmed,
aids in making predictions or decisions with the assistance of sample historical data, or training data.
For the purpose of developing predictive models, machine learning brings together statistics and
computer science.

https://www.javatpoint.com/machine-learning
Need for Machine Learning(Why used ML )

• The demand for machine learning is steadily rising. Because it is able to perform tasks that are too
complex for a person to directly implement, machine learning is required. Humans are constrained
by our inability to manually access vast amounts of data; as a result, we require computer
systems, which is where machine learning comes in to simplify our lives.
• By providing them with a large amount of data and allowing them to automatically explore the
data, build models, and predict the required output, we can train machine learning algorithms. The
cost function can be used to determine the amount of data and the machine learning algorithm's
performance. We can save both time and money by using machine learning.
• Many of today's leading companies, such as Facebook, Google, and Uber, make machine
learning a central part of their operations. Machine learning has become a significant competitive
differentiator for many companies.
• Following are some key points which show the importance of Machine Learning:
• Rapid increment in the production of data
• Solving complex problems, which are difficult for a human
• Decision making in various sector including finance
• Finding hidden patterns and extracting useful information from data.
Use cases of Machine Learning Technology
Some important applications in which machine learning is widely used
are given below:

Healthcare: Machine Learning is widely used in the healthcare industry. It helps healthcare
researchers to analyze data points and suggest outcomes. Natural language processing helped to give accurate insights
for better results of patients. Further, machine learning has improved the treatment methods by analyzing external
data on patients' conditions in terms of X-ray, Ultrasound, CT-scan, etc. NLP, medical imaging, and genetic
information are key areas of machine learning that improve the diagnosis, detection, and prediction system in the
healthcare sector.

Automation: This is one of the significant applications of machine learning that helps to make the
system automated. It helps machines to perform repetitive tasks without human intervention. As a machine learning
engineer and data scientist, you have the responsibilities to solve any given task multiple times with no errors.
However, this is not practically possible for humans. Hence machine learning has developed various models to
automate the process, having the capability of performing iterative tasks in lesser time.

Banking and Finance: Machine Learning is a subset of AI that uses statistical models
to make accurate predictions. In the banking and finance sector, machine learning helped in many ways, such as fraud
detection, portfolio management, risk management, chatbots, document analysis, high-frequency trading, mortgage
underwriting, AML detection, anomaly detection, risk credit score detection, KYC processing, etc. Hence, machine
learning is widely applied in the banking and finance sector to reduce error as well as time.
Transportation and Traffic Prediction: This is one of the most common applications
of Machine Learning that is widely used by all individuals in their daily routine. It helps to ensure highly
secured routes, generate accurate ETAs, predict vehicle breakdown, Driving Prescriptive Analytics, etc.
Although machine learning has solved transportation problems, it still requires more improvement.
Statistical machine learning algorithms helps to build a smart transportation system. Further, deep
Learning explored the complex interactions of roads, highways, traffic, environmental elements, crashes,
etc. Hence, machine learning technology has improved daily traffic management as well as a collection of
traffic data to predict insights of routes and traffic.
Image Recognition: It is one of the most common applications of machine learning which is
used to detect the image over the internet. Further, various social media sites such as Facebook uses
image recognition for tagging the images to your Facebook friends with its feature named auto friend
tagging suggestion.
Further, now a day's, almost all mobile devices come with exciting face detection features. Using this
feature, you can secure your mobile data with face unlocking, so if anyone tries to access your mobile
device, they cannot open without face recognition.

Speech Recognition: Speech recognition is one of the biggest achievements of machine


learning applications. It enables users to search content without writing text or, in other words, 'search by
voice'. It can search content/products on YouTube, Google, Amazon, etc. platforms by your voice. This
technology is referred to as speech recognition.
It is a process of converting voice instructions into the text; hence it is also known as 'Speech to text' or
'Computer speech recognition. Some important examples of speech recognitions are Google assistant,
Siri, Cortana, Alexa, etc.
Product Recommendation: It is one of the biggest achievements made by machine
learning which helps various e-commerce and entertainment companies like Flipkart,
Amazon, Netflix, etc., to digitally advertise their products over the internet. When anyone
searches for any product, they start getting an advertisement for the same product while
internet surfing on the same browser.
This is possible by machine learning algorithms that work on users' interests or past
experience and accordingly recommend them for products. For e.g., when we search for a
laptop on the Amazon platform, then it also gets started with so many other laptops having
the same categories and criteria. Similarly, when we use Netflix, we find some
recommendations for entertainment series, movies, etc. Hence, this is also possible by
machine learning algorithms.
Virtual Personal Assistance: This feature helps us in many ways, such as searching
content using voice instruction, calling a number using voice, searching contact in your
mobile, playing music, opening an email, Scheduling an appointment, etc. Now a day, you all
have seen advertising like "Alexa! Play the Music" this is also done with the help of
machine learning. Google Assistant, Alexa, Cortana, Siri, etc., are a few common applications
of machine learning. These virtual personal assistants record our voice instructions, send
them over to the server on a cloud, decode it using ML algorithms and act accordingly.
Email Spam and Malware detection & Filtering: Machine learning also helps us for
filtering emails in different categories such as spam, important, general, etc. In this way,
users can easily identify whether the email is useful or spam. This is also possible by
machine learning algorithms such as Multi-Layer Perceptron, Decision tree, and Naïve
Bayes classifier. Content filter, header filter, rules-based filter, permission filter, general
blacklist filter, etc., are some important spam filters used by Google.
Self-driving cars: This is one of the most exciting applications of machine learning. Machine
learning plays a vital role in the manufacturing of self-driving cars. It uses an unsupervised
learning method to train car models to detect people and objects while driving. Tata and Tesla
are the most popular car manufacturing companies working on self-driving cars. Hence, it is a
big revolution in a technological era which is also done with the help of machine learning.
Credit card fraud detection: Credit card frauds have become very easy targets for online
hackers. As the culture of online/digital payments is increasing, the risk of credit/debit cards is
parallel increasing. Machine Learning also helps developers to detect and analyze frauds in
online transactions. It develops a novel fraud detection method for Streaming Transaction Data,
with an objective to analyze the past transaction details of the customers and extract the
behavioral patterns. Further, cardholders are clustered into various categories with their
transaction amount so that the behavioral pattern of the groups can be extracted respectively.
Hence, credit card fraud detection is a novel approach using Aggregation Strategy and
Feedback Mechanism of machine learning.
Stock Marketing and Trading: Machine learning also helps in the stock marketing and
trading sector, where it uses historical trends or past experience for predicting the market risk.
As share marketing is another name of marketing risk, machine learning reduces it to some
extent and predicts data against marketing risk. Machine learning's long short-term neural
memory network is used for the prediction of stock market trends.
Language Translation: The use of Machine learning can be seen in language translation. It
uses the sequence-to-sequence learning algorithms for translating one language into other.
Further, it also uses images recognition techniques to identify the text from one language to
other. Similarly, Google's GNMT (Google Neural Machine Translation) provides this feature,
which is a Neural Machine Learning that translates the text into our familiar language, and it is
TYPES OF MACHINE LEARNING
• Machine learning is a subset of AI, which enables the machine to
automatically learn from data, improve performance from past
experiences, and make predictions.
• Based on the methods and way of learning, machine learning is divided into mainly
four types, which are:
1.Supervised Machine Learning
2.Unsupervised Machine Learning
3.Semi-Supervised Machine Learning
4.Reinforcement Learning
• 1. Supervised Machine Learning
Supervised Machine learning is based on supervision. It means in the supervised learning technique, we train the
machines using the "labelled" dataset, and based on the training, the machine predicts the output. Here, the
labelled data specifies that some of the inputs are already mapped to the output. More preciously, we can say;
first, we train the machine with the input and corresponding output, and then we ask the machine to predict the
output using the test dataset.
Suppose we have an input dataset of cats and dog images. So, first, we will provide the training to the machine
to understand the images, such as the shape & size of the tail of cat and dog, Shape of eyes, colour,
height (dogs are taller, cats are smaller), etc. After completion of training, we input the picture of a cat and
ask the machine to identify the object and predict the output. Now, the machine is well trained, so it will check all
the features of the object, such as height, shape, colour, eyes, ears, tail, etc., and find that it's a cat. So, it will
put it in the Cat category. This is the process of how the machine identifies the objects in Supervised Learning.
Uses of Supervised machine learning
Risk Assessment, Fraud Detection, Spam filtering, etc.
Categories of Supervised Machine
Learning
• Classification
• Regression
• a) Classification
• Classification algorithms are used to solve the
classification problems in which the output variable is
categorical, such as "Yes" or No, Male or Female,
Red or Blue, etc. The classification algorithms predict
the categories present in the dataset. Some real-world
examples of classification algorithms are Spam
Detection, Email filtering, etc.
• Some popular classification algorithms are given below:
• Random Forest Algorithm
• Decision Tree Algorithm
• Logistic Regression Algorithm
• Support Vector Machine Algorithm
• b) Regression
• Regression algorithms are used to solve regression problems in
which there is a linear relationship between input and output
variables. These are used to predict continuous output variables,
such as market trends, weather prediction, etc.
• Some popular Regression algorithms are given below:
• Simple Linear Regression Algorithm
• Multivariate Regression Algorithm
• Decision Tree Algorithm
• Lasso Regression
Advantages and Disadvantages of Supervised Learning
• Advantages:
• Well-suited for tasks with labeled data.
• Can make accurate predictions once trained.
• Can handle both regression and classification problems.
• Disadvantages:
• These algorithms are not able to solve complex tasks.
• It may predict the wrong output if the test data is different from the
training data.
• It requires lots of computational time to train the algorithm.
• Applications of Supervised Learning
• Some common applications of Supervised Learning are given below:
• Image Segmentation:
Supervised Learning algorithms are used in image segmentation. In this
process, image classification is performed on different image data with pre-
defined labels.
• Medical Diagnosis:
Supervised algorithms are also used in the medical field for diagnosis purposes.
It is done by using medical images and past labelled data with labels for disease
conditions. With such a process, the machine can identify a disease for the new
patients.
• Fraud Detection - Supervised Learning classification algorithms are used for
identifying fraud transactions, fraud customers, etc. It is done by using historic
data to identify the patterns that can lead to possible fraud.
• Spam detection - In spam detection & filtering, classification algorithms are
used. These algorithms classify an email as spam or not spam. The spam emails
are sent to the spam folder.
• Speech Recognition - Supervised learning algorithms are also used in speech
recognition. The algorithm is trained with voice data, and various identifications
can be done using the same, such as voice-activated passwords, voice
• 2. Unsupervised Machine Learning
• Unsupervised ML is different from the Supervised
learning technique; as its name suggests, there is no
need for supervision. It means, in unsupervised machine
learning, the machine is trained using the unlabeled
dataset, and the machine predicts the output without any
supervision.
• The main aim of the unsupervised learning
algorithm is to group or categories the unsorted
dataset according to the similarities, patterns, and
differences. Machines are instructed to find the hidden
patterns from the input dataset.
• Categories of Unsupervised Machine Learning
• Unsupervised Learning can be further classified into two types, which are
given below:
• Clustering
• Association
• 1) Clustering
• The clustering technique is used when we want to find the inherent groups
from the data. It is a way to group the objects into a cluster such that the
objects with the most similarities remain in one group and have fewer or no
similarities with the objects of other groups. An example of the clustering
algorithm is grouping the customers by their purchasing behavior.
• Some of the popular clustering algorithms are given below:
• K-Means Clustering algorithm
• Mean-shift algorithm
• DBSCAN Algorithm
• Principal Component Analysis
• Independent Component Analysis
• 2) Association
• Association rule learning is an unsupervised learning technique, which finds
interesting relations among variables within a large dataset. The main aim of
this learning algorithm is to find the dependency of one data item on another
data item and map those variables accordingly so that it can generate
maximum profit. This algorithm is mainly applied in Market Basket analysis
• Advantages:
• Can discover hidden patterns and structures in data without labeled examples.
• Useful for tasks like clustering, anomaly detection, and dimensionality reduction.
• Can potentially uncover valuable insights from large datasets.
• Disadvantages:
• Harder to evaluate the performance of the model since there are no ground
truth labels.
• Results may be less interpretable compared to supervised learning.
• May require more computational resources and time to process large
datasets.
• Applications of Unsupervised Learning
• Network Analysis: Unsupervised learning is used for
identifying plagiarism and copyright in document network
analysis of text data for scholarly articles.
• Recommendation Systems: Recommendation systems
widely use unsupervised learning techniques for building
recommendation applications for different web
applications and e-commerce websites.
• Anomaly Detection: Anomaly detection is a popular
application of unsupervised learning, which can identify
unusual data points within the dataset. It is used to
discover fraudulent transactions.
• Reinforcement Learning
• Reinforcement learning works on a feedback-based
process, in which an AI agent (A software component)
automatically explore its surrounding by hitting & trail,
taking action, learning from experiences, and improving
its performance. Agent gets rewarded for each good action and
get punished for each bad action; hence the goal of
reinforcement learning agent is to maximize the rewards.
• The reinforcement learning process is similar to a human being;
for example, a child learns various things by experiences in his
day-to-day life. An example of reinforcement learning is to play a
game, where the Game is the environment, moves of an agent at
each step define states, and the goal of the agent is to get a high
score. Agent receives feedback in terms of punishment and
rewards.
• Due to its way of working, reinforcement learning is employed in
different fields such as Game theory, Operation Research,
• Reinforcement learning is categorized mainly into two
types of methods/algorithms:
• Positive Reinforcement Learning: Positive
reinforcement learning specifies increasing the
tendency that the required behaviour would occur again
by adding something. It enhances the strength of the
behaviour of the agent and positively impacts it.
• Negative Reinforcement Learning: Negative
reinforcement learning works exactly opposite to the
positive RL. It increases the tendency that the specific
behaviour would occur again by avoiding the negative
condition.
• Real-world Use cases of Reinforcement Learning
• Video Games:
RL algorithms are much popular in gaming applications. It is used to gain
super-human performance. Some popular games that use RL algorithms
are AlphaGO and AlphaGO Zero.
• Resource Management:
The "Resource Management with Deep Reinforcement Learning" paper
showed that how to use RL in computer to a
• utomatically learn and schedule resources to wait for different jobs in order
to minimize average job slowdown.
• Robotics:
RL is widely being used in Robotics applications. Robots are used in the
industrial and manufacturing area, and these robots are made more
powerful with reinforcement learning. There are different industries that
have their vision of building intelligent robots using AI and Machine learning
technology.
• Text Mining
Text-mining, one of the great applications of NLP, is now being implemented
with the help of Reinforcement Learning by Salesforce company.
• Advantages and Disadvantages of Reinforcement
Learning
• Advantages
• It helps in solving complex real-world problems which
are difficult to be solved by general techniques.
• The learning model of RL is similar to the learning of
human beings; hence most accurate results can be
found.
• Helps in achieving long term results.
Disadvantage
RL algorithms are not preferred for simple problems.
• RL algorithms require huge data and computations.
• Common issues in Machine Learning
• Although machine learning is being used in every industry and helps organizations make more informed
and data-driven choices that are more effective than classical methodologies, it still has so many problems
that cannot be ignored. Here are some common issues in Machine Learning that professionals face to
inculcate ML skills and create an application from scratch.
• 1. Inadequate Training Data
• The major issue that comes while using machine learning algorithms is the lack of quality as well as
quantity of data. Although data plays a vital role in the processing of machine learning algorithms, many
data scientists claim that inadequate data, noisy data, and unclean data are extremely exhausting the
machine learning algorithms. For example, a simple task requires thousands of sample data, and an
advanced task such as speech or image recognition needs millions of sample data examples. Further, data
quality is also important for the algorithms to work ideally, but the absence of data quality is also found in
Machine Learning applications. Data quality can be affected by some factors as follows:
• Noisy Data- It is responsible for an inaccurate prediction that affects the decision as well as accuracy in
classification tasks.
• Incorrect data- It is also responsible for faulty programming and results obtained in machine learning
models. Hence, incorrect data may affect the accuracy of the results also.
• Generalizing of output data- Sometimes, it is also found that generalizing output data becomes
complex, which results in comparatively poor future actions.
• 2. Poor quality of data
• As we have discussed above, data plays a significant role in machine learning, and it must be of good
quality as well. Noisy data, incomplete data, inaccurate data, and unclean data lead to less accuracy in
classification and low-quality results. Hence, data quality can also be considered as a major common
problem while processing machine learning algorithms.
• 3. Non-representative training data
• To make sure our training model is generalized well or not, we have to ensure that sample training data
must be representative of new cases that we need to generalize. The training data must cover all cases
that are already occurred as well as occurring.
• 4. Overfitting and Underfitting
• Overfitting:
• Overfitting is one of the most common issues faced by Machine Learning engineers and data scientists.
Whenever a machine learning model is trained with a huge amount of data, it starts capturing noise and
inaccurate data into the training data set. It negatively affects the performance of the model. Let's
understand with a simple example where we have a few training data sets such as 1000 mangoes, 1000
apples, 1000 bananas, and 5000 papayas. Then there is a considerable probability of identification of an
apple as papaya because we have a massive amount of biased data in the training data set; hence prediction
got negatively affected. The main reason behind overfitting is using non-linear methods used in machine
learning algorithms as they build non-realistic data models. We can overcome overfitting by using linear and
parametric algorithms in the machine learning models.
• Methods to reduce overfitting:
• Increase training data in a dataset.
• Reduce model complexity by simplifying the model by selecting one with fewer parameters
• Ridge Regularization and Lasso Regularization
• Reduce the noise
• Reduce the number of attributes in training data.
• Underfitting:
• Underfitting is just the opposite of overfitting. Whenever a machine learning model is trained with fewer
amounts of data, and as a result, it provides incomplete and inaccurate data and destroys the accuracy of
the machine learning model.
• Underfitting occurs when our model is too simple to understand the base structure of the data, just like an
undersized pant. This generally happens when we have limited data into the data set, and we try to build a
linear model with non-linear data. In such scenarios, the complexity of the model destroys, and rules of the
machine learning model become too easy to be applied on this data set, and the model starts doing wrong
predictions as well.
• Methods to reduce Underfitting:
• Increase model complexity
• Remove noise from the data
• Trained on increased and better features
• Reduce the constraints
• Increase the number of epochs to get better results.
• 5. Monitoring and maintenance
• As we know that generalized output data is mandatory
for any machine learning model; hence, regular
monitoring and maintenance become compulsory for
the same. Different results for different actions require
data change; hence editing of codes as well as
resources for monitoring them also become necessary.
6. Lack of skilled resources
Although Machine Learning and Artificial Intelligence are continuously
growing in the market, still these industries are fresher in comparison to
others. The absence of skilled resources in the form of manpower is also
an issue. Hence, we need manpower having in-depth knowledge of
mathematics, science, and technologies for developing and managing
scientific substances for machine learning.

7. Process Complexity of Machine Learning


• The machine learning process is very complex, which is also another
major issue faced by machine learning engineers and data scientists.
However, Machine Learning and Artificial Intelligence are very new
technologies but are still in an experimental phase and continuously
being changing over time. There is the majority of hits and trial
experiments; hence the probability of error is higher than expected.
Further, it also includes analyzing the data, removing data bias, training
data, applying complex mathematical calculations, etc., making the
procedure more complicated and quite tedious.
• 8. Data Bias
• Data Biasing is also found a big challenge in Machine Learning. These errors exist when
certain elements of the dataset are heavily weighted or need more importance than
others. Biased data leads to inaccurate results, skewed outcomes, and other analytical
errors. However, we can resolve this error by determining where data is actually biased
in the dataset. Further, take necessary steps to reduce it.
• Use multi-pass annotation such as sentiment analysis, content moderation, and intent
recognition.
• 9. Lack of Explainability
• This basically means the outputs cannot be easily comprehended as it is programmed
in specific ways to deliver for certain conditions. Hence, a lack of explainability is also
found in machine learning algorithms which reduce the credibility of the algorithms.
• 10. Slow implementations and results
• This issue is also very commonly seen in machine learning models. However, machine
learning models are highly efficient in producing accurate results but are time-
consuming. Slow programming, excessive requirements' and overloaded data take
more time to provide accurate results than expected. This needs continuous
maintenance and monitoring of the model for delivering accurate results.
• 11. Irrelevant features
• Although machine learning models are intended to give the best possible outcome, if
we feed garbage data as input, then the result will also be garbage. Hence, we should
use relevant features in our training sample. A machine learning model is said to be
good if training data has a good set of features or less irrelevant features.
• Advantages Of Python Over Other Languages
1.Simple and Readable Syntax: Python has a straightforward and easy-
to-read syntax, which makes it suitable for beginners and experienced
developers alike. Its syntax emphasizes readability and reduces the cost
of program maintenance and development.
2.Extensive Libraries and Frameworks: Python boasts a rich ecosystem
of libraries and frameworks for various tasks such as data analysis,
machine learning, web development, scientific computing, and more.
Popular libraries like NumPy, pandas, TensorFlow, and Django enable
developers to build complex applications with minimal effort.
3.Platform Independence: Python is a platform-independent language,
meaning that Python code can run on any platform with the appropriate
interpreter installed. This portability allows developers to write code once
and deploy it across multiple platforms without modification.
4.Community Support: Python has a vibrant and active community of
developers, researchers, and enthusiasts who contribute to its growth and
development. The community provides extensive documentation,
tutorials, forums, and third-party packages, making it easier for
developers to learn and troubleshoot problems.
5. Ease of Learning and Adoption: Python's simplicity and readability
make it an ideal language for beginners to learn programming. Its gentle
learning curve and extensive resources make it accessible to individuals
from various backgrounds, including students, professionals, and hobbyists.
6. Versatility: Python is a versatile language that can be used for a wide
range of applications, including web development, desktop applications,
automation, scripting, data analysis, machine learning, artificial intelligence,
and more. Its flexibility allows developers to work on diverse projects without
switching to different languages.
7. Integration Capabilities: Python seamlessly integrates with other
programming languages and technologies, making it easy to incorporate
existing code and systems into Python-based projects. It supports
interoperability with languages like C/C++, Java, and .NET, enabling
developers to leverage the strengths of different languages within a single
application.
8. Scalability: Python is scalable, allowing developers to build small scripts
or large-scale applications efficiently. Its modular and object-oriented design
facilitates code organization and maintenance, making it easier to scale
projects as they grow in complexity.
SCIKIT_LEARN
• Scikit-learn is a free, open-source machine-learning library for the Python programming language
• Installation
1. Install Python: Ensure you have Python installed on your system. You can download and install Python from
the official website: https://www.python.org/
2. Install scikit-learn: You can install scikit-learn using pip, which is Python's package manager. Open your
terminal or command prompt and run the following command:
pip install scikit-learn
3. NumPy and SciPy: scikit-learn depends on NumPy and SciPy for numerical computations and scientific
algorithms. You can install them using pip:
pip install numpy scipy
4. Pandas: Pandas is a powerful data manipulation library often used in conjunction with scikit-learn for data
preprocessing and analysis:
pip install pandas
• Matplotlib: Matplotlib is a plotting library for creating static, interactive, and animated visualizations in
Python. It provides a MATLAB-like interface and supports a wide range of plots, including line plots,
scatter plots, bar plots, histograms, and more. Scikit-learn often uses Matplotlib for visualizing data and
model evaluation.
pip install matplotlib

You might also like