ML Unit-1
ML Unit-1
These are just a few examples of how machine learning and AI are being
applied across various industries. The potential applications of these
technologies are extensive and continue to evolve as technology advances.
The choice of data representation depends on the nature of the data and the
specific machine learning task. The goal is to represent the data in a way that
preserves relevant information, reduces noise or redundancy, and allows the
machine learning algorithms to effectively learn patterns and make accurate
predictions.
When discussing the diversity of data, it can be categorized into two main types:
structured data and unstructured data. These types represent different formats,
characteristics, and challenges in data representation and analysis. Let's explore
the differences between structured and unstructured data:
1. Structured Data:
Definition: Structured data refers to data that has a predefined and
well-organized format. It follows a consistent schema or data model.
Characteristics: Structured data is typically organized into rows and
columns, similar to a traditional relational database. Each column
In machine learning, there are several forms or types of learning algorithms that
are used to train models and make predictions based on data. Here are some
common forms of learning in machine learning:
Machine learning and data mining are closely related fields that involve
extracting knowledge, patterns, and insights from data. While there is overlap
between the two, they have distinct focuses and techniques. Here's an overview
of machine learning and data mining:
Data mining techniques can be used to explore and analyze structured, semi-
structured, and unstructured data. It involves preprocessing the data, applying
algorithms to discover patterns, evaluating and interpreting the results, and
presenting the findings to stakeholders.
These are just a few examples of how linear algebra concepts are applied in
machine learning. Understanding and applying linear algebra operations and
concepts allow for efficient manipulation of data, designing models, solving
optimization problems, and gaining insights from the data in the field of
machine learning.
UNIT-II
1. Data Collection: Gather a dataset that contains input features and their
associated output labels. The dataset should be representative of the problem
you are trying to solve.
2. Data Preprocessing: Clean the data by handling missing values, outliers,
and irrelevant features. It may involve techniques like data normalization,
feature scaling, or feature engineering to prepare the data for modeling.
3. Training-Validation Split: Split the dataset into two parts: a training set
and a validation set. The training set is used to train the model, while the