Full Data Science Internship Report
Full Data Science Internship Report
DATA SCIENCE
submitted in partial fulfillment of the requirements
for the award of the degree of
BACHELOR OF TECHNOLOGY
in
ELECTRICAL AND ELECTRONICS ENGINEERING
By
Data Science is a multidisciplinary field that combines statistics, computer science, and
domain knowledge to analyze and interpret complex data.
In today's data-driven world, organizations use data science to make informed decisions,
optimize operations, and uncover hidden patterns.
This report outlines the theoretical background, core components, and practical
implementation of data science principles.
It emphasizes the significance of data preprocessing, exploratory data analysis, machine
learning, and model evaluation.
The report also provides an overview of industry applications in sectors like healthcare,
finance, and manufacturing,
and discusses the ethical and policy considerations in handling data. The internship
provided hands-on experience in data visualization,
model building, and deploying data-driven solutions using Python-based tools and libraries.
1. INTRODUCTION ON DATA SCIENCE
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.linear_model import LinearRegression
from sklearn.model_selection import train_test_split
from sklearn.metrics import mean_squared_error
data = pd.read_csv('data.csv')
data.dropna(inplace=True)
X = data[['Experience']]
y = data['Salary']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=0)
model = LinearRegression()
model.fit(X_train, y_train)
predictions = model.predict(X_test)
print("MSE:", mean_squared_error(y_test, predictions))
CONCLUSION