Mini Project Report On
Mini Project Report On
BACHELOR OF TECHNOLOGY
in
Computer Science and Engineering
(Artificial Intelligence)
2023-24
By
<Riya Gupta><2200681520081>3rd sem
<Raziya><2200681520080>3rd sem
JAN 2024
TABLE OF CONTENT
DECLARATION Ii
CERTIFICATE Iii
ACKNOWLEDGEMENT iv
ABSTRACT
CHAPTER 1- INTRODUCTION
CHAPTER 2- WORKFLOW OF PROJECT
CHAPTER 3- TECHNOLOGY USED
CHAPTER 4- DATA DESCRIPTION
CHAPTER 5- PROJECT DESCRIPTION
APPENDICES- IMPLEMENTATION CODE
REFERENCES
DECLARATION
• Clean and pre-process data: handle missing values, transform data, handle
outliers.
4. Notebook Environments:
• Jupyter Notebooks: These interactive notebooks allow combining code,
visualizations, and narrative text, making them popular for data
exploration.
DATA DESCRIPTION
• IT HAS 15 COLUMNS.
2. Gender
3. Ethnic Group
4. Parent Edu
5. Lunch Type
6. Test Prep
8. Practice Sport
9. Is First Child
10. Nr Siblings
Data Reduction:
Some columns or variables can be dropped if they do not add value to our analysis.
EDA Univariate Analysis
• Univariate analysis can be done for both Categorical and Numerical variables.
• Categorical variables can be visualized using a Count plot, Bar Chart, Pie Plot, etc.
• Numerical Variables can be visualized using Histogram, Box Plot.
COUNT PLOT:
#from the above chart we have concluded that education of the parents have a
good impact on their scores
IMPLEMENTATION CODE
• import numpy as np
• import pandas as pd
• df = pd.read_csv("student_scores.csv")
• print(df.head())
• df.describe()
• df.info()
• df.isnull().sum()
• print(df.head())
• plt.figure(figsize = (5,5))
• ax.bar_label(ax.containers[0])
• plt.title("Gender Distribution")
• plt.show()
• gb = df.groupby("ParentEduc").agg({"MathScore":'mean', "ReadingScore":"mean",
"WritingScore":"mean"})
• print(gb)
• plt.show
• print(gb1)
• sns.heatmap(gb, annot = True)
• plt.show
• plt.show
• plt.show
• print(df["EthnicGroup"].unique())
• l = ["group A", "group B", "group C", "group D", "group E"]
• plt.show
• ax.bar_label(ax.containers[0])
REFERENCES
1. https://www.kaggle.com/
2. https://www.geeksforgeeks.org/
3. https://towardsdatascience.com/exploratory-data-ana
lysis-eda-python