0% found this document useful (0 votes)

38 views3 pages

DSBDA Assignment 3 Jupyter Notebook

The document is a Jupyter Notebook containing data analysis on student performance and the Iris dataset. It includes importing libraries, reading CSV files, and performing group statistics on student math scores by gender, as well as statistical details for selected Iris species. The analysis provides insights into the scores and characteristics of the datasets.

Uploaded by

sumeet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views3 pages

DSBDA Assignment 3 Jupyter Notebook

Uploaded by

sumeet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

DSBDA-Assignment-3 - Jupyter Notebook http://localhost:8888/notebooks/DSBDA-Assignment-3...

In [1]: import pandas as pd

import numpy as np

In [3]: df = pd.read_csv("StudentsPerformance.csv")

In [4]: df

Out[4]:
test
race/ parental level math reading writing
gender lunch preparation
ethnicity of education score score score
course

bachelor's
0 female group B standard none 72 72 74
degree

1 female group C some college standard completed 69 90 88

master's
2 female group B standard none 90 95 93
degree

associate's free/
3 male group A none 47 57 44
degree reduced

4 male group C some college standard none 76 78 75

... ... ... ... ... ... ... ... ...

master's
995 female group E standard completed 88 99 95
degree

free/
996 male group C high school none 62 55 55
reduced

free/
997 female group C high school completed 59 71 65
reduced

998 female group D some college standard completed 68 78 77

free/
999 female group D some college none 77 86 86
reduced

1000 rows × 8 columns

In [5]: df.head()

Out[5]:
test
race/ parental level math reading writing
gender lunch preparation
ethnicity of education score score score
course

bachelor's
0 female group B standard none 72 72 74
degree

1 female group C some college standard completed 69 90 88

master's
2 female group B standard none 90 95 93
degree

associate's free/
3 male group A none 47 57 44
degree reduced

4 male group C some college standard none 76 78 75

1 of 3 20/02/25, 11:03
DSBDA-Assignment-3 - Jupyter Notebook http://localhost:8888/notebooks/DSBDA-Assignment-3...

In [6]: df.tail()

Out[6]:
test
race/ parental level math reading writing
gender lunch preparation
ethnicity of education score score score
course

master's
995 female group E standard completed 88 99 95
degree

free/
996 male group C high school none 62 55 55
reduced

free/
997 female group C high school completed 59 71 65
reduced

998 female group D some college standard completed 68 78 77

free/
999 female group D some college none 77 86 86
reduced

In [8]: statistics = df.groupby('gender')['math score'].agg(['mean','median',

statistics

Out[8]:
mean median min max std

gender

female 63.633205 65.0 0 100 15.491453

male 68.728216 69.0 27 100 14.356277

In [9]: data = pd.read_csv("Iris.csv")

In [10]: data

Out[10]:
Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

... ... ... ... ... ... ...

145 146 6.7 3.0 5.2 2.3 Iris-virginica

146 147 6.3 2.5 5.0 1.9 Iris-virginica

147 148 6.5 3.0 5.2 2.0 Iris-virginica

148 149 6.2 3.4 5.4 2.3 Iris-virginica

149 150 5.9 3.0 5.1 1.8 Iris-virginica

150 rows × 6 columns

2 of 3 20/02/25, 11:03
DSBDA-Assignment-3 - Jupyter Notebook http://localhost:8888/notebooks/DSBDA-Assignment-3...

In [12]: selected_species = ['Iris-setosa','Iris-versicolor','Iris-virginica']

filtered_df = data[data['Species'].isin(selected_species)]

In [14]: species_stats = filtered_df.groupby('Species').agg(['quantile','mean'

print("\nBasic Statistical Details for Selected Species :\n",species_stats

Basic Statistical Details for Selected Species :

Id SepalLengthCm
\
quantile mean median min max quantile me
an median
Species
Iris-setosa 25.5 25.5 25.5 1 50 5.0 5.0
06 5.0
Iris-versicolor 75.5 75.5 75.5 51 100 5.9 5.9
36 5.9
Iris-virginica 125.5 125.5 125.5 101 150 6.5 6.5
88 6.5

... PetalLengthCm
\
min max ... quantile mean median min ma
x
Species ...
Iris-setosa 4.3 5.8 ... 1.50 1.464 1.50 1.0 1.
9
Iris-versicolor 4.9 7.0 ... 4.35 4.260 4.35 3.0 5.
1
Iris-virginica 4.9 7.9 ... 5.55 5.552 5.55 4.5 6.
9

PetalWidthCm
quantile mean median min max
Species
Iris-setosa 0.2 0.244 0.2 0.1 0.6
Iris-versicolor 1.3 1.326 1.3 1.0 1.8
Iris-virginica 2.0 2.026 2.0 1.4 2.5

[3 rows x 25 columns]

In [ ]:

3 of 3 20/02/25, 11:03

Experiment 3
No ratings yet
Experiment 3
4 pages
Dsbda 3B
No ratings yet
Dsbda 3B
5 pages
NUMPY-case Study
100% (1)
NUMPY-case Study
4 pages
Dsbda 3B
No ratings yet
Dsbda 3B
5 pages
Notes DV
No ratings yet
Notes DV
19 pages
Vsec PW 7
No ratings yet
Vsec PW 7
3 pages
DSBDA3
No ratings yet
DSBDA3
3 pages
Name:-Nisha Ambike: Roll No: - 02
No ratings yet
Name:-Nisha Ambike: Roll No: - 02
2 pages
K Means On IRIS Dataset
No ratings yet
K Means On IRIS Dataset
4 pages
Assignment 3 Iris
No ratings yet
Assignment 3 Iris
2 pages
Dsbda Lab - 3 - 1737952797670
No ratings yet
Dsbda Lab - 3 - 1737952797670
9 pages
Experiment 3
No ratings yet
Experiment 3
2 pages
Dsfasdflalksdflkasdjfasf
No ratings yet
Dsfasdflalksdflkasdjfasf
4 pages
# Common Datatype: Print Type Print Type Print Type Print Type Print Type
No ratings yet
# Common Datatype: Print Type Print Type Print Type Print Type Print Type
4 pages
Lab Manual
No ratings yet
Lab Manual
32 pages
A Complete Guide To The Iris Dataset in R
No ratings yet
A Complete Guide To The Iris Dataset in R
3 pages
A09Ass03 - Jupyter Notebook
No ratings yet
A09Ass03 - Jupyter Notebook
5 pages
b21 DSBDA Assignment No 3
No ratings yet
b21 DSBDA Assignment No 3
3 pages
Trần Mạnh Hùng 20192643.Ipynb - Colab
No ratings yet
Trần Mạnh Hùng 20192643.Ipynb - Colab
6 pages
Experiment-2-1-Ml Kritika
No ratings yet
Experiment-2-1-Ml Kritika
11 pages
Assignment 5'
No ratings yet
Assignment 5'
4 pages
ML Lab Record
No ratings yet
ML Lab Record
64 pages
Program1 MLA Lab 2025 250109 144615
No ratings yet
Program1 MLA Lab 2025 250109 144615
17 pages
Assigntment 3 Python Lab
No ratings yet
Assigntment 3 Python Lab
1 page
ML LabReport Final Index Edited
No ratings yet
ML LabReport Final Index Edited
35 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
3 pages
25 - Assignment10.ipynb - Colaboratory
No ratings yet
25 - Assignment10.ipynb - Colaboratory
13 pages
Iris - Ipynb - Colab
No ratings yet
Iris - Ipynb - Colab
1 page
Assignment 10
No ratings yet
Assignment 10
9 pages
New Era University: College of Computer Studies Department of Information System
No ratings yet
New Era University: College of Computer Studies Department of Information System
11 pages
RP 3
No ratings yet
RP 3
32 pages
Business Analytics: Iris Clustering
No ratings yet
Business Analytics: Iris Clustering
7 pages
Practical No 1 - Merged
No ratings yet
Practical No 1 - Merged
6 pages
Assignment - 10 - Pandas
No ratings yet
Assignment - 10 - Pandas
53 pages
Ex No4
No ratings yet
Ex No4
3 pages
10
No ratings yet
10
7 pages
Exno 4
No ratings yet
Exno 4
13 pages
Case Study (Iris Data Set)
No ratings yet
Case Study (Iris Data Set)
1 page
Practical 10 Code
No ratings yet
Practical 10 Code
5 pages
Practical of Professional Skills
No ratings yet
Practical of Professional Skills
4 pages
SVM and KNN
No ratings yet
SVM and KNN
3 pages
Iris - Ipynb - Colaboratory
No ratings yet
Iris - Ipynb - Colaboratory
8 pages
Ads Exp 1 Code
No ratings yet
Ads Exp 1 Code
3 pages
DML About Put
No ratings yet
DML About Put
2 pages
Summary (Iris) #View Statistical Summary of Dataset
No ratings yet
Summary (Iris) #View Statistical Summary of Dataset
1 page
Unit 5 Seaborn
No ratings yet
Unit 5 Seaborn
13 pages
Data Science: Objectives
No ratings yet
Data Science: Objectives
10 pages
6 in 1 PRGM
No ratings yet
6 in 1 PRGM
1 page
K-Nearest Neighbour - Jupyter Notebook
No ratings yet
K-Nearest Neighbour - Jupyter Notebook
2 pages
Aula Big Data
No ratings yet
Aula Big Data
5 pages
Task 1
No ratings yet
Task 1
14 pages
Cota12 6
No ratings yet
Cota12 6
4 pages
Rexpt 6&7
No ratings yet
Rexpt 6&7
3 pages
Experiment 11 PML
No ratings yet
Experiment 11 PML
3 pages
Introds Final Part2 2020 Incl Sol
No ratings yet
Introds Final Part2 2020 Incl Sol
6 pages
Exp 4
No ratings yet
Exp 4
10 pages
ML N PY Programs
No ratings yet
ML N PY Programs
17 pages
Iris Pca
No ratings yet
Iris Pca
13 pages
DS (310245C) Unit 4 Question Bank
No ratings yet
DS (310245C) Unit 4 Question Bank
1 page
DS (310245C) Unit 6 Question Bank
No ratings yet
DS (310245C) Unit 6 Question Bank
1 page
Earn and Learn Scheme - Notice and Application Form
No ratings yet
Earn and Learn Scheme - Notice and Application Form
3 pages
Te Comp Distributed System 6180-51
No ratings yet
Te Comp Distributed System 6180-51
2 pages
Te Comp Distributed Systems 6262-41
No ratings yet
Te Comp Distributed Systems 6262-41
2 pages
AI Unit 3
No ratings yet
AI Unit 3
52 pages
Mini Report Movie
No ratings yet
Mini Report Movie
9 pages
Wt@Insemqb
No ratings yet
Wt@Insemqb
56 pages
AI Unit 6
No ratings yet
AI Unit 6
60 pages
Beige & Gold Vintage Bordered Achievement Certificate
No ratings yet
Beige & Gold Vintage Bordered Achievement Certificate
1 page
CC Unit4
No ratings yet
CC Unit4
33 pages
CC Unit5
No ratings yet
CC Unit5
28 pages
Pathway To Light - 6. The Blue Canal Elemental Water Attunements
No ratings yet
Pathway To Light - 6. The Blue Canal Elemental Water Attunements
31 pages
DSBDA Assignment 4 Jupyter Notebook
No ratings yet
DSBDA Assignment 4 Jupyter Notebook
5 pages
Reiki Certificate Example 1 - PLEASE EDIT
No ratings yet
Reiki Certificate Example 1 - PLEASE EDIT
1 page
Submission Certifcate - TE COMP
No ratings yet
Submission Certifcate - TE COMP
1 page
6 Queue
No ratings yet
6 Queue
14 pages
Test1 Reading1
No ratings yet
Test1 Reading1
3 pages
Ariens YT Series Snow Blower Attachment 36" Owners Manual
No ratings yet
Ariens YT Series Snow Blower Attachment 36" Owners Manual
17 pages
Corrector's Name:................................
No ratings yet
Corrector's Name:................................
7 pages
Anthurium Polyhouse Guide
No ratings yet
Anthurium Polyhouse Guide
7 pages
Local Media2616603162805099478
No ratings yet
Local Media2616603162805099478
8 pages
Vegetables Vocabulary Esl Unscramble The Words Worksheets For Kids
100% (1)
Vegetables Vocabulary Esl Unscramble The Words Worksheets For Kids
4 pages
The Connector: Stockland Update & Wine Tasting at Beca Public Meeting June 2
No ratings yet
The Connector: Stockland Update & Wine Tasting at Beca Public Meeting June 2
4 pages
Berry Guide for Birdwatchers
No ratings yet
Berry Guide for Birdwatchers
3 pages
Crop Calendar
No ratings yet
Crop Calendar
4 pages
1st PERIODICAL TEST IN MATHEMATICS 4
No ratings yet
1st PERIODICAL TEST IN MATHEMATICS 4
2 pages
Chapter 4 Agriculture - PPTX Downloaded
No ratings yet
Chapter 4 Agriculture - PPTX Downloaded
28 pages
Exercise-3 Selected List of Ornamental Trees (A) Flowering Trees Botanical Name Common Name Famil y Floweri NG Season Flower Colour Rema Rks
No ratings yet
Exercise-3 Selected List of Ornamental Trees (A) Flowering Trees Botanical Name Common Name Famil y Floweri NG Season Flower Colour Rema Rks
7 pages
Types of Cultivation Fixed
No ratings yet
Types of Cultivation Fixed
2 pages
Gmail - 3rd, 5th & 7th Semester B.sc. (Hons) Agriculture Online Registration Form - 2025-26 Academic Year
No ratings yet
Gmail - 3rd, 5th & 7th Semester B.sc. (Hons) Agriculture Online Registration Form - 2025-26 Academic Year
5 pages
Cleopatra and Julius Caesar A Captivating Guide To A Queen of Ancient Egypt A Roman General and Their Relationship Captivating History Download
No ratings yet
Cleopatra and Julius Caesar A Captivating Guide To A Queen of Ancient Egypt A Roman General and Their Relationship Captivating History Download
41 pages
HORTICULTURE-BY Agriguru
No ratings yet
HORTICULTURE-BY Agriguru
23 pages
Medicinal and Aromatic Plants of India
100% (3)
Medicinal and Aromatic Plants of India
238 pages
The Beach House
No ratings yet
The Beach House
24 pages
Lithops: Care and Cultivation Guide
No ratings yet
Lithops: Care and Cultivation Guide
5 pages
Select and Use Farm Tools
No ratings yet
Select and Use Farm Tools
32 pages
PG 100 - Sample Five-Year Lease
No ratings yet
PG 100 - Sample Five-Year Lease
3 pages
Plant Kingdom
No ratings yet
Plant Kingdom
12 pages
Agribusiness Entrepreneurship
No ratings yet
Agribusiness Entrepreneurship
35 pages
Gardening Guides - All About Growing Asafoetida
No ratings yet
Gardening Guides - All About Growing Asafoetida
4 pages
The Self-Sufficient Backyard A Complete Guide To Growing 80% of Your Family's Food
No ratings yet
The Self-Sufficient Backyard A Complete Guide To Growing 80% of Your Family's Food
9 pages
Mango UHDMP
100% (1)
Mango UHDMP
2 pages
4127 QuestionPaper
No ratings yet
4127 QuestionPaper
6 pages
Hitech Vegetable Nursery Technology
No ratings yet
Hitech Vegetable Nursery Technology
3 pages
Unit 1 G7
No ratings yet
Unit 1 G7
10 pages
Honey Fungus Plant List: RHS Advisory Service March 2015
No ratings yet
Honey Fungus Plant List: RHS Advisory Service March 2015
2 pages

DSBDA Assignment 3 Jupyter Notebook

Uploaded by

DSBDA Assignment 3 Jupyter Notebook

Uploaded by

DSBDA-Assignment-3 - Jupyter Notebook http://localhost:8888/notebooks/DSBDA-Assignment-3...

In [1]: import pandas as pd

1 female group C some college standard completed 69 90 88

4 male group C some college standard none 76 78 75

... ... ... ... ... ... ... ... ...

998 female group D some college standard completed 68 78 77

1000 rows × 8 columns

1 female group C some college standard completed 69 90 88

4 male group C some college standard none 76 78 75

998 female group D some college standard completed 68 78 77

In [8]: statistics = df.groupby('gender')['math score'].agg(['mean','median',

female 63.633205 65.0 0 100 15.491453

male 68.728216 69.0 27 100 14.356277

In [9]: data = pd.read_csv("Iris.csv")

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

... ... ... ... ... ... ...

145 146 6.7 3.0 5.2 2.3 Iris-virginica

146 147 6.3 2.5 5.0 1.9 Iris-virginica

147 148 6.5 3.0 5.2 2.0 Iris-virginica

148 149 6.2 3.4 5.4 2.3 Iris-virginica

149 150 5.9 3.0 5.1 1.8 Iris-virginica

150 rows × 6 columns

In [12]: selected_species = ['Iris-setosa','Iris-versicolor','Iris-virginica']

In [14]: species_stats = filtered_df.groupby('Species').agg(['quantile','mean'

Basic Statistical Details for Selected Species :

You might also like