0% found this document useful (0 votes)
52 views15 pages

Data Analysis by Using Python

The document discusses using Python for data analysis. It describes collecting, cleaning, and modeling data to discover useful information. Some applications of data analysis mentioned include bank data, student attendance, and food ordering. The document then focuses on analyzing student recruitment data, including identifying problems like empty/incorrect values, and using Python tools like Pandas and Matplotlib to clean the data, apply criteria to filter it, and create a graphical representation. The conclusion states that data analysis helps provide secure and correct data formats.

Uploaded by

Nagateja Banda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
52 views15 pages

Data Analysis by Using Python

The document discusses using Python for data analysis. It describes collecting, cleaning, and modeling data to discover useful information. Some applications of data analysis mentioned include bank data, student attendance, and food ordering. The document then focuses on analyzing student recruitment data, including identifying problems like empty/incorrect values, and using Python tools like Pandas and Matplotlib to clean the data, apply criteria to filter it, and create a graphical representation. The conclusion states that data analysis helps provide secure and correct data formats.

Uploaded by

Nagateja Banda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 15

Data analysis by using python

 SUBMITED BY :  SUBMITED TO :
K. SANKARA RAO Dr.J.PARDHA SARADHI
L20AEE483 ASST.PROF EEE DEPT
4/4 EEE-A
M.TECH Ph.D
BAPATLA ENGINEERING COLLEGE
BAPATLA ENGINEERING COLLEGE
CONTENTS :

 INTRODUCTION
 APPILICATIONS
 DATA ANALYSIS PROCESS
 PROJECT
 INTRODUCTION
 PROBLEM STATEMENT & DRAW BACKS
 ELIGIBLE CRITERIA
 DATA SET CREATION
 OVER COME
 GRAPHICAL REPRESENTATION
 CONCLUSION
INTRODUCTION

Data analysis is a process of collecting ,transforming, cleaning and modelling data


with goal of discovering the required information.
Data analysis is very important in any field. Because every field consists of some
data. Based on that data they take further decisions. So this data need to be correct.
To make this data correct by data analysis by using python.
Python is a programming language it is used for data analysis Because of the more
flexible ,easy to learn, it’s a open source.
APPILICATIONS

1. Bank Data Analysis


2. Bank Balance Analysis
3. Students Attendance Analysis
4. Data Feedback Analysis
5. Company TRP Analysis
6. Student Recurrent Process
7. Data Processing of Employee
8. Food Ordering Data Analytics etc.…..
DATA ANALYSIS PROCESS
INTRODUCTION TO STUDENT
RECRUITING PROCESS :
 Student recruiting process is a process of selecting correct students for a job .
 This selection process involves so much of data about students .
 This data is very important while selecting students for a company .
 For this purpose their is a need to analyze the students data and to give correct data to the company.
I analysing students data by using python .
 At least to represent the data in the form of
graph.
PROBLEM STATEMENT &
DRAWBACKS:
 According to project we collect some data about students recruitment .
 In this data we have some good data(wrong formarte,empty cells etc..) and bad data also .
 We want only the good data instead of bad data in our final data set.
 So there is need to remove it and the type of wrong data in our data set is empty cells and wrong data
& also data in wrong format .
ELIGIBILE CRITERIA :

 According to company job requirement we select the students which have above 60
percentage in all academics and their data should correct .
PROCESS OF DATA SET
CREATION :
The data set can be create with the help
of Numpy and Pandas .
In the real world a pandas data frame
will be created by loading the data sets
from existing storage.
Storage can be SQL data base, CSV file
& excel file .
Pandas data frame created from the lists
dictionary & from a list of dictionary
etc..
i create a dataset with 50 rows and 13
columns .
Rows represents the student names.
Columns represents the qualification and
details of the student .
DRAWBACKS &OVERCOME :

 We have bad data in our data set. This bad data we remove by analyzing and clearing the
Data Frame or Data Set.
 Analyzing the data set :
First of all analyzing means deep scruting on data by analyzing over data.
Methods perform for cleaning &analyzing:
1. info() : It gives the information about our data set.
2. isnull() : it is a method to find out the empty cells.
3. fillna() : it is a process of assign some value to the empty cells. Fillna by using pad & bfill.
4. dropna() : it is a method to remove all the non value in our data set.
5 data filtering : the data filtering is done by the eligible criteria .
GRAPHICAL
REPRESENT
ATION OF
DATA SET :
By using matplotib we
represent out data set in
graphical form .
Matplotib is a low level
graph ploting library in
python that serves as a
visualization utility take
a bar graph picture.
Code:
import numpy as np
 import pandas as pd  #Count the no.of students who are eligible d=(c['10th percent'] >
a=pd.read_excel("/content/SRP DATA.xlsx") 60.0).value_counts() e=(c['Inter percent'] > 60.0).value_counts() f=(c['UG
percent'] > 60.0).value_counts()
print(a) 
a.info()  print(d) 
a=a.fillna(50)  l=[]
print(a)
 l.append(int(d))
b=a.replace(to_replace=50,value=60)
print(b)   l.append(int(e))
c=b[(b['10th percent']>60)&(b['Inter percent']>60) &(b['UG percent']>60)]  l.append(int(f))
print(c)
c['Email id'] = c['Email id'].replace(to_replace=60,
 print(l) 
value='[email protected]')  import matplotlib.pyplot as plt
print(c) 
 C=['10th percent','Inter percent','UG Percent']
c['D.O.B'] = c['D.O.B'].replace(to_replace=60, value='25-06-2001')
plt.bar(C,l,color='orange',edgecolor='Black')
 print(c)  
c['Personal Info'] = c['Personal Info'].replace(to replace=60, value='clg
 plt.show()
admin')
 print(c)
 
CONCLUSION :

Finally we Conclude that we can make the data be filter and that data was to be used for the
purpose of analysis and the usage of analysis is to provides that data to be in a Secure or
correct Format.

As more and more data is generated and collected, data analysis requires scalable, flexible,
and high performing tools to provide insights in a timely fashion. However, organizations are
facing a growing big data ecosystem where new tools emerge and become outdated very
quickly.

You might also like