0% found this document useful (0 votes)
11 views

Pandas Library

Uploaded by

heyimchad28
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Pandas Library

Uploaded by

heyimchad28
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 12

PANDAS LIBRARY

- Salma Shaikh
Pandas Logo:
Introduction:

1. Pandas is a Python library used for working with data sets. It has
functions for analyzing, cleaning, exploring, and manipulating data.
The name "Pandas" has a reference to both "Panel Data", and
"Python Data Analysis" and was created by Wes McKinney in 2008.
2. The panda's library has emerged into a power house of data
manipulation tasks in python since it was developed in 2008. With
its intuitive syntax and flexible data structure, it's easy to learn and
enables faster data computation.
3. It is mainly used for data analysis and associated
manipulation of tabular data in Data Frames. Pandas allows
importing data from various file formats such as comma-
separated values, JSON, Parquet, SQL database tables or
queries, and Microsoft Excel.
4. Pandas is one of the most widely used python libraries in
data science. It provides high-performance, easy to use
structures and data analysis tools.
5. A Pandas is an open source, providing high-performance,
easy-to-use data structures and data analysis tools for the
Python programming language.
Features of pandas:

1. Fast and efficient DataFrame object with default and


customized indexing.
2. Data alignment and integrated handling of missing data.
3. Reshaping and pivoting of date sets.
4. Label-based slicing, indexing and subsetting of large data
sets.
5. Columns from a data structure can be deleted or inserted.
6. Pandas have large variety of tools that are built-in which
help in reading and writing data.
7. Pandas can be a part of Python and give us access to other
helpful libraries like MatPlotLib and NumPy.
8. The code for Pandas is written in Python or C, which makes
it fast and extremely responsive.
9. Pandas offer in-build abilities that help you in plotting your
data and analyzing the different kinds of graphs which form.
Advantages of pandas:
1. Excellent representation of data:
The Pandas library is the perfect tool for anyone who
wants to get into data science or data analysis because of the
different ways it can represent and organize data. This is a
very important function that cannot be disregarded because
one can't possibly analyze or read any data unless it is
represented well enough. A clean set of data organized well is
essential when the data is confusing to analyze and read.
2. Less coding done, more work accomplished:
By writing 1-2 lines of code in Pandas, you can easily
accomplish tasks that would require about 10-15 lines of code
in C++ or Java, maybe even more. In data science, there is
so much to practice and hence it is a very useful ability for
people just getting into the field.
3. Extensive feature set:
Pandas have helped data analysis reach an entirely new
level. It helps you in filtering the data according to the
conditions you have set in place as well as segregating and
segmenting your data according to your own preference.
Disadvantages of pandas:

1. A complex syntax which is not always in line with Python:


When you are using Pandas, knowing it is a part of
Python, some of its syntax can be complex. This is a trouble
as many users are not able to switch efficiently and
seamlessly between the normal python code and Pandas.
However, such a problem arises only when you are using
advanced levels of Pandas.
2. Steep learning curve:
Pandas initially have a mild learning slope. But as you go
deeper into the library, the learning slope becomes steeper. The
functionality becomes extremely confusing and can cause
beginners some problems. However, with determination, it can
be overcome.
3. Bad documentation:
Without good documentation, it becomes difficult to learn a
new library. Pandas documentation isn’t much help to
understand the harder functions of the library. Thus it slows
down the learning procedure.
simple program:
#import the pandas library and aliasing as pd
import pandas as pd
import numpy as np
data = np.array(['a','b','c','d’])
s = pd.Series(data,index=[100,101,102,103])
print (s)

output:
100 a
101 b
102 c
103 d
type: object
Thank You

You might also like