0% found this document useful (0 votes)
11 views

Pandas

Pandas is a popular Python library used for data manipulation and analysis. It introduces two primary data structures - Series for one-dimensional labeled arrays and DataFrame for two-dimensional labeled data like tables. Pandas provides functions for reading, writing, cleaning, filtering, aggregating, merging, reshaping and analyzing structured data. CSV files store tabular data as plain text, with each row on a new line separated by commas or other delimiters. Pandas makes it convenient to work with CSV data. Wes McKinney created pandas in 2008 to provide a flexible tool for data analysis in Python, filling a gap between Python and R. Pandas can be installed via pip and imported for use.

Uploaded by

Akash Singh
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Pandas

Pandas is a popular Python library used for data manipulation and analysis. It introduces two primary data structures - Series for one-dimensional labeled arrays and DataFrame for two-dimensional labeled data like tables. Pandas provides functions for reading, writing, cleaning, filtering, aggregating, merging, reshaping and analyzing structured data. CSV files store tabular data as plain text, with each row on a new line separated by commas or other delimiters. Pandas makes it convenient to work with CSV data. Wes McKinney created pandas in 2008 to provide a flexible tool for data analysis in Python, filling a gap between Python and R. Pandas can be installed via pip and imported for use.

Uploaded by

Akash Singh
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

What is pandas?

 In Python, "pandas" refers to the pandas library, which is a


popular open-source data manipula on and analysis tool. It
provides data structures and func ons designed to make working
with structured data, such as tabular data, easier and more
efficient.

 Pandas introduces two primary data structures: the Series and the
DataFrame.
1. Series: A Series is a one-dimensional labeled array that can
hold any data type. It is similar to a column in a
spreadsheet or a single column in a database table.

2. DataFrame: A DataFrame is a two-dimensional labeled data


structure that consists of columns, each of which can hold
different data types. It can be thought of as a tabular
representa on of data, similar to a spreadsheet or a SQL
table.

 Pandas provides a wide range of func onali es for data


manipula on and analysis, including data cleaning, data filtering,
aggrega on, merging, reshaping, and more. It also integrates well
with other popular Python libraries such as NumPy and Matplotlib
What is CSV?
 A CSV (Comma-Separated Values) file is a plain text file that stores
tabular data (data organized in rows and columns) in a structured
format. It is a commonly used file format for storing and
exchanging data between different so ware applica ons.
 In a CSV file, each line represents a row of data, and the values
within each row are separated by commas (or other delimiters like
semicolons or tabs). Each line typically represents a record, and
each value represents a field or column within that record.
 Python's pandas library, for example, provides func ons to read
and write CSV files, making it convenient to work with tabular data
stored in this format.
History of pandas:-
 It was created by Wes McKinney and ini ally released in 2008. The
development of pandas was mo vated by the need for a flexible
and efficient tool to handle and analyze structured data in Python.
 Wes McKinney, while working as a quan ta ve analyst at AQR
Capital Management, found the exis ng tools for data analysis in
Python to be lacking. He wanted a library that could provide a
similar experience to working with data in R, a popular language
for sta s cal compu ng. Thus, he started developing pandas to fill
this gap and provide a powerful and intui ve tool for data analysis
in Python.
Installa on Method:-
1. Pandas Environment Setup:-
 pip install pandas
2. How we can use it:-
 import pandas as pd

You might also like