0% found this document useful (0 votes)
52 views

Pandas (Software)

Pandas is a software library written for the Python programming language used for data manipulation and analysis. It offers data structures and operations for working with numerical tables and time series data. Pandas was created by Wes McKinney in 2008 at AQR Capital Management to allow for quantitative analysis on financial data. It allows importing data from various file formats and performing operations like merging, reshaping, selecting, cleaning and wrangling data stored in DataFrames, similar to R. Pandas is built upon NumPy and is widely used for data analysis.

Uploaded by

levin696
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
52 views

Pandas (Software)

Pandas is a software library written for the Python programming language used for data manipulation and analysis. It offers data structures and operations for working with numerical tables and time series data. Pandas was created by Wes McKinney in 2008 at AQR Capital Management to allow for quantitative analysis on financial data. It allows importing data from various file formats and performing operations like merging, reshaping, selecting, cleaning and wrangling data stored in DataFrames, similar to R. Pandas is built upon NumPy and is widely used for data analysis.

Uploaded by

levin696
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

pandas (software)

pandas is a software library written for the Python


programming language for data manipulation and pandas
analysis. In particular, it offers data structures and
operations for manipulating numerical tables and time
series. It is free software released under the three-clause
BSD license.[2] The name is derived from the term
"panel data", an econometrics term for data sets that
include observations over multiple time periods for the
same individuals.[3] Its name is a play on the phrase
Original author(s) Wes McKinney
"Python data analysis" itself.[4] Wes McKinney started
building what would become pandas at AQR Capital Developer(s) Community
while he was a researcher there from 2007 to 2010.[5] Initial release 11 January 2008
Stable release 2.0.3[1] / 28 June 2023
DataFrames Preview release 2.0rc1 / 15 March 2023
Repository github.com/pandas-dev
Pandas is mainly used for data analysis and associated
manipulation of tabular data in DataFrames. Pandas /pandas (https://github.co
allows importing data from various file formats such as m/pandas-dev/pandas)
comma-separated values, JSON, Parquet, SQL Written in Python, Cython, C
database tables or queries, and Microsoft Excel.[6]
Operating system Cross-platform
Pandas allows various data manipulation operations
such as merging,[7] reshaping,[8] selecting,[9] as well as Type Technical computing
data cleaning, and data wrangling features. The License New BSD License
development of pandas introduced into Python many
Website pandas.pydata.org (http
comparable features of working with DataFrames that
were established in the R programming language. The s://pandas.pydata.org)
pandas library is built upon another library, NumPy,
which is oriented to efficiently working with arrays instead of the features of working on DataFrames.

History
Developer Wes McKinney started working on pandas in 2008 while at AQR Capital Management out of
the need for a high performance, flexible tool to perform quantitative analysis on financial data. Before
leaving AQR he was able to convince management to allow him to open source the library.

Another AQR employee, Chang She, joined the effort in 2012 as the second major contributor to the
library.
In 2015, pandas signed on as a fiscally sponsored project of NumFOCUS, a 501(c)(3) nonprofit charity in
the United States.[10]

See also
matplotlib
NumPy
Dask
SciPy
R (programming language)
scikit-learn
statsmodels
List of numerical analysis software

References
1. "Release 2.0.3" (https://github.com/pandas-dev/pandas/releases/tag/v2.0.3). 28 June 2023.
Retrieved 1 July 2023.
2. "License – Package overview – pandas 1.0.0 documentation" (https://pandas.pydata.org/pa
ndas-docs/stable/overview.html#license). pandas. 28 January 2020. Retrieved 30 January
2020.
3. Wes McKinney (2011). "pandas: a Foundational Python Library for Data Analysis and
Statistics" (https://www.dlr.de/sc/Portaldata/15/Resources/dokumente/pyhpc2011/submissio
ns/pyhpc2011_submission_9.pdf) (PDF). Retrieved 2 August 2018.
4. McKinney, Wes (2017). Python for Data Analysis, Second Edition. O'Reilly Media. p. 5.
ISBN 9781491957660.
5. Kopf, Dan. "Meet the man behind the most important tool in data science" (https://qz.com/11
26615/the-story-of-the-most-important-tool-in-data-science/). Quartz. Retrieved 17 November
2020.
6. "IO tools (Text, CSV, HDF5, …) — pandas 1.4.1 documentation" (https://pandas.pydata.org/p
andas-docs/stable/user_guide/io.html).
7. "Merge, join, concatenate and compare — pandas 1.4.1 documentation" (https://pandas.pyd
ata.org/pandas-docs/stable/user_guide/merging.html).
8. "Reshaping and pivot tables — pandas 1.4.1 documentation" (https://pandas.pydata.org/pan
das-docs/stable/user_guide/reshaping.html).
9. "Indexing and selecting data — pandas 1.4.1 documentation" (https://pandas.pydata.org/pan
das-docs/stable/user_guide/indexing.html).
10. "NumFOCUS – pandas: a fiscally sponsored project" (https://www.numfocus.org/project/pan
das/). NumFOCUS. Retrieved 3 April 2018.

Further reading
McKinney, Wes (2017). Python for Data Analysis : Data Wrangling with Pandas, NumPy,
and IPython (2nd ed.). Sebastopol: O'Reilly. ISBN 978-1-4919-5766-0.
Molin, Stefanie (2019). Hands-On Data Analysis with Pandas: Efficiently perform data
collection, wrangling, analysis, and visualization using Python. Packt. ISBN 978-1-7896-
1532-6.
Chen, Daniel Y. (2018). Pandas for Everyone : Python Data Analysis. Boston: Addison-
Wesley. ISBN 978-0-13-454693-3.
VanderPlas, Jake (2016). "Data Manipulations with Pandas". Python Data Science
Handbook: Essential Tools for Working with Data. O'Reilly. pp. 97–216. ISBN 978-1-4919-
1205-8.
Pathak, Chankey (2018). Pandas Cookbook. pp. 1–8.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Pandas_(software)&oldid=1160289806"

You might also like