Exp1-ref-doc-installation
Exp1-ref-doc-installation
visualization?
(Pandas, NumPy, Matplotlib
Python Pandas
The term "Pandas" refers to an open-source library for manipulating high-performance data in Python. This
instructional exercise is intended for the two novices and experts.
The name of Pandas is gotten from the word Board Information, and that implies an Econometrics from Multi-
faceted information. It was created in 2008 by Wes McKinney and is used for data analysis in Python.
Processing, such as restructuring, cleaning, merging, etc., is necessary for data analysis. Numpy, Scipy, Cython,
and Panda are just a few of the fast data processing tools available.
o It has a DataFrame object that is quick and effective, with both standard and custom indexing.
o Utilized for reshaping and turning of the informational indexes.
o For aggregations and transformations, group by data.
o It is used to align the data and integrate the data that is missing.
o Provide Time Series functionality.
o Process a variety of data sets in various formats, such as matrix data, heterogeneous tabular data, and
time series.
o Manage the data sets' multiple operations, including subsetting, slicing, filtering, groupBy, reordering,
and reshaping.
o It incorporates with different libraries like SciPy, and scikit-learn.
o Performs quickly, and the Cython can be used to accelerate it even further.
1) Series
A one-dimensional array capable of storing a variety of data types is how it is defined. The term "index" refers
to the row labels of a series. We can without much of a stretch believer the rundown, tuple, and word reference
into series utilizing "series' technique. Multiple columns cannot be included in a Series. Only one parameter
exists:
How to Check Python Version in Windows
Most out-of-the-box Windows installations do not come with Python pre-installed. However, it is always a good
idea to check.
python --version
Before creating a Series, Firstly, we have to import the numpy module and then use array() function in the
program.
1. import pandas as pd
2. import numpy as np
3. info = np.array(['Dr','V','i','j','a','y'])
4. a = pd.Series(info)
5. print(a)
output
0 Dr
1 V
2 i
3 j
4 a
5 y
dtype: object
Python Pandas Data Frame
It is a generally utilized information design of pandas and works with a two-layered exhibit with named
tomahawks (lines and segments).
import pandas as pd
# a list of strings
x = ['Dr.Vijay', 'Kumar']
# Calling DataFrame constructor on list
df = pd.DataFrame(x)
print(df)
output
0
0 Dr.Vijay
1 Kumar
Python NumPy
NumPy stands for numeric python which is a python package for the computation and processing of the
multidimensional and single dimensional array elements.
NumPy doesn't come bundled with Python. We have to install it using the python pip installer. Execute the
following command.
Installing NumPy
With Pip set up, you can use its command line for installing NumPy.
Before working with the matplotlib library, we need to install it in out Python environment. Let's see the
following method of installing matplotlib library.
Using Anaconda distribution of Python
The Anaconda distribution is an easiest way to install matplotlib library because matplotlib is pre-installed in it.
So we don't need to further installation.