manishadav
manishadav
COLLEGE OF APPLIED
SCIENCES
a. Create a two dimensional array, ARR1 having random values from 0 to 1. Compute the mean,
standard deviation, and variance of ARR1 along the second axis.
b. Create a 2-dimensional array of size m x n integer elements, also print the shape, type and data
type of the array and then reshape it into an n x m array, where n and m are user inputs given at the
run time.
c. Test whether the elements of a given 1D array are zero, non-zero and NaN. Record the indices of
these elements in three separate arrays.
d. Create three random arrays of the same size: Array1, Array2 and Array3. Subtract Array 2 from
Array3 and store in Array4. Create another array Array5 having two times the values in Array1. Find
Covariance and Correlation of Array1 with Array4 and Array5 respectively.
e. Create two random arrays of the same size 10: Array1, and Array2. Find the sum of the first half
of both the arrays and product of the second half of both the arrays.
f. Create an array with random values. Determine the size of the memory occupied by the array.
g. Create a 2-dimensional array of size m x n having integer elements in the range (10,100). Write
statements to swap any two rows, reverse a specified column and store updated array in another
variable.
a. CODE
OUTPUT
b. Code
OUTPUT
c. Code
OUTPUT
d.Code
OUTPUT
e. Code
OUTPUT
f. Code
OUTPUT
g. Code
OUTPUT
a. Create a series with 5 elements. Display the series sorted on index and also sorted on values
separately.
b. Create a series with N elements with some duplicate values. Find the minimum and maximum
ranks assigned to the values using ‘first’ and ‘max’ methods.
c. Display the index value of the minimum and maximum element of a Series.
a. Code
OUTPUT
b. Code
OUTPUT
c. Code
OUTPUT
3. Create a data frame having at least 3 columns and 50 rows to store numeric data generated using
a random function. Replace 10% of the values by null values whose index positions are generated
using random function. Do the following:
c. Identify the row label having maximum of the sum of all values in a row and drop that row.
f. Find the correlation between first and second column and covariance between second and third
column.
a.Code
OUTPUT
b. Code
OUTPUT
c. Code
OUTPUT
d. Code
OUTPUT
e. Code
OUTPUT
f. Code
OUTPUT
g. Code
OUTPUT
4. Consider two excel files having attendance of two workshops, each of duration 5 days. Each file
has three fields ‘Name’, ‘Date, duration (in minutes) where names may be repetitve within a file.
Note that duration may take one of three values (30, 40, 50) only. Import the data into two data
frames and do the following:
a. Perform merging of the two data frames to find the names of students who had attended both
workshops.
b. Find names of all students who have attended a single workshop only.
c. Merge two data frames row-wise and find the total number of records in the data frame.
d. Merge two data frames row-wise and use two columns viz. names and dates as multi-row indexes.
Generate descriptive statistics for this hierarchical data frame.
a.Code
OUTPUT
b.Code
OUTPUT
c.Code
OUTPUT
d.Code
OUTPUT
5. Using Iris data, plot the following with proper legend and axis labels: (Download IRIS data from:
https://archive.ics.uci.edu/ml/datasets/iris or import it from sklearn datasets) .
a. Load data into pandas’ data frame. Use pandas.info () method to look at the info on datatypes in
the dataset.
b. Find the number of missing values in each column (Check number of null values in a column using
df.isnull().sum()).
c. Plot bar chart to show the frequency of each class label in the data.
d. Draw a scatter plot for Petal Length vs Sepal Length and fit a regression line .
f. Use a pair plot to show pairwise bivariate distribution in the Iris Dataset.
h. Compute mean, mode, median, standard deviation, confidence interval and standard error for
each numeric feature.
a.Code
OUTPUT
b.Code
OUTPUT
c.Code
OUTPUT
d.Code
OUTPUT
e.Code
OUTPUT
f.Code
OUTPUT
g.Code
OUTPUT
h.Code
OUTPUT
i.Code
OUTPUT
6. Using Titanic dataset, to do the following:
a. Clean the data by dropping the column which has the largest number of missing values.
f. Draw a scatter plot for passenger fare paid by Female and Male passengers separately
h. Draw the pie chart for three groups labelled as class 1, class 2, class 3 respectively displayed in
different colours. The occurrence of each group converted into percentage should be displayed in
the pie chart. Appropriately Label the chart.
i. Find % of survived passengers for each class and answer the question “Did class play a role in
survival?”.
a,b,c,d:Code
a,b,c,d:OUTPUT
e.Code
OUTPUT
f.Code
OUTPUT
g.Code
Output
h.Code
OUTPUT
i.Code
OUTPUT
7. Consider the following data frame containing a family name, gender of the family member and
her/his monthly income in each record.
b. Display the highest and lowest monthly income for each family name
c. Calculate and display monthly income of all members earning income less than Rs. 80000.00.
d. Display total number of females along with their average monthly income
e. Delete rows with Monthly income less than the average income of all members
a. Code
OUTPUT
b. Code
OUTPUT
c. Code
OUTPUT
d. Code
OUTPUT
e. Code
OUTPUT