By Microsoft Website: DURATION: 6 Weeks Amount Paid: Yes: Introduction To Data Science
By Microsoft Website: DURATION: 6 Weeks Amount Paid: Yes: Introduction To Data Science
SCIENCE
BY MICROSOFT
WEBSITE : https://www.edx.org/
DURATION: 6 Weeks
AMOUNT PAID: Yes
• Select cell D2 and then hold the Shift and Ctrl keys and press the Down-Arrow key to select all the values in the
Temperature column.
• On the Home tab of the ribbon, in the Conditional Formatting drop-down list, point to Color Scales, and select
the Red-White Color Scale (with red at the top and white at the bottom). The Temperature cells are reformatted
so that the hottest days are colored an intense red, and the coolest days are much lighter in color intensity.
Scrolling through the data now, it is easier to find days that are particularly hot or cool.
• Select all the values in the Rainfall column, and then in the Conditional Formatting drop-down list, point to
Data Bars, and select the Light Blue Data Bar gradient fill. The cells are formatted with a visual indication of
the comparative level of rainfall for each day.
• Select all the values in the Sales column, and then in the Conditional Formatting drop-down list, point to
Top/Bottom Rules, and select Top 10%. Then in the Top 10% dialog box, select Green Fill with Dark Green
Text and click OK. The cells containing sales values in the top 10% are highlighted in green (you may need to
scroll to see them).
• Reselect the values in the Sales column if you deselected them, and then in the Conditional Formatting drop-
down list, point to Top/Bottom Rules, and select Bottom 10%. Then in the Bottom 10% dialog box, select Red
Fill with Dark Red Text and click OK. The cells containing sales values in the bottom 10% are highlighted in
red.
MODULE 2:DATA ANALYSIS FUNDAMENTALS
Aggregating Data
Grouping and Summarizing Data
Visualizing Data
Analyzing Data in Excel Online
Analyzing Data with a PivotTable
• Pivot tables are one of Excel's most powerful features. A pivot table allows us
to extract the significance from a large, detailed data set.
Descriptive Statistics help you understand the “shape” or distribution of your data; for
example, by finding measures on central tendency (the most common “typical” values) and
measures of variance (how much difference there is between the most common values and
other values that are higher or lower).
EXAMPLES:
Data visualization is a general term that describes any effort to help people
understand the significance of data by placing it in a visual context.
Different types of charts provided by excel:
• Histograms
• Pie Chart
• Column Chart
• Line Chart
• Bar Chart
• Area Chart
• Scatter Chart
Working with Samples :
Until now, we’ve worked with the full population of data, but now we would work
with a sample of population ,not working with the full population.
Steps to create a random population:
• Add a new column within the spreadsheet and name it Random_number
• In the first cell underneath your heading row, type “= RAND()”
• Press “Enter,” and a random number will appear in the cell
• Copy and paste the first cell into the other cells in this column
• Once each row contains a random number, sort the records by
Random_number column
• Choose the first 500 emails. Those will be the random 500 out of 3000 emails.
Inferential Statistics and Hypothesis Testing:
• To select the z-test tool, click the Data tab’s Data Analysis command button.
• When Excel displays the Data Analysis dialog box, select the z-Test: Two
Sample for Means tool and then click OK.
• n the Variable 1 Range and Variable 2 Range text boxes, identify the sample
values by telling Excel in what worksheet ranges you’ve stored the two
samples.
• Use the Hypothesized Mean Difference text box to indicate whether you
hypothesize that the means are equal.
• Use the Variable 1 Variance (Known) and Variable 2 Variance (Known) text
boxes to provide the population variance for the first and second samples.
• In the Alpha text box, state the confidence level for your z-test calculation.
• In the Output Options section, indicate where the z-test tool results should be
stored.
• Click OK.
MODULE 4:INTRODUCTION TO MACHINE
LEARNING