Data Science Course Content
Data Science Course Content
In this course you will get an introduction to the main tools and ideas which are required for Data
Scientist/Business Analyst/Data Analyst/Analytics Manager/Actuarial Scientist/Business Analytic
Practitioners. The course gives an overview of the data, questions, and tools that data analysts and data
scientists work with. The course is a combination of various data science concepts such as machine learning,
visualization, data mining, programming, data munging, etc. There are three components to this course. The first
is a conceptual introduction to the ideas behind turning data into actionable knowledge. The second is manual
calculations will be shown on how formulae’s are used behind the logics. The third is a practical introduction to
the tools that will be used in the program like R Programming and EXCEL.
Course features:
✓ Exclusive doubt clarification session on every weekend
✓ Real Time Case Study driven approach
✓ Placement Assistance
Pre-Requisite / Qualification:
✓ Any Graduate. No programming and statistics knowledge or skills required
INTRODUCTION
• What is Data Science? – Introduction.
• What background is required?
• Why Data Science?
• Importance of Data Science.
• Demand for Data Science Professional.
• Brief Introduction to Big data and Data Analytics.
• Lifecycle of data science.
• Tools and Technologies used in data Science.
• What is Machine Learning?
• Different types of Data Science Tasks.
QSHORE TECHNOLOGIES
Reach us at 9030821111 Email: [email protected]
BUSINESS STATISTICS
• Descriptive statistics and Inferential Statistics
• Sample and Population
• Variables and Data types
• Percentiles
• Measures of Central Tendency
• Measures of Spread
• Skeweness, Kurtosis
• Degrees of freedom
• Variance, Covariance, Correlation
• Standardization/Scaling
• Probability
• Expected of ‘x’
• Sampling Distribution
• Standard Probability Distribution Functions
• Bernoulli, Binomial, Normal distributions
• Standard Normal Deviate
• Decision Making Rules
• Test of Hypothesis
• One sample t-Test, Chi-square
• Two sample t-Test Analysis of Variance (ANOVA)
DATA PRE-PROCESSING
• Data Types and Conversions
• Binning, Scaling, Standardization, Normalization
• Min-max Scaling
• Missing values Treatment
• Imputation
QSHORE TECHNOLOGIES
Reach us at 9030821111 Email: [email protected]
PREDICTION ANALYTICS
➢ Simple Linear Regression
➢ Variable Selection
➢ Multicollinearity – VIF
➢ Polynomial Regression
➢ Transformations
a. Bulging Rules
b. Box Tidwell
c. Box cox
d. Weighted Least Square
➢ Dummy variables
➢ Assessing Performance
➢ Logistic Regression
A Case Study will be presented on Logistic Regression
QSHORE TECHNOLOGIES
Reach us at 9030821111 Email: [email protected]
MACHINE LEARNING
Introduction to Supervised and unsupervised Learning
➢ Neural Networks
a. Network Topology
b. Single Layer Perceptron
c. Multi-Layer perceptron
d. Feed forward and Back propagation Models
➢ Decision Tree
a. Finding Root Node, Intermediate Nodes, Terminal Nodes
b. Construction of Rules
c. Miss classification
d. Gini Index
e. Overfitting and Prunning
f. Regression Trees
2. Cluster Analysis
a. Hierarchal Clustering
b. Linkage Methods
c. Non- Hierarchal Clustering
d. K-Means Clustering
QSHORE TECHNOLOGIES
Reach us at 9030821111 Email: [email protected]
➢ Text Mining / Natural Language processing
a. Unstructured Data
b. Text Analytics
c. Cleaning Text data
d. Tokenization
e. Pre-processing
f. Word counts and word clouds
g. Sentiment Analysis
h. Text classification
i. Distance measures
➢ PYTHON - PROGRAMMING
QSHORE TECHNOLOGIES
Reach us at 9030821111 Email: [email protected]
➢ Introduction to NumPy
• One-dimensional Array
• Two-dimensional Array
• Pr-defined functions (arrange, reshape, zeros, ones, empty)
• Basic Matrix operations
• Scalar addition, subtraction, multiplication, division
• Matrix addition, subtraction, multiplication, division and transpose
• Slicing
• Indexing
• Looping
• Shape Manipulation
• Stacking
➢ Introduction to Pandas
• Series
• DataFrame
• df.GroupBy
• df.crosstab
• df.apply
• df.map
What is Spark
Introduction to Spark RDD
Introduction to Spark SQL and Dataframes
Using R-Spark for machine learning
Hands-on:
installation and configuration of Spark
Hands on Spark RDD programming
Hands on of Spark SQL
Dataframe programming
Using R-Spark for machine learning programming
QSHORE TECHNOLOGIES
Reach us at 9030821111 Email: [email protected]
➢ R – PROGRAMMING
1. Getting R
1.1 Downloading R
1.2 R Version
1.3 32-bit versus 64-bit
1.4 Installing
2. The R Environment
2.1 Command Line Interface
2.2 RStudio
3. R Packages
3.1 Installing Packages
3.2 Loading Packages
6. Basics of R
6.1 Basic Math
6.2 Variables
6.3 Data Types
6.4 Vectors
6.5 Calling Functions
6.6 Function Documentation
6.7 Missing Data
7. Control Statements
7.1 if and else
7.2 switch
7.3 ifelse
8. Loops
8.1 for Loops
QSHORE TECHNOLOGIES
Reach us at 9030821111 Email: [email protected]
8.2 while Loops
8.3 Controlling Loops
9. Group Manipulation
9.1 Apply Family
9.2 aggregate
Course Highlights
✓ A Dedicated Portal For Practicing.
✓ Real Time Project Data Models to Work
✓ 1-1 Mentorship
✓ Internship Offers for Freshers.
✓ Weekly Assignments.
✓ Weekly Doubt Sessions\
✓ Resume Preparation Tips
✓ Interview Guidance And Support.
✓ Dedicated HR Team for Job Support And Placement Assistance.
QSHORE TECHNOLOGIES
Reach us at 9030821111 Email: [email protected]