Data Science Imp Questions and Answers

1. Joint probability is the probability that two random variables take on specific values at the same time. It is represented by P(X=x, Y=y) and describes the bivariate probability distribution between two random variables. 2. A probability density function (PDF) describes the relative likelihood of a continuous random variable taking on a given value, while a cumulative distribution function (CDF) describes the probability that a random variable is less than or equal to a particular value. 3. Expected value is the average or mean (μ) of a random variable and represents the value we expect the variable to take on average over many trials. Variance measures how far values of a random variable are from the expected

Uploaded by

Raghu Nandan Lal Garikipati

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

105 views

Data Science Imp Questions and Answers

Uploaded by

Raghu Nandan Lal Garikipati

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Data Science imp questions and answers

CO-3

1) What is joint probability and examples.?

A) The joint probability mass function of the discrete random variable x and y
denoted as fxy(x y) satisifiies
* fxy(x,y) grearter than orequal to 0
* sigma x sigma y fxy(x,y) =1
* fxy(x,y) =p(X=x, Y=y)
The joint probability distribution of two random variables =bivariate probability
distribution.
The joint probability distribution of two discrete random variables is usually
written as P(X=x, Y=y).

2) what is PDF and CDF.?

A) PDF:
• Example: Suppose you have variable x which is continuous random variable.
So x is variable which is typically tells you the bus travel time from
Bangalore to Hyderabad is 10 to 15 hours.
1) What is the probability that bus can reach next day in exactly 13 hours?
0(Values should be in range)
2. What is the probability that bus can reach next day in 11 -13hours
CDF: CDF(Cumulative Distribution Function)
We have seen how to describe distributions for discrete and continuous random
variables.Now what for both:
CDF is a concept which is used for describing the distribution of random variables
either it is continuous or discrete.It is used to tell how much percentage of value
is less than a particular value.
For Example : Lets take age variable from haberman dataset and now what i am
writing is P(age=50) = 0.60.What it means that 60% of patients are less than age
of 50 in dataset.
3) functions of random varable.?

4) Expected value of a random variable.?

• Expected value is just the average or mean (µ) of random variable x.
• It’s sometimes called a “weighted average” because more frequent values
of X are weighted more highly in the average.
• It’s also how we expect X to behave on-average over the long run
(“frequentist” view again).
5) What is Variance & sum of variance in random variable .?

 The variance of a random variable X is a measure of how spread

outit is. Are the values of X clustered tightly around their mean, or
can we commonly observe values of X a long way from the mean
value?

 The variancemeasures how far the values of X are from their mean,
on average.

 If X has high variance, we can observe values of X a long way from

the mean.

 If X has low variance, the values of X tend to be clustered tightly

around the mean value.
6) Properties of covariance.?
7) what is covariance.?
 Covariance signifies the direction of the linear relationship between the
two variables. By direction we mean if the variables are directly
proportional or inversely proportional to each other. (Increasing the value
of one variable might have a positive or a negative impact on the value of
the other variable
 The values of covariance can be any number between the two opposite
infinities. Also, it’s important to mention that covariance only measures
how two variables change together, not the dependency of one variable
on another one.
 The value of covariance between 2 variables is achieved by taking the
summation of the product of the differences from the means of the
variables as follows:

• Xᵢ= Observation point of variable X

• x̅= Mean of all observations(X)
• Yᵢ= Observation point of variable Y
• ȳ = Mean of all observations(Y)
• n= Number of observations
Example:
• Following data shows the number of customers with their corresponding
temperature
• Mean of X, x̅ = (97+86+89+84+94+74)/6 = 524/6= 87.333
• Mean of Y, Ȳ = (14+11+9+9+15+7)/6 = 65/6= 10.833

8) What is co relation and examples of co realation.?

 Correlation analysis is a method of statistical evaluation used to study the
strength of a relationship between two, numerically measured,
continuous variables.
 It not only shows the kind of relation (in terms of direction) but also how
strong the relationship is. Thus, we can say the correlation values have
standardized notions, whereas the covariance values are not standardized
and cannot be used to compare how strong or weak the relationship is
because the magnitude has no direct significance. It can assume values
from -1 to +1.
 To determine whether the covariance of the two variables is large or
small, we need to assess it relative to the standard deviations of the two
variables.
 For example: Sales might increase if lot of money is spent on product
marketing.
 Why it is useful?
 1. If two variables are closely correlated, then we can predict one variable
from the other.
 2. Correlation plays a vital role in locating the important variables on
which other variables depend.
 3. It’s used as the foundation for various modeling techniques.
 4. Proper correlation analysis leads to better understanding of data.
 5. Correlation contribute towards the understanding of causal
relationship(if any).
• OV(x, y) = 22.46
• σx = 331.28/5=66.25= 8.13
• σy = 48.78/5=9.75=3.1
• correlation = 22.46/(8.13x 3.1)= 22.46/25.20 =0.8
• 0.8 shows that strength of the correlation between temperature and
number of customers is very strong

CO-4

INTERCHANGE 1 - FINAL EXAM A1 - Attempt Review
91% (11)
INTERCHANGE 1 - FINAL EXAM A1 - Attempt Review
7 pages
Sunbeam Popcorn Maker FPSBPP7310 FPSBPP7316
60% (10)
Sunbeam Popcorn Maker FPSBPP7310 FPSBPP7316
9 pages
Ford Escape 4wd Workshop Manual v6 3 0l 2008
100% (4)
Ford Escape 4wd Workshop Manual v6 3 0l 2008
7,556 pages
2019 Book EssentialsOfBusinessAnalytics PDF
93% (14)
2019 Book EssentialsOfBusinessAnalytics PDF
971 pages
Spss Command Cheat Sheet
100% (1)
Spss Command Cheat Sheet
13 pages
CHYS 3P15 Final Exam Review
No ratings yet
CHYS 3P15 Final Exam Review
7 pages
Assignment 1 - Making The Familiar Unfamiliar
No ratings yet
Assignment 1 - Making The Familiar Unfamiliar
3 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Best Practices For Sales Managers
100% (8)
Best Practices For Sales Managers
51 pages
How+to+DeFi 101
100% (1)
How+to+DeFi 101
10 pages
The Ritz Carlton Hotel Company
0% (1)
The Ritz Carlton Hotel Company
26 pages
Cape Math Unit 1 2012
100% (2)
Cape Math Unit 1 2012
9 pages
DaveLee MagicalIncenses
90% (29)
DaveLee MagicalIncenses
51 pages
ML & DS Unit 1-2 Insem Pyq
No ratings yet
ML & DS Unit 1-2 Insem Pyq
16 pages
SAP_10S_10N_unit1
No ratings yet
SAP_10S_10N_unit1
6 pages
Group 2 Continuous Random Variable
No ratings yet
Group 2 Continuous Random Variable
30 pages
Statistical methods MSE
No ratings yet
Statistical methods MSE
16 pages
Explain The Linear Regression Algorithm in Detail
No ratings yet
Explain The Linear Regression Algorithm in Detail
12 pages
Unit-III (Data Analytics)
100% (1)
Unit-III (Data Analytics)
15 pages
BSTATS
No ratings yet
BSTATS
32 pages
CPL Practical 1
No ratings yet
CPL Practical 1
14 pages
Relative Frequency of A Class : Mean
No ratings yet
Relative Frequency of A Class : Mean
5 pages
Assignment Linear Regression
No ratings yet
Assignment Linear Regression
10 pages
Name: Muhammad Siddique Class: B.Ed. Semester: Fifth Subject: Inferential Statistics Submitted To: Sir Sajid Ali
No ratings yet
Name: Muhammad Siddique Class: B.Ed. Semester: Fifth Subject: Inferential Statistics Submitted To: Sir Sajid Ali
6 pages
Multiple linear regression
No ratings yet
Multiple linear regression
39 pages
Statistics
No ratings yet
Statistics
8 pages
BA - Advanced statistical method using R (P2)
No ratings yet
BA - Advanced statistical method using R (P2)
12 pages
Basics
No ratings yet
Basics
8 pages
Biostatistics (Correlation and Regression)
100% (1)
Biostatistics (Correlation and Regression)
29 pages
Unit 2 Statics and DA
No ratings yet
Unit 2 Statics and DA
21 pages
Module 6 RM: Advanced Data Analysis Techniques
No ratings yet
Module 6 RM: Advanced Data Analysis Techniques
23 pages
Bio2 Module 4 - Multiple Linear Regression
No ratings yet
Bio2 Module 4 - Multiple Linear Regression
20 pages
Mca-probability & Statistics -II
No ratings yet
Mca-probability & Statistics -II
3 pages
PME-lec8-ch4-b
No ratings yet
PME-lec8-ch4-b
50 pages
Stats Assign
No ratings yet
Stats Assign
6 pages
Subjective Questions
No ratings yet
Subjective Questions
8 pages
8614 2nd Assignment
No ratings yet
8614 2nd Assignment
12 pages
bda answers
No ratings yet
bda answers
6 pages
Correlation and Chi-Square Test - LDR 280
100% (1)
Correlation and Chi-Square Test - LDR 280
71 pages
Stats and Prob Reviewer, Q3 Jess Anch.
No ratings yet
Stats and Prob Reviewer, Q3 Jess Anch.
8 pages
BA Notes[End Sem)
No ratings yet
BA Notes[End Sem)
26 pages
Handout 05 Regression and Correlation PDF
No ratings yet
Handout 05 Regression and Correlation PDF
17 pages
Regression Correlation
No ratings yet
Regression Correlation
22 pages
Statistical and Probability Tools For Cost Engineering
No ratings yet
Statistical and Probability Tools For Cost Engineering
16 pages
2.2 Unit-Dsp
No ratings yet
2.2 Unit-Dsp
63 pages
Unit 7 Infrential Statistics Corelation
No ratings yet
Unit 7 Infrential Statistics Corelation
10 pages
Mosconi W1
No ratings yet
Mosconi W1
14 pages
BRM Assignment
No ratings yet
BRM Assignment
26 pages
ruvan
No ratings yet
ruvan
4 pages
Statistical Modeling Notes
No ratings yet
Statistical Modeling Notes
25 pages
Basic Statistics For Data Science
100% (1)
Basic Statistics For Data Science
45 pages
Chapter 4 CONTINUOUS PROBABILITY.pptx
No ratings yet
Chapter 4 CONTINUOUS PROBABILITY.pptx
46 pages
Pearson R Correlation: Test
No ratings yet
Pearson R Correlation: Test
5 pages
Variance_vs_volatility
No ratings yet
Variance_vs_volatility
11 pages
MBA Assignment - Ashehad MB0024 - Statistics For Management
100% (1)
MBA Assignment - Ashehad MB0024 - Statistics For Management
9 pages
Chapter Two - Estimators.2
No ratings yet
Chapter Two - Estimators.2
8 pages
Introduction To Statistics (4485) : Semester: Spring, 2023
No ratings yet
Introduction To Statistics (4485) : Semester: Spring, 2023
26 pages
Edur 8131 Notes 1 Descriptive Statistics
No ratings yet
Edur 8131 Notes 1 Descriptive Statistics
8 pages
Question 4 (A) What Are The Stochastic Assumption of The Ordinary Least Squares? Assumption 1
No ratings yet
Question 4 (A) What Are The Stochastic Assumption of The Ordinary Least Squares? Assumption 1
9 pages
Regression Test Lesson Notes (Optional Download)
No ratings yet
Regression Test Lesson Notes (Optional Download)
5 pages
Exercise Book
No ratings yet
Exercise Book
43 pages
Econometrics for finance (2017-I)
No ratings yet
Econometrics for finance (2017-I)
6 pages
Random Variables Review - unannotated
No ratings yet
Random Variables Review - unannotated
9 pages
2011 L1 Final
No ratings yet
2011 L1 Final
89 pages
P.1 Biasedness - The Bias of On Estimator Is Defined As:: Chapter Two Estimators
No ratings yet
P.1 Biasedness - The Bias of On Estimator Is Defined As:: Chapter Two Estimators
8 pages
Econometrics 2
No ratings yet
Econometrics 2
27 pages
Multiple Regression and Dummy Variable Analysis
No ratings yet
Multiple Regression and Dummy Variable Analysis
16 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Multiple Regression
No ratings yet
Multiple Regression
49 pages
Module 2 in IStat 1 Probability Distribution
No ratings yet
Module 2 in IStat 1 Probability Distribution
6 pages
Stats
No ratings yet
Stats
24 pages
Resume Updated
100% (3)
Resume Updated
2 pages
Focus Investing PDF
No ratings yet
Focus Investing PDF
18 pages
A Collection of Fraud Schemes
67% (3)
A Collection of Fraud Schemes
54 pages
Microsoft AppSource Partner Listing Guidelines PDF
No ratings yet
Microsoft AppSource Partner Listing Guidelines PDF
10 pages
Farmer CA1e TB Ch10 3PP Accepted
100% (1)
Farmer CA1e TB Ch10 3PP Accepted
44 pages
Data Warehouse and Data Mining Notes
No ratings yet
Data Warehouse and Data Mining Notes
66 pages
QuickBooks Online Core Certification Self Study Workbook V21.2.2
100% (1)
QuickBooks Online Core Certification Self Study Workbook V21.2.2
55 pages
TED Talks List
100% (2)
TED Talks List
15 pages
Account List
No ratings yet
Account List
1,294 pages
Medical Coding
100% (1)
Medical Coding
43 pages
Consulting
No ratings yet
Consulting
168 pages
Mindmapping in 8 Easy Steps
No ratings yet
Mindmapping in 8 Easy Steps
40 pages
The Impact of Control Technology
No ratings yet
The Impact of Control Technology
246 pages
Automation Essentials For Small Businesses
No ratings yet
Automation Essentials For Small Businesses
18 pages
How To Conduct Meta Analysis
No ratings yet
How To Conduct Meta Analysis
15 pages
Data Analytics Concepts Techniques and A PDF
100% (11)
Data Analytics Concepts Techniques and A PDF
451 pages
Blackbook - Stealth Influence
100% (5)
Blackbook - Stealth Influence
104 pages
The Threat Hunt Process
100% (2)
The Threat Hunt Process
127 pages
Knowledge Graphs Data in Context Responsive
100% (2)
Knowledge Graphs Data in Context Responsive
87 pages
UBS Campden Global Family Office Report 2014
100% (1)
UBS Campden Global Family Office Report 2014
101 pages
Data Engineer-Resume
No ratings yet
Data Engineer-Resume
1 page
Qualitative Research Methods
100% (1)
Qualitative Research Methods
6 pages
Statistics and Research Design Slides
100% (1)
Statistics and Research Design Slides
31 pages
E14F05P16 Twin Diplexer
No ratings yet
E14F05P16 Twin Diplexer
4 pages
Radiomuseum Grundig Oszilloskop Go20z 920314
No ratings yet
Radiomuseum Grundig Oszilloskop Go20z 920314
2 pages
Application Form and Sample Question
100% (1)
Application Form and Sample Question
4 pages
March Scholarship Newsletter 2023
No ratings yet
March Scholarship Newsletter 2023
4 pages
12 Rules of Life
No ratings yet
12 Rules of Life
13 pages
KIP 1903 2504 Brochure ENG Rev09 10 17 Low
No ratings yet
KIP 1903 2504 Brochure ENG Rev09 10 17 Low
20 pages
3DF Wiki Guide
No ratings yet
3DF Wiki Guide
38 pages
Introduction of Compiler Design
No ratings yet
Introduction of Compiler Design
63 pages
Challenges of Applying Forensic Accounting Today in Nigeria
No ratings yet
Challenges of Applying Forensic Accounting Today in Nigeria
10 pages
320 & 365 Kva PDF
No ratings yet
320 & 365 Kva PDF
156 pages
Ficha Tecnica Wke100h - A - WKG101H - A LG
No ratings yet
Ficha Tecnica Wke100h - A - WKG101H - A LG
5 pages
Healthy Lifestyle: - One Which Helps To Keep and Improve People's Health and Well-Being
No ratings yet
Healthy Lifestyle: - One Which Helps To Keep and Improve People's Health and Well-Being
27 pages
Innospec Product Guide
No ratings yet
Innospec Product Guide
28 pages
C# (Bluetooth Connectivity Code)
No ratings yet
C# (Bluetooth Connectivity Code)
3 pages
Case Study - Quality Management System at Coca Cola Company - Docx - 1538569969006 PDF
No ratings yet
Case Study - Quality Management System at Coca Cola Company - Docx - 1538569969006 PDF
7 pages
And Does Not Descend
No ratings yet
And Does Not Descend
24 pages
Chamblee District Resolution
No ratings yet
Chamblee District Resolution
4 pages
Sony - cfm-10 - 10l Manual
No ratings yet
Sony - cfm-10 - 10l Manual
4 pages
Anthropological Conceptualization of Self
No ratings yet
Anthropological Conceptualization of Self
16 pages
R-Stahl Ammeter Model
No ratings yet
R-Stahl Ammeter Model
4 pages
Media Contacts Pro
No ratings yet
Media Contacts Pro
5 pages
Sistema 1-Ciac-Instalacion Sistema de Refrigeracion
No ratings yet
Sistema 1-Ciac-Instalacion Sistema de Refrigeracion
17 pages
Presentation 7
No ratings yet
Presentation 7
14 pages
11 Part 2 Att 2-2 HSE Pre-Qualification Questionnaire
No ratings yet
11 Part 2 Att 2-2 HSE Pre-Qualification Questionnaire
5 pages
Bhanu Pratap CV 2024
No ratings yet
Bhanu Pratap CV 2024
2 pages
2-Les temps en anglais
No ratings yet
2-Les temps en anglais
4 pages

Data Science Imp Questions and Answers

Uploaded by

Data Science Imp Questions and Answers

Uploaded by

Data Science imp questions and answers

1) What is joint probability and examples.?

2) what is PDF and CDF.?

4) Expected value of a random variable.?

 The variance of a random variable X is a measure of how spread

 If X has high variance, we can observe values of X a long way from

 If X has low variance, the values of X tend to be clustered tightly

• Xᵢ= Observation point of variable X

8) What is co relation and examples of co realation.?

You might also like