0 ratings0% found this document useful (0 votes) 3K views36 pagesStatistics Hand Written Notes
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here.
Available Formats
Download as PDF or read online on Scribd
Statistce = 7 :
SEAESHIC ié Ye an area 4 apple mrdtbemalice Concern with
dhe chia Collection, analycic, Inter pretation f pretentatien)
Meatue 7 Cemhe fe Glatichcal measexment and Sepueumnts Rurmery of
eof Cumbal Tenclirey' > Diciibaso of akc (‘column’)
3) mode.
Data. ie.’ Measer
le ave Ibre« 2) medion)
ie Called 00
meacusee — 1) mean
epyecented da» =
sl 4 ¢ SE ae! obeenioton-
Medien *- Aegsuse 7 Cenhe! vals 4 hs Campiscet te calle) eto)
5
= n+
oy value
valul > Median
)
mo h Obtesvalsar. = >
( opp
Ex! [10,201 30,40: 50 J Eman
0 oc
po Peet. Bb
Bo HD . ane enter
i
ait position value
"
fy) Even value > aedtian
150) 60 dq, 4?
(10/20, 3040/70/60] = BY WP /a =
Dit & ee ‘ oe ca * Polb.are meacuse Ibe Cena Teri
mean > Gete lmpata wills “tbe pecence ‘9 Cubtien
prectian + Kt Get nated woth Oxtlien
Time Comprexity OO feee for enn
meon 7
Pe ene Ne Conplanity Cn ben) mose for pec
et Nib-euttere udlese
A os AY, ae Lenpleat
maton) — pA ae \ Semplenity
‘Scanned with CamScannerMode s- i hs
Mode 2- (6 the value whith i¢ Repeatedly CE ip @ (Patacet
Ge) mestty So element in dala cee
xa
O41, 2,3, 414, 3,42) 5,71) ~>Soat Sep oo)
[)2)2, 3,3, AVAL BT] = mode =} — Oni moda
[Wy 21242, 8,4 9,85, 6,677 > ede Bare aes
Foi mode
[2 Bir Da 6,7, SI] > mode = (514) ~
Mean — jalhern catlieds imps — V [ Effected ]
Median — klben n © — % CAE effected
mote = odtiers wv — x ENOL Etbected |
Measure 4 Spread v~ TA describe bow Similar om varred oie
pote fo a paoticutas variable
lle howe (A) meacenet ee aie -
@ Inley Quviile rng.
Max —Min -
@® Marience-
GD Brendend Deviate)
ex: [lo 20:30 Hos) = [po-10] > HO. CRangd
> Thi fpatacet, Te Cpe column 4 Data Zo we called
mk 4 Bheutd Bivanable
FL Univosialolee, Sf we wloale © 9Colu
Zo vivasiablee we Can perf Cac wells Bi, muUlalse)
L ) Measure of Central stenowey = I)Mean
S 4 D Mectiasy
Dd) Measuse OF Spreo: 3) ences
S y Rrge © LQe @ wwene G sd: (6)
or
‘Scanned with CamScanner@
Percentfle £— 9% Fndlicatec the eee ] Sores Apat-
fall below a panticelay value. , They Hell, Where a Ccore-
Ganclt Delative ww Other Scoree
Ex H ule attend an excom 2 Ib manke and ongee By
moatec in & [8/100 ] and ranked ac ASF -then— 100" porcenbile
${ felle Us MD One or any ulus lowe {ue avanle that ‘gptead
So here my Store ic 100% peace ntile
Ff any Ove gor ABP percentile TE Meane 151 of Pata-fall
= , Wy vale
belo thar valus co Ronk and 25° pexcembile. ic Cibove “Dat
Quartile Mean Data doveted Ino 4 equal 0 F poste
Quntile mean Y 1 H equod no 4 poste
Decile mean " (0 equal 70 4 powk<
pbb
an @
wm & CO cone col (col
> ate)
3 | 15 @s fo {3} 4} 5] ef Ie)
- 5D 10 qo
AQe = Mectan = @2-Q, 50, % 7° %
of
DER t— Pevide a rank 7 equal poate Denvted a (Gy:63,05)
Wf vite want IRR = @3-Q, = H-3nH = NOS Br
Zeply we can Say [QR te = KOH pescentile
Ubon Cutliert lonpetecl on IQR that toed} -2fFeuge
w [R22
‘Scanned with CamScanneraD
\lanence 2- ica Meacure of Spread ik elle bow -frr away
the Pata pent fiery mea in datacet
Ex: Ib 20,30, 4O, AD => Mea = 30-- tt.
t =
arg tga
30-30-10 paints
Vetrience reprecentt in (>)
= 2 [Mobil
0
Slandasd Hevioltoy i ica Equane Toot oF Rusesctard chase Vi ime
Denoted ac (6) = VVanience.
Effected )
Range — ulhen outlienc Fpuled — v¢
nor Spreng 102 MT ifan outlier ya aX greece)
\-amence. Be cuttiore Vane tae)
Std — 0 cutioe — V C Effet)
MOD — mean abcolule Deviation —u — x (riot €)
Mean © Deviokius) 2—
Te ic dhe avg Ait¢lance blw each clata value. ard Mean.
Median Absolute Deviatwm :- FL shane the Prw—forauiny te: dab ports
—from medion §n Data cet
men = Median | LL - obe:|
In Iie H reject all Our lee
Glanderd Dewialen qr Nishibuteo) a
By above all epree we Covered tr) [ey ni uariahlic which pefer
Mecicure of Spread
rivera < Meacute of Cerhal Tendency
‘Scanned with CamScanner
*M. Relation > ©
2 Co avience.
Oo Bi vanahls we -frd <_, Co- Reketion J mesute 4, relation.
X Onirvermablet ie deal Cry Ope Colum, bv One Vamable where
Br ummable meant deal en tu larcbls & fey Columns
prenn- f We loka
if wie bawe a data cet O which hepbt $ weig? |
tuco caciablat (Hy), 7} es gy ie
an . i
lec Ware —
pve
a od a iT
Tne ®) 5
In fig cy voben heiabr Sndcepase wegdt che laureaety tc
J J 2 CoV
je —ve Grebohen ie NEP
ancreaser Hic tells
werghr abo
vamemte ic postive
& (avi)
12) re
9
elle -the Conelateon bw ~wo taal
74 fy 6 observe heb wuhen incree
gelato blw ~? saosin He
Gf ule Cbcerve arene i Univeataht 6 =
ae — aj) (hy
miley Sp Gi-variabl-< Cov (ay) = = (Wa #1) (24-4)
Inthich fell ¢ “lee Tif bw Mean-w each date point in Oat ang
mean on ‘y! vanabls “he each point
Comeladim s— 94 measuiec the degree to whnch boo uumblec. move
py velattin -le eath Oller
Coefficent amd denoted as Bhi
fn Other Sh Call ak pearson Conrelatiny
‘Scanned with CamScannerCovCay)
Correlates] fy = ae
a xX SY
The Correlatra Unlur te Niet in by [-t< fay < #1
wet as —follaror—
it alto Similay ae Covamence when we plot
nto refed,
vamables
+ variable.
=Alowe mca elle us there ic positive conclatra) bho too
he fivoeghe Sry ic 4ve ie whe Heght goncrente. weigh
jwoeepr alse chee ~thoke —ve coretat?s)
"tI
also ine, when haps dee
Fig Ce) —telle cAbere vio Prebstren, 1 Correlata) 1 zero
Dif blo coummente and Corretakray >
measusernenk Ch Correladeey , tL indicate
Covenence 1C potbirg bak:
dhe directtme qe linear retaken chip biw Uevtiabls
Corvelot) onthe cher ‘i
clade) ¢ her band meatuter both the Shorgh and Airewten
of We Tirear gelateendap bo Wo varalte
Corvelatrm Coefpicent- Support hneow retadanship
ve —fguee AN)B. bab Felle Gv Correlate),
ov cloetnt Fells Shon
if we @bser bul-
Ghonglh, 4 tle relator ic chest c
Both tells Us descuttg , incor relatenshyp J too vanabler
‘Scanned with CamScanner@
Correlatin of Cl) Indicate a purfeat negative Corvelaboy
that Qc Ope vanadble foe the Ctherone ic cbucreaset. ‘in Ubie Cate-
all the data point lie on One line i“ ‘
ce a '
¥
=> Caro Sater dnt. Uares filer Whig
x
t linear @.
Pyecaveltea Coeffcent- Oly pel Lineow Rebleashnp. veer! gety Abon-hne
-failt ie by ace
Ip. LQ cnly Lay = value in Older wee
Probalnlity s- Fe ¥ Gracy of (Oncerlainity
Random Eapunment : 4% te a precast fr which Qurcamet Cow pot be
prectictaf with Certainity
Ex “Toss @ com, Rolling @ dies, picking an Objet
Zample Epace i= See of al the possiblity ef Revlon Lapenmerl-
Roltig plee— 112/84 BGG
Event s- Amy Eubeee of Eommple. Gor
Ex: Tossing “TWO Come. = Random Experimen
Ex!
6 > Sa UT TT TNS
got 2 E& — gan, TH
6- at FIT} — Gsamething Or Bob, TH >
}- Getting -Exast y head >
‘Scanned with CamScannerEa — FHF = Getting “Tw bell > Mh,
Eee dt a |" srw Teale > Vy
Probability = Favorable Qulcomes
fl Total Our@omet
Size of Event = a -4,
Ip case of &? PCED = ere of ample space
Rules [Axenlt of pactn>ilty
© Probolaility of an-Guenk acne fe blo oepces|
Rubee OF Zamp Spake
becceue posse Lut i
we. rok beyond
ee
Value te Under te Somple Spa
@ — Pmbability of Sample Space € olny 2 pes =!
: that ar uroly facive
@ for ony Eaqante of Event e ree >
Ex G2 $1,239 a §S = aoe es Lace
Qe janes
a SS ™M eaulusive-
pec Gey
By Hye ODE We con Loy
605 95 ---- o£, =G > m- Lrclusive
P(E OE, OE, —-- VE) = PCE) + PCED + ~~~ + PCEn)
pes
Q
Te
&) = € &
ez
SPE)
E, — beth eed — {Hut
Ex: whn Rolng qwo cone £,
Event like — € — SaeTay = %y
En = gas, t1h —P(&)=
)
‘Scanned with CamScannera
if We Obcewe Event 3 PCEs) that belle Onion E& + &
Goply — E,~ Eve SWEvE) = PCE)+B(S) —pceine)
— —— 4 c\=
pee) =(PCav6) ecaye(rcey J if of co
cs wre ene Oy
pees) = p(emserCim) okeg
pe) = +> b,
E,
Perro a neuen tee =
Ex wlbat isthe probability of “Pomme 0% von Bolling a he 7
> Sep: — RE= Rolling DC Cqetng poime nue 7 Event)
es — [62345 6}
Event — Getting Pome nto $2,355
P(e) = = by
if Kle wank pocbabality of ging even no'e in Boling oes
Event = Gerson Ev) gaa eh
Prme no’ (23.54
Inthe cb ’e Ceames we use Conctitora) prebeln lite
= pcayns)
eo : PCB)
if we -tnke Ent geting Even noe ac DA PO) =3
oI,
Gme PE) =3 > peond) = ($2,353 0 {206})
PEon8) = 9.23,
All possible Out Comet = 2 only
Samyls Space = Pana) =
Crp) 5 PCAlB) = a -4+
Scanned with CamScanner©
OD p(als) — Condtitronal path — P ( Getting prime | evemno )
pcs) = PCiauity) ~ 2
pals) - Ve = 7/3 =4
Va
CRISP- 9M FRame ulork e—
data ™0i07
Cross Indushy Ztandar proce for
“pe dara cumet-fom
© Business eae 5 Te Meane whee
4o go wooh crf
wo peafam Epa, then ye baue
once data Gm
or go te derabase ard hy (0 fetch Abe data _
of B29, business P7
fuse to Ondnstard eat 4R
°F tere tc Podata
-lhen we
we hae -lo ile
@ waa Urdurslanding by EDA ,-Aralysic, Nur, statisti
@ Data Preparation = “Treat CLA UEr Sn volute, a
os , Vigualsrotty)
fata. coveat
pata
Machine hewn modeleg
© Lvatwauen of MLimodel
© Deployement
Rasnetem Variable :— ys "FL te a Uterable Whose posible Ualurs ave
numerical Ouk Comes g fandom pherorxnen
x: Tossing a Gy
Renclam exponent — Reseny Con
omylt Gace — a
Qowdlen vamable -— “% a ae no Of heaclk - {HiT} - {lo}
‘Scanned with CamScanner©
Computation, — Compute tue Pabobr tity Fes = pat
p(x=0) = PCitTH - % leis
p(x= = PCRHD) = yy Wile? °
plot-tus pprobabn lity cLish button -fancti=y (p41)
Piloting as
pe
4
A
DR avd vasabl-
Ex2 pe |
s oling a dice (pss _
£6 — § 12,34) 56F Dos
Qv — Gettog a ‘6g! ox a
pd — poo 2 pix 20 > LNBs} = Fle
GE $6} i.
ploieg — og
a a There ic 71 chante 4 "9
2 pie Aan, Quccess
Cesebaillcasiices
Rronply we fay Rarseon vam. Te a rc who will tobe. Semple
Space. ae Tope 4 give Eeme Olp ta fem of @ pamber
x: tat = PO} > Op /
eG
Closufcatien ¢— Discreate aa Bo
ie Random uasiable- \— Kinemel
Qonclem vanables
ae Continious ae Onifom
outem treble. \__ nena
‘Scanned with CamScannerClisplot + Aehrbutien plete 9b Consist of Ahree ig
de 3 a
a — pat ( Thic -furcten clitploe fc cliccontnued to-lbe
et —furwe-
we fave -to we a 7
@ cdf despite” ech ncled
Discreate Qincom vamable. :— TL has a Courlable ro of ponsible alee. the
Probalnlity of each value of @ chiccreste tonrlon tamable. F¢ between 0od 1
ard Lumot the piokabitrkec equal to 2
Berrolli Ranetom voriable- :— Sf the Outceme olan Cumnbosle. lise Finite)
nay cet Of
Cat aswell Bardem voroes Cooking
it Called Bernelle Komclom aciolle.
—friluse ooh as Buccessch,.
ard bg
sun waa nly (eB
Valuet)
Ex: fot} 2 Yee ‘0! ve presente
joo vase that asefrcte munbr
alle Contant only ®
Zauthy voice te Calted a "BRV
Apc wat
flue, 2) Guccessoh
only quo Uolues 9
Probability Mace-fumchion, ic Used wha we rue clesorente
PME >
Ranclem varalles St Helle at Vmable ie Bumolle &s dor
pmMe > pix=0 > P — Kuccese
perso 2 PR Fosluse-
‘Scanned with CamScanner3
Binomial Qinckom vane. :—
it is q collection of berndlli Qandem Yanabloe (X) it Called
Gx Binomal. & v:
E: if We Toccing acon Tt Cintane § 4,7}
EN
SIF Sir
the Combratien of Succes | Fauluse. ic Siply Binomial .R.Vv.
1, RE — “Tossing Two Counc
2.66 — HH, HT THITT
3. Rv — Counhog the 00 of heack = chou enaly tea
HH —2) un, HT, TTF = $2,103 vewioriee
oe, pico) = PCW) =
p(x-l) = PCLET THT) = 74
tO
4 Cenpate the postalnity of Uv > pcx=2) = pltnnt) = %
5. plot Por 3 pl{on2}) -» i Nobo Beraolli
ise pur dt te a collection Of bernoltt
" > Simply called as Binoanial
Cs
Can aoe
POF of Binomial Rrdom vi g— .
DI
y 1 ere —
pred = PACH FS * Gremdd)
7 no of Qoccess
is
Pp = povbability of Success
jep = Peebability of —failure~
Hn of thiols
‘Scanned with CamScannerContour Qorctom, ubmable :~ Unifom, rlovral ~
Xie ach vamabl thibe Sev oF pontee. tahur ase Tofnct or Cocoumt~
able % called as Conknicus: ndim- Venable
| CDnifem Cedon rable 5
The prbotalty tate Cnfonly cliselscid rorcton vanolle fale wily
aug Snleneel 4 bed hoops
po ee
pote > omfen Goose
10 oo X — Random vomable-(X)
x Bp
pCx=%) ~ eet x oXU= 0 , S7 100 , © =O
ey DEL std quny fom mean CRigh Sick)
Creft Sida)
84 quay fron
2sH4 away frm mean Caghr)
24 quay rer LCR), | Y
» Clefe)
(lege) 254 aot from mee
von 0
1? x (ev)
Ur 26
(Yoo a
AM-re M6 A ee
60 90 100
nema eo Te Sono
ee DACO RIO HK OM
ioe
Simply we can Bay thee as tyee 697.08 data fram a vomable-
je [60,50] lice 65% of data
ot std: lies AB% of doka—fom Vomable ie [0,40] ~ 5%
38 fice 44-77 OF dara—fieny vamoble ie CHO, 00] ~ 99-7"
LQ. whe bapperd whey to Mean medion moole., whin you have Pesihuely ov
Chuved dishnbuteen 9
Neat
0 [He mein, Positivly Shaved authitutin
PO
f Cy distihutiey J
Prob: of Rv
\ vy — RV
ted \
o— tacon Cimpact of outliers)
(
wt
va
— Mod,
Most Occyring elementé
iuhen we Observe Solow, houtepraices, Canpaces Gn tae we get bit
type of cutinbuter), accorcing to my obcervelte7) mean (4) gee
2 of Outher an Median mode O78 yoy 2",
Fmpasted with the precene
moos ate
5n Some Carec Same ie mean ig VOY ia and mection
Voy Close to cath Ober
Skaonese 2- Ft Gefere lo distibahon oF a Cymmetry Tak civtotte
distnbution, #7 ace of data
melrical bell Curve , Ov neviral
Pp ic Laid to be
to the left or lo the opbr,
Soprecentation of the
from tes Sym
af the Carve ic Shifted
Skewed, Ckevonecs Canbe Quantified as &
Catenr -lo which a gen dushibuten une fom a nevral dichilputien
a novmal dishibuten far a chew of Xero
Cheonese value lies in between — 0-5 -t0 ne a Tan
" Syrmelnc” if the Skewnese Of a Uanable. % lesan —0% or
ose thon +057 tt ic not Symmetnc.,
if Skeonese value nec fn Ao —l-o | than thic called a heavily
regigively Skawed ond heavily positively shaved -
‘Scanned with CamScannerca
kortosis :— $¢ 1c a Sfatichal measuie. that dlifines ew hewity
he -laile of- cistrbetton iffer-fem the taste of a Normal duchibution
kus tosic iden tly weatluy the fle of- a (an dichibutwn
Comlain extreme calucs.
or porme\ cichibution kuatocie vol ic ‘0’, IF kurtosic
value positive Fk loklike. peaked ,7f the Unlue ie Pagptiwe ik
if if ic
look [ike fiat.
- A Gendem unable follow ermal
Standowd Noval Distnbutton
with the meen of zero amd Glandard de
Calted ax Standowd|, 10"
| Nownel dichnibutien -
astibuction) viaben §£ OP
, ma) vorinkle-,
empecghte Ce
amd the dictibute) Lenewn as Stamelen
X VN(H=0, 6721)
Random, Vasiahtie! L, Normal DIG butror)
Z- score t+ it Ff a meacute of howmany SH act Foe,
I< a peik fiom the mec: called ar Z- m of Cemhal Tendesy — ten | ted | Moe
(es MOF prod — Qarnge] 18Q
> Mof Relationships — Cov, Correlation
Statistics
~ Bivasiel
Probability — Racice — RE, Sc, Event-
—pxiome of probability.
Ly Condthonal probability ee
LL
Ly Roretom yanabe —— 9 2
L. pa —Deserete> PHF Ly prob: cishibuten
lL pd - conhinids > PAF Ly ploting- p.d
i ad
te ic Normally duthibuted or Nok, ie (norm
if a fe anal
a test Called ® QQ plot
cor oot) Check thic we (used
Quotile — Quantile (QQ) plot = Crest of Nownality)
—, nowmal y ee tee Riva
?
Fi >)
‘Scanned with CamScanner9
tf ute plot a Graph by leo cdiffienl- Uamabler, alt the data
Pointe [ying On A Same line Cline = 45° in graph) then we Called as.
the distribution 1S normal dichib utiog). Tn cnee Saar
any dicrabancec like data poink ictratted away far) line.
“then, we Say they are rot dictributi ae Normal:
How 0 get A QQPloe e- —
a apis ie
© tate have -lo check Theovitical Quankily — Alormal distabotten
® Obcerved Quamlily — Given clistributten | coussmn
@ Sort the value of Th.@. and Obc Q.
@ After aut plot a Scatterplot and “ake Cbeerved Quantity
Colum) if ail the data pointe lyiey gna Hee line then we OY
clictribubed normally
Norirolly custibuted: Ctherwice NOt
couse Wwe Use. thic
Theoretical, Quantity alucaye lying’ ninict’ bec
'
" ip.raarlom). Nerja! (loc =0, Ccale. =1)
ear el ) a Pareto Dictibution —
Contain bry DuMbey
Of data point
if ute draw phe Kalony, houteprice, Carprice ete locum
ule. Jet thie type oF clictnbution
‘Scanned with CamScanner20
If ule Observe “lhic two cystributron's TL Contains Mose. OuFleerc
Abe coppice Of outlier ic high So TH ic Tnppesible lo delete
Ahe coum becouse lvl of date gened [ie poedict wong Gnalysic)
Box Cox Transformation :- 9 Pe a “transformation techniqu-
of a non -normn) dependent vamiablee like Chey Noval D, perc’ »
Sn db norma? Shape , Normality Fc a9 GmpootanE asc pion “for
Mow es —technique< y
y
a x
Ley Normal Diibuton | ___ tema victnbako
outliers — BX-Cox
5 Some moe Dishibuttone — by Alora, Pareto
> O@ plot > osng —fer fest of Nlomalily
|» 6ox Gx > Tianefermation Ccenteal- leg Al, Pareto —to lonmal Disribe.te
ard treat The Cutlierc)
eC y Q.plok
y gox CO* ce roncforne? S
c gn nora)
1
rqhithow roe
ow liens
x
“TheoreticaP Quautities of boant ov
SSS ee
yestmunt—- of Outer a
Toarete De shibution aS t :
Klormal Disnbution
Scanned with Cam$canneral
TFferemrtiad Stauistee ns Eshoratg or Cia
tt fe rt with 7 Infrreres’ fF pprectichons abcut populaten
pared on Sample of deta Haken fem poputattoy by vig point
eshmatton , we Car, delirmind pe tferential Clabishec
Pot Eshmation :- if we wank to find toc meen 4 (40) populate,
Ftc
firstly we dow 2 Eemple oe tan Find Compe O60 Cx) and
He
oe poputtio] mean (40% X) te ie catted pore aa
estimade
4 seule
Poputotion :- iH sca Set of Similor ikeme 7
eente
- fep Teemé ork
Somple t- Ft ic a Set of Cniformly ,rordenly Seted
' G x
Population mean chomoted by ue! and Sample 2m
PEE popiatey en
Wb = ree ) N
a 2 (obs: -%)”
% obj Sample. Uaavenee- sr = iA
eee sale
a Note: (N-1) Couse of hias % Cec. (. Remme Iga
of bias)
kle have otiff types of anping “Techniquc Bush ax Cenuienent-, alusntrce
Sanpig Uniform ganclem Capt hic Gwent ond ynluenbree Cant |
be give you perfect unlue of mead) (ig % Sample mean tt = Popular, Myyp)
In ves ic gies you Line what aypax meon nlue Tt have alse
Sone bios for hae we pee (4) in demomxnaley of- Says Lanon
‘Scanned with CamScanner2
Centrel Limit Thecrem - Sf ule have logs Pepeiet) amd uve clovrcted
inte many Camples ‘then the mean of atttbe Samplut fem luxe populalier
will be abst epuat-to mean q Wu entre papulatieg ie each of tle
Gomple oval dictnbutd
Meg ~ pop > Mean of papwlater:
L, mean of comple
Sample a Onifermly ,
Tf tele have peputalien) mI and we ate Celected
x) ¢ Woux
Promcenly cichivation 9 we ue Genpling cbctnbatten (X ) tf
imil- theorem)
Ap Cenkeal limit
noel x C Be) acconding
pedas) custnbutiy X YN (Ae 5
2S]
ae) x, at foto
Sompling Dishibution
orf Nowra! Distibuteq
Se !
ethene DI —*y
CLT — 1. Zomple Disteibidhon, 1209 Lg = popentioy Neon pop
2) Compe. D. vomence = oe ae Size of exh Sample
f © 6? — Pep vamence-
Cee ie ole nn > alo cates ag Standowel Error
4) Sample DVN Clty, 7%)
After Sampling ductributter) every Somplet mean [uy = a
of pepwiation So Tf we concider one Lample fern ample cchib-
pi) Sample
thon) “the Soanple can be biciced Or Conuinent / yalantary Samp!
and point estates thake why we have lo Jo Confidence. Intewet -
‘Scanned with CamScanner) yk > pant Eshmate , ample aan Contain biae
» Chew > ux(%t Something | woilhy yy. Conficlonce..
eo
> Normal pistHbuttiog, vamenc® =
a -Crhdarre
Intewel | pomt estrraton - |
ez
Range of mean 1
4) | App
Ket
ink.
5 Tolervel roy reprecent how wany “te cuny fir rea) to tt po
RA acer. awe. the —
x sed of PoP is given
es
wa (HE Zh yy of Confidence
Go Aha Freprecest
if we fd X= Score Value. tery we easily Seprecente ean muuch “t Grhclous
if ck > 454 ow syy —frem mean) Ft teprecent — 68% of Confictance!
Ex: Consider ane Cabige, tat Studente nace heir Geeny Pace
dod of Sere of the Students listed Os Fitous (916,245,353) 1A
Sf we Coiculate meon of tt Uy - 364-
Ithie cace we doit kro the cabny Pp of every Qhuden}
a whe
g jobs Bo what ever Golaree We ane token, ave Cawienence “Gamplee
Go ty 7384 I font estimate, Biatedl- G we need 10 Jo Confiotauce-
antewel Snthic CX + “all wilh yy Confidence. , we dont know
ie
the value OF" Spe”, rsemse Lnyp, becouse ux He Govple of, value
‘Scanned with CamScanner2
Qurh Que — ixte have to Caliculate $7 = Cample: ursionce : |
So inthe ace we hax mean of Ginpe &- easly we Gn Glculale
& = Gsn-anfeennct == sens os
Ba
6X =. MBSE
ae ncaa ee
2 oe alto change |
know lhe ualue oF Spop |
sf ule have oot
at we uce TE.
Ss Camper standard dwiation
ci Zt we VE se th aod ite
5 Sample MA
= (Ft Lange |
Ly Gmean Ly comple Siz
Insteaal
{38h £ ty kl “ ee fatbaol Cn
So-Ajere FOE Can be reprecent ac oF
Significance. level
todas 02) = lov. KFor%
confidarce level
lor, > Gamel Significance Level (%),
jer
In toi ytyg 7 Cathet vat | = Signfcance. level
4 vd freector ae
os See esterase Crist goa
DoF = Sampksize 1 5 cv = 10% BY vy 0-05
z
CC yo cr le 2 ly Aird tty vatue —formy T—£Cere-
Ih [Score Peprstent- DOF, CV 3 Ty005 = 212+
WInS6 ] wilh 07. Conf
5
pre all vols __ (3:84 $ WIE
(364 th] & F086 sha J.
‘Scanned with CamScannerSaley Ren never liying in Alepetive Qoge £0 pu ‘0!
Range will be £0,857 wib gor Confidance
Ctandand Alomal Dichibuton (el-0, c=) 16 Called ax x-cluttnbutten
from t-score value ux plot -t-ctectibubion. if ic Simitorly ntovmal
t
sa —>Snlormat Oisteibutton) (x-clistabutton ]
[7 Distribution]
us
AL =O
ey z—dichibutton is taller than T~ tictri busin) ,
bath the distri bubions
tying on Came Meat) te (t=0) (524)
normal A)
int) Boththe ichibution, valuec ase (ying 3s away , in
Tt Containg 94-47 of data
i Com pone.
mle Ob ete edict both the -ail poate Conteunion bong fail as Compare
norma ctestnbution). by thic there © chance fr pointe - a7 mose-
far auy —frem mean , anea Cndir lle ctistibutien-2 te very
ti)
less as Compare to P-ctistnbudion. Coure of log 404
By tue above mentioned resmt we ng enor in Std. aa >it ic
called A Standard error.
by clamdond erer we een a aE Interne! blw -two Sq valuce
in Obove example we qe (o, 8 )
‘Scanned with CamScanner26
Hypothesis “Testing 3
Ho 2 Null -typolbesic = =) Ground truth > =, ><
Opposite
GC, > Alternate-Hqpolherls = Bold claim =) 4, >, <
Ex: Osmania Qlteze announce oucagt pase of feH gee
fiom 2020 - 03} batch is atleast IOLPA
se ie a bold clam
hamce of caer
Inthic cace ‘heir Statement Censidev as 4
So they Said afleasl 10 LpA So Their may bec
ipa) or equai te 10 ¢PA So we HE
Co Heb in A feonnte Thpetoeic
Symbol:
atmos t Sobooy (ie ahove 10
oumutote that ac gratirttan or equal
ux dont axe thie Cpenttor Se wehave 0
ee > 10 LPA
a
use
So now we feu -lo fate Grovne truth (le iy) value *4 ie Opposite a
allermnote bgpo Ch) ie—
Hy: A pacerge < 10LpA
Shep —1 > Formulae the [lol J
Step -2 > ete Coumples CX - 35,6, 2-5, 3-5, 3.1) and
atic mean of Ganipe- X = 38h
step -3 7 kkle have -lo ape. tm Statishical eperettons, Sn thal-
Ow Bold clarm Ch) is NOb Similar of ane value in
hat Chcerateon , S in-final Step we Conteluded Ft Qk
‘Scanned with CamScanner27
Rejected —yypotbesic Cie +f, i rejected) Lif A, rejected te My
ic accepted: in Slalishcal Aermindliay. Ynstead of accepk we cay
vee fail to reject nell Hypolhecie CH)
Ex-2 if wie fe Ortere fon xormalo ord we Loy tee poset 4 Chicken
bragoni not Con(ain roge inthie Ginamo Our claim ict be y) te
alternative —tlypothesc lI, : UL #0 1e Co — Ground faut i¢ HL = soo)
yoo! fom zemolD amd Wyade
fhen we Order Some Saamylss of br
caleculake
Contasn [ ongme, Hodgn" 4909", 495.9%, 49IN« J if, we
tue mean of Compl Foe if We te Consicler tuo Cael —
1) & = 418
ti) % = Wn
Ayere- 4,9 UL EAOO 5 ff, > t= 907
but if we Obcerve Q Casee C%)%) Hee K mone neon to F500
as - Compane K , So Sinply we Gan oy % deo -o aut Ny pothesic
we = 44 Feil to reject Yo Cov Refer
% = we? Reject [yo Accept 4,
when er # Contain in 4 Ofer aT Sppheenee leve{ “thie will
going to -fuco tail eek if -, Contain > “then do right teak
if th Centar < they ae deft tesl-
‘Scanned with CamScannerif Ow alae feel &=10% and 4, ontan + a
L=10% 4, > # ha 107 4, > >
SM = 5%
oa H
Ras ip One tas| 426
Pe tend Tel-
aaah
Both ende are
Reject Region)
—for -o ¢ -Accep.
=for +,
i
!
!
i "
' Rejection Region)
t | =fer iy §
kd rel aL le Wort Acceptommee-
Regien fer a
Sravgs Of evidawte
b
Samificance level (2) r— it measures te
fore Gujeuk Nut
of a point thak mutt be present % (cea be
1 Gmply Rejewton Region of @ Prsametr:
< of Cnknown peomete
My potussic Ob
Conficiowce Inlewe! 2- Estimaitt tue Gory
with Ceme Conficance levet
Confidence (wel s- Tie tur value, which Shane tus probadoility of
ecimaud (ucakion cf pasameter
Coribica) value
ok
Pespee of -freedony 5 — Significance wef, >
Ft Tc a vale which Peprecents lower hownel , Copper bownd gt
Confidance lotervel
Degree of freedom) ° Lay] Fe Aepresent MDximum no of
Vogicaty holipertust valut:
‘Scanned with CamScanner:
rate tig pean Te :
YA) _AbternatetWypotieric “ty > >, <, # CBold clown)
g) Null typothesic Hy > <%= CGF avune uth)
» an Somple Se 0
Compute tu eau —feony thre Kample ¥
Z-Scere.
3) Compute ta Statshee ie > Hean of Gaple
KM > Actual Mean He)
me known ZL = ———_, etd of population
, alton) Voate
cappe a angle te
yeu
if poputton comenre nine & = “gp sect Sere
> T- Score.
yy evidence to
4) Deciae Sal level (K) means you need 0
jecotoy
eject atl typathesic , bere eer level elle Us
for Nutt ypothesie po)
Hy Pethetic es Re an ack
diag 0 foppulon poaomeler aod
hy pottiede “bs Oting Lomple dake
Sin Clastice where fest aN ascumpten Sega
Ft ic Uced to acess tac plearialety of a
‘Scanned with CamScannerMiffe
rence biw Pmometne { Non-pParametnc 4ett ¢— P
Povametic teste we Paced ON accurphone about the uctibuteon,
cf Population from wwhrch tus Compl wae fauen.
Aton pevermetnc test cre r0l braced On crcswmptions that the dato te Colletd
fiema Lample athat locer't flew Specific olichibubra)- nf
Povamesric ect accumes a normal Aistributea) 4 volut or 7 belt -
rape Conve , where Neon =paraineine -fette ve pred 1 aces where
posmame fects cwe Poe appropri:
wi test, qetest, Fo fect and —fonova 125 are Paramehic el:
Sve Che g Ge a ror collad AC ouictibuley Fee
tue mreasevement O
of ant ue waned ¢¢ reminal Or Coctinal.
sect tn thi
pe Tet edi co EF paced
= isi
vu
Sino ¥¢ Sra (430)
—freelem Or-t)
of reyroteton) Coofficent- ™
xT tect ce Used when Somple
# and we howe Ay caticulols dre of
se test uted -for test Gfpifieance
» Populate?) veviiable Qe Onknow7).
x —Covreladn Coxfficent Gp populate te zero. then wced +tet{-
jole om 7.
— ef we Considuy ove Continous Varalele- Gord fw woos sone pournednit
Wan we Ue Fe fesk
Scanned with CamScanner=
ae
ihe papulatey Coefficent Corelatem) Fe nok -zore ther) we OF
sc test. by tric Sampls Kixe Te large Cn rBo) they Concielly-
e qoyest Ured) tb dulimint whelber 2 populate, mean ore diffrent
when populabra vowiente ¢ Unown.
e x ctest Haced on etandowd Normal dtcfribubea- and O66 called
as bye Sownple Aes(-.
e Teck s— Cvarience rako ~— Tact)
e E test te ako celled ¢ ANNovA Tes Cone- +ope of )
= FrTect Fe Geed to te hwo [ndepencil ectirraken, OF populaltn
Yorienee
x THO gemnplt bpve Game voiwee ie TPIS J
4 Foret a rot Zomplo fost.
F- = lange -Comple vammce— a ea
Mi pce naan erm
emalt Comple Vam-mee— e
t Degree of freedom ~fer large Popuscten) vamence Fey wd cmallur
Fe to Ww
a “ee Nu tirpolluste of fe populatnn Uroiemes Ore equal 0
My = = sy
degree of reader fey vy Cloyyen) = 1-2
degee of freedom fer Vo Cmalivr) = W-]
‘Scanned with CamScannery F-Tebe awe ced by a fla Foti Of too Variiencery
The comple mut be independul-
4 FaTest Never be neptive C-ve) becouse Uppey value te aluaye
grader thm tus lower value C 82 Clonger)] €2 Cemmaiter)]
Diffrece by T-Tect , Z= Tee rarest gm
Be rs AAR
T-Tect Bo Tete F-Test
© +e Sort Sumpte. Jarge Sampla Spelt Semple
© poputaten Coeffeous — pepulatay Corelabeg “THY tndipend
° Est Mon
a Coefficenrne 7s POE t
Cevrecdatnm Ts Zero i pep '
@® voviewe fs Cokmew — Variance know) Same Varies
. all.
® My mustiple regression with °3' Cibree) “Testing fer Ove
Findiv PAua ea nilicedne :
Nlon- parametne Tet Ccbi- Square] 7—
~~ S
eee
xz chi — Square olenoled ac Ser Fe Tc a Sampling amalysis
fer testing Significance of Pepulatou voriznce.
e Due te non- parametic tect, Fe can he cenl far test OF Gastusss oF
“fre tc [clependunt 01 Tocipundsat 1, profer for 16 af Intepanclina
this Uses Gmpe random Lonping method.
# chi —Huavre Hest Unlust Lies fn betweer) 0 and 1 mv etiy Te
appears (ile lejaeme? Asshriputronn , motly cee “fir varinldee
variables Independent tect.
Scanned with CamScannerChi - Square Tect CX?) et
re. Obceaved
Gt Te a hypotnesic Ae method Uced to Compa
punpose Of thic tect Teton
Mesulke Us expected Feculte , the
data oud eapectd
elamine @ 2 ciffoawe between Cbsewed
data Fc duc tw chamee, O 41 7s due to oneladeunebip between tus
Vowtablee which weave Baty.
lle — % — chi squowe
0; — obsewed ola
Ge = -Capeused value
petwees) voriabls
rom -fer Chet Stelodemehep
x Indspendiont) oncetly looks tilde Oynsztre
SE clidikt Contain Dif
talhen we Plre the Ht
lle wheter cupendint
pooito chistibulion mols ,
esti bub
volutt- 4.6
9) | serene sallediteta
|| Rojaskien on »
pets
A el
423 & aon ey
je ee am
x vale
al of Right tailed UUst= >|
Orfferent type J plot ¥ (chi-squewe Ce)
Scanned with CamScannerConclitions for og Ca Geneve. CX”) far = zy
% Total —fiesqpmuy Céample Sine.) is large Cn >No)
Samples ave Thaipentont Cte check lodiperciauy we Oe
hypethesic aso Where fed Canditfon ie Chi-square feck Cre HD)
ie [Sule] incteared of [z-Ser aescule
3 Cel) —freequemyy ave IWweor
preety chemo Ct
Skep—| 7— Desice te Nutt +Hy}
Ldlhernale Hippalussic 44) (Bold ein)
Colvecsssug “compl Gaz (n) ,
wire
Test Statistics vou = EZ a
i
score [Tet 4 (ndeperdonce]
of, Indipenclince- hy Chypottou)
porussie [41,7 (Ground THY = ><
rot
Step-Q é- Somplirg {
Test gtatishe ”E—
Step -3 2-
Shep- g- Deckell ppificaree fevet @) by thie We Ac a
comput X”- Critic value
Gep—oe- Decision Rue ploHiy. dustibatrm PY) famulaled valut-
ep tet eer ile as > Refesk A
Rojert OW Nu Heopotassie:
Degtee p—fi i ay Rajan ay me)
_ (rest < ctastioD |
P value tesk + (p> nabs = Go=catt
(Cumulative © chcctibuter fomtrn]]
caf Tc avcthur Method 40 daceribe the clescoipten oF ranelom veusiaboles.
Nibw at| (p valu < %) > Rejeur Nut trypelant Gb)
or
—Bycceph Hibernate Pe cay
Que nore test fa evidawe. 9
Acted Ai yopeluunt
Ptetk To jab eatva
Showy er rejeut Niwull Hrgpotssis (ov
afin Kr Fest
Note 2 4f ule have (uw ayperca fects, and “tying wo Come ly Conalervion
Tuan we Uted Cli -Squave CX”) Fest-
Scanned with CamScannerCRISP - Dm Framewerh
Business Oe cA
rs
Eeeseoul Deployment
*
ue Data Exploratin jonderstard
: * a # (Basic Stalisticc)
+ (ONT, Bi, MUL vament)
4 Complete EDA Analigie
Data preparation
tnenatization | Cfandendizatin
; Machine learning 4
Medel Building > Nl
2 Handle Missing vols
Outlier treatment
“> Split pata into “Hain Pala
Test Data
‘Scanned with CamScanner