0% found this document useful (0 votes)

5 views11 pages

Sem-6_Statistical-Analysis_26089

The document outlines a series of practical projects conducted by Arvind Kumar Patel at Atma Ram Sanatan Dharma College, University of Delhi, focusing on statistical analysis in physics. It includes aims, theories, algorithms, and code for various projects such as the Central Limit Theorem demonstration, hypothesis testing using a binomial test, Markov chain simulation, and linear regression using gradient descent. Each project includes detailed steps and results, showcasing the application of statistical concepts and programming in Python.

Uploaded by

ANIKET SINGH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views11 pages

Sem-6_Statistical-Analysis_26089

Uploaded by

ANIKET SINGH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Atma Ram Sanatan Dharma College, University of Delhi

Name: Arvind Kumar Patel

College-Roll-No: 22/26089
University-Roll-No: 22003567041
Course: B.Sc Hons Physics
Paper: Statistical Analysis in Physics

Date: __ - __ - 2025
(Arvind Kumar Patel)
Practical_01

Aim: Generate sequences of N random numbers M (at least 10000) number of times from
different distributions (e.g. Binomial, Poisson, Normal). Use the arithmetic mean of each
random vector (of size N) and plot the distribution of the arithmetic means. Verify the
Central Limit Theorem (CLT) for each distribution. Show that CLT is violated for the Cauchy-Lorentz
distribution.

Theory: Central Limit Theorem (CLT)

The Central Limit Theorem (CLT) is a fundamental concept in statistics. It states that:

When we take a large number of samples (each of size N) from any independent and
identically distributed (i.i.d.) population with finite mean and variance, the distribution
of their sample means tends to follow a Normal (Gaussian) distribution — regardless of
the original distribution.

Works for: Binomial, Poisson, Normal (because they have finite mean and variance).

Fails for: Cauchy-Lorentz distribution (mean and variance are undefined → CLT doesn’t
apply).

This program verify this concept by generating sample means from different distributions and plotting
them.

Algorithm:
Step 1: Import necessary libraries:
- numpy for random number generation and statistics
- matplotlib for plotting
- scipy's norm for Gaussian curve

Step 2: Define constants:

- N = size of each random sequence
- M = number of sequences (i.e., how many times we repeat sampling)

Step 3: Create a dictionary of distributions with generator functions:

- Binomial, Poisson, Normal (CLT should hold)
- Cauchy (CLT should fail)

Step 4: Loop through each distribution:

a) Generate M random sequences (each of size N)
b) Compute the mean of each sequence
c) (For Cauchy) Remove extreme outliers for better plot visibility
d) Plot histogram of sample means
e) Overlay a fitted Gaussian curve to observe convergence

Step 5: Show all plots together to compare behavior.

Code:
import numpy as np, matplotlib.pyplot as plt;
from scipy.stats import norm

N, M = 1000, 10000; np.random.seed(42)
d = {"Binomial":lambda:np.random.binomial(10,0.5,N),
"Poisson":lambda:np.random.poisson(4,N),
"Normal":lambda:np.random.normal(0,1,N),
"Cauchy":lambda:np.random.standard_cauchy(N)}

f, a = plt.subplots(2,2,figsize=(10,8))
for ax,(k,v) in zip(a.ravel(),d.items()):
m = [np.mean(v()) for _ in range(M)]
if k=="Cauchy": m = [i for i in m if -50<i<50]
ax.hist(m,50,density=1,alpha=.6);
mu,std=np.mean(m),np.std(m)
x = np.linspace(mu-4*std, mu+4*std,500)
ax.plot(x, norm.pdf(x,mu,std), 'r--'); ax.set_title(k)

plt.suptitle("Central Limit Theorem Demonstration", fontsize=16)
plt.tight_layout();
plt.show()

Output:
Project_02

Aim: Hypothesis testing

Make a random number generator to simulate the tossing of a coin n times with the
probability for the head being q. Write a code for a Binomial test with the Null hypothesis
Ho (q = 0.5) against the alternative hypothesis HI (q ≠ 0.5).

Theory: Hypothesis Testing (Binomial Test)

Hypothesis testing is used to decide whether data supports a certain claim or assumption (called a
hypothesis).

We simulate tossing a biased coin n times with actual probability of heads q.

Null Hypothesis (H₀): q = 0.5 (fair coin)

Alternative Hypothesis (H₁): q ≠ 0.5 (biased coin)

We use a Binomial Test to compute the p-value — the probability of getting the observed
result (or more extreme) under the assumption H₀ is true.

If p-value < 0.05 → we reject H₀ (coin is biased).

Algorithm:
Step 1: Import Required Libraries
- numpy: for simulating random coin tosses.
- scipy.stats.binomtest: for performing the binomial hypothesis test.
Step 2: Initialize Parameters
- n: number of coin tosses (e.g., 100).
- q: actual probability of getting heads (e.g., 0.6).
- Set a random seed using np.random.seed() to ensure reproducibility.
Step 3: Simulate Coin Tosses
- Use np.random.binomial(trials=1, prob=q, size=n) to simulate n tosses.
- Each toss returns 1 for head and 0 for tail.
- Store the toss results in a variable (e.g., `tosses`).
Step 4: Count Number of Heads
- Use np.sum(tosses) to count how many times head appeared.
- Store this in a variable (e.g., `heads`).
Step 5: Perform Binomial Test
- Use binomtest(k=heads, n=n, p=0.5, alternative='two-sided') to test:
H₀: The coin is fair (q = 0.5)
H₁: The coin is biased (q ≠ 0.5)
- Store the result in a variable (e.g., `test`).
Step 6: Print Results
- Display the number of heads and total tosses.
- Print the p-value of the test.
- If p-value < 0.05 (5% significance level), conclude that we "Reject H₀".
- Otherwise, conclude "Fail to Reject H₀" (not enough evidence to say it's biased).

Code:
import numpy as np
from scipy.stats import binomtest

# Parameters
n = 100 # number of coin tosses
q = 0.6 # actual probability of heads
np.random.seed(1)

# Simulate coin tosses (1=head, 0=tail)
tosses = np.random.binomial(1, q, n)
heads = np.sum(tosses)

# Perform two-sided binomial test
test = binomtest(heads, n, 0.5, alternative='two-sided')

print(f"Heads: {heads}/{n}")
print(f"p-value: {test.pvalue:.4f}")
print("Result:", "Reject H₀" if test.pvalue < 0.05 else "Fail to
Reject H₀")
print("Conclusion: The coin is biased" if test.pvalue < 0.05
else "Conclusion: The coin is fair")

Output:

PS D:/B.Sc_6th_sem/Core/stat-analysis/unit2.1/Hypothesis_testing.py
Heads: 49/100
p-value: 0.9204
Result: Fail to Reject H₀
Conclusion: The coin is fair

PS D:/B.Sc_6th_sem/Core/stat-analysis/unit2.1/Hypothesis_testing.py
Heads: 61/100
p-value: 0.0352
Result: Reject H₀
Conclusion: The coin is biased
Project_03

Aim: Write a code to generate a Markov chain by defining (a finite number of) M (say 2)
states.
Encode states using a number and assign their probabilities for changing from state i to
state j. Compute the transition matrix for 1, 2,…, N steps. Following the rule, write a code
for Markovian Brownian motion of a particle.

Theory:
Markov Chain:
A Markov Chain is a stochastic process that moves through a set of discrete states with
transition probabilities. The key property is:
Markov Property: The future state depends only on the present state, not on the sequence of
events that preceded it.
The transition matrix (P) defines the probability of moving from one state to another:
perl
CopyEdit
P[i][j] = probability of moving from state i to state j

Markovian Brownian Motion:

Here, the particle moves left or right in 1D space:
● If it's in state 0, it moves left (−1).

● If in state 1, it moves right (+1).

● The next state depends only on the current state and the transition matrix P.

Over time, this stochastic process resembles random walk (Brownian motion) but with
memory (via transition probabilities) — hence “Markovian”.

Algorithm:
Step 1: Import necessary libraries
- numpy for matrix operations and random number generation
- matplotlib.pyplot for plotting motion

Step 2: Define the transition matrix P for M=2 states

- Example:
P = [[0.7, 0.3],
[0.4, 0.6]]
- This means:
From state 0: 70% chance to stay, 30% to go to state 1
From state 1: 40% to go to state 0, 60% to stay

Step 3: Compute transition matrices for multiple steps

- For steps n = 1 to N:
Use numpy’s matrix power: np.linalg.matrix_power(P, n)
- These matrices show the probability of transitioning between states over multiple steps.
Step 4: Simulate Markovian Brownian motion
- Start at initial state = 0 and position = 0
- For a fixed number of steps (e.g., 100):
- Use np.random.rand() to randomly decide next state based on P
- Update the state (0 or 1)
- Move position:
- If state is 0: move left (−1)
- If state is 1: move right (+1)
- Record the position at each step in a list

Step 5: Plot the motion

- Use matplotlib to plot position vs time
- This shows how the particle’s position evolves over time

Code:
import numpy as np
import matplotlib.pyplot as plt

# Transition matrix for 2 states
P = np.array([[0.7, 0.3],
[0.4, 0.6]])

# Compute transition matrices up to N steps
N = 5
for n in range(1, N + 1):
print(f"Step {n}:\n{np.linalg.matrix_power(P, n)}\n")

# Simulate Markovian Brownian motion
state, pos, motion = 0, 0, [0]
for _ in range(100):
state = 0 if np.random.rand() < P[state][0] else 1
pos += -1 if state == 0 else 1
motion.append(pos)

# Plot the motion
plt.plot(motion)
plt.title("Markovian Brownian Motion")
plt.xlabel("Time")
plt.ylabel("Position")
plt.grid(True)
plt.show()

Output:
Step 1:
[[0.7 0.3]
[0.4 0.6]]

Step 2:
[[0.61 0.39]
[0.52 0.48]]

Step 3:
[[0.583 0.417]
[0.556 0.444]]

Step 4:
[[0.5749 0.4251]
[0.5668 0.4332]]

Step 5:
[[0.57247 0.42753]
[0.57004 0.42996]]

Result : The transition matrices for steps 1 through N correctly show the evolving
state-to-state probabilities over time.
The simulated Markovian Brownian motion reflects a random walk where the particle’s
direction depends on state transitions. The motion plot shows fluctuating position over time,
confirming the behavior of a Markov-driven stochastic process.
Project_04

Aim: Write a code to minimize the cost function (mean squared error) in the linear
regression using gradient descent (an iterative optimization algorithm, which finds the
minimum of a differentiable function) with at least two independent variables. Determine
the correlation matrix for the regression parameters.

Theory:
Linear Regression using Gradient Descent
Linear Regression estimates the relationship between dependent variable Y and independent
variables X₁, X₂.... The model is:

Y = θ₀ + θ₁X₁ + θ₂X₂ + ... + ε

Mean Squared Error (MSE) is the cost function to minimize:

MSE = (1/n) * Σ (Yᵢ - Ŷᵢ)²

Gradient Descent updates parameters iteratively:

θ = θ - α * ∇(Cost)

Where α is the learning rate and ∇(Cost) is the gradient of the MSE.

Algorithm:
Step 1: Import numpy for calculations.

Step 2: Generate synthetic data with two independent variables (X1, X2) and one dependent
variable Y.

Step 3: Add a bias term (column of ones) to the feature matrix X.

Step 4: Initialize parameter vector theta = [θ₀, θ₁, θ₂] with zeros.

Step 5: Define learning rate and number of iterations for gradient descent.

Step 6: For each iteration:

- Compute predictions: Ŷ = X @ θ
- Calculate error: (Ŷ - Y)
- Compute gradient: grad = (2/n) * Xᵗ @ error
- Update parameters: θ -= learning_rate * grad

Step 7: After convergence, print the estimated parameters.

Step 8: Calculate and display the correlation matrix using np.corrcoef() for [X1, X2, Y].
Code:

import numpy as np

# Generate synthetic data
np.random.seed(1)
n = 100
X1 = np.random.rand(n)
X2 = np.random.rand(n)
Y = 3*X1 + 2*X2 + 1 + np.random.randn(n)*0.1 # True relation + noise

# Add bias term and stack features
X = np.c_[np.ones(n), X1, X2] # Shape: (n, 3)
theta = np.zeros(3) # Initialize weights [bias, w1, w2]

# Gradient descent
lr, epochs = 0.1, 1000
for _ in range(epochs):
preds = X @ theta
error = preds - Y
grad = (2/n) * X.T @ error
theta -= lr * grad

print(f"Estimated Parameters (theta): {theta}")

# Correlation matrix of parameters (sample-based estimate)
params_matrix = np.corrcoef([X1, X2, Y])
print("\nCorrelation Matrix:\n", params_matrix)

Output:
Estimated Parameters (theta): [1.02250584 2.98536193 1.98830098]

Correlation Matrix:
[[ 1. -0.0248661 0.80470196]
[-0.0248661 1. 0.56574719]
[ 0.80470196 0.56574719 1. ]]

Result :
After training, the model estimates the parameters close to the true values used for
generating data (approximately θ ≈ [1, 3, 2]).
The correlation matrix shows a strong linear relationship between the dependent variable Y
and the independent variables X₁ and X₂, validating the effectiveness of the regression model.
–ThankYou Sir
___________

Probability 2 Lecture Notes
No ratings yet
Probability 2 Lecture Notes
96 pages
Fresco
100% (2)
Fresco
17 pages
Homework 1
0% (1)
Homework 1
4 pages
Stochastic Processes by Joseph T Chang
0% (1)
Stochastic Processes by Joseph T Chang
233 pages
Chapter 0 Introduction
No ratings yet
Chapter 0 Introduction
14 pages
RPET Pratical
No ratings yet
RPET Pratical
7 pages
Assignment2
No ratings yet
Assignment2
10 pages
PTSP Lab Record
No ratings yet
PTSP Lab Record
27 pages
UNIT - 3 - EM Algorithm
No ratings yet
UNIT - 3 - EM Algorithm
6 pages
AI Obse-2
No ratings yet
AI Obse-2
32 pages
EXP-4 ABHAYRAJ SINGH
No ratings yet
EXP-4 ABHAYRAJ SINGH
11 pages
Probability Distributions
No ratings yet
Probability Distributions
9 pages
Chapter 03 MultiplexingA
No ratings yet
Chapter 03 MultiplexingA
12 pages
test_tasks_Camp_2025
No ratings yet
test_tasks_Camp_2025
5 pages
Random Numbers: Hugo Bowne-Anderson
No ratings yet
Random Numbers: Hugo Bowne-Anderson
31 pages
Sem 5
No ratings yet
Sem 5
25 pages
chapter5
No ratings yet
chapter5
31 pages
HW2P
No ratings yet
HW2P
19 pages
report-endterm
No ratings yet
report-endterm
30 pages
Practical 2
No ratings yet
Practical 2
7 pages
cosc416
No ratings yet
cosc416
6 pages
Stochastic Process Simulation in Matlab
No ratings yet
Stochastic Process Simulation in Matlab
17 pages
Assignment1
No ratings yet
Assignment1
30 pages
Monte Carlo Endsem Report
No ratings yet
Monte Carlo Endsem Report
2 pages
Experiment 6
No ratings yet
Experiment 6
5 pages
Case Study Hacker Statistics
No ratings yet
Case Study Hacker Statistics
31 pages
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
No ratings yet
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
10 pages
Introstat
No ratings yet
Introstat
16 pages
Random Module
No ratings yet
Random Module
14 pages
Understanding Python
No ratings yet
Understanding Python
9 pages
PS - Lab - Manual - 4th - Sem (1) ANIL
No ratings yet
PS - Lab - Manual - 4th - Sem (1) ANIL
19 pages
'Red' 'Black' 'Green': Trial
No ratings yet
'Red' 'Black' 'Green': Trial
4 pages
SML - Week 2
No ratings yet
SML - Week 2
4 pages
4-12
No ratings yet
4-12
17 pages
FC Intervals Bounded Gaussian
No ratings yet
FC Intervals Bounded Gaussian
26 pages
Problem 9
No ratings yet
Problem 9
7 pages
Stochastic Processes
No ratings yet
Stochastic Processes
277 pages
Probability Statistics and Random Processes for Engineers 4th Edition Stark Solutions Manualinstant download
100% (3)
Probability Statistics and Random Processes for Engineers 4th Edition Stark Solutions Manualinstant download
34 pages
Statistics For Data Science 20 21 Programming Exercises 1
No ratings yet
Statistics For Data Science 20 21 Programming Exercises 1
3 pages
Statistical Analysis in Physics Practical File
No ratings yet
Statistical Analysis in Physics Practical File
28 pages
Hofman Notes
No ratings yet
Hofman Notes
114 pages
Hw1renewed 1 Merged
No ratings yet
Hw1renewed 1 Merged
9 pages
Randomizedd Algorithms
No ratings yet
Randomizedd Algorithms
195 pages
4 Variates
No ratings yet
4 Variates
24 pages
Experimental study
No ratings yet
Experimental study
79 pages
Foundations of Probability in Python - Part 1
No ratings yet
Foundations of Probability in Python - Part 1
44 pages
Soft Computing Lab File
No ratings yet
Soft Computing Lab File
12 pages
Essentials On The Analysis of Randomized Algorithms: 1 Basics
No ratings yet
Essentials On The Analysis of Randomized Algorithms: 1 Basics
8 pages
Stochastic Processes
100% (2)
Stochastic Processes
233 pages
Intermediate Python For Data Science: Random Numbers
No ratings yet
Intermediate Python For Data Science: Random Numbers
22 pages
Applied Probability Theory - J. Chen
100% (3)
Applied Probability Theory - J. Chen
177 pages
randomnumbers-5
No ratings yet
randomnumbers-5
42 pages
fOU2
No ratings yet
fOU2
11 pages
Lec3 Inverse Transformation Rejection
No ratings yet
Lec3 Inverse Transformation Rejection
46 pages
Exercises of Numerical Analysis
From Everand
Exercises of Numerical Analysis
Simone Malacrida
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Basic Exercises for Competitive Programming: Python
From Everand
Basic Exercises for Competitive Programming: Python
Jan Pol
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
IMS-PRO-02 Hazard and Risk Assessment & Aspect Impact
100% (1)
IMS-PRO-02 Hazard and Risk Assessment & Aspect Impact
6 pages
The Crack-Up - Scott Fitzgerald
0% (1)
The Crack-Up - Scott Fitzgerald
4 pages
BBCUpdate 05.01.2024
No ratings yet
BBCUpdate 05.01.2024
5 pages
Srushti Universal
No ratings yet
Srushti Universal
2 pages
Cytopathology of Infectious Diseases
100% (1)
Cytopathology of Infectious Diseases
413 pages
Taylor Series Notes
No ratings yet
Taylor Series Notes
5 pages
Tank Circuits & Oscillators
No ratings yet
Tank Circuits & Oscillators
5 pages
Future Integrated Fire Control
No ratings yet
Future Integrated Fire Control
43 pages
Verbs A1
No ratings yet
Verbs A1
3 pages
Impulse Resistance of Concentrated Tower Grounding Systems Simulated by An Atpdraw Object
No ratings yet
Impulse Resistance of Concentrated Tower Grounding Systems Simulated by An Atpdraw Object
5 pages
RM SAMPLE EXAM For Revision 2021
100% (1)
RM SAMPLE EXAM For Revision 2021
2 pages
Ob Unit 2 Leadership 2022 Bba Sem 2
No ratings yet
Ob Unit 2 Leadership 2022 Bba Sem 2
18 pages
5 Es
No ratings yet
5 Es
4 pages
3 Phil. Guardians v. COMELEC
No ratings yet
3 Phil. Guardians v. COMELEC
1 page
04 MarTest
No ratings yet
04 MarTest
67 pages
B 3
No ratings yet
B 3
2 pages
Babu Jeevanantham v. Azhaguvel Mudaliar, 2012 SCC OnLine Mad 4718
No ratings yet
Babu Jeevanantham v. Azhaguvel Mudaliar, 2012 SCC OnLine Mad 4718
7 pages
SikaTop 144
No ratings yet
SikaTop 144
2 pages
Annual Return: Form No. Mgt-7
No ratings yet
Annual Return: Form No. Mgt-7
30 pages
Pyramid 2003
No ratings yet
Pyramid 2003
1,629 pages
Zacny PDCP Final
No ratings yet
Zacny PDCP Final
16 pages
Server Time Calculator
No ratings yet
Server Time Calculator
3 pages
Manage Conflict Within A Team ILM - Assessment - Guidance
No ratings yet
Manage Conflict Within A Team ILM - Assessment - Guidance
6 pages
2013-03-04 Understanding Polar Graphs
No ratings yet
2013-03-04 Understanding Polar Graphs
5 pages
Thoughts, Mind, Thinking...
100% (2)
Thoughts, Mind, Thinking...
8 pages
Phil Iri Presentation GR.6
No ratings yet
Phil Iri Presentation GR.6
9 pages
Entropy: Entropy Production and Irreversible Processes
No ratings yet
Entropy: Entropy Production and Irreversible Processes
31 pages
Greek and Cretan Christmas Customs: Kala Christouyenna! Merry Christmas!
No ratings yet
Greek and Cretan Christmas Customs: Kala Christouyenna! Merry Christmas!
6 pages
Accomplishment Report Format
No ratings yet
Accomplishment Report Format
6 pages
Java Introduction PDF
0% (1)
Java Introduction PDF
82 pages

Sem-6_Statistical-Analysis_26089

Uploaded by

Sem-6_Statistical-Analysis_26089

Uploaded by

Atma Ram Sanatan Dharma College, University of Delhi

Name: Arvind Kumar Patel

Theory: Central Limit Theorem (CLT)

Step 2: Define constants:

Step 3: Create a dictionary of distributions with generator functions:

Step 4: Loop through each distribution:

Step 5: Show all plots together to compare behavior.

Aim: Hypothesis testing

Theory: Hypothesis Testing (Binomial Test)

We simulate tossing a biased coin n times with actual probability of heads q.

Null Hypothesis (H₀): q = 0.5 (fair coin)

Alternative Hypothesis (H₁): q ≠ 0.5 (biased coin)

If p-value < 0.05 → we reject H₀ (coin is biased).

Markovian Brownian Motion:

●​ If in state 1, it moves right (+1).​

Step 2: Define the transition matrix P for M=2 states

Step 3: Compute transition matrices for multiple steps

Step 5: Plot the motion

Y = θ₀ + θ₁X₁ + θ₂X₂ + ... + ε

Mean Squared Error (MSE) is the cost function to minimize:

MSE = (1/n) * Σ (Yᵢ - Ŷᵢ)²

Gradient Descent updates parameters iteratively:

Step 3: Add a bias term (column of ones) to the feature matrix X.

Step 6: For each iteration:

Step 7: After convergence, print the estimated parameters.

import numpy as np​

You might also like

● If in state 1, it moves right (+1).

import numpy as np