0% found this document useful (0 votes)

39 views

Qualification Exam Question: 1 Statistical Models and Methods

This document contains a qualification exam question covering three topics: [1] Statistical Models and Methods, [2] Learning Theory, and [3] Decision Processes. For each topic, it lists several core concepts and problems, asking the test taker to define terms, derive solutions, compare approaches, and discuss tradeoffs. The exam questions gauge understanding of fundamental machine learning techniques like cross-validation, Bayes classifiers, kernel methods, VC dimension, reinforcement learning approaches, and more.

Uploaded by

Almalieque

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views

Qualification Exam Question: 1 Statistical Models and Methods

Uploaded by

Almalieque

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Qualification Exam Question

1 Statistical Models and Methods

1.1 Core
1. Cross-validation We would like to perform k-fold cross-validation. What should k be?
Discuss the pros and cons of large or small values of k.
2. Bayes classifier

(a) Write down the Bayes classifier f : X → Y (the classifier that minimizes the expected
loss E(L(Y, f (X)))) for binary classification Y ∈ {−1, +1} with non 0-1 loss (a is the loss
for falsely predicting negative and b is the loss for falsely predicting positive). Simplify
the classification rule as much as you can.
(b) If P (X|Y = y) is a multivariate Gaussian and assuming the 0/1 loss, write the Bayes
classifier as f (X) = sign(h(X)) and simplify h as much as possible. What is the geo-
metric shape of the decision boundary?
(c) Repeat (b) when the two Gaussians have identical covariance matrices. What is the
geometric shape of the decision boundary?
(d) Repeat (b) when the two Gaussians have covariance matrix that equals the identity
matrix. Describe the geometric shape of the decision boundary as much as possible.

3. Multiclass classification
Multiclass classification tries to assign one of several class labels (rather than binary labels) to
an object. Can you give two ways which use binary classifiers to solve multiclass classification
problem? What are the pros and cons of these different methods (eg. in terms of computional
complexity or the applicability of the method)? Besides using binary classifiers, do you have
any other idea on how to build a multiclass classifier?

1.2 Methods and Models

1. Kernel methods
Consider two machine learning models for 2-class classification. The first is a support vector
machine with Gaussian kernel. The second is kernel discriminant analysis (a Bayes classifier
with a kernel density estimator for each class), where the bandwidth may vary for each
dimension, and possibly also for each data point. Which is the more expressive, or powerful
model? Compare and discuss the pros and cons of each.
2. Bayes Rule
(y−µ)2
Let φ(y; µ, σ 2 ) = √ 1 2 e− 2σ2 denote the density of a random variable y with a Gaussian
2πσ
distribution N (µ, σ 2 ). Suppose that we have three related random variables, X, Y and Z,

1
• Random variable X has a Gaussian distribution N (0, σ 2 );
• Given random variable X = x, random variable Y has a Gaussian distribution N (x, σ 2 );
• Given random variable Y = y, random variable Z is a mixture of two Gaussians with
density

p(z|Y = y) = (1 − α)φ(z; 0, σ 2 ) + αφ(z; y, σ 2 ). (1)

• Conditioned on random variable Y , random variables X and Z are independent

Given n i.i.d. sample z 1 , . . . , z n from the mixture density (1), answer the following questions

(a) If n = 2, derive the posterior distribution of X conditioned on (z 1 , . . . , z n ) exact upto a

scalar difference.
(b) If n = 10 (or in general when n is large), what is the computational problem associated
with computing the poterior distribution of X?
(c) Propose approximation algorithms to deal with the computational problem when n is
large.

3. Dependent noise model

Let X1 , . . . , Xn be n determinations of a physical constant θ. Consider the model,

Xi = θ + ei , i = 1, . . . , n

and assume
ei = αei−1 + βei−2 + i , i = 1, . . . , n, e0 = 0, e−1 = 0
with i ’s iid standard normal, and α and β are known constant. What is the maximum
likelihood estimate of θ? Carefully justify each step of your derivation/calculation.

2 Learning Theory
1. VCdimension
(a) What is the VC-dimension of axis-parallel rectangles in R3 ? Specifically, a legal target
function is specified by three intervals [xmin , xmax ], [ymin , ymax ], and [zmin , zmax ], and classifies
an example (x, y, z) as positive iff x ∈ [xmin , xmax ], y ∈ [ymin , ymax ], and z ∈ [zmin , zmax ]. (b)
Describe the importance of VC-dimension for Machine Learning.

2. Mistake-bound model.
(a) k-CNF is the class of Conjunctive Normal Form formulas in which each clause has size
at most k. E.g., x4 ∧ (x1 ∨ x2 ) ∧ (x2 ∨ x̄3 ∨ x5 ) is a 3-CNF. Give an algorithm to learn
5-CNF formulas over n boolean features in the mistake-bound model. Your algorithm should
run in polynomial-time per example (so the “halving algorithm” is not allowed). How many
mistakes does it make at most? (b) What is the relationship between the mistake bound
model and the PAC learning model?

3. Consistency Problem for 2-term DNF formulas (a) Prove that the consistency problem
for 2-term DNF formulas is NP-hard. (b) Is the class of 2-term DNF formulas PAC-learnable?
Explain why or why not.

2
3 Decision Processes
The theme is scalability, and you aren’t getting out of it.

1. Scaling up reinforcement learning

• Typical approaches to addressing such problems in RL include function approximation

and problem decomposition. Compare and contrast these two approaches. What prob-
lems of scale do these approaches address? What are their strengths and weaknesses?
Are they orthogonal approaches? Can they work well together?
• What are the differences between hierarchical and modular reinforcement learning? Ex-
plain both the theoretical and practical limits of these approaches.

2. Learning from demonstrations

Machine learning algorithms have traditionally had difficulty scaling to large problems. In
classification and traditional supervised learning this problem arises with data that exist in
very high dimensional spaces or when there are many data points for computing, for example,
estimates of conditional densities. In reinforcement learning this is also the case, arising when,
for example, there are many, many states or when actions are at a very low level of abstraction.
Imagine that we want to leverage domain knowledge from humans in order attack this problem
of scalability. One mechanism we might use is Learning from Demonstration where humans
demonstrate correct behavior; however, complex tasks can require more examples of complete
behavior than is practical to obtain. Given that you will only be able to extract so much time
from your human teachers, what are at least two ways you might still take advantage of their
ability to give demonstrations, even for complex tasks? For each proposed method, describe
strengths and possible pitfalls.

3. Learning with Options

• Formally define an option.

• What are the advantages and limits of options? Be specific.
• Describe at least two ways one might automate the process of creating options.
• Although options are defined in a very specific way, would you argue that different op-
tions might serve different purposes? If so, do these different kinds of options have

3
identifiably different properties? If you believe that different options do not serve differ-
ent purposes, argue for that position as well.

IIT Kanpur Machine Learning End Sem Paper
No ratings yet
IIT Kanpur Machine Learning End Sem Paper
10 pages
Design II MDE 221 Mott CH 8 and CH 9 Spur Gears
No ratings yet
Design II MDE 221 Mott CH 8 and CH 9 Spur Gears
42 pages
Sol All
No ratings yet
Sol All
66 pages
Machine Learning, Spring 2005
No ratings yet
Machine Learning, Spring 2005
3 pages
Deep Learning Answers
No ratings yet
Deep Learning Answers
36 pages
Graph Clustering
No ratings yet
Graph Clustering
38 pages
SVM_NEW
No ratings yet
SVM_NEW
12 pages
An Improved Algorithm For Imbalanced Data and Small Sample Size Classification
No ratings yet
An Improved Algorithm For Imbalanced Data and Small Sample Size Classification
7 pages
Unit-Ii Chapter-3 Beyond Binary Classification Handling More Than Two Classes
No ratings yet
Unit-Ii Chapter-3 Beyond Binary Classification Handling More Than Two Classes
16 pages
Tutorial 1 Machine Learning
No ratings yet
Tutorial 1 Machine Learning
4 pages
A Fuzzy Classification Method Based On Support Vector Machine
No ratings yet
A Fuzzy Classification Method Based On Support Vector Machine
4 pages
2008 Infinite Kernel Learning Via Infinite An
No ratings yet
2008 Infinite Kernel Learning Via Infinite An
34 pages
SKL Pattern
No ratings yet
SKL Pattern
66 pages
A Fuzzy Expert System As A Stock Trading Advisor
No ratings yet
A Fuzzy Expert System As A Stock Trading Advisor
9 pages
DSCI 303: Machine Learning For Data Science Fall 2020
No ratings yet
DSCI 303: Machine Learning For Data Science Fall 2020
5 pages
Advanced Topics in Machine Learning: Supervised Learning, Deep Learning, and Optimization Techniques
No ratings yet
Advanced Topics in Machine Learning: Supervised Learning, Deep Learning, and Optimization Techniques
5 pages
d2l-en-165-218
No ratings yet
d2l-en-165-218
35 pages
NeurIPS 2023 Contextual Stochastic Bilevel Optimization Paper Conference
No ratings yet
NeurIPS 2023 Contextual Stochastic Bilevel Optimization Paper Conference
23 pages
Combinatorial Search: Li Yin March 29, 2020
No ratings yet
Combinatorial Search: Li Yin March 29, 2020
30 pages
A Comparison of Methods For Multi-Class Support Vector Machines
No ratings yet
A Comparison of Methods For Multi-Class Support Vector Machines
26 pages
Article
No ratings yet
Article
23 pages
Algebraic and Symbolic Reasoning
No ratings yet
Algebraic and Symbolic Reasoning
36 pages
Classic Machine Learning Algorithms
No ratings yet
Classic Machine Learning Algorithms
61 pages
EE378A - Combined Notes
No ratings yet
EE378A - Combined Notes
76 pages
Statistical Machine Learning-The Basic Approach and Current Research Challenges
No ratings yet
Statistical Machine Learning-The Basic Approach and Current Research Challenges
35 pages
cddual
No ratings yet
cddual
14 pages
Artigo Smallex
No ratings yet
Artigo Smallex
17 pages
SVM - Hype or Hallelujah
No ratings yet
SVM - Hype or Hallelujah
13 pages
2806 Neural Computation Committee Machines: 2005 Ari Visa
No ratings yet
2806 Neural Computation Committee Machines: 2005 Ari Visa
30 pages
ML Endsem 2022
No ratings yet
ML Endsem 2022
7 pages
2-Machine Learning Algorithms
No ratings yet
2-Machine Learning Algorithms
16 pages
NeurIPS-2019-on-making-stochastic-classifiers-deterministic-Paper
No ratings yet
NeurIPS-2019-on-making-stochastic-classifiers-deterministic-Paper
11 pages
09 Decision Trees Nearest Neighbor
No ratings yet
09 Decision Trees Nearest Neighbor
8 pages
Gaussian Process Tutorial by Andrew NG
No ratings yet
Gaussian Process Tutorial by Andrew NG
13 pages
Lecture 2: Basics and Definitions: Networks As Data Models
No ratings yet
Lecture 2: Basics and Definitions: Networks As Data Models
28 pages
Final F01soln
No ratings yet
Final F01soln
13 pages
Modularity Maximization in Networks by Variable Neighborhood Search
No ratings yet
Modularity Maximization in Networks by Variable Neighborhood Search
14 pages
Kobe University Repository: Kernel
No ratings yet
Kobe University Repository: Kernel
7 pages
Supervised Learning - Support Vector Machines and Feature Reduction
No ratings yet
Supervised Learning - Support Vector Machines and Feature Reduction
11 pages
Fair SVM
No ratings yet
Fair SVM
10 pages
Kernal Methods Machine Learning
No ratings yet
Kernal Methods Machine Learning
53 pages
Neural Networks (2010/11) Example Exam, December 2010
No ratings yet
Neural Networks (2010/11) Example Exam, December 2010
2 pages
Accepted Manuscript
No ratings yet
Accepted Manuscript
13 pages
DistributionLearn (Shai)
No ratings yet
DistributionLearn (Shai)
47 pages
Training Structural Svms When Exact Inference Is Intractable
No ratings yet
Training Structural Svms When Exact Inference Is Intractable
8 pages
Multiclass Classification: 9.520 Class 06, 25 Feb 2008 Ryan Rifkin
No ratings yet
Multiclass Classification: 9.520 Class 06, 25 Feb 2008 Ryan Rifkin
59 pages
Cheng: Decision Tree and Instance-Based Learning For Label Ranking
No ratings yet
Cheng: Decision Tree and Instance-Based Learning For Label Ranking
8 pages
UBICC Article 522 522
No ratings yet
UBICC Article 522 522
8 pages
Training Data Selection For Support Vector Machine
No ratings yet
Training Data Selection For Support Vector Machine
11 pages
Frank-Wolfe_and_friends_a_journey_into_projection-
No ratings yet
Frank-Wolfe_and_friends_a_journey_into_projection-
33 pages
A Literature Survey On Domain Adaptation of Statistical Classifiers
No ratings yet
A Literature Survey On Domain Adaptation of Statistical Classifiers
12 pages
Mathematics of Learning Dealing With Data Notices-Ams2003refs
No ratings yet
Mathematics of Learning Dealing With Data Notices-Ams2003refs
19 pages
Ai Assignment 2
No ratings yet
Ai Assignment 2
13 pages
Derivada de Una Norma
No ratings yet
Derivada de Una Norma
79 pages
Different Paradigms of Pattern Recognition
No ratings yet
Different Paradigms of Pattern Recognition
8 pages
10.1.1.92.623
No ratings yet
10.1.1.92.623
11 pages
Evaluation of Different Classifier
No ratings yet
Evaluation of Different Classifier
4 pages
Constraint-Handling in Nature-Inspired Numerical Optimization: Past, Present and Future
No ratings yet
Constraint-Handling in Nature-Inspired Numerical Optimization: Past, Present and Future
22 pages
Neural Networks Study Notes
100% (2)
Neural Networks Study Notes
11 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Formal Logic Its Scope and Limits 4th Edition Richard Jeffrey - Download the ebook now and own the full detailed content
No ratings yet
Formal Logic Its Scope and Limits 4th Edition Richard Jeffrey - Download the ebook now and own the full detailed content
53 pages
Pyspark RDD Cheat Sheet Python For Data Science
No ratings yet
Pyspark RDD Cheat Sheet Python For Data Science
1 page
JK Wsffffffffihqwvre Ilnh
No ratings yet
JK Wsffffffffihqwvre Ilnh
6 pages
Revision IB
No ratings yet
Revision IB
5 pages
Math 10 Q2 Module 3 1
100% (1)
Math 10 Q2 Module 3 1
16 pages
Table 1: Cumulative Normal Distribution
No ratings yet
Table 1: Cumulative Normal Distribution
1 page
Gcodetools - Inkscape/'s CAM Extension
No ratings yet
Gcodetools - Inkscape/'s CAM Extension
3 pages
R-Solved QB Unit-1
No ratings yet
R-Solved QB Unit-1
12 pages
Math F112
No ratings yet
Math F112
3 pages
Slender Concrete Columns Sway Frame Moment Magnification ACI318 14 W PDF
No ratings yet
Slender Concrete Columns Sway Frame Moment Magnification ACI318 14 W PDF
36 pages
Confirmatory Factor Analysis of The Performance Management Audit Questionnaire
No ratings yet
Confirmatory Factor Analysis of The Performance Management Audit Questionnaire
9 pages
ridge and lasso
No ratings yet
ridge and lasso
2 pages
Fluid Mechanics Fundamentals and Applications 4th Edition Cengel Solutions Manual download
100% (3)
Fluid Mechanics Fundamentals and Applications 4th Edition Cengel Solutions Manual download
63 pages
Home Work Solutions 9
No ratings yet
Home Work Solutions 9
5 pages
Gate Triangle Tips
No ratings yet
Gate Triangle Tips
10 pages
Cluster-2@2024 To 2027 - 3years Micro - Till 07.09.24
No ratings yet
Cluster-2@2024 To 2027 - 3years Micro - Till 07.09.24
11 pages
Generalised Linear Models
No ratings yet
Generalised Linear Models
74 pages
4.1 Lists: (A, B, C, D, E) A, B, C, D, e A B
No ratings yet
4.1 Lists: (A, B, C, D, E) A, B, C, D, e A B
45 pages
Free-surface flow: shallow-water dynamics Katopodes all chapter instant download
100% (1)
Free-surface flow: shallow-water dynamics Katopodes all chapter instant download
41 pages
Practice Exam Chapter 6 Solution
No ratings yet
Practice Exam Chapter 6 Solution
9 pages
PV Curve For Voltage Stability
No ratings yet
PV Curve For Voltage Stability
1 page
Note 6 - Sound in Enclosed Space PDF
No ratings yet
Note 6 - Sound in Enclosed Space PDF
32 pages
17S1 - MA4001 - LD - Detail Design Lesson 9 - V1.1
No ratings yet
17S1 - MA4001 - LD - Detail Design Lesson 9 - V1.1
28 pages
YanchangZhao Refcard Data Mining
No ratings yet
YanchangZhao Refcard Data Mining
4 pages
Predicting Blood Donation Using Machine Learning: Presented By
No ratings yet
Predicting Blood Donation Using Machine Learning: Presented By
13 pages
6th - Nep Excel Theory Notes
No ratings yet
6th - Nep Excel Theory Notes
25 pages
Mathematics Action Plan 2018 2021
No ratings yet
Mathematics Action Plan 2018 2021
8 pages
Patterningandalgebra
No ratings yet
Patterningandalgebra
12 pages
Markowitz-S Portfolio Theory (For INCHARGE)
No ratings yet
Markowitz-S Portfolio Theory (For INCHARGE)
3 pages

Qualification Exam Question: 1 Statistical Models and Methods

Uploaded by

Qualification Exam Question: 1 Statistical Models and Methods

Uploaded by

Qualification Exam Question

1 Statistical Models and Methods

1.2 Methods and Models

p(z|Y = y) = (1 − α)φ(z; 0, σ 2 ) + αφ(z; y, σ 2 ). (1)

• Conditioned on random variable Y , random variables X and Z are independent

(a) If n = 2, derive the posterior distribution of X conditioned on (z 1 , . . . , z n ) exact upto a

3. Dependent noise model

1. Scaling up reinforcement learning

• Typical approaches to addressing such problems in RL include function approximation

2. Learning from demonstrations

3. Learning with Options

• Formally define an option.

You might also like