0% found this document useful (0 votes)

65 views7 pages

Machine Learning in Investment Analysis

The document discusses machine learning techniques including unsupervised learning, dimension reduction, LASSO, classification and regression trees, hierarchical clustering, neural networks, and reinforcement learning. It provides examples, explanations, and key differences between various machine learning methods and how they can be applied to investment problems.

Uploaded by

Kmalk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views7 pages

Machine Learning in Investment Analysis

Uploaded by

Kmalk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CFA

CHAPTER 3

MACHINE LEARNING

1. (B) unsupervised learning.

Explanation
Dimension reduction and clustering are examples of unsupervised learning
algorithms.
(Module 3.3, LOS 3.d)
Related Material
SchweserNotes - Book 1

2. (C) Dimension reduction.

Explanation
Big Data refers to very large data sets which may include both structured
(e.g. spreadsheet) data and unstructured (e.g. emails, text, or pictures) data and
includes a large number of features as well as number of observations. Dimension
reduction seeks to remove the noise (i.e., those attributes that do not contain
much information) when the number of features in a data set
(its dimension) is excessive.
(Module 3.3, LOS 3.d)
Related Material
SchweserNotes - Book 1

3. (A) least absolute shrinkage and selection operator (LASSO).

Explanation
LASSO (least absolute shrinkage and selection operator) is a popular type of
penalized regression in which the penalty term comprises summing the absolute
values of the regression coefficients. The more included features, the larger the
penalty will be. The result is that a feature needs to make a sufficient contribution
to model fit to offset the penalty from including it.
(Module 3.2, LOS 3.c)
Related Material
SchweserNotes - Book 1

Hanna Kowalski is a senior fixed-income portfolio analyst at Czarnaskala BP.

Kowalski supervises Lena Nowak, who is a junior analyst.

Quantitative Methods 1 Machine Learning

CFA
Over the past several years, Kowalski has become aware that investment firms are
increasingly to improve their investment decision making. Kowalski has become
particularly interested in machine learning techniques and how they might be
applied to investment management applications.
Kowalski has read a number of articles about machine learning in various journals
for financial analysts. However, she has only a minimal knowledge of how she
might source appropriate model inputs, interpret model outputs, and translate
those outputs into investment actions.
Kowalski and Nowak meet to discuss plans for incorporating machine learning into
their investment model. Kowalski asks Nowak to research machine learning and
report back on the types of investment problems that machine learning can
address, how the algorithms work, and what the various terminology means.
After spending a few hours researching the topic, Nowak makes a number of
statements to Kowalski on the topics of:
• Classification and regression trees (CART)
• Hierarchical clustering
• Neural networks
• Reinforcement learning (RL) algorithms.
Kowalski is left to work out which of Nowak's statements are fully accurate and
which are not.

4. (A) discrete target variable, producing a cardinal tree.

Explanation
Classification and regression trees (CART) are generally applied to predict either a
continuous target variable, producing a regression tree, or a categorical target
variable, producing a classification tree.
(Module 3.2, LOS 3.c)
Related Material
SchweserNotes - Book 1

5. (B) Bottom-up hierarchical clustering begins with each observation being its own
cluster.
Explanation
Agglomerative (bottom-up) hierarchical clustering begins with each observation
being its own cluster. Then, the algorithm finds the two closest clusters, and
combines them into a new, larger cluster. Hierarchical clustering is an
unsupervised iterative algorithm. Divisive (top-down) hierarchical clustering
progressively partitions clusters into smaller clusters until each cluster contains
only one observation.
(Module 3.3, LOS 3.d)
Related Material
SchweserNotes - Book 1

Quantitative Methods 2 Machine Learning

CFA
6. (A) are effective in tasks with non-linearities and complex interactions among
variables.
Explanation
Neural networks have been successfully applied to solve a variety of problems
characterized by non-linearities and complex interactions among variables. Neural
networks have three types of layers: an input layer, hidden layers, and an output
layer. The hidden layer nodes (not the input layer nodes) each consist of a
summation operator and an activation function; these nodes are where learning
takes place.
(Module 3.3, LOS 3.e)
Related Material
SchweserNotes - Book 1

7. (C) take into consideration the constraints of its environment.

Explanation
The reinforcement learning (RL) algorithm involves an agent that will perform
actions that will maximize its rewards over time, taking into consideration the
constraints of the environment. Unlike supervised learning, reinforcement learning
has neither instantaneous feedback nor direct labeled data for each observation.
(Module 3.3, LOS 3.e)
Related Material
SchweserNotes - Book 1

8. (A) Regression Classification

Explanation
When the Y-variable is continuous, the appropriate approach is that of regression
(used in a broad, ML context). When the Y-variable is categorical (i.e., belonging to a
category or classification) or ordinal (i.e., ordered or ranked), a classification model
is used.
(Module 3.1, LOS 3.a)
Related Material
SchweserNotes - Book 1

9. (C) supervised learning.

Explanation
Supervised learning is a machine learning technique in which a machine is given
labelled input and output data and models the output data based on the input data.
In unsupervised learning, a machine is given input data in which to identify patterns
and relationships, but no output data to model. Deep learning is a technique to
identify patterns of increasing complexity and may use supervised or unsupervised
learning.
(Module 3.1, LOS 3.a)
Related Material
SchweserNotes - Book 1

Quantitative Methods 3 Machine Learning

CFA
10. (B) principal components analysis.
Explanation
Principal components analysis (PCA) is an unsupervised machine learning algorithm
that reduces highly correlated features into fewer uncorrelated composite variables
by transforming the feature covariance matrix. K-means partitions observations into
a fixed number (k) of non-overlapping clusters. Hierarchical clustering is an
unsupervised iterative algorithm used to build a hierarchy of clusters.
(Module 3.3, LOS 3.d)
Related Material
SchweserNotes - Book 1

11. (C) There is no labeled data.

Explanation
In unsupervised learning, the ML program is not given labeled training data. Instead,
inputs are provided without any conclusions about those inputs. In the absence of
any tagged data, the program seeks out structure or inter-relationships in the data.
Clustering is one example of the output of unsupervised ML program while
classification is suited for supervised learning.
(Module 3.1, LOS 3.a)
Related Material
SchweserNotes - Book 1

12. (B) reduce signal-to-noise ratio.

Explanation
Random forest is a collection of randomly generated classification trees from the
same data set. A randomly selected subset of features is used in creating each tree
and hence each tree is slightly different from the others. Since each tree only uses a
subset of features, random forests can mitigate the problem of over fitting. Because
errors across different trees tend to cancel each other out, using random forests can
increase the signal-to-noise ratio.
(Module 3.2, LOS 3.c)
Related Material
SchweserNotes - Book 1

13. (C) bias error plus variance error plus base error.
Explanation
Out-of-sample error equals bias error plus variance error plus base error. Bias error
is the extent to which a model fits the training data. Variance error describes the
degree to which a model's results change in response to new data from validation
and test samples. Base error comes from randomness in the data.
(Module 3.1, LOS 3.b)
Related Material
SchweserNotes - Book 1

Quantitative Methods 4 Machine Learning

CFA
Joyce Tan manages a medium-sized investment fund at Marina Bay Advisors that
specializes in international large cap equities. Over the four years that she has
been portfolio manager, Tan has been invested in approximately 40 stocks at
time.
Tan has used a number of methodologies to select investment opportunities from
the universe of investable stocks. In some cases, Tan uses quantitative measures
such as accounting ratios to find the most promising investment candidates. In
other cases, her team of analysts suggest investments based on qualitative factors
and various investment hypotheses.
Tan begins to wonder if her team could leverage financial technology to make
better decisions. Specifically, she has read about various machine learning
techniques to extract useful information from large financial datasets, in order to
uncover new sources of alpha.

14. (A) continuous.

Explanation
Supervised learning can be divided into two categories: regression and
classification. If the target variable is categorical or ordinal (e.g., determining a firm's
rating), then it is a classification problem. If the target variable to be predicted is
continuous, then the task is one of regression.
(Module 3.1, LOS 3.a)
Related Material
SchweserNotes - Book 1

15. (C) k – 1 samples will be used as training samples.

Explanation
In the K-fold cross-validation technique, the data is shuffled randomly and then
divided into k equal sub-samples. One sample is saved to be used as a validation
sample, and the other k — 1 samples are used as training samples.
(Module 3.1, LOS 3.b)
Related Material
SchweserNotes - Book 1

16. (B) more accurate and more stable.

Explanation
Ensemble learning, which is a technique of combining the predictions from a
number of models, generally results in more accurate and more stable predictions
than a single model.
(Module 3.2, LOS 3.c)
Related Material
SchweserNotes - Book 1

Quantitative Methods 5 Machine Learning

CFA
17. (C) Neural networks work well in the presence of non-linearities and complex
interactions among variables.
Explanation
Neural networks have been successfully applied to a variety of investment tasks
characterized by non-linearities and complex interactions among variables.
Neural networks with at least three hidden layers are known as deep learning nets
(DLNs). Reinforcement learning algorithms use an agent that will maximize its
rewards over time, within the constraints of its environment.
(Module 3.3, LOS 3.e)
Related Material
SchweserNotes - Book 1

18. (A) "find the pattern, apply the pattern."

Explanation
One elementary way to think of ML algorithms is to "find the pattern, apply the
pattern." Machine learning attempts to extract knowledge from large amounts of
data by learning from known examples in order to determine an underlying
structure in the data. The focus is on generating structure or predictions without
human intervention.
(Module 3.1, LOS 3.a)
Related Material
SchweserNotes - Book 1

19. (B) higher forecasting accuracy in out-of-sample data.

Explanation
Over fitting results when a large number of features (i.e., independent variables) are
included in the data sample. The resulting model can use the "noise" in the
dependent variables to improve the model fit. Overfitting the model in this way will
actually decrease the accuracy of model forecasts on other (out-of-sample) data.
(Module 3.1, LOS 3.b)
Related Material
SchweserNotes - Book 1

20. (B) Typical data analytics tasks for supervised learning include classification and
prediction.
Explanation
Supervised learning utilizes labeled training data to guide the ML program but does
not need “human intervention.” Typical data analytics tasks for supervised learning
include classification and prediction.
(Module 3.1, LOS 3.a)
Related Material
SchweserNotes - Book 1

Quantitative Methods 6 Machine Learning

CFA
21. (B) support vector machine (SVM).
Explanation
Support vector machine (SVM) is a linear classifier that aims to seek the optimal
hyperplane, i.e. the one that separates the two sets of data points by the maximum
margin. SVM is typically used for classification.
(Module 3.2, LOS 3.c)
Related Material
SchweserNotes - Book 1

22. (A) generalization.

Explanation
Generalization describes the degree to which, when predicting out-of-sample, a
machine learning model retains its explanatory power.
(Module 3.1, LOS 3.b)
Related Material
SchweserNotes - Book 1

23. (C) reinforcement learning.

Explanation
Reinforcement learning algorithms involve an agent that will perform actions that
will maximize its rewards over time, taking into consideration the constraints of its
environment. Neural networks consist of nodes connected by links; learning takes
place in the hidden layer nodes, each of which consists of a summation operator
and an activation function. Neural networks with many hidden layers (often more
than 20) are known as deep learning nets (DLNs) and used in artificial intelligence.
(Module 3.3, LOS 3.e)
Related Material
SchweserNotes - Book 1

Quantitative Methods 7 Machine Learning

Machine Learning Quiz Questions and Answers
No ratings yet
Machine Learning Quiz Questions and Answers
11 pages
Supervised Learning in Investment Analysis
No ratings yet
Supervised Learning in Investment Analysis
12 pages
Machine Learning Models Explained
No ratings yet
Machine Learning Models Explained
11 pages
CFA Machine Learning Exam Questions
No ratings yet
CFA Machine Learning Exam Questions
4 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
11 pages
Linear Regression and Clustering in ML
No ratings yet
Linear Regression and Clustering in ML
12 pages
Data Analysis and Machine Learning Guide
No ratings yet
Data Analysis and Machine Learning Guide
48 pages
Machine Learning vs Econometrics Explained
No ratings yet
Machine Learning vs Econometrics Explained
18 pages
Understanding Linear Regression in ML
No ratings yet
Understanding Linear Regression in ML
67 pages
Stochastic Processes and Time Series Analysis
No ratings yet
Stochastic Processes and Time Series Analysis
5 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
24 pages
Supervised vs Unsupervised Learning Techniques
No ratings yet
Supervised vs Unsupervised Learning Techniques
5 pages
Machine Learning in Investment Analysis
No ratings yet
Machine Learning in Investment Analysis
9 pages
Overview of Machine Learning Algorithms
No ratings yet
Overview of Machine Learning Algorithms
23 pages
Unsupervised Learning in Machine Learning
No ratings yet
Unsupervised Learning in Machine Learning
49 pages
Data Analytics: Regression & Classification Techniques
No ratings yet
Data Analytics: Regression & Classification Techniques
78 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
11 pages
Machine Learning in Investment Strategies
No ratings yet
Machine Learning in Investment Strategies
9 pages
Machine Learning Algorithms Overview
100% (1)
Machine Learning Algorithms Overview
13 pages
Machine Learning for Stock Prediction
No ratings yet
Machine Learning for Stock Prediction
8 pages
Supervised Machine Learning Algorithms
No ratings yet
Supervised Machine Learning Algorithms
6 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
12 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
19 pages
Supervised Machine Learning Algorithms Overview
No ratings yet
Supervised Machine Learning Algorithms Overview
33 pages
Understanding Estimators in ML
100% (1)
Understanding Estimators in ML
38 pages
Machine Learning Overview and Techniques
No ratings yet
Machine Learning Overview and Techniques
22 pages
Machine Learning Overview by Bhavya Sethi
No ratings yet
Machine Learning Overview by Bhavya Sethi
12 pages
Machine Learning in Investment Management
No ratings yet
Machine Learning in Investment Management
11 pages
Machine Learning Methods Overview
No ratings yet
Machine Learning Methods Overview
20 pages
KNN and Regression Techniques Explained
No ratings yet
KNN and Regression Techniques Explained
80 pages
Machine Learning Overview and Types
No ratings yet
Machine Learning Overview and Types
15 pages
Machine Learning Techniques Overview
No ratings yet
Machine Learning Techniques Overview
36 pages
Agglomerative vs Divisive Clustering Explained
No ratings yet
Agglomerative vs Divisive Clustering Explained
8 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
47 pages
Machine Learning for Time Series Forecasting
No ratings yet
Machine Learning for Time Series Forecasting
43 pages
PerceptiLabs Machine Learning Handbook
No ratings yet
PerceptiLabs Machine Learning Handbook
31 pages
Tesla Stock Marketing Price Prediction
No ratings yet
Tesla Stock Marketing Price Prediction
62 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
58 pages
Building Classification Models in Python
No ratings yet
Building Classification Models in Python
33 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
45 pages
Machine Learning Basics for Data Mining
No ratings yet
Machine Learning Basics for Data Mining
4 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
17 pages
Machine Learning Concepts and Techniques
No ratings yet
Machine Learning Concepts and Techniques
8 pages
Understanding Linear Regression Techniques
No ratings yet
Understanding Linear Regression Techniques
35 pages
Machine Learning Categories Explained
No ratings yet
Machine Learning Categories Explained
11 pages
KNN vs K-means: A Comparative Study
No ratings yet
KNN vs K-means: A Comparative Study
9 pages
Machine Learning Concepts Explained
No ratings yet
Machine Learning Concepts Explained
1 page
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
29 pages
Machine Learning in Trading Basics
No ratings yet
Machine Learning in Trading Basics
3 pages
Supervised Learning Techniques Overview
No ratings yet
Supervised Learning Techniques Overview
5 pages
Machine Learning: Encoding & Regularization Techniques
No ratings yet
Machine Learning: Encoding & Regularization Techniques
19 pages
Best Algorithms for Prediction in ML
No ratings yet
Best Algorithms for Prediction in ML
30 pages
Key Concepts in Machine Learning
No ratings yet
Key Concepts in Machine Learning
47 pages
Machine Learning Fundamentals Overview
No ratings yet
Machine Learning Fundamentals Overview
37 pages
Machine Learning Techniques Overview
No ratings yet
Machine Learning Techniques Overview
141 pages
Unsupervised Machine Learning Overview
No ratings yet
Unsupervised Machine Learning Overview
44 pages
CFA Quantitative Methods in Big Data
No ratings yet
CFA Quantitative Methods in Big Data
4 pages
Point Estimation in Statistics
No ratings yet
Point Estimation in Statistics
11 pages
Econ 102A: Statistical Methods Overview
No ratings yet
Econ 102A: Statistical Methods Overview
8 pages
Business Analytics Course Overview
No ratings yet
Business Analytics Course Overview
8 pages
Econometrics Concepts Review Guide
No ratings yet
Econometrics Concepts Review Guide
33 pages
Business Statistics: Key Formulas & Concepts
No ratings yet
Business Statistics: Key Formulas & Concepts
21 pages
Statistical Techniques with SPSS Analysis
No ratings yet
Statistical Techniques with SPSS Analysis
35 pages
Class 12 Business Mathematics Test 2022
No ratings yet
Class 12 Business Mathematics Test 2022
3 pages
MStat PSA 2025
No ratings yet
MStat PSA 2025
12 pages
Understanding Correlation and Regression
No ratings yet
Understanding Correlation and Regression
28 pages
Data Analysis Template for Attendance Impact
No ratings yet
Data Analysis Template for Attendance Impact
7 pages
Instrument Calibration and Sensitivity Analysis
No ratings yet
Instrument Calibration and Sensitivity Analysis
2 pages
Understanding Inferential Statistics
No ratings yet
Understanding Inferential Statistics
37 pages
Statistics Industry: Multiple Regression
No ratings yet
Statistics Industry: Multiple Regression
37 pages
Business Management Quantitative Methods Assignment
No ratings yet
Business Management Quantitative Methods Assignment
3 pages
Project Quality Management Overview
No ratings yet
Project Quality Management Overview
7 pages
Item-Total Statistics Analysis
No ratings yet
Item-Total Statistics Analysis
1 page
Time Series Analysis and Forecasting
No ratings yet
Time Series Analysis and Forecasting
76 pages
Understanding Positive Skewness in Psychology
No ratings yet
Understanding Positive Skewness in Psychology
4 pages
EDA with Python Syllabus for B. Tech
No ratings yet
EDA with Python Syllabus for B. Tech
3 pages
Descriptive Statistics and Regression Analysis
No ratings yet
Descriptive Statistics and Regression Analysis
39 pages
Medical Biostatistics Overview
No ratings yet
Medical Biostatistics Overview
30 pages
Engineering Experiments Homework ME 451
No ratings yet
Engineering Experiments Homework ME 451
4 pages
Electricity Generation's Impact on Nigeria's Economy
No ratings yet
Electricity Generation's Impact on Nigeria's Economy
19 pages
Nutrition Literacy in College Students
No ratings yet
Nutrition Literacy in College Students
11 pages
Holt-Winters Forecasting for E-Commerce
No ratings yet
Holt-Winters Forecasting for E-Commerce
5 pages
Estimation Exercises Overview
No ratings yet
Estimation Exercises Overview
4 pages
Classifying Salmon and Sea Bass Sizes
No ratings yet
Classifying Salmon and Sea Bass Sizes
13 pages
FDI and Economic Indicators Analysis
No ratings yet
FDI and Economic Indicators Analysis
39 pages
AI & ML Diploma Syllabus 2025-26
No ratings yet
AI & ML Diploma Syllabus 2025-26
104 pages
Statistics Test: Mean, Variance, Normal Distribution
No ratings yet
Statistics Test: Mean, Variance, Normal Distribution
4 pages

Machine Learning in Investment Analysis

Uploaded by

Machine Learning in Investment Analysis

Uploaded by

CFA

1. (B) unsupervised learning.

2. (C) Dimension reduction.

3. (A) least absolute shrinkage and selection operator (LASSO).

Hanna Kowalski is a senior fixed-income portfolio analyst at Czarnaskala BP.

Quantitative Methods 1 Machine Learning

4. (A) discrete target variable, producing a cardinal tree.

Quantitative Methods 2 Machine Learning

7. (C) take into consideration the constraints of its environment.

8. (A) Regression Classification

9. (C) supervised learning.

Quantitative Methods 3 Machine Learning

11. (C) There is no labeled data.

12. (B) reduce signal-to-noise ratio.

Quantitative Methods 4 Machine Learning

14. (A) continuous.

15. (C) k – 1 samples will be used as training samples.

16. (B) more accurate and more stable.

Quantitative Methods 5 Machine Learning

18. (A) "find the pattern, apply the pattern."

19. (B) higher forecasting accuracy in out-of-sample data.

Quantitative Methods 6 Machine Learning

22. (A) generalization.

23. (C) reinforcement learning.

Quantitative Methods 7 Machine Learning

You might also like