0% found this document useful (0 votes)

8 views

Forecasting

The document outlines the steps for forecasting using linear regression, including data collection, exploratory data analysis, preprocessing, model fitting, evaluation, and forecasting future values. It provides interview questions and answers related to linear regression, covering topics like data requirements, handling missing data, model evaluation metrics (R², MSE, RMSE), and dealing with overfitting and multicollinearity. Additionally, it explains the differences between simple and multiple linear regression and the importance of performance metrics in assessing model accuracy.

Uploaded by

deepali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Forecasting

Uploaded by

deepali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

FORECASTING

Steps in Forecasting Using Linear Regression (with Interview Q&A)

1. Understanding the Problem and Collecting Data

Q: What kind of data would you need for linear regression forecasting?

A: For linear regression, you need historical data where there is a clear relationship between the independent
variable(s) (predictor) and the dependent variable (target). For time series forecasting, you'd need data points
from consistent intervals (e.g., daily, monthly) to capture trends and patterns.

Q: How do you handle missing or incomplete data in forecasting?

A: If there are missing values, I would first try to impute them using methods like mean or median imputation,
or even forward/backward filling (for time series data). If the missing values are too significant, I may remove
those rows or use regression imputation to predict the missing values based on other features.

2. Exploratory Data Analysis (EDA)

Q: How do you perform exploratory data analysis (EDA) before applying linear regression?

A: I would start by visualizing the data using plots (scatter plots, time series plots) to observe trends, patterns,
and any outliers. I would also compute correlation between variables to check the strength of relationships and
ensure there is a linear relationship between the independent and dependent variables.

Q: What would you do if the data doesn't show a clear trend or pattern?

A: If the data doesn’t show a clear trend, I would analyze it further for seasonality or irregularity. For time
series, I could try differencing or transformations like log transformation to stabilize variance. If no pattern is
evident, I might consider using more complex models, like ARIMA or machine learning techniques.

3. Data Preprocessing

Q: What preprocessing steps would you follow before applying linear regression?

A: I would:

 Handle missing data.

 Remove or treat outliers.

 Normalize/standardize data if necessary (e.g., for features with different scales).

 Convert any categorical variables to numeric values (e.g., using one-hot encoding).

 For time series, I would check for stationarity and transform the data if needed.

Q: How do you handle outliers in time series data?

A: Outliers can be handled by either removing them if they seem like errors or capping/transforming them if
they are valid but extreme values. For time series, I might also use robust regression techniques that are less
sensitive to outliers.

4. Splitting Data into Training and Testing Sets

Q: Why do you split your data into training and testing sets?

A: Splitting data helps assess how well the model generalizes to unseen data. The training set is used to fit the
model, while the testing set allows us to evaluate the model’s performance and avoid overfitting (where the
model fits the training data too closely but fails to generalize).

Q: How do you determine the optimal split between training and testing data?

A: A common split ratio is 80/20 or 70/30, where 70-80% of the data is used for training, and the remaining 20-
30% is used for testing. However, for small datasets, cross-validation might be used to ensure robustness.

5. Fitting the Linear Regression Model

Q: How does linear regression work in forecasting?

A: Linear regression assumes a linear relationship between the independent and dependent variables. It fits a
line to the data using the least squares method, minimizing the sum of squared differences between the
observed and predicted values. The equation is:
y=β0+β1⋅x\text{y} = \beta_0 + \beta_1 \cdot \text{x}y=β0+β1⋅x Where:

 y = predicted value

 x = independent variable (predictor)

 β₀ = intercept

 β₁ = coefficient of the independent variable (slope)

Q: How would you handle a situation where the relationship between variables isn't linear?

A: If the relationship isn’t linear, I would either try transformations on the data (e.g., log transformation) to
make it linear or use more advanced models like polynomial regression or machine learning models (like
decision trees or neural networks) to capture complex relationships.

6. Evaluating the Model

Q: What are the key metrics you use to evaluate the performance of a linear regression model?

A: Key metrics include:

 R² (Coefficient of Determination): Measures the proportion of variance explained by the model. R²

close to 1 indicates a good fit.

 Mean Squared Error (MSE): Measures the average squared difference between the actual and
predicted values.
 Root Mean Squared Error (RMSE): The square root of MSE, which gives error in the same units as the
original data.

Q: How do you check if your model is overfitting or underfitting?

A: Overfitting occurs when the model performs very well on the training set but poorly on the testing set.
Underfitting occurs when the model performs poorly on both the training and testing sets. To check, I would:

 Compare training vs. testing performance metrics (MSE, RMSE).

 Plot the residuals: Random scatter of residuals indicates a good fit, while patterns may indicate
overfitting or underfitting.

 Use cross-validation to ensure the model generalizes well.

7. Forecasting Future Values

Q: How do you make forecasts with a linear regression model?

A: After training the model, you use it to predict future values by plugging new values of the independent
variable(s) (predictor) into the regression equation. For example, if I have monthly sales data, I would predict
the sales for the next month using the model’s equation.

Q: How would you deal with seasonality or trends in time series forecasting using linear regression?

A: In time series data with seasonality or trends, I would:

 Decompose the data into trend, seasonal, and residual components.

 Apply differencing to remove trends and make the data stationary.

 Consider adding seasonal components or time variables (e.g., month or quarter) as additional
features.

8. Model Improvement

Q: How would you improve your linear regression model if it doesn't provide accurate forecasts?

A: To improve the model:

 Add more relevant features (e.g., time-related variables, lag variables, external factors).

 Check for and remove multicollinearity (when independent variables are highly correlated with each
other).

 Consider using regularization techniques (like Lasso or Ridge regression) to prevent overfitting.

 If linear regression is not sufficient, try non-linear models or machine learning methods (e.g.,
Random Forest, XGBoost).

Q: Can you use multiple regression in forecasting, and how would that help?

A: Yes, multiple regression can be used when you have multiple independent variables (predictors). It helps in
improving the model by considering the impact of more than one factor (e.g., both price and advertising spend
on sales). The equation becomes: y=β0+β1⋅x1+β2⋅x2+⋯+βn⋅xn\text{y} = \beta_0 + \beta_1 \cdot x_1 + \beta_2
\cdot x_2 + \cdots + \beta_n \cdot x_ny=β0+β1⋅x1+β2⋅x2+⋯+βn⋅xn

Summary of Key Questions with Answers:

1. What kind of data is needed?

o Historical, with a clear relationship between variables.

2. How do you handle missing data?

o Imputation or removal, depending on the situation.

3. How do you evaluate model performance?

o R², MSE, RMSE, residual plots.

4. How do you handle overfitting or underfitting?

o Compare training and testing performance, use cross-validation, and adjust model
complexity.

5. How do you forecast future values?

o Using the regression equation with new data points.

What is Linear Regression?

Linear regression is a statistical technique used to model the relationship between a dependent variable (also
known as the target) and one or more independent variables (predictors or features). The goal is to find the
best-fitting line that minimizes the error (difference between predicted and actual values).

For simple linear regression with one predictor:

y=β0+β1xy = \beta_0 + \beta_1 xy=β0+β1x

Where:

 y = dependent variable (what we are predicting)

 x = independent variable (predictor)

 β₀ = intercept (the value of y when x is 0)

 β₁ = coefficient (the change in y for a one-unit change in x)

In multiple linear regression, multiple predictors are used:

y=β0+β1x1+β2x2+⋯+βnxny = \beta_0 + \beta_1 x_1 + \beta_2 x_2 + \cdots + \beta_n x_ny=β0+β1x1+β2x2+⋯

+βnxn

Where x₁, x₂, ..., xₙ are the independent variables.

Interview Questions and Answers on Linear Regression

Q: What assumptions does linear regression make?

A: Linear regression makes the following assumptions:

1. Linearity: The relationship between the dependent and independent variable(s) is linear.

2. Independence: The observations are independent of each other (no autocorrelation).

3. Homoscedasticity: The variance of the residuals (errors) is constant across all values of the
independent variable(s).

4. Normality: The residuals (errors) are normally distributed.

Q: How do you check for multicollinearity in linear regression?

A: Multicollinearity occurs when independent variables are highly correlated with each other. To detect
multicollinearity:

1. Correlation Matrix: Check correlation between predictors; values close to ±1 suggest multicollinearity.

2. Variance Inflation Factor (VIF): A VIF greater than 5 or 10 suggests multicollinearity.

3. Condition Index: A high condition index (above 30) may also indicate multicollinearity.

Q: How do you handle multicollinearity in linear regression?

A: You can handle multicollinearity by:

1. Removing one of the highly correlated features.

2. Using Principal Component Analysis (PCA) to reduce dimensionality.

3. Applying regularization techniques like Ridge or Lasso regression.

Q: What’s the difference between simple and multiple linear regression?

 Simple Linear Regression: Models the relationship between one independent variable and one
dependent variable.

 Multiple Linear Regression: Models the relationship between two or more independent variables and
a dependent variable.

Performance Metrics for Linear Regression

After fitting the model, it’s important to evaluate its performance. Here are the key performance metrics used
for linear regression:

1. R-squared (R²)

 Definition: R² measures the proportion of variance in the dependent variable that is explained by the
independent variables.

 Formula: R2=1−∑(yi−y^i)2∑(yi−yˉ)2R^2 = 1 - \frac{\sum{(y_i - \hat{y}_i)^2}}{\sum{(y_i - \

bar{y})^2}}R2=1−∑(yi−yˉ)2∑(yi−y^i)2 Where:

o yᵢ = actual values
o ŷᵢ = predicted values

o ȳ = mean of the actual values

 Interpretation:

o R² = 1: Perfect fit, meaning the model explains all the variance in the data.

o R² = 0: The model does not explain any variance, similar to predicting the mean value.

o Higher R² means better fit, but not always the best measure for model performance.

Q: When would you use R² in linear regression?

A: R² is used to assess how well the independent variables explain the variation in the dependent variable.
However, R² alone is not enough. It can be misleading if the model is overfitting or if there are irrelevant
predictors.

Q: What’s the difference between R² and Adjusted R²?

 R² can increase with the addition of more predictors, even if they aren’t improving the model’s
performance.

 Adjusted R² adjusts for the number of predictors, so it penalizes adding irrelevant variables. It’s a
better measure when comparing models with different numbers of predictors.

2. Mean Squared Error (MSE)

 Definition: MSE measures the average squared difference between the actual and predicted values.

 Formula:

MSE=1n∑(yi−y^i)2MSE = \frac{1}{n} \sum{(y_i - \hat{y}_i)^2}MSE=n1∑(yi−y^i)2

Where:

o n = number of observations

o yᵢ = actual values

o ŷᵢ = predicted values

 Interpretation:

o Lower MSE indicates a better fit.

o MSE is sensitive to outliers, as the errors are squared.

Q: When would you use MSE in linear regression?

A: MSE is useful when you want a measure of the average squared error between predicted and actual values.
However, it doesn’t give you an easy-to-interpret error in the same units as the dependent variable. For that,
we use RMSE.
3. Root Mean Squared Error (RMSE)

 Definition: RMSE is the square root of the MSE. It provides the error in the same units as the
dependent variable.

 Formula:

RMSE=1n∑(yi−y^i)2RMSE = \sqrt{\frac{1}{n} \sum{(y_i - \hat{y}_i)^2}}RMSE=n1∑(yi−y^i)2

 Interpretation:

o Like MSE, lower RMSE is better.

o RMSE provides a more interpretable metric because it’s in the same units as the target
variable.

Q: When would you use RMSE in linear regression?

A: RMSE is often preferred when you need the error in the same units as the target variable, making it easier to
understand. It’s a good metric when the cost of large errors is significant.

4. Mean Absolute Error (MAE)

 Definition: MAE measures the average of the absolute differences between the actual and predicted
values.

 Formula:

MAE=1n∑∣yi−y^i∣MAE = \frac{1}{n} \sum{|y_i - \hat{y}_i|}MAE=n1∑∣yi−y^i∣

 Interpretation:

o Like MSE and RMSE, lower MAE indicates better model performance.

o MAE is less sensitive to outliers compared to MSE and RMSE because it doesn’t square the
errors.

Q: When would you use MAE in linear regression?

A: MAE is useful when you want to avoid the influence of outliers on your error metric. It provides a direct
interpretation of how much, on average, your model's predictions are off from the true values.

5. Adjusted R²

 Definition: Adjusted R² adjusts the R² statistic by accounting for the number of predictors in the
model, making it more reliable when comparing models with different numbers of independent
variables.

 Formula:

Adjusted R2=1−((1−R2)(n−1)n−p−1)Adjusted\ R^2 = 1 - \left( \frac{(1 - R^2)(n - 1)}{n - p - 1} \

right)Adjusted R2=1−(n−p−1(1−R2)(n−1))

Where:

o n = number of observations
o p = number of predictors

 Interpretation:

o Adjusted R² will never be greater than R² and may decrease if unnecessary predictors are
added.

o It’s useful when comparing models with different numbers of predictors.

Q: Why is Adjusted R² better than R² in some cases?

A: Adjusted R² is preferred when comparing models with different numbers of predictors, as it penalizes the
inclusion of irrelevant variables, making it a more reliable measure of model quality.

Summary of Metrics and When to Use Them

Metric What It Measures When to Use

Proportion of variance explained by Use to assess how well your independent variables explain the
R²
the model target variable, but beware of overfitting.

Adjusted Adjusted for the number of Use when comparing models with different numbers of
R² predictors predictors.

Mean squared error between actual Use when you want to penalize larger errors more heavily, but
MSE
and predicted values it’s sensitive to outliers.

Square root of MSE (in same units Use when you need error in the same units as the target
RMSE
as dependent variable) variable.

Use when you want to avoid the influence of outliers and need
MAE Average of absolute errors
a direct interpretation of average errors.

Forecasting with Linear Regression: (Summarized Version)

Linear regression is a statistical method used to predict the relationship between a dependent variable (target)
and one or more independent variables (predictors). In the context of forecasting:

1. Prepare Data: Ensure you have historical data with a time-based variable (e.g., dates) and a numeric
target variable (e.g., sales).

2. Model Creation: Apply linear regression to model the relationship between the target and time (or
other predictors). The model finds the best-fitting line (y = mx + b), where:

o y is the forecasted value,

o m is the slope (trend),

o x is the time/independent variable,

o b is the intercept.

3. Make Predictions: Use the model to predict future values by inputting future time periods into the
regression equation.
4. Evaluate Performance: Use metrics like R-squared (R²), Mean Absolute Error (MAE), and Root Mean
Squared Error (RMSE) to assess the model’s accuracy.

Key Points:

 Linear Regression assumes a linear relationship between the dependent and independent variables.

 R-squared (R²) measures how well the model fits the data (0 = no fit, 1 = perfect fit).

 MAE and RMSE measure the accuracy of the forecast by comparing predicted and actual values.

Forecasting – Power BI
How to Do Forecasting in Power BI

1. Using Power BI’s Built-in Forecasting Feature: Power BI has a forecasting feature built into line charts.
It uses the Exponential Smoothing (ETS) model to forecast future values based on historical data.
Here's how you can use it:

Steps:

o Create a line chart (or any other chart that supports time series data, like a bar or area chart).

o Drag your date/time field to the Axis and the measure (like sales, revenue, etc.) to the Values
field well.

o Click on the Analytics pane (located on the right side of the visual, next to the Format pane).

o In the Analytics pane, click on Add Forecast.

o In the settings, you can specify the forecast length, seasonality, and whether you want to
display the forecast with confidence intervals.

o Power BI will automatically generate the forecast for the specified period.

2. Custom Forecasting with DAX (Data Analysis Expressions): For more control over forecasting, you can
create custom forecasting models using DAX formulas. This is especially useful when you want to
apply more complex forecasting models or use specific statistical functions to predict future data.

Example: You could create a measure for calculating Moving Averages, or use a formula that adjusts based on
recent trends.

3. Using Power Query to Prepare Data for Forecasting: Before forecasting, you can prepare your data in
Power Query. This includes handling missing data, creating time-based columns, transforming data to
make it stationary (for time series), and so on.

4. Integration with Azure Machine Learning: If you need more advanced forecasting models like ARIMA,
Machine Learning, or Neural Networks, Power BI integrates with Azure Machine Learning. This allows
you to bring in machine learning models from Azure and apply them directly within Power BI.

Limitations of Forecasting in Power BI

 Limited forecasting models: Power BI's built-in forecasting uses the Exponential Smoothing (ETS)
model, which may not suit all types of data. For more complex or specific time series models, you
might need to use external tools like R, Python, or Azure ML.

 Forecasting length: The forecast length in Power BI is limited to a specific number of future periods
(months, days, etc.). While it's useful for short-term forecasting, longer-term forecasting might require
more advanced methods.

Advantages of Forecasting in Power BI

 User-friendly: The forecasting feature is easy to apply with no need for deep statistical knowledge. It’s
integrated with visuals, making it easy to explore predictions alongside historical data.

 Visualization: You can visualize both historical data and forecasts in the same chart, making it easier to
interpret and make decisions.

 Automation: Power BI automatically updates the forecast as new data is loaded, making it an efficient
tool for real-time or periodic forecasting.

Example Scenario

Suppose you have sales data for the past 12 months, and you want to forecast the next 3 months. You can use
the line chart with a time-based axis (Month) and sales as the values. After enabling the forecasting feature,
Power BI will predict the next 3 months of sales based on historical trends.

Forecasting Feature in Power BI - Key Questions

Q: What is the model used by Power BI for forecasting?

A: Power BI uses Exponential Smoothing (ETS) for forecasting, which is suitable for data with trends or
seasonality. ETS is a smoothing technique that weighs recent observations more heavily than older ones.

Q: How can I customize the forecast in Power BI?

A: You can customize the forecast length (how many periods ahead you want to predict), adjust the seasonality
(automatic or manual), and choose whether to display confidence intervals.

Q: Can Power BI handle advanced time series forecasting like ARIMA or SARIMA?

A: Not directly within Power BI. For advanced forecasting models like ARIMA, SARIMA, or neural networks, you
would need to integrate Power BI with external tools like R, Python, or Azure Machine Learning.

Q: How do I handle missing data in Power BI before forecasting?

A: In Power BI, you can clean and transform data using Power Query before applying the forecasting model.
You can fill missing values with appropriate techniques like imputation or forward-fill, or handle outliers if
necessary.

Steps to Perform Forecasting in Power BI (Detailed)

Power BI allows you to perform forecasting directly on a time series dataset using a built-in feature. Here’s a
detailed explanation of the steps involved:

Step 1: Prepare Your Data

 Data Requirements: Your data should be time-based (e.g., daily, weekly, monthly), and you should
have a date column and a numeric value column (e.g., sales, temperature, website traffic).

 Ensure Data Quality: Make sure that your data is cleaned—handle missing values, remove outliers,
and format the data correctly.

Q: What is the importance of time-based data for forecasting in Power BI?

A: Time-based data is critical for forecasting because Power BI uses historical trends and patterns in the data to
predict future values. Without a proper time-series structure (with consistent intervals like months or days),
forecasting won’t work effectively.

Step 2: Create a Line Chart

 Visual Setup:

o In the Power BI report view, select the Line Chart visual.

o Drag your date/time column to the Axis field.

o Drag the numeric value (e.g., sales) to the Values field.

Q: Why is the Line Chart used for forecasting in Power BI?

A: Line charts are used because they represent trends over time effectively. Power BI’s forecasting feature
applies time-series analysis, which works best with a continuous, time-based dataset like the one represented
in a line chart.

Step 3: Apply the Forecasting Feature

 Analytics Pane: Once your line chart is set up, go to the Analytics pane (found on the right side of the
visual settings, next to the Format pane).

 Add Forecast: In the Analytics pane, click on Add Forecast.

 Configure Forecast Settings:

o Length: Define how far into the future you want to forecast (e.g., 3 months, 1 year).

o Seasonality: Select whether you want Power BI to automatically detect seasonality or

manually specify it (Power BI uses Exponential Smoothing, so seasonality is important here).

o Confidence Interval: You can choose to display confidence intervals (typically 95%
confidence) to show the range of potential future values.

Q: What is seasonality in forecasting, and why is it important?

A: Seasonality refers to periodic fluctuations in data (e.g., higher sales in certain months due to holidays or
weather patterns). It's important because it helps the forecasting model predict patterns that repeat over time,
improving forecast accuracy.

Step 4: Customize the Forecasting Settings

 Adjust Parameters: Depending on your dataset and needs, you can adjust the following:

o Confidence Interval: Shows the range of possible future values based on the model’s
uncertainty.
o Forecast Length: Choose how many periods ahead you want to predict (e.g., if you have
monthly data, you can forecast for the next 12 months).

 Review Forecasting Results: Power BI will automatically generate the forecast and display it as an
extension of your existing data in the line chart.

Q: How does Power BI’s forecasting feature calculate future values?

A: Power BI uses an Exponential Smoothing (ETS) model for forecasting, which gives more weight to recent
data points. This method is useful when there’s seasonality or a trend in the data. Power BI automatically
handles the modeling process in the background.

Step 5: Visualize and Interpret Results

 Power BI will plot the forecasted values on the chart, usually in a lighter shade or a different color to
differentiate it from the historical data.

 You can also add confidence intervals to show the range within which the actual future values might
fall.

Q: What does the confidence interval show in Power BI’s forecast?

A: The confidence interval shows the range of possible future values based on the model's uncertainty. For
example, with a 95% confidence interval, there’s a 95% chance that the true value will fall within this range.

Step 6: Fine-tune and Validate the Forecast

 Reevaluate: Check how well your forecast aligns with the historical data and adjust the parameters as
needed.

 Validation: To validate the forecast, you could compare it with real data as it becomes available in the
future.

Q: How do you know if your forecast is accurate in Power BI?

A: You can evaluate forecast accuracy by comparing the predicted values to actual outcomes once you have
new data. You can also use performance metrics like Mean Absolute Error (MAE) or Root Mean Squared Error
(RMSE) to quantify the error between the forecast and actual values.

Interview Questions Related to Forecasting in Power BI

Q1: What forecasting techniques are used in Power BI?

A: Power BI uses the Exponential Smoothing (ETS) method for forecasting. This method is useful for time series
data with trends and seasonality, and it assigns higher weight to more recent data points.

Q2: How do you handle seasonality in Power BI forecasting?

A: Seasonality can be automatically detected by Power BI, but you can also manually define it based on your
data. For example, if you know there’s a yearly pattern in your sales data (e.g., higher sales during the
holidays), you can specify a seasonality period that matches this pattern.

Q3: What is the significance of the confidence interval in Power BI’s forecasting?
A: The confidence interval represents the range of possible future values based on the forecast model’s
uncertainty. For instance, a 95% confidence interval suggests there is a 95% chance the actual value will fall
within this range, helping businesses to understand the potential variability of future predictions.

Q4: How do you evaluate the accuracy of your forecast in Power BI?

A: You can evaluate the forecast accuracy by comparing the predicted values with actual values once they
become available. Performance metrics like Mean Absolute Error (MAE) or Root Mean Squared Error (RMSE)
can be used to measure the accuracy of the forecast quantitatively.

Q5: Can Power BI forecast multiple time series simultaneously?

A: While Power BI can forecast a single time series (one line chart), you can create multiple visualizations for
different time series, such as sales for different regions or products. However, Power BI doesn’t support multi-
variable time series forecasting out of the box. For this, you would need to use external models like Azure
Machine Learning or R/Python integration.

Q6: How do you handle missing data in time series forecasting in Power BI?

A: In Power BI, you can handle missing data by filling missing values or applying techniques like forward fill or
interpolation using Power Query before applying the forecasting model. It's essential to clean the data to
ensure the forecast is based on accurate information.

Q7: How do you visualize the forecast and historical data together in Power BI?

A: Power BI will automatically display the forecast alongside the historical data in the same line chart. The
forecast is usually shown in a different color and can also include confidence intervals to indicate the possible
range of future values.

Forecasting in Power BI: A Quick Overview (Summarized version)

Power BI allows you to perform time series forecasting using the Exponential Smoothing
(ETS) method, which is ideal for data with trends and seasonality. The key steps are:

1. Prepare Your Data: Ensure you have time-based (date) and numeric value columns
(e.g., sales, traffic).

2. Create a Line Chart: Visualize your data with a line chart, with time on the X-axis and
the numeric values on the Y-axis.

3. Enable Forecasting: In the Analytics pane, add the forecasting option to the chart,
define the forecast length, and set the seasonality and confidence interval.

4. Review and Interpret: The forecasted values will appear alongside historical data,
with confidence intervals showing the potential range of future values.

Key Features:

 Seasonality: Automatically detected or manually set.

 Confidence Interval: Displays the uncertainty of the forecasted values.

 Performance Metrics: Forecast accuracy can be evaluated once real data is available.

Power BI's forecasting feature is great for quick, straightforward predictions but for
advanced models, integration with Azure Machine Learning or R/Python is needed.

----------------------------------------------------------------------------------------------------------------------

Time Series Forecasting: Summary

Time Series Forecasting is the process of predicting future values based on historical data,
typically ordered by time. It's commonly used in scenarios like stock price prediction, sales
forecasting, and weather forecasting.

Key Terms in Time Series Forecasting:

1. Time Series: A sequence of data points measured at successive time intervals (e.g.,
monthly sales, daily temperature).

2. Trend: The long-term movement or direction in the data (upward, downward, or

stationary).

3. Seasonality: Regular, repeating patterns or cycles in the data over specific periods
(e.g., higher sales during holidays, winter months).

4. Noise: Random fluctuations in the data that do not follow any specific pattern.

5. Stationarity: When the statistical properties of a time series (mean, variance) remain
constant over time. Most forecasting models require the data to be stationary.

6. Autocorrelation: A measure of how correlated a time series is with its own past
values (lagged values).

7. Exponential Smoothing (ETS): A method that assigns exponentially decreasing

weights to past observations to make forecasts.

Steps in Time Series Forecasting:

1. Data Collection: Gather historical data with time intervals (e.g., daily, monthly).

2. Data Preprocessing:

o Check for Stationarity: Ensure the data’s mean and variance do not change
over time. If not, apply transformations like Differencing.

o Handle Missing Values: Fill or interpolate missing data points.

o Decompose the Time Series: Break it down into trend, seasonality, and
residual (noise) components.
3. Model Selection:

o Simple Models: Linear Regression or Moving Averages for basic trends.

o Advanced Models: ARIMA (AutoRegressive Integrated Moving Average), ETS,

or machine learning models like LSTM (Long Short-Term Memory networks).

4. Model Training: Fit the chosen model to the historical data.

5. Forecasting: Use the trained model to predict future values based on past data.

6. Evaluation:

o Performance Metrics: Use metrics like R², Mean Absolute Error (MAE), Root
Mean Squared Error (RMSE) to assess the model’s accuracy.

7. Prediction & Deployment: Make predictions for the future, and update the model
periodically as new data becomes available.

Key Performance Metrics:

 R² (R-Squared): Measures how well the model explains the variance in the data
(value between 0 and 1).

 MAE (Mean Absolute Error): Measures the average of absolute errors (easy to
interpret).

 RMSE (Root Mean Squared Error): Measures the square root of the average squared
differences between predicted and actual values (penalizes larger errors).

Summary:

Time series forecasting is about predicting future values based on patterns in historical data.
You need to understand the trend, seasonality, and noise in the data. Various models can be
used based on the complexity of the data, and performance can be evaluated using metrics
like R², MAE, and RMSE.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Unit 6 Amir PPT
No ratings yet
Unit 6 Amir PPT
8 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
HSE Management System Plan
100% (2)
HSE Management System Plan
223 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
Interview Questions - Linear Regression
No ratings yet
Interview Questions - Linear Regression
6 pages
Home Ai Machine Learning Dbms Java Blockchain Control System Selenium HTML Css Javascript
No ratings yet
Home Ai Machine Learning Dbms Java Blockchain Control System Selenium HTML Css Javascript
9 pages
Linearregressionpl
No ratings yet
Linearregressionpl
9 pages
Group_1_Practical
No ratings yet
Group_1_Practical
16 pages
He Images Outline the Steps to Solve a Supervised Learning Problem
No ratings yet
He Images Outline the Steps to Solve a Supervised Learning Problem
24 pages
Linear_Regression (1)
No ratings yet
Linear_Regression (1)
35 pages
Discuss The Concept of Linear Regression and Its Use in Making Accurate Business Forecasts
No ratings yet
Discuss The Concept of Linear Regression and Its Use in Making Accurate Business Forecasts
2 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Linear Regression Skills Quiz
No ratings yet
Linear Regression Skills Quiz
13 pages
Linear Regression Algorithm
No ratings yet
Linear Regression Algorithm
16 pages
Linear Regression
No ratings yet
Linear Regression
14 pages
LRQA
No ratings yet
LRQA
3 pages
Data Science
100% (1)
Data Science
14 pages
Isn't Linear Regression From Statistics?
No ratings yet
Isn't Linear Regression From Statistics?
4 pages
3 Da
No ratings yet
3 Da
16 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
MLS+1+-+Regression
No ratings yet
MLS+1+-+Regression
20 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
MachineLearning Unit II
No ratings yet
MachineLearning Unit II
45 pages
Chapter_2_Linear and Logistic Regression
No ratings yet
Chapter_2_Linear and Logistic Regression
34 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Linear Regression: What Is Regression Analysis?
100% (1)
Linear Regression: What Is Regression Analysis?
21 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
Complete Linear Regression Algorithm
No ratings yet
Complete Linear Regression Algorithm
4 pages
Machine Learning Questions
No ratings yet
Machine Learning Questions
21 pages
Linear Regression - FDS
No ratings yet
Linear Regression - FDS
18 pages
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
No ratings yet
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
6 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
P-1.3.1 Linear Regression Analysis
No ratings yet
P-1.3.1 Linear Regression Analysis
9 pages
Linear Regression Basic Interview Questions
No ratings yet
Linear Regression Basic Interview Questions
36 pages
LINEAR REGRESSION MODEL 1
No ratings yet
LINEAR REGRESSION MODEL 1
23 pages
Linear Regression
No ratings yet
Linear Regression
3 pages
DA Notes 3
No ratings yet
DA Notes 3
12 pages
U-4_IML
No ratings yet
U-4_IML
17 pages
Unit 2 Topic 1 REGRESSION
No ratings yet
Unit 2 Topic 1 REGRESSION
19 pages
Linear Regression
No ratings yet
Linear Regression
4 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
DSR Notes 3 To 5
No ratings yet
DSR Notes 3 To 5
70 pages
LLM ML Interview Q
No ratings yet
LLM ML Interview Q
43 pages
lecture 9-10
No ratings yet
lecture 9-10
28 pages
Regression Models Overview
No ratings yet
Regression Models Overview
170 pages
Linear Regression 50 Interview Q
No ratings yet
Linear Regression 50 Interview Q
7 pages
linear regression (1)
No ratings yet
linear regression (1)
8 pages
Linear Regression PDF
100% (1)
Linear Regression PDF
32 pages
Class 8_Linear Regression
No ratings yet
Class 8_Linear Regression
56 pages
Introduction To Machine Learning Algorithms: Linear Regression
No ratings yet
Introduction To Machine Learning Algorithms: Linear Regression
1 page
Chapter4_Regression.docx
No ratings yet
Chapter4_Regression.docx
15 pages
Linear RegressionSV
No ratings yet
Linear RegressionSV
66 pages
ML Unit-2 Final
No ratings yet
ML Unit-2 Final
32 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Teit ML2
No ratings yet
Teit ML2
11 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Applied Linear Algebra: Core Principles
From Everand
Applied Linear Algebra: Core Principles
Kartikeya Dutta
No ratings yet
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Nexion Riga Catalogo 4b2
No ratings yet
Nexion Riga Catalogo 4b2
21 pages
EXAM 4 MANAGEMENT - CHAPTER 16 (QUALITY MANAGEMENT - CONTROL SYSTEMS) Flashcards - Quizlet
No ratings yet
EXAM 4 MANAGEMENT - CHAPTER 16 (QUALITY MANAGEMENT - CONTROL SYSTEMS) Flashcards - Quizlet
6 pages
The Sola Busca Tarot The Alchemical Symb
No ratings yet
The Sola Busca Tarot The Alchemical Symb
7 pages
Thesis Front Page Sample
100% (2)
Thesis Front Page Sample
6 pages
Lesson 1 NOTES Summary Conclusion and Recommendations
No ratings yet
Lesson 1 NOTES Summary Conclusion and Recommendations
3 pages
LP_FINAL-decile ungrouped
No ratings yet
LP_FINAL-decile ungrouped
12 pages
Use of English 6
No ratings yet
Use of English 6
6 pages
Brave New World by Aldous Huxley-1.edited
No ratings yet
Brave New World by Aldous Huxley-1.edited
7 pages
ICT1 Course Outline - BIT AY 2021-2022
No ratings yet
ICT1 Course Outline - BIT AY 2021-2022
3 pages
Cryogenic Engineering - Thomas M. Flyn
No ratings yet
Cryogenic Engineering - Thomas M. Flyn
910 pages
Untitled 4 PDF
No ratings yet
Untitled 4 PDF
3 pages
TREND1
No ratings yet
TREND1
55 pages
Ms1 - Sequence 02
No ratings yet
Ms1 - Sequence 02
1 page
Neils Bohr Assignment Group 1
No ratings yet
Neils Bohr Assignment Group 1
8 pages
Water Resources India PPT.pptx 20241218 075315 0000
No ratings yet
Water Resources India PPT.pptx 20241218 075315 0000
11 pages
Lifecraft by Unknown Mentalist
No ratings yet
Lifecraft by Unknown Mentalist
26 pages
Materials For Roads and Pavements: Standard Terminology Relating To
No ratings yet
Materials For Roads and Pavements: Standard Terminology Relating To
5 pages
Venice: The Problem of Overtourism and The Impact of Cruises
No ratings yet
Venice: The Problem of Overtourism and The Impact of Cruises
18 pages
Reticle - Data Sheet P4Fl: SCH Midt Bender GMBH & Co. KG
No ratings yet
Reticle - Data Sheet P4Fl: SCH Midt Bender GMBH & Co. KG
3 pages
EFQM Insights 2023
No ratings yet
EFQM Insights 2023
22 pages
Postdoc Presentation Walbot
No ratings yet
Postdoc Presentation Walbot
32 pages
Electricity WS3
No ratings yet
Electricity WS3
2 pages
Beauty Care Las 1
No ratings yet
Beauty Care Las 1
3 pages
Educational Planning and Development
100% (2)
Educational Planning and Development
18 pages
DLL - Science 3 - Q4 - W4
No ratings yet
DLL - Science 3 - Q4 - W4
4 pages
2020 Basic Way To Understanding The Hydraulic Fracturing - UPNVYK
No ratings yet
2020 Basic Way To Understanding The Hydraulic Fracturing - UPNVYK
30 pages
BSN 3J - Noguera - COPAR
No ratings yet
BSN 3J - Noguera - COPAR
5 pages
C y C L e 2: Stage 2: Comprehensive Refresher Course For General Education and Professional Education
No ratings yet
C y C L e 2: Stage 2: Comprehensive Refresher Course For General Education and Professional Education
1 page