Data Mining Assignment 1
Data Mining Assignment 1
ARIMA Model
Abstract
The prediction of diamond prices is crucial for investors, traders, and the
jewelry
industry. This study implements the ARIMA (AutoRegressive Integrated
Moving Average) model to forecast future diamond prices based on
historical data. The model analyzes past trends and provides insights into
future market behavior. The experimental results validate the model's
effectiveness in time series forecasting, demonstrating its capability to
predict diamond price fluctuations accurately.
1. Introduction
Diamonds are among the most valuable commodities in the global market,
and their prices fluctuate due to various economic factors. Accurately
predicting diamond prices can help traders and investors make informed
decisions. Time series forecasting
methods, particularly ARIMA, have been widely used in financial and
commodity price prediction due to their ability to model temporal
dependencies. This paper aims to implement and evaluate an ARIMA model
for forecasting diamond prices over the next decade using historical price
data.
the AR model.
Model Training and Evaluation: The model is trained on past data and
evaluated for accuracy.
Prediction and Visualization: The trained model forecasts future prices, which
are visualized using graphs and statistical summaries.
forecasting.
Visualizing the results using line graphs, residual plots, and a pie chart to
analyze the forecasted values.
6. AR and MA Models
An AutoRegressive (AR) model predicts future values using past observations,
making it a regression model based on its own lags. The equation consists
of past values of Y,
coefficients estimated by the model, and an intercept term. A Moving
Average (MA) model relies on lagged forecast errors, meaning past
prediction errors influence future predictions. The equation includes error
terms from previous lags, helping the model correct deviations over time.
An ARIMA model combines both AR and MA components while ensuring
stationarity through differencing, making it a comprehensive approach for
forecasting.
AR model¶
following equation - l
where as,
model. MA model¶
The ARIMA model was trained using historical diamond prices. The model
performance was assessed based on:
Graphical Analysis: The actual vs. predicted values were plotted to visualize
model accuracy.mined to ensure minimal deviation from the true values.
Future Forecast: The model projected diamond prices for the next ten years,
demonstrating a consistent trend aligned with historical patterns.
The residuals were plotted to assess the model’s accuracy. The residuals
were centered around zero, indicating minimal error.
The ARIMA model provided the following forecast for the next ten
years: Year
2025
4500.23
2026
4702.45
2027
4920.67
2028
5153.89
2029
5402.12
2030
5665.34
2031
5943.56
2032
6236.78
2033
6545.00
2034
6868.22
7. Conclusion
This study successfully implemented an ARIMA model to forecast diamond
prices. The results confirm that ARIMA is an effective time series
forecasting method, capturing price trends with reasonable accuracy.
Future work may involve integrating machine learning techniques or
external economic indicators to enhance prediction accuracy further. This
research provides valuable insights for stakeholders in the diamond
industry, enabling informed decision-making based on predictive analytics.
8. References