EV Charge Paper forIEEEBDS2023
EV Charge Paper forIEEEBDS2023
net/publication/370978369
CITATIONS READS
3 1,443
5 authors, including:
Jerry Gao
San Jose State University
352 PUBLICATIONS 5,880 CITATIONS
SEE PROFILE
All content following this page was uploaded by Jerry Gao on 23 May 2023.
Arul Palaniappan, Purnima Bhukya, Sai Kiran Chitti,Vinupriya Sanjay Kumar Jerry Gao
Department of Applied Data Science Department of Applied Data Science,
San Jose State University Department of Computer Engineering
San Jose, United States San Jose State University
(arul.palaniappan, purnima.bhukya, saikiran.chitti,vinupriya.sanjaykumar)@sjsu.edu San Jose, United States
Corresponding Author:
[email protected]
Abstract—To reduce the current crisis on energy and environ- emissions come from road vehicles. The biggest way to solve
mental problems, the adoption of electric vehicles is an effective this specific problem is for consumers to start buying electric
way. But limited access and availability of charging infrastructure vehicles (EVs) because EVs are battery powered and do not
are restricting the EV market development. This paper presents
data-driven analysis of current EV charging infrastructure to emit any GHGs when generating energy to move.
plan future EV infrastructure in the city. This paper tackles However, there are a few problems that act as barriers for
the problem using three approaches. The first approach that is the growth of electric vehicles. One of the main problems
covered in this paper is to predict the number of EV charging that must be addressed as people start switching to EVs is
stations that need to be implemented in a geographic location.
This is done using the SARIMAX model on the California EV to implement better EV infrastructure in the cities and near
charging station dataset that is preprocessed to be split into zip highways, because people need to charge their vehicle in the
codes. The second approach that is tackled by this paper is the middle of a long trip just as they would be refueling a gasoline
energy consumption prediction by an EV charging station. This powered vehicle. A positive to EVs is that most of them are
information is useful to allow planners to understand how much sold with a Level 1 charger that can be plugged into an outlet
energy is consumed by an EV station across multiple different
time periods. An optimized Meta-Fusion regression model based at home, so some people can use this to charge at home. But
on a stacked model is proposed for this approach and is applied this is the slowest method of charging and is not useful for
to the Palo Alto charging station dataset. The third approach long road trips. Another problem with charging EVs is that
utilized the K-means clustering algorithm on the Palo Alto it takes much more time to charge an electric vehicle to full
charging station dataset to predict the optimal location for EV capacity than it takes a gasoline-powered vehicle to refuel at
charging stations in that city. The SARIMAX mode had an R2
score of 0.97, a MAE of 6.50, and RMSE of 17.43 when predicting a gas station. There are currently three types of EV chargers,
EV station counts in the zip code 92101. The stacked model had and the following table shows the features of each type. The
an R2 score of 0.98, a MAE of 0.91 and an RMSE of 2.26 when variations in the times listed in the table below are caused by
predicting the daily energy consumption by an EV station. the different battery types used in each vehicle. In our project,
Keywords—Big Data, EV Charging Infrastructure, EV Charg- we are primarily focusing on Level 2 and DC fast charging
ing, Data-Driven Analysis, EV Electricity Prediction, Machine stations because those are the fastest type of chargers available.
Learning.
TABLE I
I. I NTRODUCTION OVERVIEW OF E LECTRIC C HARGES (USDOT)
Climate change is one of the most pressing issues in the Type Level 1 Level 2 Level 3(DC Fast)
world today. This issue is mainly due to the large amounts Power 1 kW 7 - 19 kW 50 - 350 kW
of carbon dioxide and other greenhouse gasses (GHG) being
Time Taken to
let into the air. According to the NIEHS [1], air pollution 20 - 60 min
Charge a Battery 40 - 50 hours 4 -10 hrs
causes over 6.5 million deaths each year globally. According (0 to 80% cap)
from 0% Capacity
to the WHO, this is because 99% of the population in the
Range per Hour
world is breathing in air that exceeds their limits on breathable 2 - 5 miles 10 - 20 miles 180 - 240 miles
of Charge
air. One of the major contributors of GHG is emissions from
Home ,
road transportation. Road transportation uses a lot of fossil Typical Locations
Home Office , Public
fuels and emits GHG when the fuels are converted into energy. Placed
According to Gurwa et. al (2019)[2], 27% of the world’s GHG Public
The above problems motivated us to come up with a solution our paper, a stacked model which combines multiple different
for the planning of EV infrastructure with an AI powered ensemble models, including the Random Forest and XGBoost
model using Machine Learning and Deep Learning Tech- is introduced and compared among the standard models.
niques. The main objective of this research is to create a Erotokritos Xydas et.al (2015)[7] used EV charging event
model that can optimally predict where EV infrastructure can data, EV charging station data and weather data. First, they
be placed in a city. The model will take into consideration did some manual data preprocessing to split the data into a
various factors related to EV charging such as the location of monthly and an hourly time series. They also combined the
current stations, EV energy consumption. weather data into some of the EV data into a daily time
This research also considered the weather data to understand series. Each of the series was sent to a different module
how EV charging demand is affected by weather. The optimal for data mining. The correlation model finds the correlation
EV infrastructure network can cater to the charging needs and between weather data and peak daily usage, the regression
reduce the life cycle costs of the transportation system. Recent model finds the patterns for monthly charging demands, and
advancement in machine learning helps analyze present data the clustering model finds the charging demands based on
and come up with efficient solutions that solve the planning geographic location. All three modules are fed into a fuzzy
of charging infrastructure in a city. It can help in planning and characterization model to combine each of the series into a
management such as prediction of location of new charging charging demand risk level. The problem with this paper is that
infrastructure, demand for charging and scheduling. Hence, they didn’t use multiple models to evaluate the performance.
knowledge of machine learning models and AI is a key In this paper, a stacked model, which combines multiple
solution for enhancing the sustainability of EV transportation. models is introduced and is compared to the individual models
Our project can be used for a few applications. We expect separately to evaluate which model was better and confirmed
other cities around the world to expand on our model and that stacked model performs better.
use it to build new EV infrastructure in their cities. Another Table II here gives a list of papers we researched, and it
application that can be created with the help of our model is includes their objectives, Region, different types of datasets
a mobile application that can be used to help owners to get they used, models, and evaluation metrics.
the various charging options when they don’t have access to
charging stations.
B. Technology Survey
This paper is divided into the following sections. Section II 1) Current Live Technologies: There are few companies
describes a literature survey of previous related work. Section that have created a software to plan EV charging infrastructure
III provides an overview of the technical survey and prior art. around various places in the United States. Almost all these
Section IV gives an overview of the Methodology, which is software are not available to the public and are proprietary. We
subdivided into Dataset, Data Preprocessing which is again are not able to get information on how they are implemented.
subdivided as Data Cleaning and Data Merging. Section V The National Renewable Energy Laboratory (NREL) created
discusses the machine learning model used in the project for one of these tools called Electric Vehicle Infrastructure –
load forecasting, forecasting of energy consumption and loca- Projection Tool (EVI-Pro)[28].
tion recommendation. Section VI details the results obtained
by the machine learning model. Section VII describes future The website states that users can use the software to analyze
improvements to the project and concludes with a summary travel patterns, charging demand, and design the infrastructure
of the project. capable of meeting demand. This tool was used to help analyze
the requirements of EV infrastructure planning in the entire
II. R ELATED W ORK state of California. The US Department of Energy has a tool
A. Literature Survey called EVI-Pro Lite which is based on EVI-Pro and is available
In this section, we have reviewed many papers related to for public use[29]. This is a web tool that allows users to plan
Electric vehicles, The EV charging infrastructure, types of the number of charging stations required in a city or state.
EV chargers to understand how previous research has been This tool has few pre-selected urban areas. Once you select
conducted in this area and how to proceed with our research. the location, then you enter in the number of vehicles that you
Mariz B et.al(2016)[6] analyzed the characteristics of EV want to support there, and then click on the Calculate button.
charging demand and identified the factors that affect it, such It will do some calculations and show you the number of Level
as weather conditions, time of day, and location. They use data 2 and Level 3 charging stations that need to be implemented
preprocessing to group their traffic data into clusters and then in the location. There are some parameters related to the
plot the traffic volumes across the time of day for each cluster. amount and types of EVs to support in the location that can be
Then, they used relational analysis to find the relationship modified on the results page before recalculating the results.
between the features in the weather dataset. Finally, they used We also did some research on a couple of methods that are
a decision tree to combine all these things to plot the graphs of used to plan EV infrastructure in a region which are listed
charging demand in a day. The problem with their data is that in Table III. The first paper talks about methods used to
they didn’t use different models to compare their results. In plan for locating where future EV infrastructure would go.
TABLE II
C OMPARISION OF E XISTING R ESEARCH PAPERS
Ref Objective Region Dataset ML Models Deep Learning Other Models Eval Metrics
Calculating the daily charging Forecasting model with
Traffic
[6] demand of EV stations South Korea N N big data technologies Compared to standard
Weather models
based on weather and traffic data P-median
Calculate the energy demand
risk level of an EV charging station Charging Event
[7] United Clustering N N Corealation
wrt to Charging event and Kingdom Charging Station Coefficient
station data.
Include supply and demand EV ranges
Set and Max Covering
[8] constraints when planning new Beijing Population N N Best solution satisfy-
P-median ing restrictions
EV charging stations Govt budget constraints
Using allocation objectives Location- Compared to standard
[9] Hong Kong EV usage rate survey N N
to plan EV infrastructure Allocation model
RMSE
Spatial and temporal information Spatial
[10] Dundee, Ireland EV infrastructure N N MAE
to predict EV station availability Temporal CGN
R2̂
ELaadNL
Predictive model for estimating LinReg RMSE
[11] Netherlands Google Maps N N
the EV infrastructure usage XGBoost MAE
Nokia HERE
Accuracy
EVnetNL LogReg
Evaluate factors affecting EV Precision
[12] Netherlands OpenGIS Rand Forest N N
infrastructure popularity Sensitivity
EV Charging Records GBRT
MCC
Predict energy demand for next KNN Pattern Sequence
[13] UCLA EV Charging Records N Accuracy
24 hours at each charging outlet ARIMA Forecasting
EV Infrastructure planning based Shanghai International Mixed- Best model
[14] Shanghai N N
on user travel data Motor City Integer Nonlinear given constraints
CEC Rand Forest Precision
Bridge the gap for EV access
[15] Orange County Energy Dept Multi-Log N N Recall
between the wealthy and the poor
U.S. Census Regression F1-score
EMD- MAE
Predict charging demand and EV
[16] Georgia Tech Charging station usage N AOA- N RMSE
infrastructure placement
LSTM Accuracy
Data-driven long-term EV Traffic Flow
[17] Sydney N CNN Cournot Competition Model
infrastructure planning Distribution Network
Models classifying EV charging Charging station BERT Accuracy
[18] United States N N
user reviews user reviews XLNet F1 score
RNN MSE
Predicting charging duration at Rabat,
[19] Real-time EV station N LSTM N RMSE
public chargers Morocco
GRU MAE
LinReg RMSE
[20] Predicting PEV charging demand Nebraska Level 2 EV stations GBRT N N MAE
SVM R2̂
Infrastructure planning for
[21] Vienna Taxi dispatch center N N Decision Support System
electric taxi vehicles
Accuracy
LogReg
Describe the urban context and Precision
[22] Netherlands EVnetNL Rand Forest N N
activities near EV infrastructure Sensitivity
GBRT
MCC
Minimize the total charging cost LTA CSPP Most EVs
[23] Singapore N N
of EVs Stats Dept Nash Transfer Fit Budget
CG - SMV RMSE
Comparing 3 ML models to Agartala,
[24] Traffic Flow Bagged and N N MAE
understand EV charging demand India
Boosted Tree R2̂
Precision
Predict charging station Logistic
[25] Italy EV Charge Transactions N N Recall
availability using ML Big Data Regression
F1 score
Ensemble RMSE
Predicting charging times for
[26] Japan Auto Research Institute Rand Forest N N MAE
different EV charging modes
Grad Boost R2̂
Mixed-
EV planning from the perspective Transport & Highest investor
[27] China Integer NLP N N
of private investor Sioux-Fall Networks profit
Huff model
TABLE III
A PPROACHES TO PLAN FUTURE EV INFRASTRUCTURE
It describes the three different approaches used for locating Electric vehicles, and the amount of Energy that is consumed
charging infrastructure. The second paper talks about the EV by each charging transaction. More information about these
infrastructure planning with respect to the type of vehicles and datasets can be found in the table below.
how to utilize the charging to its maximum level.
TABLE IV
III. DATA E NGINEERING DATASET C OLLECTION
A. Data Collection San Jose Vehicle Palo Alto
The primary focus of this research is on the state of Dataset EV Charging population EV Charging
California, specifically in the cities and zip codes of San Jose stations data - zip codes Station Usage data
and Palo Alto. We have decided to use three different datasets Data type csv xlsx csv
to solve our problems. The first dataset is from the Alternative Size 15.7 MB 4.4 MB 13.3MB
Fuels Data Center website from the US Department of Energy. No of rows 14444 172582 259,443
TABLE V
E VALUATION OF EV CHARGING STATION COUNT PREDICTION
VI. C ONCLUSION
This paper presents three different approaches to address the
challenges of planning new EV charging station infrastructure.
The first challenge that was tackled is predicting the number of
EV charging stations that should be placed in a particular zip
code in the next three years. This was done by calculating the
Fig. 10. Test and Prediction data results on Monthly Basis historical number of EV charging stations in the zip code and
then using the SARIMAX model to forecast the future counts.
The second challenge which was addressed is predicting how
Alto and applied the K-means clustering algorithm to identify much energy (kWh) is consumed by a charging station in
the clusters of high EV traffic and low EV charging station daily, weekly, and monthly time periods based on different
density. weather conditions. The proposed stacked model, called Meta
The centroids of each individual cluster are identified, and Fusion Regression, provided the best prediction results. The
the count of EV charging stations needed is calculated by third challenge utilized the K-means clustering algorithm to
dividing the total EV charging time by the number of unique predict the optimal locations for EV charging stations.
EV charging stations identified in that cluster. The obtained Overall, the above approaches helped to provide valuable
number is then rounded up to the nearest integer. The results information in planning new EV charging infrastructure in a
obtained are plotted over a Palo-Alto centered map which was zip code. In the future, live traffic data can be incorporated
created using the Folium library. into the approaches in order to provide better understanding
As shown in the below figure, the current EV charging of how drivers behave when they want to charge their electric
stations are displayed using green icons, and the predicted vehicles.
EV charging stations are displayed using red icons. R EFERENCES
[1] International Symposium “Transforming Mobility for Humanity” Com-
memorating the 50th Anniversary of The Club of Rome “The Limits to
Growth” 14 December 2022 Venue: Chulalongkorn University, Bangkok
/ Online
[2] N. Garwa and K. R. Niazi, “‘impact of EV on integration with grid
system – A review,’” in 2019 8th International Conference on Power
Systems(ICPS), 2019, pp. 1–6
[3] “Electric vehicle charging speeds,” Transportation.gov. [Online]. Avail-
able: https://www.transportation.gov/rural/ev/toolkit/ev-basics/charging-
speeds.[Accessed: 22-Oct-2022].
[4] “The current state of EV battery swapping,” Nio.com. [Online]. Avail-
able: https://www.nio.com/blog/current-state-ev-battery-swapping. [Ac-
cessed: 19-Oct-2022].
[5] N. I. Nimalsiri, E. L. Ratnam, C. P. Mediwaththe, D. B. Smith, and S. K.
Halgamuge, “Coordinated charging and discharging control of electric
Fig. 11. Optimal EV Charging station Location Prediction vehicles to manage supply voltages in distribution networks: Assessing
the customer benefit,” Appl. Energy, vol. 291, no. 116857, p. 116857,
2021.
[6] Mariz B. Arias, Sungwoo Bae, Electric vehicle charging de- Electrical Engineering and 2021 IEEE Industrial and Commercial Pow-
mand forecasting model based on big data technologies, Ap- erSystems Europe (EEEIC / ICPS Europe), 2021.
plied Energy, Volume 183, 2016, Pages 327-339, ISSN 0306-2619, [26] I. Ullah, K. Liu, T. Yamamoto, M. Zahid, and A. Jamal, “Prediction of
https://doi.org/10.1016/j.apenergy.2016.08.080. electric vehicle charging duration time using ensemble machine learning
[7] Erotokritos Xydas, Charalampos Marmaras, Liana M. Cipcigan, Nick algorithm and Shapley additive explanations,” Int. J. Energy Res., vol.
Jenkins, Steve Carroll, Myles Barker, A data-driven approach for char- 46, no. 11, pp. 15211–15230, 2022.
acterising the charging demand of electric vehicles: A UK case study, [27] Y. Zhao, Y. Guo, Q. Guo, H. Zhang, and H. Sun, “Deployment of the
Applied Energy, Volume 162, 2016, Pages 763-771, ISSN 0306-2619, electric vehicle charging station considering existing competitors,” IEEE
https://doi.org/10.1016/j.apenergy.2015.10.151. Trans. Smart Grid, vol. 11, no.5, pp. 4236–4248, 2020.
[8] S. Y. He, Y.-H. Kuo, and D. Wu, “Incorporating institutional and spatial [28] “EVI-Pro: Electric Vehicle Infrastructure – Projection Tool,” Nrel.gov.
factors in the selection of the optimal locations of public electric vehicle [Online]. Available: https://www.nrel.gov/transportation/evi-pro.html.
charging facilities: A case study of Beijing, China,” Transp. Res.Part C [Accessed: 25-Oct-2022].
Emerg. Technol., vol. 67,pp. 131–148, 2016. [29] “Electric vehicle infrastructure projection tool (EVI-pro) lite,”
[9] S. Y. He, Y.-H. Kuo, and K. K. Sun, “The spatial planning of public Energy.gov. [Online]. Available: https://afdc.energy.gov/evi-pro-
electric vehicle charging infrastructure in a high-density city using a lite.[Accessed: 05-Oct-2022].
contextualised location-allocation model,” Transp. Res. Part A Policy [30] M.-O. Metais, O. Jouini, Y. Perez, J. Berrada, and E. Suomalainen,
Pract., vol. 160, pp. 21–44, 2022. “Planning EV charging infrastructures: A literature review,” in 23rd
[10] R. Luo et al.,“Deep learning approach for long- term prediction of EUROWorking Group on Transportation Meeting, EWGT 2020, 2020.
electric vehicle (EV) charging station availability,” in 2021IEEE Inter- [31] M. O. Metais, O. Jouini, Y. Perez, J. Berrada, and E. Suomalainen, “Too
national Intelligent Transportation Systems Conference (ITSC), 2021. much or not enough? Planning electric vehicle charging infrastructure:
[11] D. Pevec, J. Babic, M. A. Kayser, A. Carvalho, Y. Ghiassi-Farrokhfal, A review of modeling options,” Renew. Sustain. Energy Rev., vol. 153,
and V. Podobnik, “A data-driven statistical approach for extending elec- no. 111719, p. 111719, 2022.
tric vehicle charging infrastructure: A data-driven statistical approach for [32] California Energy Commission, “Zero emission vehicle and in-
extending electric vehicle charging infrastructure,” Int. J. Energy Res., frastructure statistics,” California Energy Commission. [Online].
vol. 42, no. 9, pp. 3102–3120, 2018. Available: https://www.energy.ca.gov/data-reports/energy-almanac/zero-
[12] M. Straka et al., “Predicting popularity of electric vehicle charging emission-vehicle-and-infrastructure-statistics. [Accessed: 13-Oct-2022].
infrastructure in urban context,” IEEE Access, vol. 8, pp. 11315–11327, [33] “Electric Vehicle Charging Station usage,” [Online]. Available:
2020. https://data.cityofpaloalto.org/dataviews/257812/electric-vehicle-
[13] M. Majidpour, C. Qiu, P. Chu, R. Gadh, and H. R. Pota, “Modified pat- charging-station-usage-july-2011-dec-2020/. [Accessed: 18-Oct-2022]
tern sequence-based forecasting for electric vehicle charging stations,” [34] G. Lawton, “Data preprocessing: Definition, key steps and
in 2014 IEEE International Conference on Smart Grid Communica- concepts,” SearchDataManagement, 31-Jan-2022. [Online]. Available:
tions(SmartGridComm), 2014. https://www.techtarget.com/searchdatamanagement/definition/data-
[14] Pan, A., Tian, M., Tang, B., amp; Yang, X. (2019). Urban Electric preprocessing. [Accessed: 06-Nov-2022].
Vehicle (EV) charging point design based on User Travel Big Data. IOP [35] B. Artley, “Time Series Forecasting with Arima , Sarima
Conference Series: Earth and Environmental Science, 300(4), 1–10. and SARIMAX,” Medium, 27-Jun-2022. [Online]. Available:
[15] A. Roy and M. Law, “Examining spatial disparities in electric vehicle https://towardsdatascience.com/time-series-forecasting-with-arima-
charging station placements using machine learning,” Sustain. Cities sarima-and-sarimax-ee61099e78f6. [Accessed: 24-Feb-2023].
Soc., vol. 83, no. 103978, p. 103978, 2022. [36] “Simple model stacking, explained and automated.” [Online]. Avail-
[16] J. Shanmuganathan, A. A. Victoire, G. Balraj, and A. Victoire, “Deep able: https://towardsdatascience.com/simple-model-stacking-explained-
learning LSTM recurrent neural network model for prediction of electric and-automated-1b54e4357916. [Accessed: 7-Nov-2022]
vehicle charging demand,” Sustainability, vol. 14, no. 16, p. 10207, 2022. [37] “Improving Collaborative Filtering With Clustering.” [Online].
[17] C. Li, Z. Dong, G. Chen, B. Zhou, J. Zhang, and X. Yu, “Data- Available: https://medium.com/@jwu2/improving-collaborative-
driven planning of electric vehicle charging infrastructure: A case study filtering-with-clustering-88c63bdae7cc
of Sydney, Australia,” IEEE Trans. Smart Grid, vol. 12, no. 4, pp.
3289–3304, 2021.
[18] O. I. Asensio, D. J. Marchetto, S. Ha, and S. Dharur, “Extracting user
behavior at electric vehicle charging stations with transformer deep
learning models,” in CARMA 2020 - 3rd International Conference on
Advanced Research Methods and Analytics, 2020.
[19] B. Mouaad, M. Farag, B. Kawtar, K. Tarik, and Z. Malika, “A deep
learning approach for electric vehicle charging duration prediction at
public charging stations: The case of Morocco,” ITM Web Conf., vol.
43, p. 01024, 2022.
[20] A. Almaghrebi, F. Aljuheshi, M. Rafaie, K. James, and M. Alahmad,
“Data-driven charging demand prediction at public charging stations
using supervised machine learning regression methods,” Energies, vol.
13, no. 16, p. 4231, 2020.
[21] J. Asamer, M. Reinthaler, M. Ruthmair, M. Straub, and J. Puchinger,
“Optimizing charging station locations for urban taxi providers,” Transp.
Res. Part A Policy Pract., vol.85, pp. 233–246, 2016.
[22] S. Deb, K. Tammi, K. Kalita, and P. Mahanta, “Charging station
placement for electric vehicles: A case study of Guwahati city, India,”
IEEE Access, vol. 7, pp.100270–100282, 2019.
[23] Y. Xiong, J. Gan, B. An, C. Miao, and A. L. C. Bazzan, “Optimal
electric vehicle fast charging station placement based on game theoret-
ical framework,” IEEE Trans. Intell. Transp. Syst., vol. 19, no. 8, pp.
2493–2504, 2018.
[24] D. Adhya, A. Pal, A. K. Chakraborty, and A. Bhattacharya, “Machine
learning application for prediction of EV charging demand for the
scenario of agartala, India,” in 2022 4th International Conference on
Energy, Power and Environment (ICEPE), 2022.
[25] F. Soldan, E. Bionda, G. Mauri, and S. Celaschi, “Short-term forecast
of electric vehicle charging stations occupancy using big data streaming
analysis,” in 2021 IEEE International Conference on Environmentand