Non-Intrusive Load Monitoring (NILM) Using Deep
Non-Intrusive Load Monitoring (NILM) Using Deep
Abstract—Demand-side management now encompasses more change, and a negative impact on the economy [4]. It is
residential loads. To efficiently apply demand response strategies, estimated that the rise in carbon emissions will increase
it’s essential to periodically observe the contribution of various global temperatures by 2.5 to 10 ◦ C this century, causing
domestic appliances to total energy consumption. Non-intrusive
load monitoring (NILM), also known as load disaggregation, is a more frequent floods, droughts, a rise in sea level, and the
arXiv:2306.05017v1 [eess.SP] 8 Jun 2023
method for decomposing the total energy consumption profile spread of infectious illnesses [5]. Therefore, it is essential
into individual appliance load profiles within the household. to reduce carbon emissions across all sectors, including
It has multiple applications in demand-side management, construction, industry, and transportation, to mitigate climate
energy consumption monitoring, and analysis. Various methods, change. Researchers are working on developing technology
including machine learning and deep learning, have been used to
implement and improve NILM algorithms. This paper reviews solutions for energy conservation [3]. Buildings are one
some recent NILM methods based on deep learning and of the major contributors to energy consumption [6], with
introduces the most accurate methods for residential loads. It energy consumption in this sector steadily increasing over
summarizes public databases for NILM evaluation and compares time. In order to mitigate carbon emissions, optimizing
methods using standard performance metrics. energy consumption in residential and commercial buildings
Index Terms—Smart Grids, NILM, Deep Learning, Energy
Management. is crucial. This can be achieved through the construction or
design of energy-efficient structures, as well as improving
energy usage in existing buildings.
I. I NTRODUCTION
The paper is organized as follows. Section II introduces
The non-intrusive load monitoring (NILM) method has the mathematical definition of the NILM problem. Section
gained popularity in recent years as a way to monitor appliance III discusses deep learning-based NILM methods. Section IV
and electrical utility energy usage in buildings and events provides a summary of the public NILM datasets. Section V
(on/off) using a single energy meter. If consumers had data presents a comparison study of NILM methods, and finally,
on appliance-level energy usage, they could better understand Section VI concludes the paper.
their energy consumption behavior and take action to reduce
it. The aim of this study is to present an overview of the II. NILM P ROBLEM D EFINITION
latest algorithms currently being investigated by researchers A. Mathematical Problem Definition
to create a precise non-intrusive load monitoring (NILM)
method for effective energy management. The article discusses The issue at hand can be described as follows: at a given
the potential applications of NILM across different fields, time t, the total active power consumed by a system is
along with future research objectives. The development of represented by y(t), while yi (t) represents the active power
sustainable and smart cities has been made possible by consumed by the ith appliance at the same time. The overall
advancements in artificial intelligence (AI), smart meters, the load is the sum of the energy consumed by individual
internet of things (IoT), and smart grids, as cited in [1] and appliances and an unmeasured residual load, expressed as:
[2]. Effective energy management is a crucial component of
N
sustainable city development, which aims to utilize resources X
y(t) = yi (t) + e(t), (1)
responsibly, protect the environment, and enhance society’s
i=1
well-being. The objective of energy management is to promote
energy system self-reliance and sustainability [1]. where N denotes the number of appliances considered, and
Energy management involves monitoring and controlling e(t) represents the undetermined residual load. The aim is to
electrical utilities to optimize energy use and reduce estimate F (y(t)) by determining the values of yi (t), given
consumption. However, with the increase in energy needs, only the value of y(t), as:
energy conservation has become a challenge in recent years
[3]. Greater energy use can lead to an energy crisis, climate y1 (t), y2 (t), ..., yi (t), ..., yN (t) = F (y(t)), (2)
where F is an operator that produces N distinct values when
applied to the total active power. These numbers represent
the most accurate estimate of the power consumed by each
appliance. It should be noted that yi (t) typically does not
reflect the entire set of home appliances but rather a subset of
them. As a result, the unknown term e(t) takes into account
the loads caused by unmonitored appliances. If simultaneous
measurements of the aggregate consumption and load of each
appliance are available, approximating the F operator can
be considered a supervised learning problem. When mainly Fig. 1: A pictorial representation illustrates the different types
concerned with activation times and cumulative consumption, of appliances categorized based on their operating states.
as is the case in real situations, the estimated individual
appliance consumption (ŷi (t)) can be obtained using functions
that are constant over the device’s activation period: types of appliance events, which may happen concurrently or
independently and at varying time intervals.
ŷi (t) = pi âi (t), (3) III. D EEP L EARNING BASED NILM METHODS
where pi represents the average consumption of appliance NILM techniques can be broadly categorized into two
i, and âi (t) represents an estimate of the activation state of the groups: supervised and unsupervised methods [9]. In
particular appliance at time t. Its value is one if the device is supervised NILM, individual appliance power usage is used
in use and uses energy, and zero otherwise. Therefore, starting to train the models. On the other hand, unsupervised
with the aggregate load, a technique is provided to derive the methods can only utilize aggregate power usage data.
most accurate and feasible assessment of the activation state Examples of unsupervised NILM techniques include Hidden
of the appliances: Markov models (HMM) [10], [11], factorial HMM (FHMM)
[12], [13], and techniques based on event detection and
â1 (t), â2 (t), ..., âi (t), ..., âN (t) = Fa (y(t)), (4) clustering [14], [15]. These techniques have been thoroughly
examined in previous studies [9], [16]. With the advent of
After learning the average nominal consumption of the deep neural networks (DNNs), many neural network-based
considered equipment, one can use Equation 3 to estimate supervised NILM techniques have been developed [17], [18].
consumption. Convolutional neural networks (CNN) have also recently
made significant advances [19], [20]. Graph signal processing
B. Appliance types [21], HMM [12], [13], [15], [22], [23], and DNNs [24],
Based on their operational characteristics, appliances can be [25] are commonly used in suggested NILM approaches.
classified into four types as discussed in [7]. Type I appliances As the cost of employing appliance data for training has
have two modes of operation - on and off. These include grown dramatically, researchers have focused on developing
appliances such as kettles, toasters, and light bulbs, which unsupervised approaches and incorporating appliance models.
consume energy only when turned on. Type I appliances are Despite the significant progress made in NILM research
predominantly resistive with few linear reactive components. in recent years, challenges remain in terms of application,
Type II appliances are characterized as multi-state or finite identification accuracy, training time, and online deployment
state machines with a limited number of operational states techniques in smart metering frameworks.
that may be run repeatedly. Changes in these appliances’
states can be observed by monitoring the power consumption’s A. Event-Based Non-Intrusive Detection
falling/rising edges over time. Stove burners, refrigerators, and The event-based NILM method is based on the concept of
washing machines are some examples of Type II appliances detecting and categorizing events within a combined electrical
[7], [8]. Figure 1 demonstrates the distinct appliance operation signal. Figure 2 shows the block diagram of the this approach.
conditions. A robust event detector should be developed to cope with
Category III appliances, also known as Continuously noisy fluctuations and identify events with decay and growth
Variable Devices (CVDs), exhibit a non-repetitive power usage patterns, which is a bottleneck and inherent difficulty in
pattern, which poses a challenge for energy consumption existing event detectors [26]. One approach includes the steps
disaggregation. Examples of Type III appliances include power of event detection, extraction, clustering, and matching in the
drills and dimmer lights [8]. Type IV appliances are those event-based block [27]. It should be noted that the accuracy of
that run continuously for extended periods of time, typically previous event-based frameworks is dependent on the power
lasting several days or weeks. Examples of Type IV equipment features that are introduced. Since some appliances may have
include wireless telephone devices and cable TV receivers [8]. identical active power curves but radically distinct reactive
Therefore, the Non-Intrusive Load Monitoring (NILM) system power trends, increasing the number of features can enhance
is required to have the ability to differentiate between various the accuracy of the appliance model, particularly for non-linear
Fig. 2: Block diagram scheme of event-based NILM.
Fig. 3: Outline of network structure for NILM by CNN.
B. Performance Evaluation and Comparison Study appliances to energy consumption is beneficial. NILM is
In this part, the disaggregation findings for the NILM demonstrated to be a good approach to this end. The accuracy
approaches using the REDD, UK-DALE, and REFIT datasets of NILM depends on the method applied. This paper reviewed
are shown. The performance indicators obtained by executing some deep learning-based methods which outperform other
the tests on these datasets shows that F1 produced the best existing NILM algorithms. The paper compared the results of
results for refrigerators, air conditioners, freezers, televisions, applying these advanced methods to provide a basis for future
and washing machines across the three datasets, with values implementation. These datasets have public access and are
greater than 0.70. Toasters and electronics, on the other hand, widely used in NILM literature. Several performance criteria
have lower F1 scores of roughly 0.25 owing to misclassifica- are formulated to analyze the performance of the methods.
tion caused by the non-uniform pattern of these items. R EFERENCES
The accuracy metrics of the findings were compared for
[1] A. Garulli, S. Paoletti, and A. Vicino, “Models and techniques for
published methods: on-line NILM [27], NILM-TK [14], an electric load forecasting in the presence of demand response,” IEEE
FHMM implementation; Neural-NILM [7], a DNN adaption Transactions on Control Systems Technology, vol. 23, no. 3, pp. 1087–
for energy estimation. The Neural-NILM used three DNN 1097, 2014.
[2] A. Estebsari and R. Rajabi, “Single residential load forecasting using
architectures: i) long short-term memory, ii) de-noising deep learning and image encoding techniques,” Electronics, vol. 9, no. 1,
auto-encoders, and iii) rectangles. Rectangle networks, in p. 68, 2020.
particular, regress the start-time, end-time, and average power [3] R. Rajabi and A. Estebsari, “Deep learning based forecasting of
individual residential loads using recurrence plots,” in 2019 IEEE Milan
of appliance activation. PowerTech. IEEE, 2019, pp. 1–5.
In experiments using UK-DALE dataset, a comparison of [4] N. Rezaei, R. Rajabi, and A. Estebsari, “Electricity price forecasting
on-line NILM, NILM-TK and Neural-NILM methods for five model based on gated recurrent units,” in 2022 IEEE International
Conference on Environment and Electrical Engineering and 2022 IEEE
appliances (fridge, washing machine, dishwasher, microwave, Industrial and Commercial Power Systems Europe (EEEIC/I&CPS
and kettle) is done. The microwave gives the lowest marks Europe), 2022, pp. 1–5.
for all three methods. MAE and an F1 score of roughly 195 [5] T. Wang, B. Shen, C. H. Springer, and J. Hou, “What prevents us
from taking low-carbon actions? a comprehensive review of influencing
Watts and 0.01, respectively, are reported by NILM-TK. The factors affecting low-carbon behaviors,” Energy Research & Social
best MAE of 6 Watts and an F1 score of 0.21 is shown by Science, vol. 71, p. 101844, 2021.
the neural-NILM. With an F1 score of about 0.35, the on-line [6] I. Ishak, N. S. Othman, and N. H. Harun, “Forecasting electricity
consumption of malaysia’s residential sector: Evidence from an
NILM method outperforms the other two. The MAE and F1 exponential smoothing model,” F1000Research, vol. 11, no. 54, p. 54,
scores reported by NILM-TK are roughly 67 watts and 0.55, 2022.
respectively. The Neural-NILM has an MAE of 18 Watts and [7] D. Shi, R. Li, R. Shi, and F. Li, “Analysis of the relationship between
load profile and weather condition,” in 2014 IEEE PES General
an F1 score of 0.82. In terms of energy estimate, the Neural- Meeting— Conference & Exposition. IEEE, 2014, pp. 1–5.
NILM outperformed the suggested technique, particularly [8] M. U. Fahad and N. Arbab, “Factor affecting short term load
for complicated equipment like dishwashers and washing forecasting,” Journal of Clean Energy Technologies, vol. 2, no. 4, pp.
305–309, 2014.
machines. Nonetheless, the time and computational resources [9] R. Bonfigli, S. Squartini, M. Fagiani, and F. Piazza, “Unsupervised
required to train the neural network and generate the models algorithms for non-intrusive load monitoring: An up-to-date overview,”
need a large amount of appliance-level data. The on-line NILM in 2015 IEEE 15th international conference on environment and
electrical engineering (EEEIC). IEEE, 2015, pp. 1175–1180.
technique, on the other hand, may generate appliance models [10] O. Parson, S. Ghosh, M. Weal, and A. Rogers, “Non-intrusive load
using aggregate data without the necessity for appliance-level monitoring using prior models of general appliance types,” in Twenty-
sub-metered data. Sixth AAAI Conference on Artificial Intelligence, 2012.
[11] ——, “An unsupervised training method for non-intrusive appliance load
VI. C ONCLUSION monitoring,” Artificial Intelligence, vol. 217, pp. 1–19, 2014.
[12] J. Z. Kolter and T. Jaakkola, “Approximate inference in additive
The three-phase distribution network often feeds the factorial hmms with application to energy disaggregation,” in Artificial
final residential customers through single phase cables. It intelligence and statistics. PMLR, 2012, pp. 1472–1482.
is important to make the loads balanced. Demand side [13] X. Sun, P. B. Luh, K. W. Cheung, W. Guan, L. D. Michel, S. Venkata,
and M. T. Miller, “An efficient approach to short-term load forecasting
management and different optimisation techniques would help at the distribution level,” IEEE Transactions on Power Systems, vol. 31,
on this. However, understanding the contribution of different no. 4, pp. 2526–2537, 2015.
TABLE II: Comparison of Non-Intrusive Load Monitoring (NILM) Methods
Method Pros Cons Data Real-time Interpretability Evaluation
Requirements Performance Performance
Template Simple, easy to Requires pre-defined None Fast High Limited by
Matching implement templates for each pre-defined
appliance, limited templates
accuracy
Steady-State Accurate for steady- Limited accuracy Detailed Fast Medium Limited
Analysis state loads for transient loads, power by power
requires detailed system data system
knowledge of the model
power system
HMM and Can handle transient Requires training data Labeled Slow Low Sensitive to
fractional loads, can identify for each appliance, training data initialization
HMM multiple appliances can be computation- and noise
simultaneously ally expensive
Deep Neural High accuracy, can Requires a large Labeled Fast Low Dependent
Networks handle complex amount of training training data on quality
appliance behavior, data, can be and quantity
can identify computationally of training
multiple appliances expensive, limited data
simultaneously interpretability
Autoencoders Can handle variable Requires a large Unlabeled Fast Low Dependent
appliance behavior, amount of training training data on quality
can identify data, can be and quantity
multiple appliances computationally of training
simultaneously, can expensive data
learn without labeled
data
[14] H. Gonçalves, A. Ocneanu, M. Bergés, and R. Fan, “Unsupervised [25] P. P. M. do Nascimento, “Applications of deep learning techniques on
disaggregation of appliances using aggregated consumption data,” in nilm,” Diss. Universidade Federal do Rio de Janeiro, 2016.
The 1st KDD workshop on data mining applications in sustainability [26] S. Henriet, U. Şimşekli, B. Fuentes, and G. Richard, “A generative model
(SustKDD), 2011. for non-intrusive load monitoring in commercial buildings,” Energy and
[15] J. Zheng, C. Xu, Z. Zhang, and X. Li, “Electric load forecasting in smart Buildings, vol. 177, pp. 268–278, 2018.
grids using long-short-term-memory based recurrent neural network,” [27] M. A. Mengistu, A. A. Girmay, C. Camarda, A. Acquaviva, and E. Patti,
in 2017 51st Annual Conference on Information Sciences and Systems “A cloud-based on-line disaggregation algorithm for home appliance
(CISS). IEEE, 2017, pp. 1–6. loads,” IEEE Transactions on Smart Grid, vol. 10, no. 3, pp. 3430–
[16] M. Zhuang, M. Shahidehpour, and Z. Li, “An overview of non- 3439, 2019.
intrusive load monitoring: Approaches, business applications, and [28] L. Massidda, M. Marrocu, and S. Manca, “Non-intrusive load
challenges,” in 2018 international conference on power system disaggregation by convolutional neural network and multilabel classifica-
technology (POWERCON). IEEE, 2018, pp. 4291–4299. tion,” Applied Sciences, vol. 10, no. 4, 2020.
[17] L. Mauch and B. Yang, “A new approach for supervised power [29] B. Zhao, L. Stankovic, and V. Stankovic, “On a training-less solution for
disaggregation by using a deep recurrent lstm network,” in 2015 IEEE non-intrusive appliance load monitoring using graph signal processing,”
Global Conference on Signal and Information Processing (GlobalSIP). IEEE Access, vol. 4, pp. 1784–1799, 2016.
IEEE, 2015, pp. 63–67. [30] T. Kriechbaumer, D. Jorde, and H.-A. Jacobsen, “Waveform signal
[18] J. Kelly and W. Knottenbelt, “Neural nilm: Deep neural networks applied entropy and compression study of whole-building energy datasets,” in
to energy disaggregation,” in Proceedings of the 2nd ACM international Proceedings of the Tenth ACM International Conference on Future
conference on embedded systems for energy-efficient built environments, Energy Systems, 2019, pp. 58–67.
2015, pp. 55–64. [31] L. Wen, K. Zhou, and S. Yang, “Load demand forecasting of residential
[19] C. Shin, S. Joo, J. Yim, H. Lee, T. Moon, and W. Rhee, “Subtask gated buildings using a deep learning model,” Electric Power Systems
networks for non-intrusive load monitoring,” in Proceedings of the AAAI Research, vol. 179, p. 106073, 2020.
Conference on Artificial Intelligence, vol. 33, no. 01, 2019, pp. 1150– [32] J. Z. Kolter and M. J. Johnson, “Redd: A public data set for energy
1157. disaggregation research,” in Workshop on data mining applications in
[20] C. Zhang, M. Zhong, Z. Wang, N. Goddard, and C. Sutton, sustainability (SIGKDD), San Diego, CA, vol. 25, no. Citeseer, 2011,
“Sequence-to-point learning with neural networks for non-intrusive pp. 59–62.
load monitoring,” in Proceedings of the AAAI Conference on Artificial [33] K. Anderson, A. Ocneanu, D. Benitez, D. Carlson, A. Rowe, and
Intelligence, vol. 32, no. 1, 2018. M. Berges, “Blued: A fully labeled public dataset for event-based non-
[21] E. Busseti, I. Osband, and S. Wong, “Deep learning for time series intrusive load monitoring research,” in Proceedings of the 2nd KDD
modeling,” Technical report, Stanford University, pp. 1–5, 2012. workshop on data mining applications in sustainability (SustKDD),
[22] S. Makonin, F. Popowich, I. V. Bajić, B. Gill, and L. Bartram, vol. 7. ACM New York, 2012, pp. 1–5.
“Exploiting hmm sparsity to perform online real-time nonintrusive load [34] S. Makonin, F. Popowich, L. Bartram, B. Gill, and I. V. Bajić, “Ampds:
monitoring,” IEEE Transactions on smart grid, vol. 7, no. 6, pp. 2575– A public dataset for load disaggregation and eco-feedback research,” in
2585, 2015. 2013 IEEE Electrical Power & Energy Conference, 2013, pp. 1–6.
[23] H. Kim, M. Marwah, M. Arlitt, G. Lyon, and J. Han, “Unsupervised [35] S. Makonin, “Ampds2: The almanac of minutely power dataset (version
disaggregation of low frequency power measurements,” in Proceedings 2),” 2021. [Online]. Available: https://dx.doi.org/10.21227/c2rq-bx53
of the 2011 SIAM international conference on data mining. SIAM, [36] D. Murray, L. Stankovic, and V. Stankovic, “An electrical load
2011, pp. 747–758. measurements dataset of united kingdom households from a two-year
[24] J. Kelly and W. Knottenbelt, “Neural nilm: Deep neural networks applied longitudinal study,” Scientific data, vol. 4, no. 1, pp. 1–12, 2017.
to energy disaggregation,” in Proceedings of the 2nd ACM international [37] J. Kelly and W. Knottenbelt, “The uk-dale dataset, domestic appliance-
conference on embedded systems for energy-efficient built environments, level electricity demand and whole-house demand from five uk homes,”
2015, pp. 55–64. Scientific data, vol. 2, no. 1, pp. 1–14, 2015.