scholarly journals Data Requirements for Applying Machine Learning to Energy Disaggregation

Energies ◽  
2019 ◽  
Vol 12 (9) ◽  
pp. 1696 ◽  
Author(s):  
Changho Shin ◽  
Seungeun Rho ◽  
Hyoseop Lee ◽  
Wonjong Rhee

Energy disaggregation, or nonintrusive load monitoring (NILM), is a technology for separating a household’s aggregate electricity consumption information. Although this technology was developed in 1992, its practical usage and mass deployment have been rather limited, possibly because the commonly used datasets are not adequate for NILM research. In this study, we report the findings from a newly collected dataset that contains 10 Hz sampling data for 58 houses. The dataset not only contains the aggregate measurements, but also individual appliance measurements for three types of appliances. By applying three classification algorithms (vanilla DNN (Deep Neural Network), ML (Machine Learning) with feature engineering, and CNN (Convolutional Neural Network) with hyper-parameter tuning) and a recent regression algorithm (Subtask Gated Network) to the new dataset, we show that NILM performance can be significantly limited when the data sampling rate is too low or when the number of distinct houses in the dataset is too small. The well-known NILM datasets that are popular in the research community do not meet these requirements. Our results indicate that higher quality datasets should be used to expedite the progress of NILM research.

Energies ◽  
2020 ◽  
Vol 13 (9) ◽  
pp. 2148 ◽  
Author(s):  
Pascal A. Schirmer ◽  
Iosif Mporas ◽  
Akbar Sheikh-Akbari

A data-driven methodology to improve the energy disaggregation accuracy during Non-Intrusive Load Monitoring is proposed. In detail, the method uses a two-stage classification scheme, with the first stage consisting of classification models processing the aggregated signal in parallel and each of them producing a binary device detection score, and the second stage consisting of fusion regression models for estimating the power consumption for each of the electrical appliances. The accuracy of the proposed approach was tested on three datasets—ECO (Electricity Consumption & Occupancy), REDD (Reference Energy Disaggregation Data Set), and iAWE (Indian Dataset for Ambient Water and Energy)—which are available online, using four different classifiers. The presented approach improves the estimation accuracy by up to 4.1% with respect to a basic energy disaggregation architecture, while the improvement on device level was up to 10.1%. Analysis on device level showed significant improvement of power consumption estimation accuracy especially for continuous and nonlinear appliances across all evaluated datasets.


Energies ◽  
2020 ◽  
Vol 13 (20) ◽  
pp. 5393
Author(s):  
Philippe Voinov ◽  
Patrick Huber ◽  
Alberto Calatroni ◽  
Andreas Rumsch ◽  
Andrew Paice

Grid-connected photovoltaic (PV) capacity is increasing and is currently estimated to account for 3.0% of worldwide energy generation. One strategy to balance fluctuating PV power is to incentivize self-consumption by shifting certain loads. The potential improvement in the amount of self-consumption is usually estimated using smart meter and PV production data. Smart meter data are usually available only at sampling frequences far below the Nyquist limit. In this paper we investigate how this insufficient sampling rate affects the estimated self-consumption potential of shiftable household appliances (washing machines, tumble dryers and dishwashers). We base our analyses on measured consumption data from 16 households in the UK and corresponding PV data. We found that the simulated results have a marked dependence on the data sampling rate. The amount of self-consumed energy estimated with data sampled every 10 min was overestimated by 30–40% compared to estimations using data with 1 min sampling rate. We therefore recommend to take this factor into account when making predictions on the impact of appliance load shifting on the rate of self-consumption.


2013 ◽  
Vol 341-342 ◽  
pp. 880-886
Author(s):  
Wen Jun Wang ◽  
Xiao Jun Duan ◽  
Ju Bo Zhu

Based on the linear model of guidance instrument error separation, study on the separation accuracy affected by data sampling rate of inertial navigation equipment. First, theoretically proved that the higher data sampling rate is, the higher separation accuracy we can get. Second, a method for determining the optimal sampling rate is presented, whose idea is from the model itself. At last, the simulation results can verify the above two conclusions.


1992 ◽  
Vol 46 (1) ◽  
pp. 49-59 ◽  
Author(s):  
Gary W. Small ◽  
Scott E. Carpenter

Fourier transform infrared (FT-IR) data of pure and mixture samples of benzene and nitrobenzene are used to investigate and improve methods for interferogram-based qualitative analyses. For use in dedicated monitoring applications, the methodology employed is based on the application of pattern recognition analysis to short, digitally filtered interferogram segments. In the work described here, the impact of the interferogram data sampling rate on the analysis is studied. The results of this study indicate that optimal pattern recognition prediction performance is achieved by use of linear discriminants developed from faster sampled interferogram data. These findings suggest that improved performance can be obtained in FT-IR monitoring applications through the use of spectrometer designs based on a decreased interferogram scan length, coupled with faster sampling electronics.


Nowadays, Energy conservation and management are a must practice due to the exponentially increasing energy usage. One solution for providing for energy conservation is appliance load monitoring. Load monitoring approach should be simple and of low cost in order to be massively deployable. Non-Intrusive load monitoring is a better approach since it can disaggregate energy at the cost of single energy meter. A low sampling rate energy meter incurs low cost compared to a high sampling rate energy meter. In this paper a less complex, low cost energy disaggregation approach has been proposed


2019 ◽  
Vol 90 (e7) ◽  
pp. A30.2-A30
Author(s):  
Wenbo Ge ◽  
Deborah Apthorp ◽  
Christian J Lueck ◽  
Hanna Suominen

IntroductionParkinson’s Disease (PD) is associated with increased mortality and reduced quality of life. There is currently no accurate objective measure for use in diagnosis or assessment of severity. Analysis of postural sway may help in this regard. This systematic review aimed to assess the effectiveness of the various features currently used to analyse postural sway.MethodsFive databases were searched for articles that examined postural sway in both PD patients and controls. An effect size (ES) was derived for every feature reported in each article. The most effective features and feature-families were determined, along with the influence on these measures of data sampling rate and experimental condition.Results441 papers were initially retrieved, of which 31 met the requirements for analysis. The most commonly-used features were not the most effective (e.g. PathLength had an ES of 0.47 while TotalEnergy had an ES of 1.78). Decreased sampling rate was associated with decreased ES (e.g. ES of PathLength lowered from 1.12 at 100 Hz to 0.40 at 10 Hz). Being off medication was associated with a larger ES (e.g. ES of PathLength was 0.21 on medication and 0.83 off medication).ConclusionsSome measures of postural sway are better able to distinguish PD patients from controls than others. ES is enhanced by using a higher sampling rate and studying patients off medication. These results will inform future studies looking at postural sway in PD and contribute to the aim of finding an objective marker of the disease.


Author(s):  
Changho Shin ◽  
Sunghwan Joo ◽  
Jaeryun Yim ◽  
Hyoseop Lee ◽  
Taesup Moon ◽  
...  

Non-intrusive load monitoring (NILM), also known as energy disaggregation, is a blind source separation problem where a household’s aggregate electricity consumption is broken down into electricity usages of individual appliances. In this way, the cost and trouble of installing many measurement devices over numerous household appliances can be avoided, and only one device needs to be installed. The problem has been well-known since Hart’s seminal paper in 1992, and recently significant performance improvements have been achieved by adopting deep networks. In this work, we focus on the idea that appliances have on/off states, and develop a deep network for further performance improvements. Specifically, we propose a subtask gated network that combines the main regression network with an on/off classification subtask network. Unlike typical multitask learning algorithms where multiple tasks simply share the network parameters to take advantage of the relevance among tasks, the subtask gated network multiply the main network’s regression output with the subtask’s classification probability. When standby-power is additionally learned, the proposed solution surpasses the state-of-the-art performance for most of the benchmark cases. The subtask gated network can be very effective for any problem that inherently has on/off states.


2019 ◽  
Vol 6 (1) ◽  
Author(s):  
Changho Shin ◽  
Eunjung Lee ◽  
Jeongyun Han ◽  
Jaeryun Yim ◽  
Wonjong Rhee ◽  
...  

Abstract AMI has been gradually replacing conventional meters because newer models can acquire more informative energy consumption data. The additional information has enabled significant advances in many fields, including energy disaggregation, energy consumption pattern analysis and prediction, demand response, and user segmentation. However, the quality of AMI data varies significantly across publicly available datasets, and low sampling rates and numbers of houses monitored seriously limit practical analyses. To address these challenges, we herein present the ENERTALK dataset, which contains both aggregate and per-appliance measurements sampled at 15 Hz from 22 houses. Among the publicly available datasets with both aggregate and per-appliance measurements, 15 Hz was the highest sampling rate. The number of houses (22) was the second-largest where the largest one had a sampling rate of 1 Hz. The ENERTALK dataset is also the first Korean open dataset on residential electricity consumption.


1985 ◽  
Vol 16 (2) ◽  
pp. 77-84
Author(s):  
Aklra Iwata ◽  
Nobutoshi Yamagishi ◽  
Nobuo Suzumura ◽  
Isao Horiba

Sign in / Sign up

Export Citation Format

Share Document