Skip to main content
Log in

Islands of misfit buildings: Detecting uncharacteristic electricity use behavior using load shape clustering

  • Research Article
  • Published:
Building Simulation Aims and scope Submit manuscript

Abstract

Many energy performance analysis methodologies assign buildings a descriptive label that represents their main activity, often known as the primary space usage (PSU). This attribute comes from the intent of the design team based on assumptions of how the majority of the spaces in the building will be used. In reality, the way a building’s occupants use the spaces can be different than what was intended. With the recent growth of hourly electricity meter data from the built environment, there is the opportunity to create unsupervised methods to analyze electricity consumption behavior to understand whether the PSU assigned is accurate. Misclassification or oversimplification of the use of the building is possible using these labels when applied to simulation inputs or benchmarking processes. To work towards accurate characterization of a building’s utilization, we propose a modular methodology for identifying potentially mislabeled buildings using distance-based clustering analysis based on hourly electricity consumption data. This method seeks to segment buildings according to their daily behavior and predict which ones are misfits according to their assigned PSU label. This process finds potentially uncharacteristic behavior that could be an indication of mixed-use or a misclassified PSU. Our results on two public data sets, from the Building Data Genome (BDG) Project and Washington DC (DGS), with 507 and 322 buildings respectively, show that 26% and 33% of these buildings are potentially mislabelled based on their load shape behavior. Such information provides a more realistic insight into their true consumption characteristics, enabling more accurate simulation scenarios. Applications of this process and a discussion of limitations and reproducibility are included.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Ahmad AS, Hassan MY, Abdullah MP, Rahman HA, Hussin F, Abdullah H, Saidur R (2014). A review on applications of ANN and SVM for building electrical energy consumption forecasting. Renewable and Sustainable Energy Reviews, 33: 102–109.

    Article  Google Scholar 

  • Al-Wakeel A, Wu J, Jenkins N (2017). K-means based load estimation of domestic smart meter measurements. Applied Energy, 194: 333–342.

    Article  Google Scholar 

  • Arjunan P, Khadilkar HD, Ganu T, Charbiwala ZM, Singh A, Singh P (2015). Multi-user energy consumption monitoring and anomaly detection with partial context information. In: Proceedings of the 2nd ACM International Conference on Embedded Systems for Energy-Efficient Built Environments (BuildSys’15).

    Google Scholar 

  • Batista NdN, Rovere ELL, Aguiar JCR (2011). Energy efficiency labeling of buildings: An assessment of the Brazilian case. Energy and Buildings, 43: 1179–1188.

    Article  Google Scholar 

  • Bennet IE, O’Brien W (2017). Office building plug and light loads: Comparison of a multi-tenant office tower to conventional assumptions. Energy and Buildings, 153: 461–475.

    Article  Google Scholar 

  • Christ M, Braun N, Neuffer J, Kempa-Liehr AW (2018). Time series FeatuRe extraction on basis of scalable hypothesis tests (tsfresh—A python package). Neurocomputing, 307: 72–77.

    Article  Google Scholar 

  • Coakley D, Raftery P, Keane M (2014). A review of methods to match building energy simulation models to measured data. Renewable and Sustainable Energy Reviews, 37: 123–141.

    Article  Google Scholar 

  • BuildSmart DC (2018). Building directory. Available at http://www.buildsmartdc.com/buildings.

  • Dong B, Yan D, Li Z, Jin Y, Feng X, Fontenot H (2018). Modeling occupancy and behavior for better building design and operation—A critical review. Building Simulation, 11: 899–921.

    Article  Google Scholar 

  • Fan C, Xiao F, Li Z, Wang J (2018). Unsupervised data analytics in mining big building operational data for energy efficiency enhancement: a review. Energy and Buildings, 159: 296–308.

    Article  Google Scholar 

  • Goldin DQ, Kanellakis PC (1995). On similarity queries for time-series data: Constraint specification and implementation. In: Montanari U, Rossi F (eds), Principles and Practice of Constraint Programming—CP’95. Lecture Notes in Computer Science, vol 976. Berlin: Springer.

    Google Scholar 

  • Granell R, Axon CJ, Wallom DCH (2015). Impacts of raw data temporal resolution using selected clustering methods on residential electricity load profiles. IEEE Transactions on Power Systems, 30: 3217–3224.

    Article  Google Scholar 

  • Gunay HB, Shen W, Newsham G, Ashouri A (2019). Detection and interpretation of anomalies in building energy use through inverse modeling. Science and Technology for the Built Environment, 25: 488–503.

    Article  Google Scholar 

  • Heidarinejad M, Cedeño-Laurent JG, Wentz JR, Rekstad NM, Spengler JD, Srebric J (2017). Actual building energy use patterns and their implications for predictive modeling. Energy Conversion and Management, 144: 164–180.

    Article  Google Scholar 

  • Hsu D (2015). Comparison of integrated clustering methods for accurate and stable prediction of building energy consumption data. Applied Energy, 160: 153–163.

    Article  Google Scholar 

  • Jain AK (2010). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31: 651–666.

    Article  Google Scholar 

  • Kung F, Frank S, Pless S, Judkoff R (2019). Meter-based synthesis of equipment schedules for improved models of electrical demand in multifamily buildings. Journal of Building Performance Simulation, 12: 388–403.

    Article  Google Scholar 

  • Koupaei DM, Hashemi F, Tabard-Fortecoëf V, Passe U (2019). A technique for developing high-resolution residential occupancy schedules for urban energy models. In: Proceedings of the Symposium for Architecture and Urban Design.

    Google Scholar 

  • Leutenegger S, Chli M, Siegwart R (2011). BRISK: Binary Robust Invariant Scalable Keypoints. In: Proceedings of the 13th International Conference on Computer Vision (ICCV 2011), Barcelona, Spain.

    Google Scholar 

  • Lusis P, Khalilpour KR, Andrew L, Liebman A (2017). Short-term residential load forecasting: Impact of calendar effects and forecast granularity. Applied Energy, 205: 654–669.

    Article  Google Scholar 

  • Miller C, Nagy Z, Schlueter A (2015). Automated daily pattern filtering of measured building performance data. Automation in Construction, 49: 1–17.

    Article  Google Scholar 

  • Miller C (2016). Screening meter data: Characterization of temporal energy data from large groups of non-residential buildings. PhD Thesis, ETH Zürich, Switzerland.

    Google Scholar 

  • Miller C, Meggers F (2017). The Building Data Genome Project: An open, public data set from non-residential building electrical meters. Energy Procedia, 122: 439–444.

    Article  Google Scholar 

  • Miller C, Nagy Z, Schlueter A (2018). A review of unsupervised statistical learning and visual analytics techniques applied to performance analysis of non-residential buildings. Renewable and Sustainable Energy Reviews, 81: 1365–1377.

    Article  Google Scholar 

  • O’Brien W, Gunay HB (2019). Do building energy codes adequately reward buildings that adapt to partial occupancy? Science and Technology for the Built Environment, 25: 678–691.

    Article  Google Scholar 

  • Ouf MM, Gunay HB, O'Brien W (2019). A method to generate designsensitive occupant-related schedules for building performance simulations. Science and Technology for the Built Environment, 25: 221–232.

    Article  Google Scholar 

  • Paparrizos J, Gravano L (2015). k-Shape: Efficient and accurate clustering of time series. In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data.

    Google Scholar 

  • Park HS, Lee M, Kang H, Hong T, Jeong J (2016). Development of a new energy benchmark for improving the operational rating system of office buildings using various data-mining techniques. Applied Energy, 173: 225–237.

    Article  Google Scholar 

  • Park JY, Yang X, Miller C, Arjunan P, Nagy Z (2019a). Apples or oranges? Identification of fundamental load shape profiles for benchmarking buildings using a large and diverse dataset. Applied Energy, 236: 1280–1295.

    Article  Google Scholar 

  • Park JY, Ouf MM, Gunay B, Peng Y, O’Brien W, Kjrgaard MB, Nagy Z (2019b). A critical review of field implementations of occupant centric building controls. Building and Environment, 165: 106351.

    Article  Google Scholar 

  • Ploennigs J, Chen B, Palmes P, Lloyd R (2015). E2-diagnoser: A system for monitoring, forecasting and diagnosing energy usage. In: Proceedings IEEE International Conference on Data Mining Workshop, Shenzhen, China.

    Google Scholar 

  • Rackes A, Melo AP, Lamberts R (2016). Naturally comfortable and sustainable: Informed design guidance and performance labeling for passive commercial buildings in hot climates. Applied Energy, 174: 256–274.

    Article  Google Scholar 

  • Rousseeuw PJ (1987). Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20: 53–65.

    Article  MATH  Google Scholar 

  • Tardioli G, Kerrigan R, Oates M, O’Donnell J, Finn D (2015). Data driven approaches for prediction of building energy consumption at urban level. Energy Procedia, 78: 3378–3383.

    Article  Google Scholar 

  • Wang S, Yan C, Xiao F (2012). Quantitative energy performance assessment methods for existing buildings. Energy and Buildings, 55: 873–888.

    Article  Google Scholar 

  • Wang Y, Chen Q, Hong T, Kang C (2019). Review of smart meter data analytics: applications, methodologies, and challenges. IEEE Transactions on Smart Grid, 10: 3125–3148.

    Article  Google Scholar 

  • Wu J, Zhao J (2015). Evaluation on building end-user energy consumption using clustering algorithm. Procedia Engineering, 121: 1144–1149.

    Article  Google Scholar 

  • Xiao F, Fan C (2014). Data mining in building automation system for improving building operational performance. Energy and Buildings, 75: 109–118.

    Article  Google Scholar 

  • Xu S, Barbour E, González MC (2017). Household segmentation by load shape and daily consumption. In: Proceedings of ACM SigKDD 2017 Conference.

    Google Scholar 

  • Yang J, Ning C, Deb C, Zhang F, Cheong D, Lee SE, Sekhar C, Tham KW (2017). k-Shape clustering algorithm for building energy usage patterns analysis and forecasting model accuracy improvement. Energy and Buildings, 146: 27–37.

    Article  Google Scholar 

  • Yang Z, Roth J, Jain RK (2018). DUE-B: Data-driven urban energy benchmarking of buildings using recursive partitioning and stochastic frontier analysis. Energy and Buildings, 163: 58–69.

    Article  Google Scholar 

Download references

Acknowledgments

The Ministry of Education (MOE) of the Republic of Singapore (R296000181133) and the National University of Singapore (R296000158646) provided support for the development and implementation of this research. This research was also supported by the Republic of Singapore’s National Research Foundation (NRF) through a grant to the Berkeley Education Alliance for Research in Singapore (BEARS) for the Singapore-Berkeley Building Efficiency and Sustainability in the Tropics 2 (SinBerBEST2) Program. BEARS has been established by the University of California, Berkeley as a center for intellectual excellence in research and education in Singapore.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Clayton Miller.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Quintana, M., Arjunan, P. & Miller, C. Islands of misfit buildings: Detecting uncharacteristic electricity use behavior using load shape clustering. Build. Simul. 14, 119–130 (2021). https://doi.org/10.1007/s12273-020-0626-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12273-020-0626-1

Keywords

Navigation