Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools

Athanasios Tsanas*, Angeliki Xifara

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

We develop a statistical machine learning framework to study the effect of eight input variables (relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area, glazing area distribution) on two output variables, namely heating load (HL) and cooling load (CL), of residential buildings. We systematically investigate the association strength of each input variable with each of the output variables using a variety of classical and non-parametric statistical analysis tools, in order to identify the most strongly related input variables. Then, we compare a classical linear regression approach against a powerful state of the art nonlinear non-parametric method, random forests, to estimate HL and CL Extensive simulations on 768 diverse residential buildings show that we can predict HL and CL with low mean absolute error deviations from the ground truth which is established using Ecotect (0.51 and 1.42, respectively). The results of this study support the feasibility of using machine learning tools to estimate building parameters as a convenient and accurate approach, as long as the requested query bears resemblance to the data actually used to train the mathematical model in the first place. (C) 2012 Elsevier B.V. All rights reserved.

Original languageEnglish
Pages (from-to)560-567
Number of pages8
JournalEnergy and buildings
Volume49
Early online date9 Mar 2012
DOIs
Publication statusPublished - Jun 2012

Keywords

  • Building energy evaluation
  • Heating load
  • Cooling load
  • Non-parametric statistics
  • Statistical machine learning
  • CONSUMPTION
  • SIMULATION
  • STANDARD
  • CHINA

Fingerprint

Dive into the research topics of 'Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools'. Together they form a unique fingerprint.

Cite this