Skip navigation

A data-centric approach to terminal unit’s fault categorization and optimal positioning in building HVAC systems using ensemble learning

A data-centric approach to terminal unit’s fault categorization and optimal positioning in building HVAC systems using ensemble learning

Dey, Maitreyee, Patel, Preeti and Rana, Soumya Prakash ORCID logoORCID: https://orcid.org/0000-0002-8014-8122 (2025) A data-centric approach to terminal unit’s fault categorization and optimal positioning in building HVAC systems using ensemble learning. Discover Data, 3 (17). ISSN 2731-6955 (doi:10.1007/s44248-025-00039-1)

[thumbnail of Open Access Article]
Preview
PDF (Open Access Article)
50433 RANA_A_Data-Centric_Approach_To_Terminal_Unit_s_Fault_Categorization_And_Optimal_Positioning_In_Building_HVAC_Systems_(OA)_2025.pdf - Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview

Abstract

This paper focuses on the fault detection and diagnosis of terminal units (TUs) in a building located in London, utilizing real operational historical data to assess their performance and optimal placement across multiple floors. While precise locations of the TUs are unavailable, our method analyzes their operational behaviour for one month, applying popular machine learning models to detect and analyze faults effectively. By examining each TU individually and in the aggregate, we identify behavioural patterns that inform decisions regarding their positioning within the building. The dataset comprises over 2 million data points collected from 730 TUs, enabling a comprehensive analysis of their functionality and the impact of suboptimal thermostat placements. Our study employs three machine learning models-traditional multi-class Support Vector Machines and two ensemble methods: Random Forest (RF), and Adaptive Boosting (AdaBoost)-to classify TU behaviors into normal operation, heating faults, and cooling faults. Results indicate that RF outperforms the other models with an accuracy of 99.89%, while AdaBoost achieves an accuracy of 85% and SVM shows 47% accuracy. The findings underscore the potential of a data-driven approach to inform retrofitting decisions and enhance the reliability of HVAC systems. This research contributes valuable knowledge toward optimizing TU placement, ultimately leading to improved energy efficiency and indoor environmental quality.

Item Type: Article
Uncontrolled Keywords: fault categorisation, building energy, HVAC, terminal unit, statistical data analysis, ensemble learning
Subjects: Q Science > Q Science (General)
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
T Technology > T Technology (General)
Faculty / School / Research Centre / Research Group: Faculty of Engineering & Science
Faculty of Engineering & Science > School of Engineering (ENG)
Last Modified: 13 May 2025 14:42
URI: http://gala.gre.ac.uk/id/eprint/50433

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics