Skip navigation

A novel model for hourly PM2.5 concentration prediction based on CART and EELM

A novel model for hourly PM2.5 concentration prediction based on CART and EELM

Shang, Zhigen, Deng, Tong ORCID: 0000-0003-4117-4317, He, Jianqiang and Duan, Xiaohui (2018) A novel model for hourly PM2.5 concentration prediction based on CART and EELM. Science of The Total Environment, 651 (2). pp. 3043-3052. ISSN 0048-9697 (Print), 1879-1026 (Online) (doi:

PDF (Author Accepted Manuscript)
22019 DENG_A_Novel_Model_for_Hourly_PM2.5_Concentration_2018.pdf - Accepted Version

Download (916kB) | Preview


Hourly PM2.5 concentrations have multiple change patterns. For hourly PM2.5 concentration prediction, it is beneficial to split the whole dataset into several subsets with similar properties and to train a local prediction model for each subset. However, the methods based on local models need to solve the global-local duality. In this study, a novel prediction model based on classification and regression tree (CART) and ensemble extreme learning machine (EELM) methods is developed to split the dataset into subsets in a hierarchical fashion and build a prediction model for each leaf. Firstly, CART is used to split the dataset by constructing a shallow hierarchical regression tree. Then at each node of the tree, EELM models are built using the training samples of the node, and hidden neuron numbers are selected to minimize validation errors respectively on the leaves of a sub-tree that takes the node as the root. Finally, for each leaf of the tree, a global and several local EELMs on the path from the root to the leaf are compared, and the one with the smallest validation error on the leaf is chosen. The meteorological data of Yancheng urban area and the air pollutant concentration data from City Monitoring Centre are used to evaluate the method developed. The experimental results demonstrate that the method developed addresses the global-local duality, having better performance than global models including random forest (RF), v-support vector regression (v-SVR) and EELM, and other local models based on season and k-means clustering. The new model has improved the capability of treating multiple change patterns.

Item Type: Article
Uncontrolled Keywords: PM2.5 concentration prediction; Local model; Classification and regression tree (CART); Extreme learning machine (ELM); Ensemble model
Subjects: Q Science > Q Science (General)
Faculty / School / Research Centre / Research Group: Faculty of Engineering & Science
Faculty of Engineering & Science > Wolfson Centre for Bulk Solids Handling Technology
Faculty of Engineering & Science > School of Engineering (ENG)
Last Modified: 18 Sep 2020 23:35

Actions (login required)

View Item View Item


Downloads per month over past year

View more statistics