*Corresponding author
This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

It is estimated that the lifetime building maintenance cost is equal to 0.4 times the construction cost (Hughes et al., 2004). Maintenance actions aim to restore every part of a building back to its original status (British Standards Institution, 1993). Building failures and defects are common phenomena in construction. Some of the most typical types of such defects are: wall cracks, faulty electrical wiring, faulty fire/smoke detection system, not working lighting, moisture, blocked or inadequate drainage systems (Ahzahar et al., 2011; Othman, 2015). Such issues increase dramatically the maintenance cost. For example, moisture issues cost billions of dollars in the United States (Kubba, 2008).

Efficient planning of maintenance tasks is essential for the wellbeing of the users of residential and office buildings (Brugge et al., 2010). For example, a well maintained ventilation system can increase productivity by 2.5-5% during an eight hour work period (Park & Yoon, 2011). On the contrary, a delayed response to malfunctions of the heat, ventilation and air conditioning system (HVAC) can cause a productivity loss that ranges between 5% to 9% (Al Horr et al., 2017; Kosonen & Tan, 2004), and Sick Building Syndrome (SBS) (Lan et al., 2011). The latter has a very negative impact to the occupants of buildings, as it is linked to several illnesses, among them eye, nose, and throat initiation, headaches, and allergies (Au-yong et al., 2014). Previous studies show that insufficient maintenance quality, results in poor building indoor environmental quality (Diaz et al., 2018; Rauh et., 2008). Such environmental quality is linked to respiratory illnesses, among them, asthma, rhinitis, bronchitis, common cold and cough (May et al., 2017). In the US, the cost induced by such maintenance-related health issues is estimated to be equal to 168 US dollars (Wargocki, 2018). Similarly, in the UK, the annual costs incurred from asthma treatments are equal to 1 billion British Pounds (Mukherjee et al., 2014).

Maintenance actions are initiated either after an asset fails (corrective) or before (preventive) at regular intervals. In particular, the corrective actions need to be immediately addressed to avoid severe inconvenience to the users of buildings (Le et al., 2018). Such immediate response depends on budget allowance. However, the budget that is allocated for maintenance is limited (Le et al., 2018). This is mainly due to the lack of accurate estimation tools of the required maintenance funds (Yu et al., 2017). Given these budget restrictions, the organisations need to be able to prioritize the most ‘important’ maintenance tasks. However, this is not achievable. It is estimated that 1/3 of the maintenance cost is wasted insufficiently (Mobley, 2002).

Most of the prioritization methods consider the probability of failure and the criticality of an asset (consequences of failure). The criticality of assets varies significantly between organizations, depending on their key performance indicators and maintenance objectives. The criticality is defined by the organisation (or asset owner) based on several parameters, among them safety, downtime cost, asset importance, productivity, whilst the probability of failure is determined based on deterioration models, regular manual inspections, or installed sensors. Current methods evaluate quantitatively the criticality of assets based on elaborate weighted score procedures and extensive maintenance historical data collection efforts (Crespo et al., 2016). However, such data are not always available. This lack of information it is confirmed in a recent study, which focuses on a three-step-methodology for planning the maintenance of the facades of 30 buildings (Madureira et al., 2017). In addition, the assessment of the consequences of assets’ failure is subject to experts’ subjectivity (Stadnicka et al., 2014). On that respect, Parlikad & Srinivasan (2016) propose a dynamic criticality-based method to optimize maintenance plans in terms of defining the optimal repair/replacement time for an asset. Sweis et al. (2014) developed a multi-attribute prioritization model that relies on predefined major criteria of major public healthcare facilities. Ratnayake & Antosz (2017) combine fuzzy logic with a risk matrix to optimize the prioritization of machinery maintenance. Chang et al. (2004) in a different approach, propose a knowledge-based method to prioritize the maintenance needs of public universities. Babashamsi et al. (2016) apply fuzzy analytic hierarchy process in combination with a multi-criteria optimization and compromise solution (VIKOR) to prioritize pavement maintenance alternatives. However, unscheduled downtime and waste of resources are the main limitations of the existing prioritization methods that rely on heuristic rules and experience-based common sense (Li & Ni, 2009). In a different approach, Ahmed et al. (2017) and Contreras-nieto et al. (2019) use the analytical hierarchy process method (AHP) to prioritize pavement and bridge maintenance actions respectively. However, such approach is limited by the exponentially increased number of pairwise comparisons. This is due to the increase of the alternatives to rank (Crespo et al., 2016). In addition, to the best of the knowledge of the authors of this paper, little focus is given on assets belonging to residential and office buildings. Current studies are predominantly designed for manufacturing and public infrastructure (e.g. bridges, pavements).

Currently, digital ‘twins’ are presented as a promising solution for improving maintenance decision making (Macchi et al., 2018; Watson, 2011). The ‘digital twin’ is defined as a virtual entity that consists of sensors and transmitted data. This paper presents an innovative machine learning-based approach that exploits the concept of a digital ‘twin’ to address the prioritization of maintenance tasks of residential and office buildings in an efficient way. This paper hypothesizes that the criticality of assets can be inferred through historical data. It features an accuracy of 59% on average in terms of classifying the maintenance tasks as “Urgent”, “non-Urgent”, “Critical”, “High”, “Medium” and “Low”. The remainder of this paper is structured as follows. Section 3 analyses the overall framework proposed in this paper, using the historical data that are provided by the Estate Management of the University of Cambridge. Section 4 presents the performance of the proposed method and lastly, Section 5 summarizes the outcomes of this paper.

The proposed method exploits the IoT in order to establish a continuous ‘communication’ among managers and assets. Such communication will inform managers about the condition of multiple assets real time. Digital profiles of assets are created to achieve such ‘communication’. The users scan unique identification tags to access the digital profiles of the assets (see in Figure 1-a,-b, and-d).

Figure 1

Asset tagging

These tags are directly linked to a software application that manages the digital profiles all assets (The Simple Asset Management Software, 2019) as shown in Figure 1-c above. The digital profiles contain useful information about the location of assets, the past maintenance inspections (e.g. date, issue resolved), and most importantly they allow the users to send feedback about any potential failures.

The users’ feedback is the input of a machine learning (ML) based-method that infers the criticality of every asset failure. The proposed method uses this criticality to prioritize the maintenance tasks, which are provided by the users. This prioritization indicates the response time. Figure 2 illustrates the overall proposed framework proposed of this paper.

Figure 2

Proposed framework

Figure 2

Proposed framework

Close modal

Figure 3 illustrates the machine learning-based proposed method for prioritizing the maintenance tasks. The rectangular shapes refer to input/outputs and the skewed parallelogram to methods. The input of the proposed method is the free text (feedback) that is provided by the users of the assets through the itemit application. The proposed method is trained offline with text-based historical data depicting multiple assets’ failures.

Figure 3

Proposed machine learning-based prioritization of responsive maintenance tasks

Figure 3

Proposed machine learning-based prioritization of responsive maintenance tasks

Close modal

Initially, natural language processing techniques are exploited to pre-process the text-based feedback of users. This includes: a) tokenization for splitting the sentences into a sequence of strings i.e. tokens, b) removal of stop words, c) lowercasing, and d) word stemming for converting the words to their root form. Then, the Paragraph Vector (Mikolov & Com, 2014), an unsupervised algorithm, is trained to convert the pre-processed texts (inputs) into feature vectors. This vectorization method uses stochastic gradient descent that is calculated via backpropagation to achieve this. One of its main advantages is that: a) takes into account word semantics, and b) considers the word order. The Paragraph Vector has two versions, the Distributed Bag of Words (PV-DBOW) and the Distributed Memory (PV-DM). The PV-DM is trained to predict the word following a text window, whereas the PV-DBOW is trained to predict the words within a text window. This paper uses the PV-DM as it proven to perform better (Mikolov & Com, 2014). The scope of this papers is to prioritize a large variety of maintenance complaints. Each of these complaints corresponds to a different type of asset and failure. This implies imbalanced classes. For this reason, a random forest is trained to classify each of the inputs into a response time (prioritization label) lastly. Such classification method is selected as it does not overfit and performs well with imbalanced data (Khoshgoftaar et al., 2007)

This paper achieves this prioritization by following a two-stage classification approach. The classification labels reflect the practises followed by the Estate Management of the University of Cambridge that provided the historical data. At the first stage (Stage A), the proposed method classifies the maintenance tasks as ‘Urgent’ and ‘non-Urgent’. The former depict assets’ failures with a large impact on its users, whilst the latter failures of lower importance. At the second stage (Stage B), each of these two categories is subdivided based on the exact response time. The ‘Urgent’ input is classified as either ‘Critical’ (response within half an hour) or ‘High’ maintenance tasks (response within 2 hours), whilst the ‘nonUrgent’ as ‘Medium’ (response within 2 days) and ‘Low’ (response within 10 days). Table 1 illustrates some examples of users’ feedback as provided by the Estate Management of the University of Cambridge.

Table 1

Example of labelled input training data

Text (input)Label
The fire alarm in the bull pens keeps going into fault.Urgent
South research house lift More than one person stuck in the lift.Urgent
Magnetic door holder - cover has been removed and there are exposed wires. Could BBC engineer attend to fix please.Urgent
Water pump is leaking. Pls could someone attend.non-Urgent
We have some lights/bulbs out room 1.20 x1 corridor outside 1.20 x2non-Urgent
Can you please arrange for someone to visit site and look at the existing heating system as several radiators are not working and may have air in the systemnon-Urgent
After a fire alarm test the panel has gone into alarm with a 'power supply fault' message. They can't find the source of the power supply fault. Nothing has tripped out.Critical
Lift entrapmentCritical
PIN HOLE IN 42ml COPPER PIPE TO CAGE WASHLow
Glass lift requires leveling up in the basement, (as there is half inch gap).Low
Air con is leaking, pls could b & C attend. Dept said they have put bucket and towels underneath, so it should be ok for tonight, but pls could someone attend tomorrow.Medium
MAIN KITCHEN GAS PUMP FAILED UNABLE TO USE COOKERS ECT.Medium

The historical data depict actual maintenance complaints raised over a period of 4 years (2014-2018). They cover a variety of assets and building types (residential, office, teaching). The historical data are prioritized by experts, considering the level of impact that the failures of the assets have on the users of the buildings, such as productivity, health and safety. A percentage of 80% of the training data is randomly selected for training and the remaining 20% is used for testing. Figure 4 displays the frequency the ‘Urgent’ and ‘non-Urgent’ classes of Stage A, whilst Figure 5 presents, the training data of Stage B (‘Critical’, ‘High’, ‘Medium’, and ‘Low’). In these figures, it appears that the training data are strongly imbalanced. This complies with our hypothesis in the previous section. The classes with the largest training data are down-sampled to the size of the smallest class to achieve an accurate performance. The training dataset contains 178 different types of assets in total. Table 2 illustrates some of them categorized as Mechanical, Electrical, Plumbing and Architectural.

Figure 4

Training data frequency (Stage A)

Figure 4

Training data frequency (Stage A)

Close modal
Figure 5

Training data frequency (Stage B)

Figure 5

Training data frequency (Stage B)

Close modal
Table 2

Type of assets

Mechanical AssetsTotalArchitectural AssetsTotal
lift1396door2507
air conditioning1144window722
boiler700roof336
radiator614wall260
chiller366lock162
plant room353gate131
fan320work top89
heater Building113stair75
management system (BMS)99slab71
Air Handling Units (AHU)94shelf69
freezer39desk63
air handling unit35cabinet45
pressurisation unit33whiteboard34
control panel31noticeboard34
cooling unit15seat24
duct11keypad10
dumbwaiter8pin board6
heat exchanger5shelter5
air compressor3paver5
electric heating1dispenser4
Plumbing AssetsElectrical Assets
toilet3049lighting3728
tap544fire system965
sink460outlet384
drain416smoke detector104
PiPe307camera44
shower193intruder alarm39
humidifier81sounder32
cistern70card access32
water pump54cooker19
water tank43security detector10
basin41bell3
sewage28dishwasher1
inverter14  
hot water system11  
water cooler9  
grease trap5  
gulley5  

The performance of the method presented in this paper is evaluated with a python implementation developed in PyCharm framework, running in a Windows 8.1 operating system. Four classification metrics are used to display the performance of the proposed method. These metrics are: a) accuracy (see Equation ((1)), b) precision (see Equation (2) c) recall (see Equation (3)), and d) F1_score (see Equation (4)). Accuracy returns the fraction of correct predictions. Precision depicts how well the proposed method avoids from labelling as positive (TP) a sample that is negative (FP), whilst recall represents the efficiency of the classifier to find all the true positive samples (TP). Lastly, the f1_score is the weighted average of precision and recall. Table 3 shows the performance of the proposed method.

(1)
(2)
(3)
(4)
Table 3

Performance of proposed method

ClassAccuracyPrecisionRecallf1
Urgent0.620.590.690.64
Non-Urgent0.640.540.58
Critical0.570.590.670.63
High0.550.460.50
Medium0.580.580.690.63
Low0.600.480.53
Average0.590.580.590.61

Each stage (A, B) is separately evaluated. Table 3 shows that all classification steps perform equally well under all four metrics. The proposed method returns an accuracy of 62% in terms of classifying the text-based feedback of the users as “Urgent” or “non-Urgent”, it features an accuracy of 57% in classifying the feedback as “Critical” or “High”, and lastly it scores an accuracy of 58% in labelling the feedback as “Medium” or “Low”.

The main limitation of the proposed method is that the accuracy of Stage A affects the performance of Stage B, since samples are first classified as ‘Urgent’ vs ‘non-Urgent’ before being classified as either ‘Critical’ vs ‘High’ or ‘Medium’ vs ‘Low’. One of the main reasons that the proposed method does not achieve a better accuracy, is that the experts were not entirely consistent throughout the years in terms of prioritization. Table 4 presents such examples. In this table, it is shown that the same maintenance issue (i.e. same importance and asset type) it is tagged with different response time. For instance, in #1 and #2, a faulty fire door is labelled as “nonUrgent” (response from 2-10 days), whilst the same issue in #3 and #4 is labelled as ‘Urgent’.

Table 4

Non-consistent labelling of training data

#Text (input)LabelAsset
1GROUND FLOOR CORRIDOR FIRE DOOR DROPPED FRAME ISSUE REPAIR BROKEN DOORnon-UrgentDoor
2GROUND FLOOR TO BASEMENT THE FIRE DOOR IS BROKENnon-UrgentDoor
3Replace worn fire door that is not closing on alarmUrgentDoor
4Replace worn fire door seal to room SW02UrgentDoor
5Taken from Email Good afternoon, We have a blocked toiler in the men’s fitness changing rooms. Would someone be able to come and unblock it? Thank youLowToilet
6Basement toilet is blockedMediumToilet
7The air conditioning unit in the SP clean room is not working. Please can someone attend.MediumAir conditioning
8Air Con is not working in room M113C. Pls could B & C attend.LowAir conditioning
9The radiator in room 416 is not working.LowRadiator
10The heating in this office is not working properly. One of the radiators is like warm, the other is not working at all.MediumRadiator

Up to present, while the maintenance cost remains high the budget that is allocated for it is limited. The wellbeing of occupants is directly linked with a good quality of maintenance management. Several studies are proposed focusing on prioritizing the maintenance works. However, they remain inefficient as they cause downtime and waste of resources. This paper exploits IoT and machine learning inference in order to prioritize maintenance tasks in a time and cost-efficient way. It hypothesizes that the priority of maintenance tasks can be predicted through historical data capturing the importance of the failure of the assets. The proposed method features a classification accuracy of 59% on average. The main limitation of the proposed method is that it is tested on historical maintenance data that are prioritized inconsistently. Future work will focus on: a) standardizing the labelling process in order to avoid such issues and increase the performance, b) focusing on assets with the most frequent issues in order to avoid strongly imbalanced training data, and b) establishing a semantic-based linkage with the digital twin of the building in order to increase the accuracy of estimating the impact that the failure of assets have on its users.

This research is an InnovateUK grant: “Social Networked Smart Infrastructure Assets”, project number #4360. Any opinions, findings, and conclusions or recommendations included in this paper are those of the authors and do not necessarily reflect the views of organizations and people mentioned above.

Ahmed
S
et al.
(
2017
)
Prioritization of pavement maintenance sections using objective based Analytic Hierarchy Process
.
International Journal of Pavement Research and Technology
10
(
2
):
158
170
. .
Ahzahar
N
et al.
(
2011
)
A Study of Contribution Factors to Building Failures and Defects in Construction Industry
.
The 2nd International Building Control Conference
,
Penang, Malaysia
, pp.
249
255
.
Al Horr
Y
et al.
(
2017
)
Occupant productivity and indoor environment quality : A case of GSAS
.
International Journal of Sustainable Built Environment
6
(
2
):
476
490
,
Au-yong
CP
et al.
(
2014
).
Improving occupants’ satisfaction with effective maintenance management of HVAC system in office buildings
.
Automation in Construction
43
:
31
37
, .
Babashamsi
P
et al.
(
2016
)
Integrated fuzzy analytic hierarchy process and VIKOR method in the prioritization of pavement maintenance activities
.
International Journal of Pavement Research and Technology
9
(
2
):
112
120
. .
British Standards Institution
(
1993
)
Glossary of terms used in terotechnology
.
British Standards Institution
,
London, UK
.
Brugge
D
et al.
(
2010
)
Association and Correlation of Self-Reported Home Environmental Factors and Health Symptoms
.
Archives of Environmental & Occupational Health
,
8244
,
Chang
CY
et al.
(
2004
)
Prioritization System for Building Maintenance Applications in Public Universities
.
Proceedings of the 21st International Symposium on Automation and Robotics in Construction
.
Jeju, South Korea
., .
Contreras-Nieto
C
et al.
(
2019
)
Bridge maintenance prioritization using analytic hierarchy process and fusion tables
.
Automation in Construction
101
(
November 2018
):
99
110
, .
Crespo
A
et al.
(
2016
)
Criticality Analysis for Maintenance Purposes: A Study for Complex In-service Engineering Assets
.
Quality and Reliability Engineering International
(
February 2015
), .
Diaz
E
et al.
(
2018
)
Indoor environmental quality in social housing: A literature review
.
Building and Environment
131
(
December 2017
):
231
241
, .
Hughes
W
et al.
(
2004
)
EXPOSING THE MYTH OF THE 1: 5: 200 RATIO RELATING INITIAL COST, MAINTENANCE AND STAFFING COSTS OF OFFICE BUILDINGS
.
Proceedings of the 20th Annual ARCOM Conference
.
London, UK
, pp.
373
381
.
Khoshgoftaar
TM
et al.
(
2007
).
An Empirical Study of Learning from Imbalanced Data Using Random Forest
.
Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence
.
Patras, Greece
, pp.
310
317
, .
Kosonen
R
and
Tan
F
(
2004
)
The effect of perceived indoor air quality on productivity loss
.
Energy and Buildings
36
:
981
986
, .
Kubba
S
(
2008
)
Architectural Forensics
.
McGraw Hill Professional
.
New York, USA
, .
Lan
L
et al.
(
2011
)
Effects of thermal discomfort in an office on perceived air quality, SBS symptoms, physiological responses, and human performance
.
INDOOR AIR
21
(
5
):
376
390
, .
Le
A Domingo
et al.
(
2018
)
Building Maintenance Cost Planning and Estimating: A Literature Review
.
Proceeding of the 34th Annual ARCOM Conference
.
Belfast, UK
, pp.
707
716
.
Li
L
and
Ni
J
(
2009
)
Short-term decision support system for maintenance task prioritization
.
International Journal of Production Economics
121
:
195
202
, .
Macchi
M
et al.
(
2018
)
Exploring the role of Digital Twin for Asset Lifecycle Management
.
IFAC-PapersOnLine
51
(
11
):
790
795
, .
Madureira
S
et al.
(
2017
)
Maintenance planning of facades in current buildings
.
Construction and Building Materials
147
:
790
802
,
May
N
et al.
(
2017
)
Health and Moisture in Buildings
. See http://www.ukcmb.org/health-and-moisture-in-buildings-report (accessed 10/04/2019).
Mikolov
T
et al.
(
2014
)
Distributed Representations of Sentences and Documents
.
Proceedings of the 31st International Conference on International Conference on Machine Learning
.
Beijing, China
, pp.
1188
1196
.
Mobley
RK
(
2002
)
An Introduction to Predictive Maintenance
( (2) nd edition).
Elsevier Science
.
USA
.
Mukherjee
M
et al.
(
2014
)
Estimating the incidence, prevalence and true cost of asthma in the Estimating the incidence, prevalence and true cost of asthma in the UK: secondary analysis of national stand-alone and linked databases in England, Northern Ireland, Scotland and Wales-a study protocol
.
BMJ Open
4
(
11
), .
Othman
NL
et al.
(
2015
)
A Case Study on Moisture Problems and Building Defects
.
Procedia - Social and Behavioral Sciences
170
:
27
36
,
Park
JS
and
Yoon
CH
(
2011
)
The effects of outdoor air supply rate on work performance during 8-h work period
.
INDOOR AIR
284
290
, -0668.2010.00700.x.
Parlikad
AK
and
Srinivasan
R
(
2016
)
Towards Dynamic Criticality-Based Maintenance Towards Criticality-Based Assets
.
IFAC-PapersOnLine
49
(
28
):
103
107
, .
Ratnayake
RMC
and
Antosz
K
(
2017
)
Development of a Risk Matrix and Extending the Risk-based Maintenance Analysis with Fuzzy Logic
.
Procedia Engineering
182
(
1877
):
602
610
, .
Rauh
VA
et al.
(
2008
)
Housing and Health - intersection of Poverty and Environmental Exposures
.
In Reducing the Impact of Poverty on Health and Human Development. Scientific Approaches
.
New York, USA
,
Wiley-Blackwel
, pp.
276
288
.
Stadnicka
D
et al.
(
2014
)
Development of an empirical formula for machine classification: Prioritization of maintenance tasks
.
Safety Science
63
:
34
41
, .
Sweis
G
et al.
(
2014
).
Priority Setting for Healthcare Facilities Maintenance
.
Life Science Journal
11
(
2s
):
54
:-
64
, .
The Simple Asset Management Software
(
2019
) https://itemit.com (accessed 10/04/2019).
Wargocki
P
(
2018
)
Productivity and Health Effects of High Indoor Air Quality
. In
Encyclopedia of Environmental Health
.
Elsevier
.
USA
, pp.
688
693
.
Watson
A
(
2011
)
Digital buildings – Challenges and opportunities
.
Advanced Engineering Informatics
25
:
573
581
, .
Yu
AJ
et al.
(
2017
)
Risk-Based Prioritization of Maintenance Tasks
.
Proceedings of the 2017 Industrial and Systems Engineering Conference
.
Pittsburgh, Pennsylvania, USA
, pp.
794
800
.