Intelligent fault diagnosis for elevator door system based on discriminative multi-scale residual network Open Access

https://doi.org/10.1016/j.ins.2023.119120

Chan

Deng

Yeung

and

Daniel

(

2023

), “

Multi-proxy based deep metric learning

”,

Information Sciences

, Vol.

643

, 119120, doi:

https://doi.org/10.1088/1742-6596/1345/4/042024

Chen

Lan

and

Jiang

(

2019

), “

Elevators Fault diagnosis based on artificial intelligence

”,

Journal of Physics: Conference Series

, Vol.

1345

, 042024, doi:

https://doi.org/10.3390/a17060250

Chen

Ren

and

Cheng

(

2025

), “

Research on distributed fault diagnosis model of elevator based on PCA-LSTM

”,

Algorithms

, Vol.

No.

, 250, doi:

https://doi.org/10.1109/LSP.2024.3364055

Dong

and

Lam

(

2024

), “

Bi-center loss for compound facial expression recognition

”,

IEEE Signal Processing Letters

, Vol.

, pp.

641

645

, doi:

https://doi.org/10.1016/j.asoc.2023.110678

Chu

Tang

Zhou

Chen

and

Sun

(

2023

), “

A lightweight multi-sensory field-based dual-feature fusion residual network for bird song recognition

”,

Applied Soft Computing

, Vol.

146

, 110678, doi:

https://doi.org/10.1016/j.aei.2024.102997

Huang

Ren

Zhu

Lin

Zhu

Zeng

and

Wan

(

2025

), “

Intra-domain self generalization network for intelligent fault diagnosis of bearings under unseen working conditions

”,

Advanced Engineering Informatics

, Vol.

, 102997, doi:

https://doi.org/10.1109/ACCESS.2023.3330137

Kim

Son

and

K.-Y.

(

2023

), “

Margin-maximized hyperspace for fault detection and prediction: a case study with an elevator door

”,

IEEE Access

, Vol.

, pp.

128580

128595

, doi:

https://doi.org/10.1016/j.patcog.2023.109381

Yang

and

Xue

(

2023

), “

Deep metric learning for few-shot image classification: a review of recent developments

”,

Pattern Recognition

, Vol.

138

, 109381, doi:

https://doi.org/10.1109/TBME.2022.3193277

Liu

Yang

Wang

and

(

2022

), “

FBMSNet: a filter-bank multi-scale convolutional neural network for EEG-based motor imagery decoding

”,

IEEE Transactions on Biomedical Engineering

, Vol.

No.

, pp.

436

445

, doi:

https://doi.org/10.1016/j.knosys.2025.113450

Chen

Xiao

and

Wang

, (

2025a

), “

Temporal knowledge graph fusion with neural ordinary differential equations for the predictive maintenance of electromechanical equipment

”,

Knowledge-Based Systems

, Vol.

317

, 113450, doi:

https://doi.org/10.1016/j.isatra.2025.09.041

Zhang

Xiao

and

Wang

(

2025b

), “

A multi-scale convolution capsule network with data augmentation and attention mechanisms for elevator fault diagnosis

”,

ISA Transactions

, Vol.

167

, pp.

1873

1887

, doi:

https://doi.org/10.1016/j.ymssp.2023.110544

Halkon

Feng

and

Nandi

A.K.

(

2023

), “

Physics-Informed Residual Network (PIResNet) for rolling element bearing fault diagnostics

”,

Mechanical Systems and Signal Processing

, Vol.

200

, 110544, doi:

https://doi.org/10.3390/s24072135

Pan

Shao

Dai

Wei

Chen

and

Lin

(

2024

), “

Research on fault prediction method of elevator door system based on transfer learning

”,

Sensors

, Vol.

No.

, 2135, doi:

https://doi.org/10.1016/j.engappai.2025.110641

Pan

Nie

Zhai

and

Ding

(

2025

), “

Classification of power quality disturbances using residual networks with channel attention mechanism

”,

Engineering Applications of Artificial Intelligence

, Vol.

151

, 110641, doi:

https://doi.org/10.1108/JIMSE-09-2023-0006

Ren

and

Niu

(

2024

), “

Suppression of horizontal vibrations in high-speed elevators using active shock absorber to assist traditional damping systems

”,

Journal of Intelligent Manufacturing and Special Equipment

, Vol.

No.

, pp.

170

189

, doi:

https://doi.org/10.1007/s40430-025-05816-2

Wan

Tong

AL-Bukhaiti

Zhou

and

Cheng

(

2025

), “

Intelligent fault diagnosis for elevator door systems using variational mode decomposition and multi-scale convolutional networks

”,

Journal of the Brazilian Society of Mechanical Sciences and Engineering

, Vol.

No.

, 508, doi:

https://doi.org/10.1088/1757-899X/428/1/012028

Wang

Leng

Zhang

Zhu

and

Zhang

(

2018

), “

MCU system-based intelligent high-speed elevator door operator fault analysis and research

”,

IOP Conference Series: Materials Science and Engineering

, Vol.

428

No.

, 012028, doi:

https://doi.org/10.1016/j.ymssp.2021.107650

Wang

Liu

Jiang

and

Jiang

(

2021

), “

Collaborative deep learning framework for fault diagnosis in distributed complex systems

”,

Mechanical Systems and Signal Processing

, Vol.

156

, 107650, doi:

https://doi.org/10.1016/j.eswa.2025.127493

Cheng

and

Wang

(

2025

), “

A contrastive clustering loss function increases class-balanced in time series classification

”,

Expert Systems with Applications

, Vol.

283

, 127493, doi:

https://doi.org/10.1016/j.compbiomed.2023.107120

Yin

Han

Jian

Wang

Chen

and

Wang

(

2023

), “

AMSUnet: a neural network using atrous multi-scale convolution for medical image segmentation

”,

Computers in Biology and Medicine

, Vol.

162

, 107120, doi:

https://doi.org/10.1016/j.eswa.2023.122105

Zhou

Qin

Hou

Dai

Huang

and

Zhang

(

2024

), “

Deep global semantic structure-preserving hashing via corrective triplet loss for remote sensing image retrieval

”,

Expert Systems with Applications

, Vol.

238

, 122105, doi:

2026

Jiefeng Li, He Ren, Qiuyu Song, Liang Ye and Gongning Li

Figure 1

The illustration presents a three-stage workflow diagram for a fault diagnosis system applied to a lift door system. The process is divided into three labeled sections arranged from left to right: (1) Data preparation, (2) Feature extraction, and (3) Fault Classification. The left panel, labeled “(1) Data preparation”, shows the initial data acquisition and preprocessing stage. At the top, a label reads “Failed lift door system”. Below it, an image depicts a lift door mechanism connected to monitoring equipment, including a computer screen and a data acquisition device. An arrow points downward to the next step labeled “Data and Segmentation”. This section shows three time-series signals in different colors representing vibration or sensor measurements. An ellipsis is shown between the second and third series. Red dashed vertical boxes at the left and right on each series highlight segmented portions of the signal, indicating that the raw sensor data are divided into smaller samples for analysis. The segmented signals are labeled “Training data”. Two thick arrows from this panel point rightward to the central panel. The center panel is titled “(2) Feature extraction”. The panel is divided into two parts: the upper section shows neural network feature extraction layers, and the lower section illustrates metric learning in the feature space. At the top, several stacked neural network layers are displayed horizontally from left to right. Each layer is represented by a vertical rounded rectangle containing circular nodes. The first layers contain light yellow and light blue nodes, indicating intermediate feature maps. Solid arrows pointing right show the forward propagation of data from one layer to the next. Between some layers, dotted arrows pointing left indicate back propagation during training. The sequence of layers gradually transforms the input representation until the final layer on the far right, which contains blue circular nodes representing extracted features. Below the neural network layers, a label “Metric learning” marks the next stage. This section is enclosed by a red dashed rectangular boundary, representing the feature embedding space used to separate different health conditions. Within this feature space, several colored shapes represent samples from different health states: orange circles, blue squares, and green triangles. Each class forms clusters around center points, represented by larger outlined symbols such as a circled circle, a square outline, or a triangle outline. Gray arrows point toward these centers. Red dashed curved lines divide the feature space into regions. In the upper portion of the metric learning space, three clusters of different shapes are partially overlapping but being separated by the boundaries. In the lower portion, the clusters become more compact and clearly separated, illustrating how the metric learning process improves class separation. Two thick arrows from this central panel point rightward to the third panel. The right panel is labeled “(3) Fault Classification”. The panel depicts a simplified neural network used to classify system conditions based on extracted features. On the left side, a vertical column of green circular nodes. The circles are stacked vertically inside a rounded rectangular container, with a vertical ellipsis between them indicating additional nodes. From these input nodes, several connection lines extend to a second vertical column of rectangular output nodes, representing the classification layer. The output column is labeled “Predicted Labels” above it. The rectangles represent the predicted categories corresponding to different health states of the system. To the right of the output layer, a green rectangular box contains the symbols “L subscript m” and “L subscript c”, representing the loss functions used during training, typically metric loss and classification loss. A solid arrow pointing right connects the predicted label layer to this loss block. A dotted arrow pointing left indicates the back propagation of gradients from the loss functions back through the network. Below the classification diagram is a legend explaining the graphical symbols used in the overall framework: A solid black arrow represents forward propagation. A dotted arrow pointing left represents back propagation. Outlined symbols—a circle, square, and triangle—represent the centers of different health states. Filled symbols—blue squares, orange circles, and green triangles—represent samples belonging to different health states. A red dashed line represents a decision boundary. A gray arrow indicates distance reduction.

Structure of the proposed DMSRN method. Source(s): Created by authors

Figure 1

Structure of the proposed DMSRN method. Source(s): Created by authors

Figure 2

A neural network architecture diagram with three convolution branches and concatenated feature vectors.

The image shows a neural network architecture diagram arranged vertically. At the top, a rectangular box labeled “Conv-1 D 64 at 1 cross 7” appears, followed by another box labeled “Max Pooling 1 cross 3”. From this point, the structure splits into three parallel columns. In the left column, the first block contains a text box labeled “Conv-1 D 64 at 1 cross 3” followed by the arrow “B N plus Re L U”, then another text box labeled “Conv-1 D 64 at 1 cross 3” followed by the arrow “B N plus Re L U”, which leads to a circular plus symbol. Below it, a text box labeled “Conv-1 D 128 at 1 cross 3” appears, followed by the arrow “B N plus Re L U”, then another text box labeled “Conv-1 D 128 at 1 cross 3” followed by the arrow “B N plus Re L U”, leading to another circular plus symbol. Further below, a text box labeled “Conv-1 D 256 at 1 cross 3” appears, followed by the arrow “B N plus Re L U”, then another text box labeled “Conv-1 D 256 at 1 cross 3” followed by the arrow “B N plus Re L U”, leading to a circular plus symbol. The downward arrow emerges from the arrow connecting “Max Pooling 1 cross 3” to “Conv-1 D 64 at 1 cross 3” and points to the first circular plus symbol. Another downward arrow emerges from the first circular plus symbol and points to the second circular plus symbol. Another downward arrow emerges from the second circular plus symbol and points to the third circular plus symbol. At the bottom of the column, a text box labeled “Average Pooling” appears, followed by another text box labeled “Feature Vector (256 cross 1)”. In the middle column, the first block contains a text box labeled “Conv-1 D 64 at 1 cross 5” followed by the arrow “B N plus Re L U”, then another text box labeled “Conv-1 D 64 at 1 cross 5” followed by the arrow “B N plus Re L U”, leading to a circular plus symbol. Below it, a text box labeled “Conv-1 D 128 at 1 cross 5” appears, followed by the arrow “B N plus Re L U”, then another text box labeled “Conv-1 D 128 at 1 cross 5” followed by the arrow “B N plus Re L U”, leading to a circular plus symbol. Further below, a text box labeled “Conv-1 D 256 at 1 cross 5” appears, followed by the arrow “B N plus Re L U”, then another text box labeled “Conv-1 D 256 at 1 cross 5” followed by the arrow “B N plus Re L U”, leading to a circular plus symbol. The downward arrow emerges from the arrow connecting “Max Pooling 1 cross 3” to “Conv-1 D 64 at 1 cross 3” and points to the first circular plus symbol. Another downward arrow emerges from the first circular plus symbol and points to the second circular plus symbol. Another downward arrow emerges from the second circular plus symbol and points to the third circular plus symbol. At the bottom of the column, a text box labeled “Average Pooling” appears, followed by another text box labeled “Feature Vector (256 cross 1)”. In the right column, the first block contains a text box labeled “Conv-1 D 64 at 1 cross 7” followed by the arrow “B N plus Re L U”, then another text box labeled “Conv-1 D 64 at 1 cross 7” followed by the arrow “B N plus Re L U”, leading to a circular plus symbol. Below it, a text box labeled “Conv-1 D 128 at 1 cross 7” appears, followed by the arrow “B N plus Re L U”, then another text box labeled “Conv-1 D 128 at 1 cross 7” followed by the arrow “B N plus Re L U”, leading to a circular plus symbol. Further below, a text box labeled “Conv-1 D 256 at 1 cross 7” appears, followed by the arrow “B N plus Re L U”, then another text box labeled “Conv-1 D 256 at 1 cross 7” followed by the arrow “B N plus Re L U”, leading to a circular plus symbol. The downward arrow emerges from the arrow connecting “Max Pooling 1 cross 3” to “Conv-1 D 64 at 1 cross 3” and points to the first circular plus symbol. Another downward arrow emerges from the first circular plus symbol and points to the second circular plus symbol. Another downward arrow emerges from the second circular plus symbol and points to the third circular plus symbol. At the bottom of the column, a text box labeled “Average Pooling” appears, followed by another text box labeled “Feature Vector (256 cross 1)”. The three “Feature Vector (256 cross 1)” outputs connect to a text box labeled “Concatenate”, which leads to the final text box labeled “Feature Vector (768 cross 1)”.

Details of the one-dimensional MSRN. Source(s): Created by authors

Figure 2

Details of the one-dimensional MSRN. Source(s): Created by authors

Figure 3

A diagram shows three health state clusters and dashed boundaries before and after separation.

The image shows two panels connected by a rightward-pointing arrow in the center. At the bottom, a legend shows a blue circular point labeled “Health state 1”, a green triangular point labeled “Health state 2”, an orange square point labeled “Health state 3”, and a dashed line labeled “Boundary”. In the left panel, orange square points appear mainly in the upper right area, blue circular points appear in the upper left area, and green triangular points appear near the bottom area. Curved dashed boundary lines appear between the groups of points. Inside each group, a cross mark appears with arrows pointing inward from several nearby points toward the cross mark. In the green triangular group near the bottom, three triangular points show arrows pointing inward toward the cross mark. Similar inward arrows appear for the blue circular group and the orange square group toward their respective cross marks. In the right panel, the points appear arranged into three clearer clusters where blue circular points gather in the upper left area, orange square points gather in the upper right area, and green triangular points gather in the lower area. Each cluster contains a cross mark with arrows pointing inward from surrounding points toward the cross mark. Curved dashed boundary lines separate the clusters in both panels. The visible spaces appear between the dashed boundary lines around the clusters in the right panel.

Schematic diagram of central loss. Source(s): Created by authors

Figure 4

A flowchart shows training phase and test phase steps for a D M S R N fault diagnosis process.

The image shows a flowchart divided into two horizontal sections labeled “Training phase” and “Test phase”. In the “Training phase”, four rectangular boxes appear in sequence, connected by right-pointing arrows. The first box reads “Step 1: Collecting vibration data”. The arrow points to the second box labeled “Step 2: Splitting data into training samples”. Another arrow points to the third box labeled “Step 3: Inputting training samples into the proposed D M S R N for training”. A final arrow points to the fourth box labeled “Step 4: Outputting the trained D M S R N”. In the “Test phase”, four rectangular boxes also appear in sequence, connected by right-pointing arrows. The first box reads “Step 1: Collecting target vibration data”. The arrow points to the second box labeled “Step 2: Splitting data into test samples”. Another arrow points to the third box labeled “Step 3: Inputting test samples into the trained D M S R N for test”. A final arrow points to the fourth box labeled “Step 4: Outputting fault diagnosis result”.

Diagnosis process using proposed DMSRN method. Source(s): Created by authors

Figure 5

A labeled image showing elevator car door components including drive belt, door vane assembly, and car door panel.

The image contains two side by side panels showing an elevator car door system. The left panel shows a photograph of a vertical sliding elevator door structure with a metal frame, rails, belts, and door panels inside an industrial setting. The right panel shows a labeled schematic diagram of the elevator car door components. Several arrows point to labeled parts. The labels read “Drive belt”, “Door vane assembly”, “Door operator assembly”, “Car door guide rail”, “Permanent magnet motor”, “Car door panel”, “Door slider”, and “Sill”. The diagram shows the vertical door panels connected to a mechanical system at the top and a horizontal base labeled “Sill”.

Test bench of elevator door system. Source(s): Created by authors

Figure 6

A line graph shows training accuracy and training loss across epochs.

The image shows a line graph with the horizontal axis labeled “Epoch” and the vertical axes labeled “Training accuracy” on the left and “Training loss” on the right. The horizontal axis displays epoch values from 1 to 100 in increments of 11 units. The left vertical axis shows percentages from 0.00 percent to 100.00 percent in increments of 20.00 percent, while the right vertical axis shows values from 0.00 to 8.00 in increments of 1.00 units. Two lines appear on the graph. A legend at the bottom identifies the orange line as “Training accuracy” and the blue line as “Training loss”. The orange “Training accuracy” line starts near the coordinate (1, 35.00 percent), rises sharply during the early epochs passing through (12, 81.00 percent), and gradually increases until it reaches near the coordinate (100, 100.00 percent), where the line becomes almost flat. The blue “Training loss” line starts near the coordinate (1, 7.00), decreases steadily during the early epochs passing through (12, 5.50), and gradually declines toward the coordinate (100, 4.00), where the line becomes nearly stable.

Iterative curves during training phase. Source(s): Created by authors

Figure 7

A box plot compares test accuracy percent for S V M, C N N, M S R N, I D A N, and D M S R N methods.

The image shows a box plot that compares test accuracy (percent) for different methods. The horizontal axis is labeled “Different methods”, and the vertical axis is labeled “Test accuracy (percent)”. The vertical axis ranges from 0 to 100 percent in increments of 10 percent. Five box plots appear along the horizontal axis labeled “S V M”, “C N N”, “M S R N”, “I D A N”, and “D M S R N”. The “S V M” box plot appears around 28 to 30 percent with a central mark at about 29 percent. The “C N N” box plot appears around 44 to 46 percent with a central mark at about 45 percent. The “M S R N” box plot appears around 69 to 71 percent with a central mark near about 70 percent. The “I D A N” box plot appears around 76 to 78 percent with a central mark at about 77 percent. The “D M S R N” box plot appears around 86 to 89 percent, with a central mark at about 88 percent. Each method shows a rectangular box with whisker lines extending above and below the box.

Box plot of diagnostic accuracy and standard deviation of different methods. Source(s): Created by authors

Figure 8

Five confusion matrix heatmaps compare predicted and truth labels for five different models.

The image shows five confusion matrix heatmaps arranged in two rows. Each heatmap has the horizontal axis labeled “Predicted label”, with class labels 0, 1, 2, and 3 from left to right, and the vertical axis labeled “Truth label”, with class labels 0, 1, 2, and 3 from top to bottom. Each matrix contains numerical values inside shaded squares and a vertical color scale bar on the right. The first heatmap, labeled “(a)”, shows the following matrix data. The vertical scale bar shows the lightest blue as 60 and the darkest blue as 120. The table contains four rows and four columns. The row-wise entries in the table are as follows: Row 1: Truth label: 0; Predicted label 0: 138; Predicted label 1: 87; Predicted label 2: 98; Predicted label 3: 57. Row 2: Truth label: 1; Predicted label 0: 67; Predicted label 1: 114; Predicted label 2: 83; Predicted label 3: 116. Row 3: Truth label: 2; Predicted label 0: 126; Predicted label 1: 81; Predicted label 2: 93; Predicted label 3: 80. Row 4: Truth label: 3; Predicted label 0: 71; Predicted label 1: 116; Predicted label 2: 87; Predicted label 3: 106. The second heatmap, labeled “(b)”, shows the following matrix data. The vertical scale bar shows the lightest blue as 50 and the darkest blue as 200. The table contains four rows and four columns. The row-wise entries in the table are as follows: Row 1: Truth label: 0; Predicted label 0: 203; Predicted label 1: 26; Predicted label 2: 138; Predicted label 3: 13. Row 2: Truth label: 1; Predicted label 0: 24; Predicted label 1: 163; Predicted label 2: 48; Predicted label 3: 145. Row 3: Truth label: 2; Predicted label 0: 136; Predicted label 1: 46; Predicted label 2: 146; Predicted label 3: 52. Row 4: Truth label: 3; Predicted label 0: 18; Predicted label 1: 147; Predicted label 2: 48; Predicted label 3: 167. The third heatmap, labeled “(c)”, shows the following matrix data. The vertical scale bar shows the lightest blue as 0 and the darkest blue as 300. The table contains four rows and four columns. The row-wise entries in the table are as follows: Row 1: Truth label: 0; Predicted label 0: 367; Predicted label 1: 0; Predicted label 2: 13; Predicted label 3: 0. Row 2: Truth label: 1; Predicted label 0: 9; Predicted label 1: 179; Predicted label 2: 6; Predicted label 3: 186. Row 3: Truth label: 2; Predicted label 0: 137; Predicted label 1: 3; Predicted label 2: 233; Predicted label 3: 7. Row 4: Truth label: 3; Predicted label 0: 4; Predicted label 1: 124; Predicted label 2: 7; Predicted label 3: 245. The fourth heatmap, labeled “(d)”, shows the following matrix data. The vertical scale bar shows the lightest blue as 0 and the darkest blue as 300. The table contains four rows and four columns. The row-wise entries in the table are as follows: Row 1: Truth label: 0; Predicted label 0: 354; Predicted label 1: 0; Predicted label 2: 26; Predicted label 3: 0. Row 2: Truth label: 1; Predicted label 0: 4; Predicted label 1: 277; Predicted label 2: 5; Predicted label 3: 94. Row 3: Truth label: 2; Predicted label 0: 49; Predicted label 1: 4; Predicted label 2: 321; Predicted label 3: 6. Row 4: Truth label: 3; Predicted label 0: 2; Predicted label 1: 144; Predicted label 2: 14; Predicted label 3: 220. The fifth heatmap, labeled “(e)”, shows the following matrix data. The vertical scale bar shows the lightest blue as 0 and the darkest blue as 300. The table contains four rows and four columns. The row-wise entries in the table are as follows: Row 1: Truth label: 0; Predicted label 0: 369; Predicted label 1: 0; Predicted label 2: 11; Predicted label 3: 0. Row 2: Truth label: 1; Predicted label 0: 0; Predicted label 1: 293; Predicted label 2: 1; Predicted label 3: 86. Row 3: Truth label: 2; Predicted label 0: 12; Predicted label 1: 1; Predicted label 2: 366; Predicted label 3: 1. Row 4: Truth label: 3; Predicted label 0: 0; Predicted label 1: 59; Predicted label 2: 1; Predicted label 3: 320.

Confusion matrices of different methods: (a) SVM, (b) CNN, (c) MSRN, (d) IDAN and (e) DMSRN. Source(s): Created by authors

Figure 8

Confusion matrices of different methods: (a) SVM, (b) CNN, (c) MSRN, (d) IDAN and (e) DMSRN. Source(s): Created by authors

Figure 9

A multi-panel scatter plot compares feature visualization results for S V M, C N N, M S R N, I D A N, and D M S R N.

https://doi.org/10.1088/1742-6596/1906/1/012017

The image shows five scatter plot panels arranged in two rows inside a dashed rectangular boundary. The panels are labeled “(a)”, “(b)”, and “(c)” in the top row and “(d)” and “(e)” in the bottom row. A legend on the right shows four classes represented by different markers: the orange circle labeled “0”, the pink star labeled “1”, the green triangle labeled “2”, and the blue square labeled “3”. In panel “(a)”, the symbols form a flower-shaped cluster with curved loops, and the square markers appear prominently across the structure, which correspond to the blue square markers. In the center of the shape, the blue square markers are also prominent and densely present. In panel “(b)”, a large oval-shaped cluster is shown. The left side of the oval is mainly covered by green triangular markers and orange circular markers, while the right side of the oval is mainly occupied by blue square markers and pink star markers. In panel “(c)”, a large cluster appears on the left side, formed mainly by blue square markers and pink star markers. In the center, a green cluster formed by triangular markers is visible, and on the right side, a separate cluster formed by orange circular markers appears. In panel “(d)”, two large clusters appear, one on the left and one on the right, and both clusters contain markers of all four types, which makes the clusters appear multicolored. In panel “(e)”, an inverted C-shaped curve appears on the bottom left side. The upper tip of this curve contains mainly pink star markers, and as the curve moves toward the right side, it gradually changes to blue square markers. Another curved line appears on the right side, which contains green triangular markers toward the upper portion and orange circular markers toward the lower portion.

Feature visualization of different methods: (a) SVM, (b) CNN, (c) MSRN, (d) IDAN and (e) DMSRN. Source(s): Created by authors

Table 1

Architecture of CNN in this experiment

Layer	Operation and parameters
1-Conv	Kernel 8–15 × 1, stride 2, BN, ReLU
2-Conv	Kernel 16–15 × 1, stride 4, BN, ReLU
3-Conv	Kernel 32–15 × 1, stride 4, BN, ReLU
4-Conv	Kernel 64–15 × 1, stride 4, BN, ReLU
5-AveragePool	Kernel 256–8 × 1, stride 1

Layer	Operation and parameters
1-Conv	Kernel 8–15 × 1, stride 2, BN, ReLU
2-Conv	Kernel 16–15 × 1, stride 4, BN, ReLU
3-Conv	Kernel 32–15 × 1, stride 4, BN, ReLU
4-Conv	Kernel 64–15 × 1, stride 4, BN, ReLU
5-AveragePool	Kernel 256–8 × 1, stride 1

Table 2

Diagnostic accuracy of different methods

Methods	1st time	2nd time	3rd time	4th time	5th time	6th time	Average	Standard deviation
SVM	29.67	30.72	28.3	27.5	30.00	27.5	28.95	1.37
CNN	44.47	44.93	46.97	44.21	43.16	44.67	44.74	1.25
MSRN	69.54	70.26	69.67	70.79	70.2	67.37	69.64	1.20
IDAN	76.72	77.76	76.13	78.37	76.42	77.78	77.20	0.89
DMSRN	88.82	85.07	88.55	87.04	88.62	88.68	87.80	1.49

Methods	1st time	2nd time	3rd time	4th time	5th time	6th time	Average	Standard deviation
SVM	29.67	30.72	28.3	27.5	30.00	27.5	28.95	1.37
CNN	44.47	44.93	46.97	44.21	43.16	44.67	44.74	1.25
MSRN	69.54	70.26	69.67	70.79	70.2	67.37	69.64	1.20
IDAN	76.72	77.76	76.13	78.37	76.42	77.78	77.20	0.89
DMSRN	88.82	85.07	88.55	87.04	88.62	88.68	87.80	1.49

Table 3

Diagnostic accuracy of different branches

Methods	Branch of kernel size 3	Branch of kernel size 5	Branch of kernel size 7	Multi-scale branches
Accuracy	83.75	83.22	79.61	87.76

Table 4

Diagnostic accuracy of different network settings

(Depth, width)	(1,1)	(1,2)	(1,3)	(2,1)	(2,2)	(2,3)	(3,1)	(3,2)	(3,3)
Accuracy	43.20	45.78	56.80	54.30	68.41	70.64	69.60	77.21	87.71

Table 5

The computational burden of different methods

Evaluation criterion	SVM	CNN	MSRN	IDAN	DMSRN
FLOPs/(M)	0.02	2.29	5.97	10.2	7.89
Parameters Number/(M)	0.01	0.13	0.37	0.51	0.42
Storage Occupancy/(MB)	0.01	0.67	1.98	5.11	3.20
Training Time/(s)	1.5	66	99.5	123	102.3
Test Time/(s)	1.0	1.3	1.4	1.5	1.3

Evaluation criterion	SVM	CNN	MSRN	IDAN	DMSRN
FLOPs/(M)	0.02	2.29	5.97	10.2	7.89
Parameters Number/(M)	0.01	0.13	0.37	0.51	0.42
Storage Occupancy/(MB)	0.01	0.67	1.98	5.11	3.20
Training Time/(s)	1.5	66	99.5	123	102.3
Test Time/(s)	1.0	1.3	1.4	1.5	1.3

Table 6

Diagnostic accuracy of different values for weighting coefficient

Value	0.01	0.1	0.5	1
Accuracy	70.02	72.39	82.65	87.14

Table 7

Accuracy (%) with different testing SNRs by different methods

SNR/(dB)	20	10	0	−10
SVM	30.07	29.87	26.91	26.25
CNN	42.96	37.04	25.99	25.13
MSRN	65.39	44.74	27.30	25.66
IDAN	69.11	48.53	32.64	26.87
DMSRN	78.29	61.64	50.20	27.79

Bai

Wang

Liu

and

(

2021

), “

The prediction of the elevator fault based on improved PSO-BP algorithm

”,

Journal of Physics: Conference Series

, Vol.

1906

No.

, 012017, doi:

https://doi.org/10.1016/j.ins.2023.119120

Chan

Deng

Yeung

and

Daniel

(

2023

), “

Multi-proxy based deep metric learning

”,

Information Sciences

, Vol.

643

, 119120, doi:

https://doi.org/10.1088/1742-6596/1345/4/042024

Chen

Lan

and

Jiang

(

2019

), “

Elevators Fault diagnosis based on artificial intelligence

”,

Journal of Physics: Conference Series

, Vol.

1345

, 042024, doi:

https://doi.org/10.3390/a17060250

Chen

Ren

and

Cheng

(

2025

), “

Research on distributed fault diagnosis model of elevator based on PCA-LSTM

”,

Algorithms

, Vol.

No.

, 250, doi:

https://doi.org/10.1109/LSP.2024.3364055

Dong

and

Lam

(

2024

), “

Bi-center loss for compound facial expression recognition

”,

IEEE Signal Processing Letters

, Vol.

, pp.

641

645

, doi:

https://doi.org/10.1016/j.asoc.2023.110678

Chu

Tang

Zhou

Chen

and

Sun

(

2023

), “

A lightweight multi-sensory field-based dual-feature fusion residual network for bird song recognition

”,

Applied Soft Computing

, Vol.

146

, 110678, doi:

https://doi.org/10.1016/j.aei.2024.102997

Huang

Ren

Zhu

Lin

Zhu

Zeng

and

Wan

(

2025

), “

Intra-domain self generalization network for intelligent fault diagnosis of bearings under unseen working conditions

”,

Advanced Engineering Informatics

, Vol.

, 102997, doi:

https://doi.org/10.1109/ACCESS.2023.3330137

Kim

Son

and

K.-Y.

(

2023

), “

Margin-maximized hyperspace for fault detection and prediction: a case study with an elevator door

”,

IEEE Access

, Vol.

, pp.

128580

128595

, doi:

https://doi.org/10.1016/j.patcog.2023.109381

Yang

and

Xue

(

2023

), “

Deep metric learning for few-shot image classification: a review of recent developments

”,

Pattern Recognition

, Vol.

138

, 109381, doi:

https://doi.org/10.1109/TBME.2022.3193277

Liu

Yang

Wang

and

(

2022

), “

FBMSNet: a filter-bank multi-scale convolutional neural network for EEG-based motor imagery decoding

”,

IEEE Transactions on Biomedical Engineering

, Vol.

No.

, pp.

436

445

, doi:

https://doi.org/10.1016/j.knosys.2025.113450

Chen

Xiao

and

Wang

, (

2025a

), “

Temporal knowledge graph fusion with neural ordinary differential equations for the predictive maintenance of electromechanical equipment

”,

Knowledge-Based Systems

, Vol.

317

, 113450, doi:

https://doi.org/10.1016/j.isatra.2025.09.041

Zhang

Xiao

and

Wang

(

2025b

), “

A multi-scale convolution capsule network with data augmentation and attention mechanisms for elevator fault diagnosis

”,

ISA Transactions

, Vol.

167

, pp.

1873

1887

, doi:

https://doi.org/10.1016/j.ymssp.2023.110544

Halkon

Feng

and

Nandi

A.K.

(

2023

), “

Physics-Informed Residual Network (PIResNet) for rolling element bearing fault diagnostics

”,

Mechanical Systems and Signal Processing

, Vol.

200

, 110544, doi:

https://doi.org/10.3390/s24072135

Pan

Shao

Dai

Wei

Chen

and

Lin

(

2024

), “

Research on fault prediction method of elevator door system based on transfer learning

”,

Sensors

, Vol.

No.

, 2135, doi:

https://doi.org/10.1016/j.engappai.2025.110641

Pan

Nie

Zhai

and

Ding

(

2025

), “

Classification of power quality disturbances using residual networks with channel attention mechanism

”,

Engineering Applications of Artificial Intelligence

, Vol.

151

, 110641, doi:

https://doi.org/10.1108/JIMSE-09-2023-0006

Ren

and

Niu

(

2024

), “

Suppression of horizontal vibrations in high-speed elevators using active shock absorber to assist traditional damping systems

”,

Journal of Intelligent Manufacturing and Special Equipment

, Vol.

No.

, pp.

170

189

, doi:

https://doi.org/10.1007/s40430-025-05816-2

Wan

Tong

AL-Bukhaiti

Zhou

and

Cheng

(

2025

), “

Intelligent fault diagnosis for elevator door systems using variational mode decomposition and multi-scale convolutional networks

”,

Journal of the Brazilian Society of Mechanical Sciences and Engineering

, Vol.

No.

, 508, doi:

https://doi.org/10.1088/1757-899X/428/1/012028

Wang

Leng

Zhang

Zhu

and

Zhang

(

2018

), “

MCU system-based intelligent high-speed elevator door operator fault analysis and research

”,

IOP Conference Series: Materials Science and Engineering

, Vol.

428

No.

, 012028, doi:

https://doi.org/10.1016/j.ymssp.2021.107650

Wang

Liu

Jiang

and

Jiang

(

2021

), “

Collaborative deep learning framework for fault diagnosis in distributed complex systems

”,

Mechanical Systems and Signal Processing

, Vol.

156

, 107650, doi:

https://doi.org/10.1016/j.eswa.2025.127493

Cheng

and

Wang

(

2025

), “

A contrastive clustering loss function increases class-balanced in time series classification

”,

Expert Systems with Applications

, Vol.

283

, 127493, doi:

https://doi.org/10.1016/j.compbiomed.2023.107120

Yin

Han

Jian

Wang

Chen

and

Wang

(

2023

), “

AMSUnet: a neural network using atrous multi-scale convolution for medical image segmentation

”,

Computers in Biology and Medicine

, Vol.

162

, 107120, doi:

https://doi.org/10.1016/j.eswa.2023.122105

Zhou

Qin

Hou

Dai

Huang

and

Zhang

(

2024

), “

Deep global semantic structure-preserving hashing via corrective triplet loss for remote sensing image retrieval

”,

Expert Systems with Applications

, Vol.

238

, 122105, doi: