Corporate financial distress prediction: a machine learning approach in the era of big data Open Access

https://doi.org/10.2307/2978933

Altman

E.I.

(

1968

), “

Financial ratios, discriminant analysis and the prediction of corporate bankruptcy

”,

The Journal of Finance

, Vol.

No.

, pp.

589

609

, doi:

https://doi.org/10.1111/j.1467-6281.2007.00234.x

Altman

E.I.

and

Sabato

(

2007

), “

Modelling credit risk for SMEs: evidence from the U.S. Market

”,

Abacus

, Vol.

No.

, pp.

332

357

, doi:

Ashraf

and

Ahmed

(

2020

), “

Machine learning shrewd approach for an imbalanced dataset conversion samples

”,

Journal of Engineering and Technology (JET)

, Vol.

No.

1 SE-Articles

, pp.

https://doi.org/10.1016/j.bar.2005.09.001

Balcaen

and

Ooghe

(

2006

), “

35 Years of studies on business failure: an overview of the classic statistical methodologies and their related problems

”,

The British Accounting Review

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1111/1475-679X.12292

Bao

Y.J.

and

Zhang

(

2020

), “

Detecting accounting fraud in publicly traded U.S. Firms using a machine learning approach

”,

Journal of Accounting Research

, Vol.

No.

, pp.

199

235

, doi:

https://doi.org/10.1016/j.eswa.2017.04.006

Barboza

Kimura

and

Altman

(

2017

), “

Machine learning models and bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

, pp.

405

417

, doi:

https://doi.org/10.1177/014920639101700108

Barney

(

1991

), “

Firm resources and sustained competitive advantage

”,

Journal of Management

, Vol.

No.

, pp.

120

, doi:

https://doi.org/10.2307/2490171

Beaver

(

1966

), “

Financial ratios As predictors of failure

”,

Journal of Accounting Research

, Vol.

No.

1966

, pp.

111

, doi:

https://doi.org/10.1007/s11142-004-6341-9

Beaver

McNichols

and

Rhie

J.W.

(

2005

), “

Have financial statements become less informative? Evidence from the ability of financial ratios to predict bankruptcy

”,

Review of Accounting Studies

, Vol.

No.

, pp.

122

, doi:

https://doi.org/10.1016/S0378-4266(02)00319-9

Becchetti

and

Sierra

(

2003

), “

Bankruptcy risk and productive efficiency in manufacturing firms

”,

Journal of Banking and Finance

, Vol.

No.

, pp.

2099

2120

, doi:

https://doi.org/10.1016/j.procs.2016.06.016

Belavagi

M.C.

and

Muniyal

(

2016

), “

Performance evaluation of supervised machine learning algorithms for intrusion detection

”,

Procedia Computer Science

, Vol.

, pp.

117

123

, doi:

https://doi.org/10.1007/s11142-020-09554-9

Bertomeu

(

2020

), “

Machine learning improves accounting: Discussion, implementation and research opportunities

”,

Review of Accounting Studies

, Vol.

No.

, pp.

1135

1155

, doi:

https://doi.org/10.1007/s11142-020-09563-8

Bertomeu

Cheynel

Floyd

and

Pan

(

2021

), “

Using machine learning to detect misstatements

”,

Review of Accounting Studies

, Vol.

No.

, pp.

468

519

, doi:

https://doi.org/10.1016/j.eswa.2013.12.009

Booth

Gerding

and

Mcgroarty

(

2014

Expert Systems with Applications and Seasonality. Expert Systems with Applications

, Vol.

No.

, pp.

3651

3661

, doi:

https://doi.org/10.1007/s00191-011-0224-6

Bottazzi

Grazzi

Secchi

and

Tamagni

(

2011

), “

Financial and economic determinants of firm default

”,

Journal of Evolutionary Economics

, Vol.

No.

, pp.

373

406

, doi:

https://doi.org/10.2307/257138

Bourgeois

L.J.

(

1981

), “On the measurement of organizational slack”,

The Academy of Management Review

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/S0031-3203(96)00142-2

Bradley

A.E.

(

1997

), “

The use of the area under the ROC curve in the evaluation of machine learning algorithms

”,

Pattern Recognition

, Vol.

No.

, pp.

1145

1159

, doi:

https://doi.org/10.3390/risks8030083

Breiman

(

1996

), “

Bagging predictors

”,

Machine Learning

, Vol.

No.

, pp.

123

140

, doi:

https://doi.org/10.1023/A:1010933404324

Breiman

(

2001

), “

Random forests

”,

Machine Learning

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1111/j.1540-6261.2008.01416.x

Campbell

J.Y.

Hilscher

and

Szilagyi

(

2008

), “

In search of distress risk

”,

The Journal of Finance

, Vol.

No.

, pp.

2899

2939

, doi:

https://doi.org/10.1016/j.iref.2018.03.008

Carmona

Climent

and

Momparler

(

2019

), “

Predicting failure in the U.S. banking sector: an extreme gradient boosting approach

”,

International Review of Economics and Finance

, Vol.

, pp.

304

323

, doi:

https://doi.org/10.1080/0963818042000216811

Charitou

Neophytou

and

Charalambous

(

2004

), “

Predicting corporate failure: empirical evidence for the UK

”,

European Accounting Review

, Vol.

No.

, pp.

465

497

, doi:

https://doi.org/10.1613/jair.953

Chawla

Bowyer

K.W.

Hall

L.O.

and

Kegelmeyer

W.P.

(

2002

), “

SMOTE: synthetic minority over-sampling technique

”,

Journal of Artificial Intelligence Research

, Vol.

, pp.

321

357

, doi:

https://doi.org/10.1016/j.camwa.2011.10.030

Chen

M.Y.

(

2011

), “

Bankruptcy prediction in firms with statistical and intelligent techniques and a comparison of evolutionary computation approaches

”,

Computers and Mathematics with Applications

, Vol.

No.

, pp.

4514

4524

, doi:

https://doi.org/10.1007/s10462-015-9434-x

Chen

Ribeiro

and

Chen

(

2016

), “

Financial credit risk assessment: a recent review

”,

Artificial Intelligence Review

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.asoc.2017.03.014

Chou

C.H.

Hsieh

S.C.

and

Qiu

C.J.

(

2017

), “

Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction

”,

Applied Soft Computing

, Vol.

, pp.

298

316

, doi:

https://doi.org/10.1016/j.jbusres.2018.11.015

Climent

Momparler

and

Carmona

(

2019

), “

Anticipating bank distress in the eurozone: an extreme gradient boosting approach

”,

Journal of Business Research

, Vol.

101

, pp.

885

896

, doi:

https://doi.org/10.1007/BF00994018

Consiglio Nazionale dei Dottori Commercialisti e degli Esperti Contabili

(

2019

), “Crisi d’Impresa”,

Gli Indici Di Allerta

Cortes

and

Vapnik

(

1995

), “

Support-Vector networks

”,

Machine Learning

, Vol.

No.

, pp.

273

297

, doi:

https://doi.org/10.5465/256801

Daily

C.M.

and

Dalton

D.R.

(

1994

), “

Bankruptcy and corporate governance: the impact of board composition and structure

”,

Academy of Management Journal

, Vol.

No.

, pp.

1603

1617

, doi:

https://doi.org/10.1177/0148558X14560898

Darrat

A.F.

Gray

Park

J.C.

and

(

2016

), “

Corporate governance and bankruptcy risk

”,

Journal of Accounting, Auditing and Finance

, Vol.

No.

, pp.

163

202

, doi:

https://doi.org/10.2307/2490225

Deakin

E.B.

(

1972

), “

A discriminant analysis of predictors of business failure

”,

Journal of Accounting Research

, Vol.

No.

, p.

167

, doi:

https://doi.org/10.1016/j.accinf.2023.100617

Desai

Bucaro

A.C.

Kim

J.W.

Srivastava

and

Desai

(

2023

), “

Toward a better expert system for auditor going concern opinions using bayesian network inflation factors

”,

International Journal of Accounting Information Systems

, Vol.

, p.

100617

, doi:

https://doi.org/10.1016/j.neucom.2009.11.034

Du Jardin

(

2010

), “

Predicting bankruptcy using neural networks and other classification methods: the influence of variable selection techniques on model accuracy

”,

Neurocomputing

, Vol.

Nos

10-12

, pp.

2047

2060

, doi:

https://doi.org/10.1016/j.eswa.2017.01.016

Du Jardin

(

2017

), “

Dynamics of firm financial evolution and bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

, pp.

, doi:

https://doi.org/10.1016/j.dss.2018.01.003

Du Jardin

(

2018

), “

Failure pattern-based ensembles applied to bankruptcy forecasting

”,

Decision Support Systems

, Vol.

107

, pp.

, doi:

https://doi.org/10.1016/j.dss.2011.04.001

Du Jardin

and

Séverin

(

2011

), “

Predicting corporate bankruptcy using a self-organizing map: an empirical study to improve the forecasting horizon of a financial failure model

”,

Decision Support Systems

, Vol.

No.

, pp.

701

711

, doi:

https://doi.org/10.2307/2329929

Edmister

R.O.

(

1972

), “

An empirical test of financial ratio analysis for small business failure prediction

”,

The Journal of Financial and Quantitative Analysis

, Vol.

No.

, pp.

1477

1493

, doi:

https://doi.org/10.1007/s13748-019-00197-9

Faris

Abukhurma

Almanaseer

Saadeh

Mora

A.M.

Castillo

P.A.

and

Aljarah

(

2020

), “

Improving financial bankruptcy prediction in a highly imbalanced class distribution using oversampling and ensemble learning: a case from the spanish market

”,

Progress in Artificial Intelligence

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2013.07.032

Fedorova

Gilenko

and

Dovzhenko

(

2013

), “

Bankruptcy prediction for russian companies: application of combined classifiers

”,

Expert Systems with Applications

, Vol.

No.

, pp.

7285

7293

, doi:

https://doi.org/10.1613/jair.1.11192

Fernandez

Garcia

Herrera

and

Chawla

N.V.

(

2018

), “

SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary

”,

Journal of Artificial Intelligence Research

, Vol.

, pp.

863

905

, doi:

https://doi.org/10.1016/j.ejor.2015.09.014

Fitzpatrick

and

Mues

(

2016

), “

An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market

”,

European Journal of Operational Research

, Vol.

249

No.

, pp.

427

439

, doi:

https://doi.org/10.1214/aos/1013203451

Friedman

J.H.

(

2001

), “

Greedy function approximation: a gradient boosting machine author

”,

The Annals of Statistics

, Vol.

No.

, pp.

1189

1232

, doi:

https://doi.org/10.1016/S0167-9473(01)00065-2

Friedman

J.H.

(

2002

), “

Stochastic gradient boosting

”,

Computational Statistics and Data Analysis

, Vol.

No.

, pp.

367

378

, doi:

https://doi.org/10.1016/j.mlwa.2022.100343

Garcia

(

2022

), “

Bankruptcy prediction using synthetic sampling

”,

Machine Learning with Applications

, Vol.

, p.

100343

, doi:

https://doi.org/10.1016/j.procs.2015.06.046

Gepp

and

Kumar

(

2015

), “

Predicting financial distress: a comparison of survival analysis and decision tree techniques

”,

Procedia Computer Science

, Vol.

, pp.

396

404

, doi:

Gloubos

and

Grammatikos

(

1988

), “

The success of bankruptcy prediction models in Greece

”,

Studies in Banking and Finance

, Vol.

, pp.

https://doi.org/10.1016/j.ijforecast.2018.01.009

Gogas

Papadimitriou

and

Agrapetidou

(

2018

), “

Forecasting bank failures and stress testing: a machine learning approach

”,

International Journal of Forecasting

, Vol.

No.

, pp.

440

455

, doi:

https://doi.org/10.1016/j.jobe.2019.100950

Gong

Bai

Qin

Wang

Yang

and

Wang

(

2020

), “

Gradient boosting machine for predicting return temperature of district heating system: a case study for residential buildings in Tianjin

”,

Journal of Building Engineering

, Vol.

, p.

100950

, doi:

https://doi.org/10.1111/acfi.12400

Habib

Costa

M.D.

Huang

H.J.

Bhuiyan

M.B.U.

and

Sun

(

2020

), “

Determinants and consequences of financial distress: Review of the empirical literature

”,

Accounting and Finance

, Vol.

No.

, pp.

1023

1075

, doi:

Hastie

Tibshirani

Friedman

J.H.

and

Friedman

J.H.

(

2009

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Springer

https://doi.org/10.1108/JAOC-03-2024-0105

Hazami-Ammar

(

2024

), “

Related party transactions and financial distress: Role of governance and audit attributes

”,

Journal of Accounting and Organizational Change

, doi:

https://doi.org/10.1109/IJCNN.2008.4633969

Bai

Garcia

E.A.

and

(

2008

), “

ADASYN: Adaptive synthetic sampling approach for imbalanced learning

”,

2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), Hong Kong, 2008

, pp.

1322

1328

https://doi.org/10.1016/j.asoc.2014.08.009

Heo

and

Yang

J.Y.

(

2014

), “

AdaBoost based bankruptcy forecasting of korean construction companies

”,

Applied Soft Computing

, Vol.

, pp.

494

499

, doi:

https://doi.org/10.1023/B:RAST.0000013627.90884.b7

Hillegeist

S.A.

Keating

E.K.

Cram

D.P.

and

Lundstedt

K.G.

(

2004

), “

Assessing the probability of bankruptcy

”,

Review of Accounting Studies

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2018.09.039

Hosaka

(

2019

), “

Bankruptcy prediction using imaged financial ratios and convolutional neural networks

”,

Expert Systems with Applications

, Vol.

117

, pp.

287

299

, doi:

https://doi.org/10.1016/j.eswa.2006.05.006

Hua

Wang

Zhang

and

Liang

(

2007

), “

Predicting corporate financial distress based on integration of support vector machine and logistic regression

”,

Expert Systems with Applications

, Vol.

No.

, pp.

434

440

, doi:

https://doi.org/10.1016/j.techfore.2021.120658

Jabeur

S.B.

Gharib

Mefteh-Wali

and

Arfi

W.B.

(

2021

), “

CatBoost model and artificial intelligence techniques for corporate failure prediction

”,

Technological Forecasting and Social Change

, Vol.

166

No.

January

, doi:

https://doi.org/10.1016/j.bar.2013.06.009

Jackson

R.H.G.

and

Wood

(

2013

), “

The performance of insolvency prediction and credit risk models in the UK: a comparative study

”,

The British Accounting Review

, Vol.

No.

, pp.

183

202

, doi:

https://doi.org/10.1016/j.asoc.2018.04.033

Jadhav

and

Jenkins

(

2018

), “

Information gain directed genetic algorithm wrapper feature selection for credit rating

”,

Applied Soft Computing

, Vol.

, pp.

541

553

, doi:

https://doi.org/10.1007/s11142-017-9407-1

Jones

(

2017

), “

Corporate bankruptcy prediction: a high dimensional analysis

”,

Review of Accounting Studies

, Vol.

No.

, pp.

1366

1422

, doi:

https://doi.org/10.1016/j.bar.2006.12.003

Jones

and

Hensher

D.A.

(

2007

), “

Modelling corporate failure: a multinomial nested logit analysis for unordered outcomes

”,

The British Accounting Review

, Vol.

No.

, pp.

107

, doi:

https://doi.org/10.1016/j.jbankfin.2015.02.006

Jones

Johnstone

and

Wilson

(

2015

), “

An empirical evaluation of the performance of binary classifiers in the prediction of credit ratings changes

”,

Journal of Banking and Finance

, Vol.

, pp.

, doi:

https://doi.org/10.1109/ICCONS.2018.8663128

Joshi

Ramesh

and

Tahsildar

(

2019

), “

A bankruptcy prediction model using random Forest

”,

Proceedings of the 2nd International Conference on Intelligent Computing and Control Systems, ICICCS 2018, Iciccs

, pp.

, doi:

https://doi.org/10.1016/0305-0483(90)90020-A

Keasey

McGuinness

and

Short

(

1990

), “

The failure of UK industrial firms for the period 1976–1984, logistic analysis and entropy measures

”,

Omega

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2016.04.027

Kim

H.J.

N.O.

and

Shin

K.S.

(

2016

), “

Optimization of cluster-based evolutionary undersampling for the artificial neural networks in corporate bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

, pp.

226

234

, doi:

https://doi.org/10.1016/j.eswa.2009.10.012

Kim

and

Kang

D.K.

(

2010

), “

Ensemble with neural networks for bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

No.

, pp.

3373

3379

, doi:

https://doi.org/10.1007/s11156-011-0238-z

Kwak

Shi

and

Kou

(

2012

), “

Bankruptcy prediction for korean firms after the 1997 financial crisis: Using a multiple criteria linear programming data mining approach

”,

Review of Quantitative Finance and Accounting

, Vol.

No.

, pp.

441

453

, doi:

https://doi.org/10.1016/j.ijforecast.2016.02.002

Landry

Erlinger

T.P.

Patschke

and

Varrichio

(

2016

), “

Probabilistic gradient boosting machines for GEFCom2014 wind forecasting

”,

International Journal of Forecasting

, Vol.

No.

, pp.

1061

1066

, doi:

https://doi.org/10.1016/j.eswa.2005.01.004

Lee

Booth

and

Alam

(

2005

), “

A comparison of supervised and unsupervised neural networks in predicting bankruptcy of korean firms

”,

Expert Systems with Applications

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2012.12.009

Lee

and

Choi

W.S.

(

2013

), “

A multi-industry bankruptcy prediction model using back-propagation neural network and multivariate discriminant analysis

”,

Expert Systems with Applications

, Vol.

No.

, pp.

2941

2946

, doi:

https://doi.org/10.1016/j.ejor.2016.01.012

Liang

C.C.

Tsai

C.F.

and

Shih

G.A.

(

2016

), “

Financial ratios and corporate governance indicators in bankruptcy prediction: a comprehensive study

”,

European Journal of Operational Research

, Vol.

252

No.

, pp.

561

572

, doi:

https://doi.org/10.1016/j.jbusres.2020.07.052

Liang

Tsai

C.F.

H.Y.R.

and

Chang

L.S.

(

2020

), “

Combining corporate governance indicators with stacking ensembles for financial distress prediction

”,

Journal of Business Research

, Vol.

120

, pp.

137

146

, doi:

https://doi.org/10.1016/j.knosys.2014.10.010

Liang

Tsai

C.F.

and

H.T.

(

2015

), “

The effect of feature selection on financial distress prediction

”,

Knowledge-Based Systems

, Vol.

No.

, pp.

289

297

, doi:

https://doi.org/10.1111/exsy.12335

Lin

Y.H.

and

Tsai

C.F.

(

2019

), “

Feature selection in single and ensemble learning-based bankruptcy prediction models

”,

Expert Systems

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1109/TSMCC.2011.2170420

Lin

W.Y.

Y.H.

and

Tsai

C.F.

(

2012

), “

Machine learning in financial crisis prediction: a survey

”,

IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews

, Vol.

No.

, pp.

421

436

, doi:

https://doi.org/10.1016/j.ins.2017.05.008

Lin

W.-C.

Tsai

C.-F.

Y.-H.

and

Jhang

J.-S.

(

2017

), “

Clustering-based undersampling in class-imbalanced data

”,

Information Sciences

, Vols

409-410

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2014.11.025

López Iturriaga

F.J.

and

Sanz

I.P.

(

2015

), “

Bankruptcy visualization and prediction using neural networks: a study of U.S. commercial banks

”,

Expert Systems with Applications

, Vol.

No.

, pp.

2857

2869

, doi:

https://doi.org/10.1016/j.ejor.2018.10.024

Mai

Tian

Lee

and

(

2019

), “

Deep learning models for bankruptcy prediction using textual disclosures

”,

European Journal of Operational Research

, Vol.

274

No.

, pp.

743

758

, doi:

https://doi.org/10.1016/j.eswa.2015.11.024

Maione

De Paula

E.S.

Gallimberti

Batista

B.L.

Campiglia

A.D.

Barbosa

and

Barbosa

R.M.

(

2016

), “

Comparative study of data mining techniques for the authentication of organic grape juice based on ICP-MS analysis

”,

Expert Systems with Applications

, Vol.

, pp.

, doi:

https://doi.org/10.1016/j.bar.2019.04.002

Moll

and

Yigitbasioglu

(

2019

), “

The role of internet-related technologies in shaping the work of accountants: New directions for accounting research

”,

The British Accounting Review

, Vol.

No.

, doi:

https://doi.org/10.1111/j.1540-6288.1998.tb01367.x

Mossman

C.E.

Bell

G.G.

Swartz

L.M.

and

Turtle

(

1998

), “

An empirical comparison of bankruptcy models

”,

Financial Review

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1007/s11187-014-9616-y

Mueller

and

Stegmaier

(

2015

), “

Economic failure and the role of plant age and size

”,

Small Business Economics

, Vol.

No.

, pp.

621

638

, doi:

https://doi.org/10.1080/09638180600555016

Neves

J.C.

and

Vieira

(

2006

), “

Improving bankruptcy prediction with hidden layer learning vector quantization

”,

European Accounting Review

, Vol.

No.

, pp.

253

271

, doi:

https://doi.org/10.2307/2490395

Ohlson

J.A.

(

1980

), “

Financial ratios and the probabilistic prediction of bankruptcy

”,

Journal of Accounting Research

, Vol.

No.

, p.

109

, doi:

https://doi.org/10.1016/j.dss.2011.10.007

Olson

D.L.

and

Meng

(

2012

), “

Comparative analysis of data mining methods for bankruptcy prediction

”,

Decision Support Systems

, Vol.

No.

, pp.

464

473

, doi:

https://doi.org/10.1016/j.eswa.2013.09.004

Oreski

and

Oreski

(

2014

), “

Genetic algorithm-based heuristic for feature selection in credit risk assessment

”,

Expert Systems with Applications

, Vol.

No.

, pp.

2052

2064

, doi:

https://doi.org/10.2308/ajpt-50009

Perols

(

2011

), “

Financial statement fraud detection: an analysis of statistical and machine learning algorithms

”,

AUDITING: A Journal of Practice and Theory

, Vol.

No.

, pp.

, doi:

https://doi.org/10.2308/accr-51562

Perols

J.L.

Bowen

R.M.

Zimmermann

and

Samba

(

2017

), “

Finding needles in a haystack: using data analytics to improve fraud prediction

”,

The Accounting Review

, Vol.

No.

, pp.

221

245

, doi:

https://doi.org/10.1016/j.ijforecast.2019.11.005

Petropoulos

Siakoulis

Stavroulakis

and

Vlachogiannakis

N.E.

(

2020

), “

Predicting bank insolvencies using machine learning techniques

”,

International Journal of Forecasting

, Vol.

No.

, pp.

1092

1113

, doi:

https://doi.org/10.1080/09638180.2022.2137221

Ranta

Ylinen

and

Järvenpää

(

2023

), “

Machine learning in management accounting research: literature review and pathways for the future

”,

European Accounting Review

, Vol.

No.

, pp.

607

636

, doi:

https://doi.org/10.1016/j.ejor.2006.08.043

Ravi Kumar

and

Ravi

(

2007

), “

Bankruptcy prediction in banks and firms via statistical and intelligent techniques—a review

”,

European Journal of Operational Research

, Vol.

180

No.

, pp.

, doi:

https://doi.org/10.1016/j.knosys.2010.05.007

Ravisankar

and

Ravi

(

2010

), “

Financial distress prediction in banks using group method of data handling neural network, counter propagation neural network and fuzzy ARTMAP

”,

Knowledge-Based Systems

, Vol.

No.

, pp.

823

831

, doi:

https://doi.org/10.1109/IWBIS.2018.8471718

Rustam

and

Saragih

G.S.

(

2018

), “

Predicting bank financial failures using random Forest

”, 2018

International Workshop on Big Data and Information Security, IWBIS

, pp.

. doi:

https://doi.org/10.1109/21.97458

Safavian

S.R.

and

Landgrebe

(

1991

), “

A survey of decision tree classifier methodology

”,

IEEE Transactions on Systems, Man, and Cybernetics

, Vol.

No.

, pp.

660

674

, doi:

https://doi.org/10.1023/A:1022648800760

Schapire

(

1990

), “

The strength of weak learnability

”,

Machine Learning

, Vol.

No.

, pp.

197

227

, doi:

https://doi.org/10.7551/mitpress/8291.003.0001

Schapire

and

Freund

(

2012

Boosting: Foundations and Algorithms

MIT Press

, doi:

https://doi.org/10.1016/j.asoc.2020.106852

Shen

Zhao

Kou

and

Alsaadi

F.E.

(

2021

), “

A new deep learning ensemble credit risk evaluation model with an improved synthetic minority oversampling technique

”,

Applied Soft Computing

, Vol.

, p.

106852

, doi:

https://doi.org/10.1086/209665

Shumway

(

2001

), “

Forecasting bankruptcy more accurately: a simple hazard model

”,

The Journal of Business

, Vol.

No.

, pp.

101

124

, doi:

https://doi.org/10.1007/s10796-020-10031-6

Smiti

and

Soui

(

2020

), “

Bankruptcy prediction using deep learning approach based on borderline SMOTE

”,

Information Systems Frontiers

, Vol.

No.

, pp.

1067

1083

, doi:

https://doi.org/10.1016/j.eswa.2019.07.033

Son

Hyun

Phan

and

Hwang

H.J.

(

2019

), “

Data analytic approach for bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

138

, p.

112816

, doi:

https://doi.org/10.1016/j.cogsys.2018.09.006

Song

Y.G.

Cao

Q. L.

and

Zhang

(

2018

), “

Towards a new approach to predict business performance using machine learning

”,

Cognitive Systems Research

, Vol.

, pp.

1004

1012

, doi:

https://doi.org/10.2307/2392337

Staw

B.M.

Sandelands

L.E.

and

Dutton

J.E.

(

1981

), “Threat rigidity effects in organizational behavior: a multilevel analysis”,

Administrative Science Quarterly

, Vol.

No.

, pp.

501

524

, doi:

https://doi.org/10.1002/for.2661

Tang

Tan

and

Shi

(

2020

), “

Incorporating textual and management factors into financial distress prediction: a comparative study of machine learning methods

”,

Journal of Forecasting

, Vol.

No.

, pp.

769

787

, doi:

https://doi.org/10.1016/j.aci.2018.08.003

Tharwat

(

2018

), “

Classification assessment methods

”,

Applied Computing and Informatics

, Vol.

No.

, pp.

168

192

, doi:

https://doi.org/10.1002/sam.11482

Tsai

C.F.

(

2020

), “

Two-stage hybrid learning techniques for bankruptcy prediction

”,

Statistical Analysis and Data Mining: The ASA Data Science Journal

, Vol.

No.

, pp.

565

572

, doi:

https://doi.org/10.1016/j.dss.2018.06.011

Veganzones

and

Séverin

(

2018

), “

An investigation of bankruptcy prediction in imbalanced datasets

”,

Decision Support Systems

, Vol.

112

No.

May

, pp.

111

124

, doi:

https://doi.org/10.1016/j.knosys.2011.06.020

Wang

Huang

and

(

2012

), “

Two credit scoring models based on dual strategy ensemble trees

”,

Knowledge-Based Systems

, Vol.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2013.09.033

Wang

and

Yang

(

2014

), “

An improved boosting based on feature selection for corporate bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

No.

, pp.

2353

2361

, doi:

https://doi.org/10.2307/2392987

Weitzel

and

Jonsson

(

1989

), “

Decline in organizations: a literature integration and extension

”,

Administrative Science Quarterly

, Vol.

No.

, pp.

109

, doi:

https://doi.org/10.1016/j.ins.2013.07.011

Yeh

Chi

and

Lin

(

2014

), “

Going-concern prediction using hybrid random forests and rough set approach

”,

Information Sciences

, Vol.

254

, pp.

110

, doi:

https://doi.org/10.1111/j.1468-5957.1985.tb00077.x

Zavgren

C.V.

(

1985

), “

Assessing the vulnerability to failure of American industrial firms: a logistic analysis

”,

Journal of Business Finance and Accounting

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2016.04.001

Ziȩba

Tomczak

S.K.

and

Tomczak

J.M.

(

2016

), “

Ensemble boosted trees with synthetic features generation in application to bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

, pp.

101

, doi:

https://doi.org/10.2307/2490859

Zmijewski

M.E.

(

1984

), “

Methodological issues related to the estimation of financial distress prediction models

”,

Journal of Accounting Research

, Vol.

, pp.

, doi:

Set of financial ratios: Several indicators map to more than one economic construct (e.g. liquidity, leverage, coverage and efficiency). To avoid over-interpretation, we report for each ratio a primary domain (the construct most closely measured) and cross-links (other relevant domains). Labels are provided for exposition only; the predictive models use all features jointly and are not restricted by this taxonomy.

Appendix 2

Appendix 3

Appendix 4

2025

Gianluca Gabrielli, Andrea Melioli and Flavio Bertini

Figure 1.

The image displays a map of Italy illustrated with regional variations using a gradient of pink and purple shades. The darkest shades are located in northern regions of the country, while lighter shades appear towards the southern regions. There are notable areas in dark pink in the south east. The outline of mainland Italy and the islands of Sardinia and Sicily are visible, with each region distinctly marked by its colour intensity suggesting a range of values or data.

Region heatmap. Observation per regions

Figure 2.

A bar chart displays performance metrics of four machine learning models across three categories of financial data with varying heights representing values.

This bar chart illustrates the performance metrics for four different machine learning models, Random Forest, Gradient Boosting, Decision Tree, and Logistic Regression. The data categories examined are Financial Ratios, Raw Data of Financial Ratios, and Raw Data. Each model is represented by differently coloured bars, with heights indicating their performance metrics ranging from approximately zero point eighty five to one. The chart provides a visual comparison of how these models performed across the specified data categories, with multiple bars grouped under each category to facilitate direct comparisons. Each data category is labelled at the bottom, and the performance values are indicated at the top of each bar for clarity.

AUC results: the random forest outperforms the other classifiers using the unbalance data set consisting of financial ratios

Figure 3.

A bar chart compares four machine learning models' performance across three categories: Financial ratios, Raw Data of financial ratios, and Raw data.

The image is a bar chart displaying the performance metrics of four machine learning models, Random Forest, Gradient Boosting, Decision Tree, and Logistic Regression. Each model is represented by differently styled bars. The categories along the horizontal axis include Financial ratios, Raw Data of financial ratios, and Raw data, while the vertical axis indicates performance scores ranging from zero point seventy five to one. Each modelâ€™s performance is presented as separate bars within each category, showing specific scores, such as Random Forest scoring zero point ninety six in the Financial ratios category. The arrangement allows for a comparison of models across various data types. Individual model scores are shown at the top of each corresponding bar.

AUC results: random forest outperforms the other classifiers after the application of SMOTE

Figure 4.

A bar graph illustrates the performance of four algorithms: Random Forest, Gradient Boosting, Decision Tree, and Logistic Regression, measured against three categories: Financial Ratios, Raw Data of Financial Ratios, and Raw Data.

https://doi.org/10.1016/j.eswa.2017.10.040

The image displays a vertical bar graph comparing the performance of four algorithms, Random Forest, Gradient Boosting, Decision Tree, and Logistic Regression. The y axis indicates the performance scores ranging from zero point seventy five to one, while the x axis categorises the data into three segments, Financial Ratios, Raw Data of Financial Ratios, and Raw Data. Each algorithmâ€™s scores are represented by differently coloured bars, blue for Random Forest, orange for Gradient Boosting, grey for Decision Tree, and yellow for Logistic Regression. The graph highlights specific performance values for each algorithm across the three categories, showing variations in scores. The tallest bars in both the Raw Data categories belong to Random Forest and Gradient Boosting, with unique scores presented next to the corresponding bars, such as zero point ninety nine for both algorithms in the Raw Data category.

AUC results: random forest outperforms the other classifiers after the application of random undersampling

Table 1.

Main methods results

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – Unbalanced data set: financial ratios as input variables (15 features)
RF	0.90	0.96	0.90	0.75	0.82	0.95
GBT	0.86	0.87	0.84	0.82	0.83	0.93
DT	0.84	0.87	0.83	0.78	0.80	0.91
LR	0.77	0.73	0.74	0.80	0.77	0.85
SVM	0.73	0.77	0.74	0.68	0.70	N.A.^[2]
Panel B – Unbalanced data set: raw data of financial ratios as input variables (45 features)
RF	0.94	0.95	0.94	0.92	0.93	0.98
GBT	0.93	0.95	0.93	0.89	0.91	0.98
DT	0.91	0.93	0.90	0.87	0.89	0.96
LR	0.82	0.96	0.92	0.59	0.71	0.90
SVM	0.76	0.96	0.92	0.45	0.58	N.A.
Panel C – Unbalanced data set: raw data as input variables (215 features)
RF	0.98	0.99	0.98	0.90	0.94	0.99
GBT	0.97	0.98	0.97	0.89	0.92	0.99
DT	0.96	0.99	0.89	0.77	0.83	0.97
LR	0.90	0.97	0.93	0.72	0.80	0.94
SVM	0.88	0.97	0.93	0.62	0.73	N.A.

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – Unbalanced data set: financial ratios as input variables (15 features)
RF	0.90	0.96	0.90	0.75	0.82	0.95
GBT	0.86	0.87	0.84	0.82	0.83	0.93
DT	0.84	0.87	0.83	0.78	0.80	0.91
LR	0.77	0.73	0.74	0.80	0.77	0.85
SVM	0.73	0.77	0.74	0.68	0.70	N.A.[2]
Panel B – Unbalanced data set: raw data of financial ratios as input variables (45 features)
RF	0.94	0.95	0.94	0.92	0.93	0.98
GBT	0.93	0.95	0.93	0.89	0.91	0.98
DT	0.91	0.93	0.90	0.87	0.89	0.96
LR	0.82	0.96	0.92	0.59	0.71	0.90
SVM	0.76	0.96	0.92	0.45	0.58	N.A.
Panel C – Unbalanced data set: raw data as input variables (215 features)
RF	0.98	0.99	0.98	0.90	0.94	0.99
GBT	0.97	0.98	0.97	0.89	0.92	0.99
DT	0.96	0.99	0.89	0.77	0.83	0.97
LR	0.90	0.97	0.93	0.72	0.80	0.94
SVM	0.88	0.97	0.93	0.62	0.73	N.A.

Table 2.

Results after Smote application

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – SMOTE oversampling: financial ratios as input variables (15 features)
RF	0.88	0.89	0.89	0.88	0.88	0.96
GBT	0.86	0.87	0.87	0.86	0.86	0.94
DT	0.84	0.85	0.85	0.83	0.84	0.92
LR	0.77	0.71	0.74	0.82	0.78	0.85
SVM	0.73	0.72	0.73	0.75	0.74	N.A.
Panel B – SMOTE oversampling: raw data of financial ratios as input variables (45 features)
RF	0.95	0.95	0.95	0.95	0.95	0.99
GBT	0.93	0.93	0.93	0.92	0.93	0.98
DT	0.90	0.91	0.91	0.90	0.90	0.97
LR	0.83	0.94	0.93	0.71	0.80	0.91
SVM	0.77	0.95	0.92	0.59	0.72	N.A.
Panel C – SMOTE oversampling: raw data as input variables (215 features)
RF	0.98	0.98	0.98	0.99	0.98	0.99
GBT	0.96	0.97	0.97	0.95	0.96	0.99
DT	0.96	0.99	0.89	0.77	0.83	0.97
LR	0.90	0.96	0.95	0.83	0.89	0.96
SVM	0.86	0.96	0.95	0.76	0.84	N.A.

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – SMOTE oversampling: financial ratios as input variables (15 features)
RF	0.88	0.89	0.89	0.88	0.88	0.96
GBT	0.86	0.87	0.87	0.86	0.86	0.94
DT	0.84	0.85	0.85	0.83	0.84	0.92
LR	0.77	0.71	0.74	0.82	0.78	0.85
SVM	0.73	0.72	0.73	0.75	0.74	N.A.
Panel B – SMOTE oversampling: raw data of financial ratios as input variables (45 features)
RF	0.95	0.95	0.95	0.95	0.95	0.99
GBT	0.93	0.93	0.93	0.92	0.93	0.98
DT	0.90	0.91	0.91	0.90	0.90	0.97
LR	0.83	0.94	0.93	0.71	0.80	0.91
SVM	0.77	0.95	0.92	0.59	0.72	N.A.
Panel C – SMOTE oversampling: raw data as input variables (215 features)
RF	0.98	0.98	0.98	0.99	0.98	0.99
GBT	0.96	0.97	0.97	0.95	0.96	0.99
DT	0.96	0.99	0.89	0.77	0.83	0.97
LR	0.90	0.96	0.95	0.83	0.89	0.96
SVM	0.86	0.96	0.95	0.76	0.84	N.A.

Table 3.

Results after random undersampling application

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – Random undersampling: financial ratios as input variables (15 features)
RF	0.86	0.87	0.87	0.86	0.86	0.94
GBT	0.85	0.85	0.85	0.86	0.85	0.93
DT	0.83	0.84	0.84	0.82	0.83	0.91
LR	0.76	0.71	0.74	0.82	0.78	0.84
SVM	0.73	0.72	0.72	0.73	0.73	N.A.
Panel B – Random undersampling: raw data of financial ratios as input variables (45 features)
RF	0.94	0.93	0.93	0.94	0.94	0.98
GBT	0.92	0.94	0.93	0.91	0.92	0.98
DT	0.90	0.91	0.91	0.89	0.90	0.96
LR	0.81	0.94	0.92	0.68	0.78	0.90
SVM	0.75	0.95	0.92	0.55	0.69	N.A.
Panel C – Random undersampling: raw data as input variables (215 features)
RF	0.98	0.98	0.98	0.99	0.98	0.99
GBT	0.96	0.97	0.97	0.95	0.96	0.99
DT	0.96	0.97	0.97	0.95	0.96	0.99
LR	0.88	0.95	0.94	0.81	0.87	0.94
SVM	0.86	0.96	0.95	0.76	0.84	N.A.

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – Random undersampling: financial ratios as input variables (15 features)
RF	0.86	0.87	0.87	0.86	0.86	0.94
GBT	0.85	0.85	0.85	0.86	0.85	0.93
DT	0.83	0.84	0.84	0.82	0.83	0.91
LR	0.76	0.71	0.74	0.82	0.78	0.84
SVM	0.73	0.72	0.72	0.73	0.73	N.A.
Panel B – Random undersampling: raw data of financial ratios as input variables (45 features)
RF	0.94	0.93	0.93	0.94	0.94	0.98
GBT	0.92	0.94	0.93	0.91	0.92	0.98
DT	0.90	0.91	0.91	0.89	0.90	0.96
LR	0.81	0.94	0.92	0.68	0.78	0.90
SVM	0.75	0.95	0.92	0.55	0.69	N.A.
Panel C – Random undersampling: raw data as input variables (215 features)
RF	0.98	0.98	0.98	0.99	0.98	0.99
GBT	0.96	0.97	0.97	0.95	0.96	0.99
DT	0.96	0.97	0.97	0.95	0.96	0.99
LR	0.88	0.95	0.94	0.81	0.87	0.94
SVM	0.86	0.96	0.95	0.76	0.84	N.A.

Table 4.

Out-of-sample validation

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – Unbalanced data set: financial ratios as input variables (15 features)- Out of sample validation
RF	0.90	0.96	0.86	0.71	0.78	0.94
GBT	0.88	0.95	0.82	0.64	0.72	0.92
DT	0.85	0.92	0.73	0.64	0.68	0.89
LR	0.48	0.42	0.28	0.67	0.39	0.55
SVM	0.49	0.44	0.27	0.63	0.38	N.A.
Panel B – Unbalanced data set: raw data of financial ratios as input variables (45 features) – Out of sample validation
RF	0.92	0.94	0.90	0.89	0.89	0.98
GBT	0.91	0.95	0.91	0.84	0.87	0.97
DT	0.88	0.92	0.85	0.82	0.83	0.94
LR	0.80	0.93	0.81	0.58	0.68	0.86
SVM	0.75	0.93	0.78	0.40	0.53	N.A.
Panel C Unbalanced data set: raw data as input variables (215 features) – Out of sample validation
RF	0.98	0.99	0.98	0.91	0.94	0.99
GBT	0.96	0.99	0.98	0.90	0.94	1.00
DT	0.97	0.99	0.89	0.82	0.85	0.97
LR	0.93	0.98	0.86	0.72	0.78	0.96
SVM	0.84	0.92	0.87	0.62	0.72	N.A.

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – Unbalanced data set: financial ratios as input variables (15 features)- Out of sample validation
RF	0.90	0.96	0.86	0.71	0.78	0.94
GBT	0.88	0.95	0.82	0.64	0.72	0.92
DT	0.85	0.92	0.73	0.64	0.68	0.89
LR	0.48	0.42	0.28	0.67	0.39	0.55
SVM	0.49	0.44	0.27	0.63	0.38	N.A.
Panel B – Unbalanced data set: raw data of financial ratios as input variables (45 features) – Out of sample validation
RF	0.92	0.94	0.90	0.89	0.89	0.98
GBT	0.91	0.95	0.91	0.84	0.87	0.97
DT	0.88	0.92	0.85	0.82	0.83	0.94
LR	0.80	0.93	0.81	0.58	0.68	0.86
SVM	0.75	0.93	0.78	0.40	0.53	N.A.
Panel C Unbalanced data set: raw data as input variables (215 features) – Out of sample validation
RF	0.98	0.99	0.98	0.91	0.94	0.99
GBT	0.96	0.99	0.98	0.90	0.94	1.00
DT	0.97	0.99	0.89	0.82	0.85	0.97
LR	0.93	0.98	0.86	0.72	0.78	0.96
SVM	0.84	0.92	0.87	0.62	0.72	N.A.

Table 5.

Out-of-sample validation after SMOTE application

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – SMOTE oversampling: financial ratios as input variables (15 features) – Out of sample validation
RF	0.85	0.87	0.78	0.82	0.80	0.92
GBT	0.80	0.77	0.68	0.87	0.76	0.91
DT	0.81	0.83	0.60	0.76	0.67	0.89
LR	0.50	0.25	0.41	0.95	0.58	0.85
SVM	0.45	0.16	0.39	0.96	0.56	N.A.
Panel B – SMOTE oversampling raw data of financial ratios as input variables (45 features) – Out of sample Validation
RF	0.91	0.89	0.82	0.94	0.88	0.97
GBT	0.87	0.84	0.75	0.93	0.83	0.96
DT	0.86	0.84	0.75	0.88	0.81	0.94
LR	0.82	0.82	0.71	0.81	0.76	0.87
SVM	0.79	0.80	0.67	0.77	0.72	N.A.
Panel C – SMOTE oversampling: raw data as input variables (215 features) – Out of sample Validation
RF	0.97	0.98	0.86	0.94	0.90	0.99
GBT	0.97	0.97	0.73	0.92	0.82	0.99
DT	0.96	0.95	0.98	0.96	0.97	0.99
LR	0.94	0.95	0.70	0.90	0.79	0.96
SVM	0.94	0.95	0.69	0.87	0.77	N.A.

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – SMOTE oversampling: financial ratios as input variables (15 features) – Out of sample validation
RF	0.85	0.87	0.78	0.82	0.80	0.92
GBT	0.80	0.77	0.68	0.87	0.76	0.91
DT	0.81	0.83	0.60	0.76	0.67	0.89
LR	0.50	0.25	0.41	0.95	0.58	0.85
SVM	0.45	0.16	0.39	0.96	0.56	N.A.
Panel B – SMOTE oversampling raw data of financial ratios as input variables (45 features) – Out of sample Validation
RF	0.91	0.89	0.82	0.94	0.88	0.97
GBT	0.87	0.84	0.75	0.93	0.83	0.96
DT	0.86	0.84	0.75	0.88	0.81	0.94
LR	0.82	0.82	0.71	0.81	0.76	0.87
SVM	0.79	0.80	0.67	0.77	0.72	N.A.
Panel C – SMOTE oversampling: raw data as input variables (215 features) – Out of sample Validation
RF	0.97	0.98	0.86	0.94	0.90	0.99
GBT	0.97	0.97	0.73	0.92	0.82	0.99
DT	0.96	0.95	0.98	0.96	0.97	0.99
LR	0.94	0.95	0.70	0.90	0.79	0.96
SVM	0.94	0.95	0.69	0.87	0.77	N.A.

Table 6.

Out-of-sample validation after random undersampling validation

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – Random undersampling: financial ratios as input variables (15 features) – Out of sample Validation
RF	0.83	0.82	0.73	0.86	0.79	0.92
GBT	0.84	0.85	0.64	0.82	0.72	0.92
DT	0.80	0.81	0.57	0.79	0.66	0.88
LR	0.51	0.26	0.42	0.95	0.58	0.60
SVM	0.45	0.17	0.39	0.94	0.55	N.A.
Panel B – Random undersampling: raw data of financial ratios as input variables (45 features) – Out of sample validation
RF	0.89	0.87	0.79	0.95	0.86	0.97
GBT	0.89	0.87	0.79	0.92	0.85	0.96
DT	0.86	0.85	0.76	0.88	0.82	0.94
LR	0.81	0.81	0.70	0.80	0.75	0.86
SVM	0.77	0.78	0.65	0.74	0.69	N.A.
Panel C – Random undersampling: raw data as input variables (215 features) – Out of sample validation
RF	0.88	0.79	0.79	0.99	0.88	0.99
GBT	0.97	0.97	0.80	0.97	0.88	0.99
DT	0.85	0.75	0.76	0.98	0.86	0.99
LR	0.92	0.93	0.62	0.90	0.73	0.96
SVM	0.92	0.93	0.62	0.88	0.72	N.A.

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Panel A – Random undersampling: financial ratios as input variables (15 features) – Out of sample Validation
RF	0.83	0.82	0.73	0.86	0.79	0.92
GBT	0.84	0.85	0.64	0.82	0.72	0.92
DT	0.80	0.81	0.57	0.79	0.66	0.88
LR	0.51	0.26	0.42	0.95	0.58	0.60
SVM	0.45	0.17	0.39	0.94	0.55	N.A.
Panel B – Random undersampling: raw data of financial ratios as input variables (45 features) – Out of sample validation
RF	0.89	0.87	0.79	0.95	0.86	0.97
GBT	0.89	0.87	0.79	0.92	0.85	0.96
DT	0.86	0.85	0.76	0.88	0.82	0.94
LR	0.81	0.81	0.70	0.80	0.75	0.86
SVM	0.77	0.78	0.65	0.74	0.69	N.A.
Panel C – Random undersampling: raw data as input variables (215 features) – Out of sample validation
RF	0.88	0.79	0.79	0.99	0.88	0.99
GBT	0.97	0.97	0.80	0.97	0.88	0.99
DT	0.85	0.75	0.76	0.98	0.86	0.99
LR	0.92	0.93	0.62	0.90	0.73	0.96
SVM	0.92	0.93	0.62	0.88	0.72	N.A.

Table 7.

Results at regional level

Metrics	R1	R2	R3	R4	R5	R6	R7	R8	R9	R10	R11	R12	R13	R14	R15	R16	R17	R18	R19	R20
Panel A – financial ratios
Accuracy	0.88	0.91	0.89	0.90	0.89	0.86	0.90	0.88	0.88	0.88	0.89	0.87	0.90	0.87	0.89	0.87	0.90	0.87	0.90	0.88
Specificity	0.96	0.98	0.98	0.98	0.97	0.97	0.98	0.97	0.97	0.97	0.98	0.97	0.98	0.97	0.98	0.97	0.96	0.96	0.96	0.97
Precision	0.84	0.85	0.87	0.88	0.86	0.88	0.85	0.88	0.86	0.90	0.91	0.86	0.87	0.84	0.86	0.88	0.78	0.86	0.71	0.85
Recall	0.62	0.58	0.54	0.52	0.64	0.57	0.54	0.59	0.57	0.68	0.61	0.59	0.58	0.52	0.54	0.64	0.64	0.63	0.56	0.61
F1-score	0.72	0.69	0.67	0.65	0.74	0.69	0.66	0.71	0.69	0.78	0.73	0.70	0.69	0.64	0.67	0.74	0.71	0.73	0.63	0.71
AUC	0.92	0.92	0.91	0.91	0.93	0.92	0.91	0.92	0.92	0.94	0.93	0.92	0.92	0.91	0.91	0.93	0.93	0.92	0.91	0.93
Panel B – raw data for financial ratios
Accuracy	0.91	0.93	0.91	0.92	0.91	0.89	0.92	0.90	0.90	0.90	0.93	0.90	0.92	0.89	0.92	0.90	0.93	0.90	0.92	0.90
Specificity	0.98	0.98	0.98	0.99	0.98	0.98	0.98	0.98	0.98	0.97	0.99	0.97	0.98	0.98	0.98	0.97	0.98	0.97	0.98	0.98
Precision	0.88	0.88	0.87	0.89	0.89	0.90	0.87	0.89	0.88	0.90	0.93	0.89	0.89	0.88	0.89	0.90	0.88	0.90	0.89	0.89
Recall	0.67	0.67	0.62	0.60	0.68	0.64	0.61	0.64	0.63	0.71	0.67	0.65	0.63	0.56	0.62	0.68	0.66	0.68	0.64	0.65
F1-score	0.76	0.76	0.72	0.71	0.77	0.75	0.72	0.74	0.73	0.80	0.78	0.75	0.73	0.68	0.73	0.77	0.75	0.77	0.75	0.75
AUC	0.94	0.96	0.94	0.94	0.95	0.94	0.94	0.94	0.94	0.95	0.96	0.94	0.95	0.93	0.94	0.94	0.94	0.95	0.94	0.94
Panel C – full raw accounting data
Accuracy	0.95	0.97	0.96	0.96	0.96	0.95	0.96	0.95	0.96	0.96	0.96	0.95	0.96	0.95	0.96	0.95	0.97	0.95	0.96	0.96
Specificity	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99
Precision	0.94	0.95	0.94	0.96	0.95	0.96	0.94	0.95	0.95	0.95	0.96	0.95	0.95	0.97	0.96	0.95	0.91	0.95	0.89	0.95
Recall	0.67	0.65	0.64	0.64	0.72	0.69	0.65	0.66	0.68	0.74	0.68	0.70	0.66	0.66	0.65	0.68	0.72	0.69	0.65	0.71
F1-score	0.78	0.77	0.76	0.77	0.82	0.80	0.77	0.78	0.79	0.84	0.79	0.81	0.78	0.79	0.77	0.79	0.81	0.80	0.75	0.81
AUC	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.97	0.98	0.98

Metrics	R1	R2	R3	R4	R5	R6	R7	R8	R9	R10	R11	R12	R13	R14	R15	R16	R17	R18	R19	R20
Panel A – financial ratios
Accuracy	0.88	0.91	0.89	0.90	0.89	0.86	0.90	0.88	0.88	0.88	0.89	0.87	0.90	0.87	0.89	0.87	0.90	0.87	0.90	0.88
Specificity	0.96	0.98	0.98	0.98	0.97	0.97	0.98	0.97	0.97	0.97	0.98	0.97	0.98	0.97	0.98	0.97	0.96	0.96	0.96	0.97
Precision	0.84	0.85	0.87	0.88	0.86	0.88	0.85	0.88	0.86	0.90	0.91	0.86	0.87	0.84	0.86	0.88	0.78	0.86	0.71	0.85
Recall	0.62	0.58	0.54	0.52	0.64	0.57	0.54	0.59	0.57	0.68	0.61	0.59	0.58	0.52	0.54	0.64	0.64	0.63	0.56	0.61
F1-score	0.72	0.69	0.67	0.65	0.74	0.69	0.66	0.71	0.69	0.78	0.73	0.70	0.69	0.64	0.67	0.74	0.71	0.73	0.63	0.71
AUC	0.92	0.92	0.91	0.91	0.93	0.92	0.91	0.92	0.92	0.94	0.93	0.92	0.92	0.91	0.91	0.93	0.93	0.92	0.91	0.93
Panel B – raw data for financial ratios
Accuracy	0.91	0.93	0.91	0.92	0.91	0.89	0.92	0.90	0.90	0.90	0.93	0.90	0.92	0.89	0.92	0.90	0.93	0.90	0.92	0.90
Specificity	0.98	0.98	0.98	0.99	0.98	0.98	0.98	0.98	0.98	0.97	0.99	0.97	0.98	0.98	0.98	0.97	0.98	0.97	0.98	0.98
Precision	0.88	0.88	0.87	0.89	0.89	0.90	0.87	0.89	0.88	0.90	0.93	0.89	0.89	0.88	0.89	0.90	0.88	0.90	0.89	0.89
Recall	0.67	0.67	0.62	0.60	0.68	0.64	0.61	0.64	0.63	0.71	0.67	0.65	0.63	0.56	0.62	0.68	0.66	0.68	0.64	0.65
F1-score	0.76	0.76	0.72	0.71	0.77	0.75	0.72	0.74	0.73	0.80	0.78	0.75	0.73	0.68	0.73	0.77	0.75	0.77	0.75	0.75
AUC	0.94	0.96	0.94	0.94	0.95	0.94	0.94	0.94	0.94	0.95	0.96	0.94	0.95	0.93	0.94	0.94	0.94	0.95	0.94	0.94
Panel C – full raw accounting data
Accuracy	0.95	0.97	0.96	0.96	0.96	0.95	0.96	0.95	0.96	0.96	0.96	0.95	0.96	0.95	0.96	0.95	0.97	0.95	0.96	0.96
Specificity	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99
Precision	0.94	0.95	0.94	0.96	0.95	0.96	0.94	0.95	0.95	0.95	0.96	0.95	0.95	0.97	0.96	0.95	0.91	0.95	0.89	0.95
Recall	0.67	0.65	0.64	0.64	0.72	0.69	0.65	0.66	0.68	0.74	0.68	0.70	0.66	0.66	0.65	0.68	0.72	0.69	0.65	0.71
F1-score	0.78	0.77	0.76	0.77	0.82	0.80	0.77	0.78	0.79	0.84	0.79	0.81	0.78	0.79	0.77	0.79	0.81	0.80	0.75	0.81
AUC	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.98	0.97	0.98	0.98

Table 8.

SMOTE

Metrics	R1	R2	R3	R4	R5	R6	R7	R8	R9	R10	R11	R12	R13	R14	R15	R16	R17	R18	R19	R20
Panel A – financial ratios
Accuracy	0.90	0.91	0.89	0.90	0.90	0.88	0.90	0.89	0.89	0.90	0.89	0.89	0.90	0.89	0.90	0.89	0.91	0.89	0.84	0.89
Specificity	0.95	0.95	0.95	0.98	0.95	0.92	0.97	0.94	0.94	0.93	0.91	0.94	0.97	0.94	0.97	0.94	0.95	0.93	0.85	0.95
Precision	0.86	0.77	0.84	0.90	0.90	0.85	0.90	0.84	0.88	0.89	0.77	0.86	0.89	0.83	0.90	0.89	0.81	0.85	0.56	0.88
Recall	0.76	0.76	0.71	0.68	0.79	0.81	0.70	0.76	0.80	0.86	0.83	0.77	0.72	0.75	0.69	0.83	0.76	0.81	0.80	0.78
F1-score	0.81	0.76	0.77	0.78	0.84	0.83	0.79	0.80	0.84	0.87	0.80	0.82	0.80	0.79	0.78	0.86	0.78	0.83	0.66	0.82
AUC	0.94	0.95	0.93	0.94	0.96	0.95	0.95	0.94	0.95	0.96	0.94	0.95	0.95	0.94	0.94	0.96	0.95	0.95	0.92	0.95
Panel B – raw data for financial ratios
Accuracy	0.92	0.90	0.91	0.93	0.92	0.90	0.92	0.91	0.91	0.91	0.87	0.91	0.93	0.91	0.93	0.91	0.92	0.90	0.86	0.91
Specificity	0.93	0.89	0.93	0.97	0.95	0.90	0.97	0.92	0.95	0.92	0.86	0.94	0.97	0.93	0.97	0.93	0.94	0.89	0.84	0.95
Precision	0.81	0.61	0.78	0.88	0.88	0.79	0.89	0.79	0.89	0.84	0.67	0.85	0.87	0.80	0.87	0.87	0.80	0.78	0.61	0.87
Recall	0.86	0.92	0.83	0.75	0.83	0.90	0.74	0.89	0.82	0.89	0.92	0.83	0.79	0.84	0.78	0.85	0.85	0.90	0.95	0.83
F1-score	0.83	0.73	0.80	0.81	0.85	0.84	0.81	0.84	0.86	0.86	0.78	0.84	0.83	0.82	0.82	0.86	0.83	0.84	0.74	0.85
AUC	0.97	0.97	0.96	0.96	0.97	0.96	0.96	0.97	0.96	0.97	0.96	0.96	0.96	0.96	0.96	0.96	0.97	0.96	0.97	0.97
Panel C – full raw accounting data
Accuracy	0.96	0.97	0.97	0.97	0.97	0.96	0.97	0.96	0.97	0.96	0.96	0.97	0.97	0.97	0.97	0.97	0.97	0.96	0.95	0.97
Specificity	0.97	0.98	0.97	0.98	0.98	0.96	0.99	0.97	0.98	0.96	0.97	0.97	0.98	0.98	0.98	0.97	0.97	0.96	0.96	0.98
Precision	0.87	0.85	0.88	0.93	0.92	0.89	0.94	0.89	0.95	0.90	0.85	0.92	0.91	0.91	0.92	0.92	0.83	0.88	0.76	0.93
Recall	0.94	0.95	0.93	0.92	0.95	0.96	0.92	0.94	0.95	0.96	0.93	0.95	0.94	0.94	0.93	0.95	0.95	0.95	0.91	0.96
F1-score	0.90	0.90	0.91	0.93	0.94	0.92	0.93	0.91	0.95	0.93	0.89	0.93	0.93	0.93	0.92	0.94	0.89	0.91	0.83	0.95
AUC	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	1.00	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	1.00

Metrics	R1	R2	R3	R4	R5	R6	R7	R8	R9	R10	R11	R12	R13	R14	R15	R16	R17	R18	R19	R20
Panel A – financial ratios
Accuracy	0.90	0.91	0.89	0.90	0.90	0.88	0.90	0.89	0.89	0.90	0.89	0.89	0.90	0.89	0.90	0.89	0.91	0.89	0.84	0.89
Specificity	0.95	0.95	0.95	0.98	0.95	0.92	0.97	0.94	0.94	0.93	0.91	0.94	0.97	0.94	0.97	0.94	0.95	0.93	0.85	0.95
Precision	0.86	0.77	0.84	0.90	0.90	0.85	0.90	0.84	0.88	0.89	0.77	0.86	0.89	0.83	0.90	0.89	0.81	0.85	0.56	0.88
Recall	0.76	0.76	0.71	0.68	0.79	0.81	0.70	0.76	0.80	0.86	0.83	0.77	0.72	0.75	0.69	0.83	0.76	0.81	0.80	0.78
F1-score	0.81	0.76	0.77	0.78	0.84	0.83	0.79	0.80	0.84	0.87	0.80	0.82	0.80	0.79	0.78	0.86	0.78	0.83	0.66	0.82
AUC	0.94	0.95	0.93	0.94	0.96	0.95	0.95	0.94	0.95	0.96	0.94	0.95	0.95	0.94	0.94	0.96	0.95	0.95	0.92	0.95
Panel B – raw data for financial ratios
Accuracy	0.92	0.90	0.91	0.93	0.92	0.90	0.92	0.91	0.91	0.91	0.87	0.91	0.93	0.91	0.93	0.91	0.92	0.90	0.86	0.91
Specificity	0.93	0.89	0.93	0.97	0.95	0.90	0.97	0.92	0.95	0.92	0.86	0.94	0.97	0.93	0.97	0.93	0.94	0.89	0.84	0.95
Precision	0.81	0.61	0.78	0.88	0.88	0.79	0.89	0.79	0.89	0.84	0.67	0.85	0.87	0.80	0.87	0.87	0.80	0.78	0.61	0.87
Recall	0.86	0.92	0.83	0.75	0.83	0.90	0.74	0.89	0.82	0.89	0.92	0.83	0.79	0.84	0.78	0.85	0.85	0.90	0.95	0.83
F1-score	0.83	0.73	0.80	0.81	0.85	0.84	0.81	0.84	0.86	0.86	0.78	0.84	0.83	0.82	0.82	0.86	0.83	0.84	0.74	0.85
AUC	0.97	0.97	0.96	0.96	0.97	0.96	0.96	0.97	0.96	0.97	0.96	0.96	0.96	0.96	0.96	0.96	0.97	0.96	0.97	0.97
Panel C – full raw accounting data
Accuracy	0.96	0.97	0.97	0.97	0.97	0.96	0.97	0.96	0.97	0.96	0.96	0.97	0.97	0.97	0.97	0.97	0.97	0.96	0.95	0.97
Specificity	0.97	0.98	0.97	0.98	0.98	0.96	0.99	0.97	0.98	0.96	0.97	0.97	0.98	0.98	0.98	0.97	0.97	0.96	0.96	0.98
Precision	0.87	0.85	0.88	0.93	0.92	0.89	0.94	0.89	0.95	0.90	0.85	0.92	0.91	0.91	0.92	0.92	0.83	0.88	0.76	0.93
Recall	0.94	0.95	0.93	0.92	0.95	0.96	0.92	0.94	0.95	0.96	0.93	0.95	0.94	0.94	0.93	0.95	0.95	0.95	0.91	0.96
F1-score	0.90	0.90	0.91	0.93	0.94	0.92	0.93	0.91	0.95	0.93	0.89	0.93	0.93	0.93	0.92	0.94	0.89	0.91	0.83	0.95
AUC	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	1.00	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	0.99	1.00

Table 9.

Random undersampling

Metrics	R1	R2	R3	R4	R5	R6	R7	R8	R9	R10	R11	R12	R13	R14	R15	R16	R17	R18	R19	R20
Panel A – financial ratios
Accuracy	0.83	0.83	0.83	0.83	0.84	0.83	0.82	0.82	0.83	0.85	0.81	0.83	0.83	0.84	0.82	0.84	0.84	0.83	0.84	0.83
Specificity	0.82	0.82	0.86	0.89	0.84	0.85	0.88	0.85	0.85	0.82	0.79	0.84	0.85	0.86	0.84	0.83	0.85	0.81	0.87	0.85
Precision	0.83	0.76	0.85	0.83	0.85	0.88	0.83	0.86	0.85	0.88	0.82	0.85	0.83	0.86	0.81	0.87	0.82	0.86	0.84	0.85
Recall	0.83	0.84	0.80	0.75	0.83	0.82	0.75	0.80	0.81	0.87	0.82	0.82	0.81	0.81	0.79	0.84	0.83	0.85	0.81	0.82
F1-score	0.83	0.80	0.82	0.79	0.84	0.85	0.79	0.83	0.83	0.88	0.82	0.84	0.82	0.83	0.80	0.85	0.83	0.85	0.82	0.84
AUC	0.91	0.91	0.91	0.90	0.92	0.91	0.90	0.91	0.91	0.93	0.91	0.91	0.91	0.91	0.90	0.92	0.92	0.91	0.93	0.92
Panel B – raw data for financial ratios
Accuracy	0.85	0.87	0.85	0.86	0.87	0.86	0.85	0.86	0.85	0.87	0.87	0.85	0.87	0.85	0.86	0.86	0.85	0.86	0.84	0.87
Specificity	0.85	0.90	0.88	0.90	0.88	0.88	0.89	0.89	0.87	0.85	0.90	0.86	0.90	0.90	0.89	0.86	0.87	0.85	0.90	0.88
Precision	0.85	0.86	0.85	0.86	0.89	0.91	0.85	0.90	0.88	0.90	0.90	0.88	0.88	0.88	0.85	0.89	0.84	0.89	0.85	0.89
Recall	0.84	0.83	0.82	0.81	0.86	0.83	0.81	0.84	0.83	0.88	0.85	0.84	0.83	0.80	0.82	0.86	0.84	0.86	0.76	0.85
F1-score	0.85	0.85	0.83	0.83	0.88	0.87	0.83	0.87	0.85	0.89	0.87	0.86	0.85	0.84	0.84	0.88	0.84	0.87	0.80	0.87
AUC	0.93	0.95	0.93	0.93	0.94	0.93	0.93	0.94	0.93	0.95	0.95	0.93	0.94	0.93	0.93	0.94	0.93	0.93	0.91	0.94
Panel C – full raw accounting data
Accuracy	0.91	0.92	0.91	0.91	0.92	0.92	0.91	0.91	0.91	0.91	0.92	0.91	0.91	0.90	0.90	0.91	0.90	0.90	0.86	0.91
Specificity	0.90	0.92	0.91	0.93	0.91	0.93	0.92	0.93	0.91	0.89	0.95	0.91	0.93	0.92	0.91	0.91	0.89	0.88	0.90	0.91
Precision	0.90	0.86	0.91	0.91	0.91	0.94	0.90	0.93	0.92	0.92	0.94	0.92	0.92	0.92	0.90	0.92	0.86	0.90	0.84	0.92
Recall	0.91	0.91	0.90	0.89	0.92	0.92	0.89	0.89	0.91	0.92	0.90	0.92	0.89	0.88	0.89	0.90	0.92	0.91	0.81	0.91
F1-score	0.91	0.88	0.91	0.90	0.92	0.93	0.90	0.91	0.91	0.92	0.92	0.92	0.91	0.90	0.90	0.91	0.89	0.91	0.83	0.92
AUC	0.97	0.98	0.97	0.97	0.98	0.98	0.97	0.97	0.97	0.97	0.97	0.97	0.97	0.97	0.97	0.97	0.97	0.96	0.94	0.97

Metrics	R1	R2	R3	R4	R5	R6	R7	R8	R9	R10	R11	R12	R13	R14	R15	R16	R17	R18	R19	R20
Panel A – financial ratios
Accuracy	0.83	0.83	0.83	0.83	0.84	0.83	0.82	0.82	0.83	0.85	0.81	0.83	0.83	0.84	0.82	0.84	0.84	0.83	0.84	0.83
Specificity	0.82	0.82	0.86	0.89	0.84	0.85	0.88	0.85	0.85	0.82	0.79	0.84	0.85	0.86	0.84	0.83	0.85	0.81	0.87	0.85
Precision	0.83	0.76	0.85	0.83	0.85	0.88	0.83	0.86	0.85	0.88	0.82	0.85	0.83	0.86	0.81	0.87	0.82	0.86	0.84	0.85
Recall	0.83	0.84	0.80	0.75	0.83	0.82	0.75	0.80	0.81	0.87	0.82	0.82	0.81	0.81	0.79	0.84	0.83	0.85	0.81	0.82
F1-score	0.83	0.80	0.82	0.79	0.84	0.85	0.79	0.83	0.83	0.88	0.82	0.84	0.82	0.83	0.80	0.85	0.83	0.85	0.82	0.84
AUC	0.91	0.91	0.91	0.90	0.92	0.91	0.90	0.91	0.91	0.93	0.91	0.91	0.91	0.91	0.90	0.92	0.92	0.91	0.93	0.92
Panel B – raw data for financial ratios
Accuracy	0.85	0.87	0.85	0.86	0.87	0.86	0.85	0.86	0.85	0.87	0.87	0.85	0.87	0.85	0.86	0.86	0.85	0.86	0.84	0.87
Specificity	0.85	0.90	0.88	0.90	0.88	0.88	0.89	0.89	0.87	0.85	0.90	0.86	0.90	0.90	0.89	0.86	0.87	0.85	0.90	0.88
Precision	0.85	0.86	0.85	0.86	0.89	0.91	0.85	0.90	0.88	0.90	0.90	0.88	0.88	0.88	0.85	0.89	0.84	0.89	0.85	0.89
Recall	0.84	0.83	0.82	0.81	0.86	0.83	0.81	0.84	0.83	0.88	0.85	0.84	0.83	0.80	0.82	0.86	0.84	0.86	0.76	0.85
F1-score	0.85	0.85	0.83	0.83	0.88	0.87	0.83	0.87	0.85	0.89	0.87	0.86	0.85	0.84	0.84	0.88	0.84	0.87	0.80	0.87
AUC	0.93	0.95	0.93	0.93	0.94	0.93	0.93	0.94	0.93	0.95	0.95	0.93	0.94	0.93	0.93	0.94	0.93	0.93	0.91	0.94
Panel C – full raw accounting data
Accuracy	0.91	0.92	0.91	0.91	0.92	0.92	0.91	0.91	0.91	0.91	0.92	0.91	0.91	0.90	0.90	0.91	0.90	0.90	0.86	0.91
Specificity	0.90	0.92	0.91	0.93	0.91	0.93	0.92	0.93	0.91	0.89	0.95	0.91	0.93	0.92	0.91	0.91	0.89	0.88	0.90	0.91
Precision	0.90	0.86	0.91	0.91	0.91	0.94	0.90	0.93	0.92	0.92	0.94	0.92	0.92	0.92	0.90	0.92	0.86	0.90	0.84	0.92
Recall	0.91	0.91	0.90	0.89	0.92	0.92	0.89	0.89	0.91	0.92	0.90	0.92	0.89	0.88	0.89	0.90	0.92	0.91	0.81	0.91
F1-score	0.91	0.88	0.91	0.90	0.92	0.93	0.90	0.91	0.91	0.92	0.92	0.92	0.91	0.90	0.90	0.91	0.89	0.91	0.83	0.92
AUC	0.97	0.98	0.97	0.97	0.98	0.98	0.97	0.97	0.97	0.97	0.97	0.97	0.97	0.97	0.97	0.97	0.97	0.96	0.94	0.97

Table 10.

Results using different accounting reports

Metrics	P&L		BS		Full financial statement
Metrics	RF	GBT	RF	GBT	RF	GBT
Panel A – Unbalanced data set
Accuracy	0.93	0.91	0.95	0.93	0.98	0.97
Specificity	0.99	0.99	0.99	0.99	0.99	0.98
Precision	0.85	0.79	0.93	0.91	0.98	0.97
Recall	0.45	0.35	0.58	0.47	0.9	0.89
F1-score	0.59	0.49	0.71	0.62	0.94	0.92
AUC	0.91	0.88	0.96	0.93	0.99	0.99
MCC	0.58	0.49	0.71	0.63	0.88	0.88
Panel B – SMOTE oversampling
Accuracy	0.92	0.81	0.96	0.87	0.98	0.95
Specificity	0.92	0.83	0.96	0.88	0.98	0.94
Precision	0.92	0.83	0.96	0.88	0.98	0.95
Recall	0.93	0.79	0.96	0.85	0.99	0.94
F1-score	0.92	0.81	0.96	0.86	0.98	0.96
AUC	0.98	0.89	0.99	0.94	0.99	0.98
Panel C – Random undersampling
Accuracy	0.83	0.80	0.88	0.86	0.98	0.96
Specificity	0.85	0.83	0.88	0.87	0.98	0.97
Precision	0.84	0.82	0.88	0.86	0.98	0.97
Recall	0.81	0.77	0.88	0.85	0.99	0.95
F1-score	0.83	0.80	0.88	0.85	0.98	0.96
AUC	0.91	0.88	0.95	0.93	0.99	0.99

Metrics	P&L		BS		Full financial statement
Metrics	RF	GBT	RF	GBT	RF	GBT
Panel A – Unbalanced data set
Accuracy	0.93	0.91	0.95	0.93	0.98	0.97
Specificity	0.99	0.99	0.99	0.99	0.99	0.98
Precision	0.85	0.79	0.93	0.91	0.98	0.97
Recall	0.45	0.35	0.58	0.47	0.9	0.89
F1-score	0.59	0.49	0.71	0.62	0.94	0.92
AUC	0.91	0.88	0.96	0.93	0.99	0.99
MCC	0.58	0.49	0.71	0.63	0.88	0.88
Panel B – SMOTE oversampling
Accuracy	0.92	0.81	0.96	0.87	0.98	0.95
Specificity	0.92	0.83	0.96	0.88	0.98	0.94
Precision	0.92	0.83	0.96	0.88	0.98	0.95
Recall	0.93	0.79	0.96	0.85	0.99	0.94
F1-score	0.92	0.81	0.96	0.86	0.98	0.96
AUC	0.98	0.89	0.99	0.94	0.99	0.98
Panel C – Random undersampling
Accuracy	0.83	0.80	0.88	0.86	0.98	0.96
Specificity	0.85	0.83	0.88	0.87	0.98	0.97
Precision	0.84	0.82	0.88	0.86	0.98	0.97
Recall	0.81	0.77	0.88	0.85	0.99	0.95
F1-score	0.83	0.80	0.88	0.85	0.98	0.96
AUC	0.91	0.88	0.95	0.93	0.99	0.99

Table 11.

SMOTE Extension results (testing)

Method	Accuracy – test	Specificity – test	Precision – test	Recall – test	F1-score – test	AUC – test
SMOTE	0.98	0.98	0.98	0.99	0.98	0.99
Borderline SMOTE	0.99	0.98	0.98	0.99	0.98	0.99
ADASYN	0.97	0.96	0.97	0.98	0.97	0.99
KMeans SMOTE (K = 100)	0.97	0.97	0.99	0.94	0.97	0.99

Method	Accuracy – test	Specificity – test	Precision – test	Recall – test	F1-score – test	AUC – test
SMOTE	0.98	0.98	0.98	0.99	0.98	0.99
Borderline SMOTE	0.99	0.98	0.98	0.99	0.98	0.99
ADASYN	0.97	0.96	0.97	0.98	0.97	0.99
KMeans SMOTE (K = 100)	0.97	0.97	0.99	0.94	0.97	0.99

Table 12.

SMOTE Extension results (out-of-sample validation)

Method	Accuracy – validation	Specificity – validation	Precision – validation	Recall – validation	F1-score – validation	AUC – validation
SMOTE	0.93	0.95	0.94	0.91	0.92	0.98
Borderline SMOTE	0.93	0.95	0.95	0.91	0.93	0.98
ADASYN	0.93	0.95	0.95	0.91	0.93	0.98
K-means SMOTE (K = 100)	0.86	0.98	0.97	0.72	0.83	0.97

Method	Accuracy – validation	Specificity – validation	Precision – validation	Recall – validation	F1-score – validation	AUC – validation
SMOTE	0.93	0.95	0.94	0.91	0.92	0.98
Borderline SMOTE	0.93	0.95	0.95	0.91	0.93	0.98
ADASYN	0.93	0.95	0.95	0.91	0.93	0.98
K-means SMOTE (K = 100)	0.86	0.98	0.97	0.72	0.83	0.97

Table 13.

Out-of-time validation

Method	Accuracy	Specificity	Precision	Recall	F1-score	AUC
Random forest	0.89	0.98	0.91	0.60	0.72	0.95

Table 14.

Relative variable importance of the random forest model using the financial ratios-based data set

Input variable	Feature importance
Equity/total debt t – 1	100.00
Cash/current liabilities t – 1	81.25
Tax and social security debts/total asset t – 3	68.75
Tax and social security debts/total asset t – 1	50.00
Cash/current liabilities t – 3	43.75
EBIT/interest expenses t – 1	31.25
Equity/total debts t – 2	25.00
Average payment time t – 1	25.00
EBIT/interest expenses t – 2	18.75
Equity/total debts t – 3	18.75

Input variable	Feature importance
Equity/total debt t – 1	100.00
Cash/current liabilities t – 1	81.25
Tax and social security debts/total asset t – 3	68.75
Tax and social security debts/total asset t – 1	50.00
Cash/current liabilities t – 3	43.75
EBIT/interest expenses t – 1	31.25
Equity/total debts t – 2	25.00
Average payment time t – 1	25.00
EBIT/interest expenses t – 2	18.75
Equity/total debts t – 3	18.75

Table 15.

Relative variable importance of the random forest model using the raw accounting data set

Input variable	Feature importance
Total short debts t – 1	100.00
Total equity t – 1	44.44
Total shareholders fund t – 1	33.33
Revenues from sales and services t – 1	22.22
Total financial charges t – 2	22.22
Due to suppliers t – 3	22.22
Profit (loss) t – 1	22.22
Total financial charges t – 3	22.22
Due to suppliers t – 1	22.22
Total financial charges t – 1	22.22

Input variable	Feature importance
Total short debts t – 1	100.00
Total equity t – 1	44.44
Total shareholders fund t – 1	33.33
Revenues from sales and services t – 1	22.22
Total financial charges t – 2	22.22
Due to suppliers t – 3	22.22
Profit (loss) t – 1	22.22
Total financial charges t – 3	22.22
Due to suppliers t – 1	22.22
Total financial charges t – 1	22.22

Table A1.

Feature description

#	Financial ratio	Feature type (primary domain)	Cross-links
1	Accounts receivables/inventory	Working-capital composition	Liquidity; operating cycle
2	Average cashing time*	Efficiency	Liquidity; operating cycle
3	Average payment time*	Efficiency	Liquidity; operating cycle
4	Cash/current liabilities	Liquidity	Short-term solvency
5	Current assets/total assets	Liquidity	Working capital composition; solvency
6	Ebita/interest expenses	Coverage	Profitability; solvency
7	Ebitdaa/total debts	Coverage	Leverage
8	Net financial position/Ebitda	Leverage	Coverage
9	Net financial position/total equity	Leverage	Liquidity
10	Operating profit/total sales	Profitability	Operating efficiency
11	Tax and social security debts/total assets	Solvency	Liquidity structure
12	Total debt / (total debt + total equity) (Gearing)	Leverage	Solvency
13	Total equity/total debts	Leverage	Solvency
14	Total sales/total assets	Efficiency	Profitability
15	Working capital/total sales	Efficiency	Liquidity

#	Financial ratio	Feature type (primary domain)	Cross-links
1	Accounts receivables/inventory	Working-capital composition	Liquidity; operating cycle
2	Average cashing time*	Efficiency	Liquidity; operating cycle
3	Average payment time*	Efficiency	Liquidity; operating cycle
4	Cash/current liabilities	Liquidity	Short-term solvency
5	Current assets/total assets	Liquidity	Working capital composition; solvency
6	Ebita/interest expenses	Coverage	Profitability; solvency
7	Ebitdaa/total debts	Coverage	Leverage
8	Net financial position/Ebitda	Leverage	Coverage
9	Net financial position/total equity	Leverage	Liquidity
10	Operating profit/total sales	Profitability	Operating efficiency
11	Tax and social security debts/total assets	Solvency	Liquidity structure
12	Total debt / (total debt + total equity) (Gearing)	Leverage	Solvency
13	Total equity/total debts	Leverage	Solvency
14	Total sales/total assets	Efficiency	Profitability
15	Working capital/total sales	Efficiency	Liquidity

Note(s):

*Ebit = earnings before interest and taxes; Ebitda = earnings before interest, taxes, depreciation and amortization; average cashing time: accounts receivables *360 / net sales; average payment time: due to suppliers *360 / cost for purchase of goods and services

Table A2.

Descriptive statistics

Financial ratio	Mean	Min	25%	50%	75%	Max
Descriptive statistics for financial ratios
Accounts receivable/inventory	27.96	0	0	0	1.41	270
Average cashing time (days)	698.59	0	0	68.52	158.46	195.491
Average payment time (days)	141.78	1.40	46.14	117.44	247.14	756
Cash/current liabilities	6.69	0	0.01	0.09	0.41	113
Current assets/total assets	0.67	0	0.47	0.79	0.94	24
Ebit/interest expenses	5.63	−3	0	1.69	11.02	146
Ebitda*/total debts	0.89	−12	0.00	0.07	0.18	434
Net financial position/ebitda	17.93	−64	−1.44	0.12	4.38	155
Net financial position/total equity	4.36	−57	−0.44	0.10	2.11	71
Operating profit/total sales	7.36	−824	0	0.03	0.09	46
Tax and social security debts/total assets	1.19	5	0.00	0.03	0.09	337
Total debt/total debt + total equity (gearing)	0.01	−11	−0.05	0.47	0.89	96
Total equity/total debts	1.75	−721	0.05	0.21	0.79	192.173
Total sales/total assets	1.19	0	0.13	0.77	1.42	17
Working capital/net sales	1.86	−126	0.34	0.07	1.25	64
Descriptive statistics of main raw accounting data (financial data in thousand euros)
Number of employees	12	0	0	2	7	61,904
Total fixed assets	1.962	0	10	76	474	12,362,509
Total current assets	25	0	92	332	1,140	9,392,512
Trade accounts - beyond 12 months	15	0	0	0	0	419,365
Total liquid funds	270	0	4	18	79	1,622,776
Other provisions	51	0	0	0	0	1,183,545
Bonds	4	0	0	0	0	328,336
Bonds beyond 12 months	20	0	0	0	0	1,398,697
Due to shareholders for loans	94	0	0	0	0	4,317,942
Due to sharesholders for loans - beyond 12 months	131	0	0	0	0	3,673,494
Due to banks	411	0	0	0	0	1,329,994
Due to banks - beyond 12 months	388	0	0	0	45	1,748,214
Due to other lenders	29	0	0	0	0	756,154
Due to other lenders - beyond 12 Months	35	0	0	0	0	1,128,329
Due to suppliers	681	0	7	58	284	5,101,812
Due to suppliers - beyond 12 months	13	0	0	0	0	182,680
Tax payable	92	0	1	9	42	1,454,786
Tax payable beyond 12 months	15	0	0	0	0	118,928
Due to social security institutions	33	0	0	2	12	119,954
Due to social security institutions - beyond 12 months	3	0	0	0	0	43,747
Services	731	0	15	69	260	2,921,213
Total depreciation, amortization and writedowns	116	0	1	8	33	1,473,550
Provisions for risks and charges	10	0	0	0	0	294,428
Operating margin	92	−7759571	−1	13	55	1,004,223
Total financial charges	49	0	0	3	17	812,193
Profit (loss) group	28	−1399393	−5	2	19	847,752

Financial ratio	Mean	Min	25%	50%	75%	Max
Descriptive statistics for financial ratios
Accounts receivable/inventory	27.96	0	0	0	1.41	270
Average cashing time (days)	698.59	0	0	68.52	158.46	195.491
Average payment time (days)	141.78	1.40	46.14	117.44	247.14	756
Cash/current liabilities	6.69	0	0.01	0.09	0.41	113
Current assets/total assets	0.67	0	0.47	0.79	0.94	24
Ebit/interest expenses	5.63	−3	0	1.69	11.02	146
Ebitda*/total debts	0.89	−12	0.00	0.07	0.18	434
Net financial position/ebitda	17.93	−64	−1.44	0.12	4.38	155
Net financial position/total equity	4.36	−57	−0.44	0.10	2.11	71
Operating profit/total sales	7.36	−824	0	0.03	0.09	46
Tax and social security debts/total assets	1.19	5	0.00	0.03	0.09	337
Total debt/total debt + total equity (gearing)	0.01	−11	−0.05	0.47	0.89	96
Total equity/total debts	1.75	−721	0.05	0.21	0.79	192.173
Total sales/total assets	1.19	0	0.13	0.77	1.42	17
Working capital/net sales	1.86	−126	0.34	0.07	1.25	64
Descriptive statistics of main raw accounting data (financial data in thousand euros)
Number of employees	12	0	0	2	7	61,904
Total fixed assets	1.962	0	10	76	474	12,362,509
Total current assets	25	0	92	332	1,140	9,392,512
Trade accounts - beyond 12 months	15	0	0	0	0	419,365
Total liquid funds	270	0	4	18	79	1,622,776
Other provisions	51	0	0	0	0	1,183,545
Bonds	4	0	0	0	0	328,336
Bonds beyond 12 months	20	0	0	0	0	1,398,697
Due to shareholders for loans	94	0	0	0	0	4,317,942
Due to sharesholders for loans - beyond 12 months	131	0	0	0	0	3,673,494
Due to banks	411	0	0	0	0	1,329,994
Due to banks - beyond 12 months	388	0	0	0	45	1,748,214
Due to other lenders	29	0	0	0	0	756,154
Due to other lenders - beyond 12 Months	35	0	0	0	0	1,128,329
Due to suppliers	681	0	7	58	284	5,101,812
Due to suppliers - beyond 12 months	13	0	0	0	0	182,680
Tax payable	92	0	1	9	42	1,454,786
Tax payable beyond 12 months	15	0	0	0	0	118,928
Due to social security institutions	33	0	0	2	12	119,954
Due to social security institutions - beyond 12 months	3	0	0	0	0	43,747
Services	731	0	15	69	260	2,921,213
Total depreciation, amortization and writedowns	116	0	1	8	33	1,473,550
Provisions for risks and charges	10	0	0	0	0	294,428
Operating margin	92	−7759571	−1	13	55	1,004,223
Total financial charges	49	0	0	3	17	812,193
Profit (loss) group	28	−1399393	−5	2	19	847,752

Table A3.

The sample distribution based on the industrial sector

Industry classification according to NACE one-digit code	(%)
Industry	(%)
Agriculture	1.05
Manifacturing, mining and quarrying	24.11
Utilities	1.26
Construction	20.05
Wholesale and retail trade	26.09
Transportation and storage	5.58
Accommodation and food service activities	3.80
Real estate activities	4.80
Information and communication	2.42
Financial and insurance activities	0.62
Services	10.22
Total	100.00

Industry classification according to NACE one-digit code	(%)
Industry	(%)
Agriculture	1.05
Manifacturing, mining and quarrying	24.11
Utilities	1.26
Construction	20.05
Wholesale and retail trade	26.09
Transportation and storage	5.58
Accommodation and food service activities	3.80
Real estate activities	4.80
Information and communication	2.42
Financial and insurance activities	0.62
Services	10.22
Total	100.00

Table A4.

Comparative overview of classification methods

Method	Basic idea	Key assumpitons	Strengths	Limitations
Random forest	Ensemble of bootstrapped trees with feature randomness; votes aggregated	Weakly correlated, moderately strong trees; enough trees for stability	Strong out-of-box accuracy; robust to noise and outliers; little tuning; variable importance available	Less interpretable than single tree; slower with many trees; probability calibration sometimes needed
Decision tree	Greedy recursive splits to reduce impurity	None on distribution/scale; assumes meaningful splits exist	Interpretable rules; handles nonlinearity and interactions; invariant to monotone rescaling	Unstable; prone to overfit; lower accuracy vs ensembles
Gradient boosting	Sequentially adds small trees to fit residuals (boosts weak learners)	Additive tree model; learning rate and depth govern bias–variance tradeoff	State-of-the-art tabular accuracy; captures subtle interactions; flexible	More tuning sensitive; can overfit without early stopping; slower than RF
Support vector machine	Finds a maximum-margin boundary (linear or kernelized)	Margin separability in transformed space; appropriate kernel choice; scaled features	Strong on high-dimensional data; handles complex boundaries with kernels	Harder to tune; no native probabilities (needs calibration); slower on very large N; scaling required
Logistic regression	Fits a logistic link between features and the probability of bankruptcy	Linear log-odds; additivity; limited multicollinearity; well-specified features	Simple, fast, well-understood; baseline for odds ratios; calibrated probabilities	Misses nonlinearity and interactions unless engineered; sensitive to scaling/collinearity; underfits complex patterns

Method	Basic idea	Key assumpitons	Strengths	Limitations
Random forest	Ensemble of bootstrapped trees with feature randomness; votes aggregated	Weakly correlated, moderately strong trees; enough trees for stability	Strong out-of-box accuracy; robust to noise and outliers; little tuning; variable importance available	Less interpretable than single tree; slower with many trees; probability calibration sometimes needed
Decision tree	Greedy recursive splits to reduce impurity	None on distribution/scale; assumes meaningful splits exist	Interpretable rules; handles nonlinearity and interactions; invariant to monotone rescaling	Unstable; prone to overfit; lower accuracy vs ensembles
Gradient boosting	Sequentially adds small trees to fit residuals (boosts weak learners)	Additive tree model; learning rate and depth govern bias–variance tradeoff	State-of-the-art tabular accuracy; captures subtle interactions; flexible	More tuning sensitive; can overfit without early stopping; slower than RF
Support vector machine	Finds a maximum-margin boundary (linear or kernelized)	Margin separability in transformed space; appropriate kernel choice; scaled features	Strong on high-dimensional data; handles complex boundaries with kernels	Harder to tune; no native probabilities (needs calibration); slower on very large N; scaling required
Logistic regression	Fits a logistic link between features and the probability of bankruptcy	Linear log-odds; additivity; limited multicollinearity; well-specified features	Simple, fast, well-understood; baseline for odds ratios; calibrated probabilities	Misses nonlinearity and interactions unless engineered; sensitive to scaling/collinearity; underfits complex patterns

Table A5.

List of Italian regions

Code	Region
R1	Abruzzo
R2	Basilicata
R3	Calabria
R4	Campania
R5	Emilia-Romagna
R6	Friuli-Venezia Giulia
R7	Lazio
R8	Liguria
R9	Lombardia
R10	Marche
R11	Molise
R12	Piemonte
R13	Puglia
R14	Sardegna
R15	Sicilia
R16	Toscana
R17	Trentino-Alto Adige
R18	Umbria
R19	Val d’Aosta
R20	Veneto

Code	Region
R1	Abruzzo
R2	Basilicata
R3	Calabria
R4	Campania
R5	Emilia-Romagna
R6	Friuli-Venezia Giulia
R7	Lazio
R8	Liguria
R9	Lombardia
R10	Marche
R11	Molise
R12	Piemonte
R13	Puglia
R14	Sardegna
R15	Sicilia
R16	Toscana
R17	Trentino-Alto Adige
R18	Umbria
R19	Val d’Aosta
R20	Veneto

Alaka

H.A.

Oyedele

L.O.

Owolabi

H.A.

Kumar

Ajayi

S.O.

Akinade

O.O.

and

Bilal

(

2018

), “

Systematic review of bankruptcy prediction models: towards a framework for tool selection

”,

Expert Systems with Applications

, Vol.

, pp.

164

184

, doi:

https://doi.org/10.2307/2978933

Altman

E.I.

(

1968

), “

Financial ratios, discriminant analysis and the prediction of corporate bankruptcy

”,

The Journal of Finance

, Vol.

No.

, pp.

589

609

, doi:

https://doi.org/10.1111/j.1467-6281.2007.00234.x

Altman

E.I.

and

Sabato

(

2007

), “

Modelling credit risk for SMEs: evidence from the U.S. Market

”,

Abacus

, Vol.

No.

, pp.

332

357

, doi:

Ashraf

and

Ahmed

(

2020

), “

Machine learning shrewd approach for an imbalanced dataset conversion samples

”,

Journal of Engineering and Technology (JET)

, Vol.

No.

1 SE-Articles

, pp.

https://doi.org/10.1016/j.bar.2005.09.001

Balcaen

and

Ooghe

(

2006

), “

35 Years of studies on business failure: an overview of the classic statistical methodologies and their related problems

”,

The British Accounting Review

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1111/1475-679X.12292

Bao

Y.J.

and

Zhang

(

2020

), “

Detecting accounting fraud in publicly traded U.S. Firms using a machine learning approach

”,

Journal of Accounting Research

, Vol.

No.

, pp.

199

235

, doi:

https://doi.org/10.1016/j.eswa.2017.04.006

Barboza

Kimura

and

Altman

(

2017

), “

Machine learning models and bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

, pp.

405

417

, doi:

https://doi.org/10.1177/014920639101700108

Barney

(

1991

), “

Firm resources and sustained competitive advantage

”,

Journal of Management

, Vol.

No.

, pp.

120

, doi:

https://doi.org/10.2307/2490171

Beaver

(

1966

), “

Financial ratios As predictors of failure

”,

Journal of Accounting Research

, Vol.

No.

1966

, pp.

111

, doi:

https://doi.org/10.1007/s11142-004-6341-9

Beaver

McNichols

and

Rhie

J.W.

(

2005

), “

Have financial statements become less informative? Evidence from the ability of financial ratios to predict bankruptcy

”,

Review of Accounting Studies

, Vol.

No.

, pp.

122

, doi:

https://doi.org/10.1016/S0378-4266(02)00319-9

Becchetti

and

Sierra

(

2003

), “

Bankruptcy risk and productive efficiency in manufacturing firms

”,

Journal of Banking and Finance

, Vol.

No.

, pp.

2099

2120

, doi:

https://doi.org/10.1016/j.procs.2016.06.016

Belavagi

M.C.

and

Muniyal

(

2016

), “

Performance evaluation of supervised machine learning algorithms for intrusion detection

”,

Procedia Computer Science

, Vol.

, pp.

117

123

, doi:

https://doi.org/10.1007/s11142-020-09554-9

Bertomeu

(

2020

), “

Machine learning improves accounting: Discussion, implementation and research opportunities

”,

Review of Accounting Studies

, Vol.

No.

, pp.

1135

1155

, doi:

https://doi.org/10.1007/s11142-020-09563-8

Bertomeu

Cheynel

Floyd

and

Pan

(

2021

), “

Using machine learning to detect misstatements

”,

Review of Accounting Studies

, Vol.

No.

, pp.

468

519

, doi:

https://doi.org/10.1016/j.eswa.2013.12.009

Booth

Gerding

and

Mcgroarty

(

2014

Expert Systems with Applications and Seasonality. Expert Systems with Applications

, Vol.

No.

, pp.

3651

3661

, doi:

https://doi.org/10.1007/s00191-011-0224-6

Bottazzi

Grazzi

Secchi

and

Tamagni

(

2011

), “

Financial and economic determinants of firm default

”,

Journal of Evolutionary Economics

, Vol.

No.

, pp.

373

406

, doi:

https://doi.org/10.2307/257138

Bourgeois

L.J.

(

1981

), “On the measurement of organizational slack”,

The Academy of Management Review

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/S0031-3203(96)00142-2

Bradley

A.E.

(

1997

), “

The use of the area under the ROC curve in the evaluation of machine learning algorithms

”,

Pattern Recognition

, Vol.

No.

, pp.

1145

1159

, doi:

https://doi.org/10.3390/risks8030083

Breiman

(

1996

), “

Bagging predictors

”,

Machine Learning

, Vol.

No.

, pp.

123

140

, doi:

https://doi.org/10.1023/A:1010933404324

Breiman

(

2001

), “

Random forests

”,

Machine Learning

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1111/j.1540-6261.2008.01416.x

Campbell

J.Y.

Hilscher

and

Szilagyi

(

2008

), “

In search of distress risk

”,

The Journal of Finance

, Vol.

No.

, pp.

2899

2939

, doi:

https://doi.org/10.1016/j.iref.2018.03.008

Carmona

Climent

and

Momparler

(

2019

), “

Predicting failure in the U.S. banking sector: an extreme gradient boosting approach

”,

International Review of Economics and Finance

, Vol.

, pp.

304

323

, doi:

https://doi.org/10.1080/0963818042000216811

Charitou

Neophytou

and

Charalambous

(

2004

), “

Predicting corporate failure: empirical evidence for the UK

”,

European Accounting Review

, Vol.

No.

, pp.

465

497

, doi:

https://doi.org/10.1613/jair.953

Chawla

Bowyer

K.W.

Hall

L.O.

and

Kegelmeyer

W.P.

(

2002

), “

SMOTE: synthetic minority over-sampling technique

”,

Journal of Artificial Intelligence Research

, Vol.

, pp.

321

357

, doi:

https://doi.org/10.1016/j.camwa.2011.10.030

Chen

M.Y.

(

2011

), “

Bankruptcy prediction in firms with statistical and intelligent techniques and a comparison of evolutionary computation approaches

”,

Computers and Mathematics with Applications

, Vol.

No.

, pp.

4514

4524

, doi:

https://doi.org/10.1007/s10462-015-9434-x

Chen

Ribeiro

and

Chen

(

2016

), “

Financial credit risk assessment: a recent review

”,

Artificial Intelligence Review

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.asoc.2017.03.014

Chou

C.H.

Hsieh

S.C.

and

Qiu

C.J.

(

2017

), “

Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction

”,

Applied Soft Computing

, Vol.

, pp.

298

316

, doi:

https://doi.org/10.1016/j.jbusres.2018.11.015

Climent

Momparler

and

Carmona

(

2019

), “

Anticipating bank distress in the eurozone: an extreme gradient boosting approach

”,

Journal of Business Research

, Vol.

101

, pp.

885

896

, doi:

https://doi.org/10.1007/BF00994018

Consiglio Nazionale dei Dottori Commercialisti e degli Esperti Contabili

(

2019

), “Crisi d’Impresa”,

Gli Indici Di Allerta

Cortes

and

Vapnik

(

1995

), “

Support-Vector networks

”,

Machine Learning

, Vol.

No.

, pp.

273

297

, doi:

https://doi.org/10.5465/256801

Daily

C.M.

and

Dalton

D.R.

(

1994

), “

Bankruptcy and corporate governance: the impact of board composition and structure

”,

Academy of Management Journal

, Vol.

No.

, pp.

1603

1617

, doi:

https://doi.org/10.1177/0148558X14560898

Darrat

A.F.

Gray

Park

J.C.

and

(

2016

), “

Corporate governance and bankruptcy risk

”,

Journal of Accounting, Auditing and Finance

, Vol.

No.

, pp.

163

202

, doi:

https://doi.org/10.2307/2490225

Deakin

E.B.

(

1972

), “

A discriminant analysis of predictors of business failure

”,

Journal of Accounting Research

, Vol.

No.

, p.

167

, doi:

https://doi.org/10.1016/j.accinf.2023.100617

Desai

Bucaro

A.C.

Kim

J.W.

Srivastava

and

Desai

(

2023

), “

Toward a better expert system for auditor going concern opinions using bayesian network inflation factors

”,

International Journal of Accounting Information Systems

, Vol.

, p.

100617

, doi:

https://doi.org/10.1016/j.neucom.2009.11.034

Du Jardin

(

2010

), “

Predicting bankruptcy using neural networks and other classification methods: the influence of variable selection techniques on model accuracy

”,

Neurocomputing

, Vol.

Nos

10-12

, pp.

2047

2060

, doi:

https://doi.org/10.1016/j.eswa.2017.01.016

Du Jardin

(

2017

), “

Dynamics of firm financial evolution and bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

, pp.

, doi:

https://doi.org/10.1016/j.dss.2018.01.003

Du Jardin

(

2018

), “

Failure pattern-based ensembles applied to bankruptcy forecasting

”,

Decision Support Systems

, Vol.

107

, pp.

, doi:

https://doi.org/10.1016/j.dss.2011.04.001

Du Jardin

and

Séverin

(

2011

), “

Predicting corporate bankruptcy using a self-organizing map: an empirical study to improve the forecasting horizon of a financial failure model

”,

Decision Support Systems

, Vol.

No.

, pp.

701

711

, doi:

https://doi.org/10.2307/2329929

Edmister

R.O.

(

1972

), “

An empirical test of financial ratio analysis for small business failure prediction

”,

The Journal of Financial and Quantitative Analysis

, Vol.

No.

, pp.

1477

1493

, doi:

https://doi.org/10.1007/s13748-019-00197-9

Faris

Abukhurma

Almanaseer

Saadeh

Mora

A.M.

Castillo

P.A.

and

Aljarah

(

2020

), “

Improving financial bankruptcy prediction in a highly imbalanced class distribution using oversampling and ensemble learning: a case from the spanish market

”,

Progress in Artificial Intelligence

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2013.07.032

Fedorova

Gilenko

and

Dovzhenko

(

2013

), “

Bankruptcy prediction for russian companies: application of combined classifiers

”,

Expert Systems with Applications

, Vol.

No.

, pp.

7285

7293

, doi:

https://doi.org/10.1613/jair.1.11192

Fernandez

Garcia

Herrera

and

Chawla

N.V.

(

2018

), “

SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary

”,

Journal of Artificial Intelligence Research

, Vol.

, pp.

863

905

, doi:

https://doi.org/10.1016/j.ejor.2015.09.014

Fitzpatrick

and

Mues

(

2016

), “

An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market

”,

European Journal of Operational Research

, Vol.

249

No.

, pp.

427

439

, doi:

https://doi.org/10.1214/aos/1013203451

Friedman

J.H.

(

2001

), “

Greedy function approximation: a gradient boosting machine author

”,

The Annals of Statistics

, Vol.

No.

, pp.

1189

1232

, doi:

https://doi.org/10.1016/S0167-9473(01)00065-2

Friedman

J.H.

(

2002

), “

Stochastic gradient boosting

”,

Computational Statistics and Data Analysis

, Vol.

No.

, pp.

367

378

, doi:

https://doi.org/10.1016/j.mlwa.2022.100343

Garcia

(

2022

), “

Bankruptcy prediction using synthetic sampling

”,

Machine Learning with Applications

, Vol.

, p.

100343

, doi:

https://doi.org/10.1016/j.procs.2015.06.046

Gepp

and

Kumar

(

2015

), “

Predicting financial distress: a comparison of survival analysis and decision tree techniques

”,

Procedia Computer Science

, Vol.

, pp.

396

404

, doi:

Gloubos

and

Grammatikos

(

1988

), “

The success of bankruptcy prediction models in Greece

”,

Studies in Banking and Finance

, Vol.

, pp.

https://doi.org/10.1016/j.ijforecast.2018.01.009

Gogas

Papadimitriou

and

Agrapetidou

(

2018

), “

Forecasting bank failures and stress testing: a machine learning approach

”,

International Journal of Forecasting

, Vol.

No.

, pp.

440

455

, doi:

https://doi.org/10.1016/j.jobe.2019.100950

Gong

Bai

Qin

Wang

Yang

and

Wang

(

2020

), “

Gradient boosting machine for predicting return temperature of district heating system: a case study for residential buildings in Tianjin

”,

Journal of Building Engineering

, Vol.

, p.

100950

, doi:

https://doi.org/10.1111/acfi.12400

Habib

Costa

M.D.

Huang

H.J.

Bhuiyan

M.B.U.

and

Sun

(

2020

), “

Determinants and consequences of financial distress: Review of the empirical literature

”,

Accounting and Finance

, Vol.

No.

, pp.

1023

1075

, doi:

Hastie

Tibshirani

Friedman

J.H.

and

Friedman

J.H.

(

2009

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Springer

https://doi.org/10.1108/JAOC-03-2024-0105

Hazami-Ammar

(

2024

), “

Related party transactions and financial distress: Role of governance and audit attributes

”,

Journal of Accounting and Organizational Change

, doi:

https://doi.org/10.1109/IJCNN.2008.4633969

Bai

Garcia

E.A.

and

(

2008

), “

ADASYN: Adaptive synthetic sampling approach for imbalanced learning

”,

2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), Hong Kong, 2008

, pp.

1322

1328

https://doi.org/10.1016/j.asoc.2014.08.009

Heo

and

Yang

J.Y.

(

2014

), “

AdaBoost based bankruptcy forecasting of korean construction companies

”,

Applied Soft Computing

, Vol.

, pp.

494

499

, doi:

https://doi.org/10.1023/B:RAST.0000013627.90884.b7

Hillegeist

S.A.

Keating

E.K.

Cram

D.P.

and

Lundstedt

K.G.

(

2004

), “

Assessing the probability of bankruptcy

”,

Review of Accounting Studies

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2018.09.039

Hosaka

(

2019

), “

Bankruptcy prediction using imaged financial ratios and convolutional neural networks

”,

Expert Systems with Applications

, Vol.

117

, pp.

287

299

, doi:

https://doi.org/10.1016/j.eswa.2006.05.006

Hua

Wang

Zhang

and

Liang

(

2007

), “

Predicting corporate financial distress based on integration of support vector machine and logistic regression

”,

Expert Systems with Applications

, Vol.

No.

, pp.

434

440

, doi:

https://doi.org/10.1016/j.techfore.2021.120658

Jabeur

S.B.

Gharib

Mefteh-Wali

and

Arfi

W.B.

(

2021

), “

CatBoost model and artificial intelligence techniques for corporate failure prediction

”,

Technological Forecasting and Social Change

, Vol.

166

No.

January

, doi:

https://doi.org/10.1016/j.bar.2013.06.009

Jackson

R.H.G.

and

Wood

(

2013

), “

The performance of insolvency prediction and credit risk models in the UK: a comparative study

”,

The British Accounting Review

, Vol.

No.

, pp.

183

202

, doi:

https://doi.org/10.1016/j.asoc.2018.04.033

Jadhav

and

Jenkins

(

2018

), “

Information gain directed genetic algorithm wrapper feature selection for credit rating

”,

Applied Soft Computing

, Vol.

, pp.

541

553

, doi:

https://doi.org/10.1007/s11142-017-9407-1

Jones

(

2017

), “

Corporate bankruptcy prediction: a high dimensional analysis

”,

Review of Accounting Studies

, Vol.

No.

, pp.

1366

1422

, doi:

https://doi.org/10.1016/j.bar.2006.12.003

Jones

and

Hensher

D.A.

(

2007

), “

Modelling corporate failure: a multinomial nested logit analysis for unordered outcomes

”,

The British Accounting Review

, Vol.

No.

, pp.

107

, doi:

https://doi.org/10.1016/j.jbankfin.2015.02.006

Jones

Johnstone

and

Wilson

(

2015

), “

An empirical evaluation of the performance of binary classifiers in the prediction of credit ratings changes

”,

Journal of Banking and Finance

, Vol.

, pp.

, doi:

https://doi.org/10.1109/ICCONS.2018.8663128

Joshi

Ramesh

and

Tahsildar

(

2019

), “

A bankruptcy prediction model using random Forest

”,

Proceedings of the 2nd International Conference on Intelligent Computing and Control Systems, ICICCS 2018, Iciccs

, pp.

, doi:

https://doi.org/10.1016/0305-0483(90)90020-A

Keasey

McGuinness

and

Short

(

1990

), “

The failure of UK industrial firms for the period 1976–1984, logistic analysis and entropy measures

”,

Omega

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2016.04.027

Kim

H.J.

N.O.

and

Shin

K.S.

(

2016

), “

Optimization of cluster-based evolutionary undersampling for the artificial neural networks in corporate bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

, pp.

226

234

, doi:

https://doi.org/10.1016/j.eswa.2009.10.012

Kim

and

Kang

D.K.

(

2010

), “

Ensemble with neural networks for bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

No.

, pp.

3373

3379

, doi:

https://doi.org/10.1007/s11156-011-0238-z

Kwak

Shi

and

Kou

(

2012

), “

Bankruptcy prediction for korean firms after the 1997 financial crisis: Using a multiple criteria linear programming data mining approach

”,

Review of Quantitative Finance and Accounting

, Vol.

No.

, pp.

441

453

, doi:

https://doi.org/10.1016/j.ijforecast.2016.02.002

Landry

Erlinger

T.P.

Patschke

and

Varrichio

(

2016

), “

Probabilistic gradient boosting machines for GEFCom2014 wind forecasting

”,

International Journal of Forecasting

, Vol.

No.

, pp.

1061

1066

, doi:

https://doi.org/10.1016/j.eswa.2005.01.004

Lee

Booth

and

Alam

(

2005

), “

A comparison of supervised and unsupervised neural networks in predicting bankruptcy of korean firms

”,

Expert Systems with Applications

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2012.12.009

Lee

and

Choi

W.S.

(

2013

), “

A multi-industry bankruptcy prediction model using back-propagation neural network and multivariate discriminant analysis

”,

Expert Systems with Applications

, Vol.

No.

, pp.

2941

2946

, doi:

https://doi.org/10.1016/j.ejor.2016.01.012

Liang

C.C.

Tsai

C.F.

and

Shih

G.A.

(

2016

), “

Financial ratios and corporate governance indicators in bankruptcy prediction: a comprehensive study

”,

European Journal of Operational Research

, Vol.

252

No.

, pp.

561

572

, doi:

https://doi.org/10.1016/j.jbusres.2020.07.052

Liang

Tsai

C.F.

H.Y.R.

and

Chang

L.S.

(

2020

), “

Combining corporate governance indicators with stacking ensembles for financial distress prediction

”,

Journal of Business Research

, Vol.

120

, pp.

137

146

, doi:

https://doi.org/10.1016/j.knosys.2014.10.010

Liang

Tsai

C.F.

and

H.T.

(

2015

), “

The effect of feature selection on financial distress prediction

”,

Knowledge-Based Systems

, Vol.

No.

, pp.

289

297

, doi:

https://doi.org/10.1111/exsy.12335

Lin

Y.H.

and

Tsai

C.F.

(

2019

), “

Feature selection in single and ensemble learning-based bankruptcy prediction models

”,

Expert Systems

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1109/TSMCC.2011.2170420

Lin

W.Y.

Y.H.

and

Tsai

C.F.

(

2012

), “

Machine learning in financial crisis prediction: a survey

”,

IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews

, Vol.

No.

, pp.

421

436

, doi:

https://doi.org/10.1016/j.ins.2017.05.008

Lin

W.-C.

Tsai

C.-F.

Y.-H.

and

Jhang

J.-S.

(

2017

), “

Clustering-based undersampling in class-imbalanced data

”,

Information Sciences

, Vols

409-410

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2014.11.025

López Iturriaga

F.J.

and

Sanz

I.P.

(

2015

), “

Bankruptcy visualization and prediction using neural networks: a study of U.S. commercial banks

”,

Expert Systems with Applications

, Vol.

No.

, pp.

2857

2869

, doi:

https://doi.org/10.1016/j.ejor.2018.10.024

Mai

Tian

Lee

and

(

2019

), “

Deep learning models for bankruptcy prediction using textual disclosures

”,

European Journal of Operational Research

, Vol.

274

No.

, pp.

743

758

, doi:

https://doi.org/10.1016/j.eswa.2015.11.024

Maione

De Paula

E.S.

Gallimberti

Batista

B.L.

Campiglia

A.D.

Barbosa

and

Barbosa

R.M.

(

2016

), “

Comparative study of data mining techniques for the authentication of organic grape juice based on ICP-MS analysis

”,

Expert Systems with Applications

, Vol.

, pp.

, doi:

https://doi.org/10.1016/j.bar.2019.04.002

Moll

and

Yigitbasioglu

(

2019

), “

The role of internet-related technologies in shaping the work of accountants: New directions for accounting research

”,

The British Accounting Review

, Vol.

No.

, doi:

https://doi.org/10.1111/j.1540-6288.1998.tb01367.x

Mossman

C.E.

Bell

G.G.

Swartz

L.M.

and

Turtle

(

1998

), “

An empirical comparison of bankruptcy models

”,

Financial Review

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1007/s11187-014-9616-y

Mueller

and

Stegmaier

(

2015

), “

Economic failure and the role of plant age and size

”,

Small Business Economics

, Vol.

No.

, pp.

621

638

, doi:

https://doi.org/10.1080/09638180600555016

Neves

J.C.

and

Vieira

(

2006

), “

Improving bankruptcy prediction with hidden layer learning vector quantization

”,

European Accounting Review

, Vol.

No.

, pp.

253

271

, doi:

https://doi.org/10.2307/2490395

Ohlson

J.A.

(

1980

), “

Financial ratios and the probabilistic prediction of bankruptcy

”,

Journal of Accounting Research

, Vol.

No.

, p.

109

, doi:

https://doi.org/10.1016/j.dss.2011.10.007

Olson

D.L.

and

Meng

(

2012

), “

Comparative analysis of data mining methods for bankruptcy prediction

”,

Decision Support Systems

, Vol.

No.

, pp.

464

473

, doi:

https://doi.org/10.1016/j.eswa.2013.09.004

Oreski

and

Oreski

(

2014

), “

Genetic algorithm-based heuristic for feature selection in credit risk assessment

”,

Expert Systems with Applications

, Vol.

No.

, pp.

2052

2064

, doi:

https://doi.org/10.2308/ajpt-50009

Perols

(

2011

), “

Financial statement fraud detection: an analysis of statistical and machine learning algorithms

”,

AUDITING: A Journal of Practice and Theory

, Vol.

No.

, pp.

, doi:

https://doi.org/10.2308/accr-51562

Perols

J.L.

Bowen

R.M.

Zimmermann

and

Samba

(

2017

), “

Finding needles in a haystack: using data analytics to improve fraud prediction

”,

The Accounting Review

, Vol.

No.

, pp.

221

245

, doi:

https://doi.org/10.1016/j.ijforecast.2019.11.005

Petropoulos

Siakoulis

Stavroulakis

and

Vlachogiannakis

N.E.

(

2020

), “

Predicting bank insolvencies using machine learning techniques

”,

International Journal of Forecasting

, Vol.

No.

, pp.

1092

1113

, doi:

https://doi.org/10.1080/09638180.2022.2137221

Ranta

Ylinen

and

Järvenpää

(

2023

), “

Machine learning in management accounting research: literature review and pathways for the future

”,

European Accounting Review

, Vol.

No.

, pp.

607

636

, doi:

https://doi.org/10.1016/j.ejor.2006.08.043

Ravi Kumar

and

Ravi

(

2007

), “

Bankruptcy prediction in banks and firms via statistical and intelligent techniques—a review

”,

European Journal of Operational Research

, Vol.

180

No.

, pp.

, doi:

https://doi.org/10.1016/j.knosys.2010.05.007

Ravisankar

and

Ravi

(

2010

), “

Financial distress prediction in banks using group method of data handling neural network, counter propagation neural network and fuzzy ARTMAP

”,

Knowledge-Based Systems

, Vol.

No.

, pp.

823

831

, doi:

https://doi.org/10.1109/IWBIS.2018.8471718

Rustam

and

Saragih

G.S.

(

2018

), “

Predicting bank financial failures using random Forest

”, 2018

International Workshop on Big Data and Information Security, IWBIS

, pp.

. doi:

https://doi.org/10.1109/21.97458

Safavian

S.R.

and

Landgrebe

(

1991

), “

A survey of decision tree classifier methodology

”,

IEEE Transactions on Systems, Man, and Cybernetics

, Vol.

No.

, pp.

660

674

, doi:

https://doi.org/10.1023/A:1022648800760

Schapire

(

1990

), “

The strength of weak learnability

”,

Machine Learning

, Vol.

No.

, pp.

197

227

, doi:

https://doi.org/10.7551/mitpress/8291.003.0001

Schapire

and

Freund

(

2012

Boosting: Foundations and Algorithms

MIT Press

, doi:

https://doi.org/10.1016/j.asoc.2020.106852

Shen

Zhao

Kou

and

Alsaadi

F.E.

(

2021

), “

A new deep learning ensemble credit risk evaluation model with an improved synthetic minority oversampling technique

”,

Applied Soft Computing

, Vol.

, p.

106852

, doi:

https://doi.org/10.1086/209665

Shumway

(

2001

), “

Forecasting bankruptcy more accurately: a simple hazard model

”,

The Journal of Business

, Vol.

No.

, pp.

101

124

, doi:

https://doi.org/10.1007/s10796-020-10031-6

Smiti

and

Soui

(

2020

), “

Bankruptcy prediction using deep learning approach based on borderline SMOTE

”,

Information Systems Frontiers

, Vol.

No.

, pp.

1067

1083

, doi:

https://doi.org/10.1016/j.eswa.2019.07.033

Son

Hyun

Phan

and

Hwang

H.J.

(

2019

), “

Data analytic approach for bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

138

, p.

112816

, doi:

https://doi.org/10.1016/j.cogsys.2018.09.006

Song

Y.G.

Cao

Q. L.

and

Zhang

(

2018

), “

Towards a new approach to predict business performance using machine learning

”,

Cognitive Systems Research

, Vol.

, pp.

1004

1012

, doi:

https://doi.org/10.2307/2392337

Staw

B.M.

Sandelands

L.E.

and

Dutton

J.E.

(

1981

), “Threat rigidity effects in organizational behavior: a multilevel analysis”,

Administrative Science Quarterly

, Vol.

No.

, pp.

501

524

, doi:

https://doi.org/10.1002/for.2661

Tang

Tan

and

Shi

(

2020

), “

Incorporating textual and management factors into financial distress prediction: a comparative study of machine learning methods

”,

Journal of Forecasting

, Vol.

No.

, pp.

769

787

, doi:

https://doi.org/10.1016/j.aci.2018.08.003

Tharwat

(

2018

), “

Classification assessment methods

”,

Applied Computing and Informatics

, Vol.

No.

, pp.

168

192

, doi:

https://doi.org/10.1002/sam.11482

Tsai

C.F.

(

2020

), “

Two-stage hybrid learning techniques for bankruptcy prediction

”,

Statistical Analysis and Data Mining: The ASA Data Science Journal

, Vol.

No.

, pp.

565

572

, doi:

https://doi.org/10.1016/j.dss.2018.06.011

Veganzones

and

Séverin

(

2018

), “

An investigation of bankruptcy prediction in imbalanced datasets

”,

Decision Support Systems

, Vol.

112

No.

May

, pp.

111

124

, doi:

https://doi.org/10.1016/j.knosys.2011.06.020

Wang

Huang

and

(

2012

), “

Two credit scoring models based on dual strategy ensemble trees

”,

Knowledge-Based Systems

, Vol.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2013.09.033

Wang

and

Yang

(

2014

), “

An improved boosting based on feature selection for corporate bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

No.

, pp.

2353

2361

, doi:

https://doi.org/10.2307/2392987

Weitzel

and

Jonsson

(

1989

), “

Decline in organizations: a literature integration and extension

”,

Administrative Science Quarterly

, Vol.

No.

, pp.

109

, doi:

https://doi.org/10.1016/j.ins.2013.07.011

Yeh

Chi

and

Lin

(

2014

), “

Going-concern prediction using hybrid random forests and rough set approach

”,

Information Sciences

, Vol.

254

, pp.

110

, doi:

https://doi.org/10.1111/j.1468-5957.1985.tb00077.x

Zavgren

C.V.

(

1985

), “

Assessing the vulnerability to failure of American industrial firms: a logistic analysis

”,

Journal of Business Finance and Accounting

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.eswa.2016.04.001

Ziȩba

Tomczak

S.K.

and

Tomczak

J.M.

(

2016

), “

Ensemble boosted trees with synthetic features generation in application to bankruptcy prediction

”,

Expert Systems with Applications

, Vol.

, pp.

101

, doi:

https://doi.org/10.2307/2490859

Zmijewski

M.E.

(

1984

), “

Methodological issues related to the estimation of financial distress prediction models

”,

Journal of Accounting Research

, Vol.

, pp.

, doi:

https://doi.org/10.1016/j.engappai.2017.05.003

Wang

Chen

Cai

Zhao

Tong

and

(

2017

), “

Grey wolf optimization evolving kernel extreme learning machine: application to bankruptcy prediction

”,

Engineering Applications of Artificial Intelligence

, Vol.

, pp.

, doi: