When interpretable machine learning meets the beautiful game: a predictive analytics approach to soccer player valuation in the transfer market

Al-Madi

Al-Tarawneh

K.I.

and

Alshammarı

M.A.

(

2016

), “

HR practices in the soccer industry: promising research arena

”,

International Review of Management and Marketing

, Vol.

No.

, pp.

641

653

https://pubsonline.informs.org/do/10.1287/LYTX.2011.05.05/full

Alamar

and

Mehrotra

(

2011

), “

Beyond moneyball: The rapidly evolving world of sports analytics, Part I

”,

Analytics Magazine, available at:

https://doi.org/10.1111/j.1600-0838.2010.01256.x

Ali

(

2011

), “

Measuring soccer skill performance: a review

”,

Scandinavian Journal of Medicine and Science in Sports

, Vol.

No.

, pp.

170

183

, doi:

https://doi.org/10.1016/j.eswa.2021.115736

Antwarg

Miller

R.M.

Shapira

and

Rokach

(

2021

), “

Explaining anomalies detected by autoencoders using Shapley additive explanations

”,

Expert Systems with Applications

, Vol.

186

, 115736, doi:

https://doi.org/10.1145/3442188.3445884

Awasthi

Beutel

Kleindessner

Morgenstern

and

Wang

(

2021

), “

Evaluating fairness of machine learning models under uncertain and incomplete information

”,

Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency

, pp.

206

214

, doi:

https://doi.org/10.1108/sbm-02-2021-0011

Balliauw

Bosmans

and

Pauwels

(

2022

), “

Does the quality of a youth academy impact a football player’s market value?

”,

Sport, Business and Management: An International Journal

, Vol.

No.

, pp.

269

283

, doi:

https://doi.org/10.1007/s10618-021-00763-7

Bauer

and

Anzer

(

2021

), “

Data-driven detection of counterpressing in professional football: a supervised machine learning task based on synchronized positional and event data with expert-based feature extraction

”,

Data Mining and Knowledge Discovery

, Vol.

No.

, pp.

2009

2049

, doi:

https://doi.org/10.1287/isre.2023.1199

Bauer

von Zahn

and

Hinz

(

2023

), “

Expl (AI) ned: the impact of explainable artificial intelligence on users’ information processing

”,

Information Systems Research

, Vol.

No.

, pp.

1582

1602

, doi:

https://doi.org/10.1016/j.techfore.2021.120577

Beiderbeck

Frevel

von der Gracht

H.A.

Schmidt

S.L.

and

Schweitzer

V.M.

(

2021

), “

The impact of COVID-19 on the European football ecosystem–A Delphi-based scenario analysis

”,

Technological Forecasting and Social Change

, Vol.

165

, 120577, doi:

https://doi.org/10.1287/deca.2017.0354

Bogaert

Ballings

Hosten

and

Van den Poel

(

2017

), “

Identifying soccer players on Facebook through predictive analytics

”,

Decision Analysis

, Vol.

No.

, pp.

274

297

, doi:

https://doi.org/10.1123/jsm.2020-0319

Breuer

Feiler

and

Rossi

(

2021

), “

Increasing human capital of coaches—an investigation into individual and organizational factors

”,

Journal of Sport Management

, Vol.

No.

, pp.

199

209

, doi:

https://doi.org/10.1177/1527002511435118

Bryson

Frick

and

Simmons

(

2013

), “

The returns to scarce talent: footedness and player remuneration in European soccer

”,

Journal of Sports Economics

, Vol.

No.

, pp.

606

628

, doi:

https://doi.org/10.1177/15270025211059527

Campa

(

2022

), “

Exploring the market of soccer player registrations: an empirical analysis of the difference between transfer fees and estimated players’ inherent value

”,

Journal of Sports Economics

, Vol.

No.

, pp.

379

406

, doi:

https://doi.org/10.1080/00036849300000150

Carmichael

and

Thomas

(

1993

), “

Bargaining in the transfer market: theory and evidence

”,

Applied Economics

, Vol.

No.

, pp.

1467

1476

, doi:

https://doi.org/10.1007/s11205-020-02323-w

Carpita

Ciavolino

and

Pasca

(

2021

), “

Players’ role-based performance composite indicators of soccer teams: a statistical perspective

”,

Social Indicators Research

, Vol.

156

Nos

2-3

, pp.

815

830

, doi:

https://doi.org/10.1177/15270025211049791

Carreras-Simó

and

García

(

2022

), “

Offensive/defensive talent and sporting success in football: evidence from the big five European leagues

”,

Journal of Sports Economics

, Vol.

No.

, pp.

251

276

, doi:

https://doi.org/10.1016/j.ejor.2021.10.046

Coates

and

Parshakov

(

2022

), “

The wisdom of crowds and transfer market values

”,

European Journal of Operational Research

, Vol.

301

No.

, pp.

523

534

, doi:

https://doi.org/10.1287/inte.1110.0606

Coleman

B.J.

(

2012

), “

Identifying the “players” in sports analytics research

”,

Interfaces

, Vol.

No.

, pp.

109

118

, doi:

https://doi.org/10.1504/ijsmm.2018.091345

Coluccia

Fontana

and

Solimene

(

2018

), “

An application of the option-pricing model to the valuation of a football player in the’Serie A League

”,

International Journal of Sport Management and Marketing

, Vol.

Nos

1-2

, pp.

155

168

, doi:

https://doi.org/10.1016/j.dss.2024.114276

Coussement

Abedin

M.Z.

Kraus

Maldonado

and

Topuz

(

2024

), “

Explainable AI for enhanced decision-making

”,

Decision Support Systems

, Vol.

184

, 114276, doi:

Davenport

T.H.

(

2014

), “

What businesses can learn from sports analytics

”,

MIT Sloan Management Review

, Vol.

No.

, p.

https://doi.org/10.1108/pr-02-2024-0130

Dubois

L.-E.

and

Walzak

(

2025

), “

Blind scouting: using artificial intelligence to alleviate bias in selection

”,

Personnel Review

, Vol.

No.

, pp.

953

970

, doi:

https://doi.org/10.1108/md-06-2023-0899

Follert

and

Gleißner

(

2024

), “

A decision model to value football player investments under uncertainty

”,

Management Decision

, Vol.

No.

, pp.

178

200

, doi:

https://doi.org/10.1111/joes.12552

Franceschi

Brocard

J.F.

Follert

and

Gouguet

J.J.

(

2024

), “

Determinants of football players’ valuation: a systematic review

”,

Journal of Economic Surveys

, Vol.

No.

, pp.

577

600

, doi:

https://doi.org/10.1080/16184740802024450

Franck

and

Nüesch

(

2008

), “

Mechanisms of superstar formation in German soccer: empirical evidence

”,

European Sport Management Quarterly

, Vol.

No.

, pp.

145

164

, doi:

https://doi.org/10.1111/j.1465-7295.2010.00360.x

Franck

and

Nüesch

(

2012

), “

Talent and/or popularity: what does it take to be a superstar?

”,

Economic Inquiry

, Vol.

No.

, pp.

202

216

, doi:

https://doi.org/10.1515/jqas-2016-0098

Franks

A.M.

D’Amour

Cervone

and

Bornn

(

2016

), “

Meta-analytics: tools for understanding the statistical properties of sports metrics

”,

Journal of Quantitative Analysis in Sports

, Vol.

No.

, pp.

151

165

, doi:

https://doi.org/10.1111/j.1467-9485.2007.00423.x

Frick

(

2007

), “

The football player’s labor market: empirical evidence from the major European leagues

”,

Scottish Journal of Political Economy

, Vol.

No.

, pp.

422

446

, doi:

https://doi.org/10.1287/inte.1120.0633

Fry

M.J.

and

Ohlmann

J.W.

(

2012

), “

Introduction to the special issue on analytics in sports, part I: general sports applications

”,

Interfaces

, Vol.

No.

, pp.

105

108

, doi:

https://doi.org/10.1108/mf-04-2020-0213

Garcia-del-Barrio

and

Pujol

(

2021

), “

Recruiting talent in a global sports market: appraisals of soccer players’ transfer fees

”,

Managerial Finance

, Vol.

No.

, pp.

789

811

, doi:

https://doi.org/10.1002/mde.1313

Garcia-del-Barrio

and

Pujol

(

2007

), “

Hidden monopsony rents in winner-take-all markets—sport and economic contribution of Spanish soccer players

”,

Managerial and Decision Economics

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1038/s41598-021-90264-w

Garnica-Caparrós

and

Memmert

(

2021

), “

Understanding gender differences in professional European football through machine learning interpretability and match actions data

”,

Scientific Reports

, Vol.

No.

, 10805, doi:

Gavião

L.O.

Sant’Anna

A.P.

Lima

G.B.A.

and

de Almada Garcia

P.A.

(

2023

), “Evaluation of soccer players under the Moneyball concept”,

Science and Football

Routledge

London

, pp.

https://doi.org/10.1177/155862350700200405

Gerrard

(

2007

), “

Is the Moneyball approach transferable to complex invasion team sports?

”,

International Journal of Sport Finance

, Vol.

No.

, pp.

214

230

, doi:

Gerrard

(

2014

), “Achieving transactional efficiency in professional team sports: the theory and practice of player valuation”,

Handbook on the Economics of Professional Football

Edward Elgar Publishing

Cheltenham

, pp.

189

202

Gerrard

(

2016

), “Analytics, technology and high-performance sport”,

Critical Issues in Global Sport Management

Routledge

London

, pp.

227

240

Gigerenzer

(

2023

The Intelligence of Intuition

Cambridge University Press

Cambridge

https://doi.org/10.1016/j.smr.2013.12.006

Herm

Callsen-Bracker

H.-M.

and

Kreis

(

2014

), “

When the crowd evaluates soccer players’ market values: accuracy and evaluation attributes of an online community

”,

Sport Management Review

, Vol.

No.

, pp.

484

492

, doi:

https://doi.org/10.1016/j.jbusres.2019.03.045

Hofmann

Schnittka

Johnen

and

Kottemann

(

2021

), “

Talent or popularity: what drives market value and brand image for human brands?

”,

Journal of Business Research

, Vol.

124

, pp.

748

758

, doi:

https://doi.org/10.1123/jsm.2018-0344

Katz

Baker

T.A.

and

(

2020

), “

Team identity, supporter club identity, and fan relationships: a brand community network analysis of a soccer supporters club

”,

Journal of Sport Management

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1123/jsm.22.1.50

Kedar-Levy

and

Bar-Eli

(

2008

), “

The valuation of athletes as risky investments: a theoretical model

”,

Journal of Sport Management

, Vol.

No.

, pp.

, doi:

Kuper

and

Szymanski

(

2009

Soccernomics: Why England Loses, Why Germany and Brazil Win, and Why the US, Japan, Australia, Turkey--and Even Iraq--are Destined to Become the Kings of the World’s Most Popular Sport

Nation Books

New York, NY

https://doi.org/10.1371/journal.pone.0156504

Liu

X.F.

Liu

Y.-L.

X.-H.

Wang

Q.-X.

and

Wang

T.-X.

(

2016

), “

The anatomy of the global football player transfer network: club functionalities versus network properties

”,

PLoS One

, Vol.

No.

e0156504

, doi:

https://doi.org/10.1080/24733938.2024.2341837

Lolli

Bauer

Irving

Bonanno

Höner

Gregson

and

Di Salvo

(

2025

), “

Data analytics in the football industry: a survey investigating operational frameworks and practices in professional clubs and national federations from around the world

”,

Science and Medicine in Football

, Vol.

No.

, pp.

189

198

, doi:

arXiv preprint arXiv:1802.03888

Lundberg

S.M.

Erion

G.G.

and

Lee

S.-I.

(

2018

), “

Consistent individualized feature attribution for tree ensembles

”,

https://doi.org/10.1515/foli-2017-0019

Majewski

and

Majewska

(

2017

), “

Using Monte Carlo methods for the valuation of intangible assets in sports economics

”,

Folia Oeconomica Stetinensia

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1371/journal.pone.0209362

Matesanz

Holzmayer

Torgler

Schmidt

S.L.

and

Ortega

G.J.

(

2018

), “

Transfer market activities and sportive performance in European first football leagues: a dynamic network approach

”,

PLoS One

, Vol.

No.

e0209362

, doi:

https://doi.org/10.1016/j.ejor.2022.06.033

McHale

I.G.

and

Holmes

(

2023

), “

Estimating transfer fees of professional footballers using advanced performance metrics and machine learning

”,

European Journal of Operational Research

, Vol.

306

No.

, pp.

389

399

, doi:

https://doi.org/10.1287/inte.1110.0589

McHale

I.G.

Scarf

P.A.

and

Folker

D.E.

(

2012

), “

On the development of a soccer player performance rating system for the English Premier League

”,

Interfaces

, Vol.

No.

, pp.

339

351

, doi:

Memmert

and

Raabe

(

2023

Data Analytics in Football: Positional Data Collection, Modelling and Analysis

Routledge

London

https://doi.org/10.1080/16184740701814381

Montanari

Silvestri

and

Bof

(

2008

), “

Performance and individual characteristics as predictors of pay levels: the case of the Italian ‘Serie A’

”,

European Sport Management Quarterly

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.ejor.2017.05.005

Müller

Simons

and

Weinmann

(

2017

), “

Beyond crowd judgments: data-driven estimation of market value in association football

”,

European Journal of Operational Research

, Vol.

263

No.

, pp.

611

624

, doi:

https://doi.org/10.1080/16184742.2021.1939397

Neri

Russo

Di Domizio

and

Rossi

(

2023

), “

Football players and asset manipulation: the management of football transfers in Italian Serie A

”,

European Sport Management Quarterly

, Vol.

No.

, pp.

942

962

, doi:

https://doi.org/10.1145/3343172

Pappalardo

Cintia

Ferragina

Massucco

Pedreschi

and

Giannotti

(

2019

), “

PlayeRank: data-driven performance evaluation and player ranking in soccer via a machine learning approach

”,

ACM Transactions on Intelligent Systems and Technology (TIST)

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1007/s10669-019-09721-7

Payyappalli

V.M.

and

Zhuang

(

2019

), “

A data-driven integer programming model for soccer clubs’ decision making on player transfers

”,

Environment Systems and Decisions

, Vol.

No.

, pp.

466

481

, doi:

https://doi.org/10.1177/1527002507301422

Pedace

(

2008

), “

Earnings, performance, and nationality discrimination in a highly competitive labor market as an analysis of the English professional soccer league

”,

Journal of Sports Economics

, Vol.

No.

, pp.

115

140

, doi:

https://doi.org/10.1177/0149206313512152

Ployhart

R.E.

Nyberg

A.J.

Reilly

and

Maltarich

M.A.

(

2014

), “

Human capital is dead; long live human capital resources!

”,

Journal of Management

, Vol.

No.

, pp.

371

398

, doi:

https://doi.org/10.3390/economies10010004

Poli

Besson

and

Ravenel

(

2022

), “

Econometric approach to assessing the transfer fees and values of professional football players

”,

Economies

, Vol.

No.

, p.

, doi:

https://doi.org/10.3389/fpsyg.2015.01672

Raab

and

Gigerenzer

(

2015

), “

The power of simplicity: a fast-and-frugal heuristics approach to performance science

”,

Frontiers in Psychology

, Vol.

, p.

1672

, doi:

https://doi.org/10.1177/1042258717732957

Radaelli

Dell’Era

Frattini

and

Messeni Petruzzelli

(

2018

), “

Entrepreneurship and human capital in professional sport: a longitudinal analysis of the Italian soccer league

”,

Entrepreneurship Theory and Practice

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1108/sbm-06-2020-0062

Rai

J.S.

Yousaf

Itani

M.N.

and

Singh

(

2021

), “

Sports celebrity personality and purchase intention: the role of endorser-brand congruence, brand credibility and brand image transfer

”,

Sport, Business and Management: An International Journal

, Vol.

No.

, pp.

340

361

, doi:

https://doi.org/10.14198/jhse.2017.12.proc2.05

Rathke

(

2017

), “

An examination of expected goals and shot efficiency in soccer

”,

Journal of Human Sport and Exercise

, Vol.

No.

, pp.

514

529

, doi:

https://doi.org/10.1080/02640410050120078

Reilly

Williams

A.M.

Nevill

and

Franks

(

2000

), “

A multidisciplinary approach to talent identification in soccer

”,

Journal of Sports Sciences

, Vol.

No.

, pp.

695

702

, doi:

https://doi.org/10.1186/s40064-016-3108-2

Rein

and

Memmert

(

2016

), “

Big data and tactical analysis in elite soccer: future challenges and opportunities for sports science

”,

SpringerPlus

, Vol.

, pp.

, doi:

https://doi.org/10.1086/260169

Rosen

(

1974

), “

Hedonic prices and implicit markets: product differentiation in pure competition

”,

Journal of Political Economy

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1177/152700250000100102

Rottenberg

(

2000

), “

Resource allocation and income distribution in professional team sports

”,

Journal of Sports Economics

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1108/jic-06-2020-0211

Rubio Martin

Manuel García

C.M.

Rodríguez-López

Á.

and

Gonzalez Sanchez

F.J.

(

2022

), “

Measuring football clubs’ human capital: analytical and dynamic models based on footballers’ life cycles

”,

Journal of Intellectual Capital

, Vol.

No.

, pp.

1107

1137

, doi:

https://doi.org/10.1038/s42256-019-0048-x

Rudin

(

2019

), “

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

”,

Nature Machine Intelligence

, Vol.

No.

, pp.

206

215

, doi:

https://doi.org/10.1177/1527002518808344

Serna Rodríguez

Ramírez Hassan

and

Coad

(

2019

), “

Uncovering value drivers of high performance soccer players

”,

Journal of Sports Economics

, Vol.

No.

, pp.

819

849

, doi:

https://doi.org/10.1080/16184742.2017.1329331

Shapiro

S.L.

DeSchriver

T.D.

and

Rascher

D.A.

(

2017

), “

The Beckham effect: examining the longitudinal impact of a star performer on league marketing, novelty, and scarcity

”,

European Sport Management Quarterly

, Vol.

No.

, pp.

610

634

, doi:

https://doi.org/10.2307/23042796

Shmueli

and

Koppius

O.R.

(

2011

), “

Predictive analytics in information systems research

”,

MIS Quarterly

, Vol.

No.

, pp.

553

572

, doi:

https://doi.org/10.1016/j.psychsport.2006.05.002

Stambulova

Stephan

and

Jäphag

(

2007

), “

Athletic retirement: a cross-national comparison of elite French and Swedish athletes

”,

Psychology of Sport and Exercise

, Vol.

No.

, pp.

101

118

, doi:

https://doi.org/10.1109/icdmw.2016.0031

Stanojevic

and

Gyarmati

(

2016

), “

Towards data-driven football player assessment

”,

2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW)

, pp.

167

172

, doi:

https://doi.org/10.1016/j.jsis.2023.101790

Sturm

Pumplun

Gerlach

J.P.

Kowalczyk

and

Buxmann

(

2023

), “

Machine learning advice in managerial decision-making: the overlooked role of decision makers’ advice utilization

”,

The Journal of Strategic Information Systems

, Vol.

No.

, 101790, doi:

Taylor

M.S.

and

Giannantonio

C.M.

(

1993

), “

Forming, adapting, and terminating the employment relationship: a review of the literature from individual, organizational, and interactionist perspectives

”,

Journal of Management

, Vol.

No.

, pp.

461

515

https://doi.org/10.1016/j.techfore.2022.122116

Toma

and

Campobasso

(

2023

), “

Using data analytics to capture the strategic and financial decision-making of Europe’s top football club

”,

Technological Forecasting and Social Change

, Vol.

186

, 122116, doi:

https://doi.org/10.1016/j.rfe.2004.11.002

Tunaru

Clark

and

Viney

(

2005

), “

An option pricing framework for valuation of football players

”,

Review of Financial Economics

, Vol.

Nos

3-4

, pp.

281

295

, doi:

https://doi.org/10.1016/0010-0285(73)90033-9

Tversky

and

Kahneman

(

1973

), “

Availability: a heuristic for judging frequency and probability

”,

Cognitive Psychology

, Vol.

No.

, pp.

207

232

, doi:

Vroonen

Decroos

Van Haaren

and

Davis

(

2017

), “

Predicting the potential of professional soccer players

”,

Proceedings of the 4th Workshop on Machine Learning and Data Mining for Sports Analytics

, Vol.

1971

, pp.

https://doi.org/10.3233/jsa-200554

Wakelam

Steuber

and

Wakelam

(

2022

), “

The collection, analysis and exploitation of footballer attributes: a systematic review

”,

Journal of Sports Analytics

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1123/jsm.2022-0026

Wanless

and

Naraine

M.L.

(

2023

), “

Analogous forecasting for predicting sport innovation diffusion: from business analytics to natural language processing

”,

Journal of Sport Management

, Vol.

No.

, pp.

191

202

, doi:

https://doi.org/10.1123/jsm.2021-0067

Watanabe

N.M.

Shapiro

and

Drayer

(

2021

), “

Big data and analytics in sport management

”,

Journal of Sport Management

, Vol.

No.

, pp.

197

202

, doi:

https://doi.org/10.1080/02640410050120113

Williams

A.M.

(

2000

), “

Perceptual skill in soccer: implications for talent identification and development

”,

Journal of Sports Sciences

, Vol.

No.

, pp.

737

750

, doi:

https://doi.org/10.2307/256620

Wright

P.M.

Smart

D.L.

and

McMahan

G.C.

(

1995

), “

Matches between human resources and strategy among NCAA basketball teams

”,

Academy of Management Journal

, Vol.

No.

, pp.

1052

1074

, doi:

https://doi.org/10.1080/16184742.2022.2153898

Yang

Koenigstorfer

and

Pawlowski

(

2024

), “

Predicting transfer fees in professional European football before and during COVID-19 using machine learning

”,

European Sport Management Quarterly

, Vol.

No.

, pp.

603

623

, doi:

https://doi.org/10.1080/02640414.2025.2518694

Zhou

Keogh

J.W.L.

Tong

R.K.Y.

Khan

A.R.

and

Jennings

N.R.

(

2025

), “

Artificial intelligence in sport: a narrative review of applications, challenges and future trends

”,

Journal of Sports Sciences

, pp.

, doi:

2025

Yisheng Li, Anteneh Ayanso, Shuai Yuan, Martin Kusy and Shannon Kerwin

Figure 1

The diagram shows three boxes arranged horizontally in the center, connected by right arrows. They are labeled from left to right as “Assets (Players),” “Predictions,” “Explanations.” Above “Assets (Players),” two boxes are arranged horizontally. The top left rectangle is labeled “Human Capital Theory,” and the top right rectangle is labeled “Pricing Theories.” Both rectangles have downward arrows pointing to “Assets (Players).” Below “Assets (Players),” a vertically stacked rectangle is labeled “Data,” and points back to “Assets (Players),” with an upward arrow.

A framework for data-driven decision-making in sports organizations. Source: Created by the authors

Figure 2

A waterfall chart shows feature contributions from baseline E of f(x) equals 15.422 to final f(x) equals 18.162.

The figure shows a horizontal waterfall chart. The vertical axis lists the following features and their values: “83 equals team underscore rating,” “20 equals age underscore tm,” “83 equals movement underscore reactions,” “29 equals games,” “5 equals contract underscore remaining,” “2450 equals minutes,” “82 equals mentality underscore composure,” “85 equals attacking underscore heading underscore accuracy,” “3 equals international underscore reputation,” and “68 other features.” The horizontal axis ranges from 15.5 to 18.5 in increments of 0.5 units. Rightward arrowheads stacked horizontally display individual contributions to the prediction. At the start of the horizontal axis labels, the text reads “E of f(x) equals 15.422.” A vertical line is drawn on the graph at the horizontal axis value of 18.162, and reads “f(x) equals 18.162.” The values start from 15.422, and the values sum to the final value of 18.162. The value for each arrowhead from the graph is as follows: 83 equals team underscore rating: Plus 0.74. 20 equals age underscore tm: plus 0.67. 83 equals movement underscore reactions: Plus 0.46. 29 equals games: Plus 0.22. 5 equals contract underscore remaining: Plus 0.12. 2450 equals minutes: Plus 0.11. 82 equals mentality underscore composure: Plus 0.09. 85 equals attacking underscore heading underscore accuracy: Plus 0.09. 3 equals international underscore reputation: Plus 0.08. 68 other features: Plus 0.17.

Matthijs de ligt transfer fee prediction – SHAP values. Source: Created by the authors

Figure 3

A horizontal bar graph shows top 20 variables ranked by importance.

The horizontal axis of the horizontal bar graph ranged from 0 to 03 in increments of 0.05. The vertical axis shows 20 variables, labeled from top to bottom as “contract underscore remaining,” “team underscore rating,” “age underscore t m,” “mentality underscore composure,” “skill underscore ball underscore control,” “minutes,” “league underscore name underscore Premier League,” “movement underscore reactions,” “attacking underscore heading underscore accuracy,” “games,” “movement underscore sprint underscore speed,” “x g,” “defending underscore sliding underscore tackle,” “power underscore strength,” “power underscore long underscore shots,” “x a,” “mentality underscore positioning,” “mentality underscore aggression,” “skill underscore dribbling,” and “mentality underscore vision.” The data from the graph is as follows: contract underscore remaining: 0.317. team underscore rating: 0.304. age underscore t m: 0.196. mentality underscore composure: 0.110. skill underscore ball underscore control: 0.099. minutes: 0.090. league underscore name underscore Premier underscore League: 0.088. movement underscore reactions: 0.075. attacking underscore heading underscore accuracy: 0.071. games: 0.065. movement underscore sprint underscore speed: 0.053. x g: 0.044. defending underscore sliding underscore tackle: 0.044. power underscore strength: 0.044. power underscore long underscore shots: 0.036. x a: 0.034. mentality underscore positioning: 0.032. mentality underscore aggression: 0.027. skill underscore dribbling: 0.027. mentality underscore vision: 0.027. Note: All numerical values are approximated.

SHAP standard bar chart (Top 20 features). Source: Created by the authors

Figure 4

A scatter plot of 20 variables showing S H A P values, with color gradient from low (blue) to high (pink) feature values.

The horizontal axis of the scatter plot is labeled “S H A P value (impact on model output), and plot ranges from negative 1 to 0.75 in increments of 0.25. The vertical axis shows 20 variables, labeled from top to bottom as “contract underscore remaining,” “team underscore rating,” “age underscore t m,” “mentality underscore composure,” “skill underscore ball underscore control,” “minutes,” “league underscore name underscore Premier League,” “movement underscore reactions,” “attacking underscore heading underscore accuracy,” “games,” “movement underscore sprint underscore speed,” “x g,” “defending underscore sliding underscore tackle,” “power underscore strength,” “power underscore long underscore shots,” “x a,” “mentality underscore positioning,” “mentality underscore aggression,” “skill underscore dribbling,” and “mentality underscore vision.” A vertical color bar on the right side is labeled “Feature value,” with “High” at the top in bright pink and “Low” at the bottom in blue. Points are colored according to this gradient, with pink indicating high feature values and blue indicating low feature values. The values from the graph are as follows: contract underscore remaining: Range: negative 0.937 to 0.597. team underscore rating: Range: negative 0.658 to 0.95. age underscore t m: Range: negative 0.437 to 0.364. mentality underscore composure: Range: negative 0.244 to 0.222. skill underscore ball underscore control: Range: negative 0.142 to 0.398. minutes: Range: negative 0.25 to 0.222. league underscore name underscore Premier underscore League: Range: negative 0.102 to 0.347. movement underscore reactions: Range: negative 0.108 to 0.505. attacking underscore heading underscore accuracy: Range: negative 0.114 to 0.199. games: Range: negative 0.341 to 0.108. movement underscore sprint underscore speed: Range: negative 0.176 to 0.227 x g: Range: negative 0.114 to 0.091. defending underscore sliding underscore tackle: Range: negative 0.051 to 0.284 power underscore strength: Range: negative 0.159 to 0.114. power underscore long underscore shots: Range: negative 0.051 to 0.176. x a: Range: negative 0.102 to 0.153. mentality underscore positioning: Range: negative 0.057 to 0.159. mentality underscore aggression: Range: negative 0.057 to 0.193. skill underscore dribbling: Range: negative 0.386 to 0.045. mentality underscore vision: Range: negative 0.148 to 0.068. Note: All numerical values are approximated.

SHAP values summary plot (Top 20 features). Source: Created by the authors

Figure 4

SHAP values summary plot (Top 20 features). Source: Created by the authors

Figure 5

A scatter plot shows positive correlation between contract underscore remaining and S H A P value from −0.8 to 0.6.

The vertical axis of the scatter plot is labeled “S H A P value for contract underscore remaining” and ranges from negative 0.8 to 0.6 in increments of 0.2. The horizontal axis is labeled “contract underscore remaining” and ranges from 0 to 6 in increments of 1. Data points are plotted showing vertical dots. The overall data shows an increasing trend from lower left to upper right. The data ranges for the lies shown are as follows: The data for 0 ranges from (0, negative 0.907) to (0, negative 0.794). The data for 1 ranges from (1, negative 0.693) to (1, negative 0.391). The data for 2 ranges from (2, negative 0.394) to (2, negative 0.248). The data for 3 ranges from (3, negative 0.245) to (3, negative 0.051). The data for 4 ranges from (4, negative 0.194) to (4, negative 0.376). The data for 5 ranges from (5, negative 0.301) to (5, negative 0.615). The data for 6 is shown at (6, 0.454). Note: All numerical values are approximated.

Contract remaining SHAP dependence plot. Source: Created by the authors

Figure 6

A scatter and line plot shows the relationship between team underscore rating and its S H A P value.

The figure shows a combination of a line and a scatter chart. The vertical axis of the plot is labeled “S H A P value for team underscore rating” and ranges from negative 0.6 to 0.8 in increments of 0.2. The horizontal axis is labeled “team underscore rating” and ranges from 65 to 85 in increments of 2.5. The line starts at (65, negative 0.6), increases to (76.3, 0.133), and steadily keeps increasing to end at (84.4, 0.659). A few of the scattered dots lie at (72.16, 0.437), (75.85, 0.236), and (83.43, 0.653), along with others. Note: All numerical values are approximated.

Team rating SHAP dependence plot. Source: Created by the authors

Figure 7

A line and scatter plot showing the relationship between age underscore t m and its S H A P value.

The figure shows a combination of a line and a scatter chart. The vertical axis of the plot is labeled “S H A P value for age underscore t m” and ranges from negative 0.4 to 0.4 in increments of 0.2. The horizontal axis is labeled “age underscore t m” and ranges from 20 to 34 in increments of 2. The line starts at approximately (19, 0.28), descends gradually to around (24, 0.12), and then sharply decreases to (27, negative 0.18), and stays almost constant and ends at (34, negative 0.28). A scattered dots are shown vertically at each age underscore t m value. The range of the dots of few of the points are For age underscore t m 20: Range: 0.27 to 0.30. For age underscore t m 25: Range: 0.058 to 0.216. For age underscore t m 30: Range: negative 0.33 to negative 0.19. Note: All numerical values are approximated.

Age SHAP dependence plot. Source: Created by the authors

Figure 8

A line and scatter plot showing the relationship between mentality underscore composure, and its S H A P value.

The figure shows a combination of a line and a scatter chart. The vertical axis of the plot is labeled “S H A P value for mentality underscore composure” and ranges from negative 0.2 to 0.2 in increments of 0.1. The horizontal axis is labeled “mentality underscore composure” and ranges from 30 to 80 in increments of 10. The line starts at (24, negative 0.09), slightly decreases with a negative slope to (61, negative 0.13), then sharply increases to end at (88, 0.167), and further increases to end at (86, 0.185). The scattered dots lie around the line between 50 and 85 on the horizontal axis. A few of the points of the scattered dots are (59, negative 0.166), (69, negative 0.097), (73.6, 0.113), and (84, 0.22), along with others. Note: All numerical values are approximated.

Mentality composure SHAP dependence plot. Source: Created by the authors

Figure 9

A line and scatter plot shows the relationship between skill underscore ball underscore control and its S H A P value.

The figure shows a combination of a line and a scatter chart. The vertical axis of the plot is labeled “S H A P value for skill underscore ball underscore control” and ranges from negative 0.0 to 0.6 in increments of 0.2. The horizontal axis is labeled “skill underscore ball underscore control” and ranges from 10 to 90 in increments of 10. The line starts at (10, negative 0.115), remains largely flat with a shallow positive slope until (77, 0.064), then sharply increases to end at (90, 0.70). The scattered dots lie closely around the line between 65 and 80 on the horizontal axis. A few of the points of the scattered dots are (22, negative 0.142), (34, negative 0.066), (79, negative 0.12), (90, 0.343), and (84, 0.355), along with others. Note: All numerical values are approximated.

Skill ball control SHAP dependence plot. Source: Created by the authors

Figure 10

A line and scatter plot showing the relationship between minutes played and corresponding S H A P values.

The figure shows a combination of a line and a scatter chart. The vertical axis of the plot is labeled “S H A P value for minutes” and ranges from negative 0.15 to 0.15 in increments of 0.05. The horizontal axis is labeled “minutes” and ranges from 0 to 3000 in increments of 500. The line starts at (042 negative 0.07), and rises with small fluctuations to (1391, negative 0.043). The line then increases sharply to (1873, 0.061), and continues upward to end at (3312, 0.12). The scattered dots lie around the line along the horizontal axis. A few of the scattered dots are at (486, negative 0.052), (1571, negative 0.027), (1793, 0.128), and (2878, 0.125), along with others. Note: All numerical values are approximated.

Minutes SHAP dependence plot. Source: Created by the authors

Figure 11

A line and scatter plot show the relation between movement underscore reactions and corresponding S H A P values.

The figure shows a combination of a line and a scatter chart. The vertical axis of the plot is labeled “S H A P value for movement underscore reactions” and ranges from negative 0.1 to 0.5 in increments of 0.1. The horizontal axis is labeled “movement underscore reactions” and ranges from 50 to 85 in increments of 5. The line starts at (49, negative 0.059), and is nearly flat with a small positive slope until (72, negative 0.025), then rises steeply to end at (85, 0.435). The scattered dots follow the line, mostly clustered between 55 and 75, and some between 80 and 85 along the horizontal axis. A few of the points of the scattered dots are (57, negative 0.037), (68, 0.008), (83, 0.181), and (84, 0.508), along with others. Note: All numerical values are approximated.

Movement reactions SHAP dependence plot. Source: Created by the authors

Figure 12

Line and scatter plot showing the relationship between games played and corresponding S H A P values.

https://doi.org/10.1109/access.2022.3154767

The figure shows a combination of a line and a scatter chart. The vertical axis of the plot is labeled “S H A P value for games” and ranges from negative 0.3 to 0.1 in increments of 0.1. The horizontal axis is labeled “games” and ranges from 0 to 35 in increments of 5. The line starts at (0.77, negative 0.261), rises sharply to (10.1, negative 0.052), then gradually increases to (30, 0.058). The line dips and again increases to end at (38, 0.081). The scattered dots are distributed around the line and are mostly clustered between 25 and 40 on the horizontal axis. A few of the scattered dots are at (6, negative 0.2), (12, negative 0.002), (24.8, 0.007), and (32.98, 0.097), along with others. Note: All numerical values are approximated.

Games SHAP dependence plot. Source: Created by the authors

Table 1

A typology of transfer mechanisms

Dimension	Characteristic
Transfer Types	Transfer fee
	Free transfer
	Player loans
	Swap
Contractual Clauses	Release clauses
	Sell-on percentages
	Buyback clauses
Intermediaries	Agents
Intermediaries	Club executives/sporting directors
Associated Financial Flows	Signing bonuses
	Agent commissions
	Performance-based bonuses

Dimension	Characteristic
Transfer Types	Transfer fee
	Free transfer
	Player loans
	Swap
Contractual Clauses	Release clauses
	Sell-on percentages
	Buyback clauses
Intermediaries	Agents
Intermediaries	Club executives/sporting directors
Associated Financial Flows	Signing bonuses
	Agent commissions
	Performance-based bonuses

Source(s): Created by the authors

Table 2

Value driver classification and mapping

Level	Category	Observed feature
Individual human capital	Demographics	Age, nationality
	Popularity	International reputation
	Physiological Psychological Seniority	Reaction, speed, strength, etc. Composure, positioning, aggression, etc. Minutes/games played
	Soccer-Specific	Playing position, goals, expected goals, assists, expected assists, yellow/red cards, dribble, ball control, etc.
Market	Bargaining Power	Contract remaining, team rating
Market	League	League name

Level	Category	Observed feature
Individual human capital	Demographics	Age, nationality
	Popularity	International reputation
	Physiological PsychologicalSeniority	Reaction, speed, strength, etc.Composure, positioning, aggression, etc.Minutes/games played
	Soccer-Specific	Playing position, goals, expected goals, assists, expected assists, yellow/red cards, dribble, ball control, etc.
Market	Bargaining Power	Contract remaining, team rating
Market	League	League name

Source(s): Created by the authors

Table 3

Model testing results

Model	RMSE	R²
Decision Tree	1.002	0.404
XGBoost	0.717	0.695
Random Forest	0.884	0.536
SVR	0.797	0.623

Source(s): Created by the authors

Table A1

Complete feature set

Category	Feature	Data Type	Description
Demographics	Age	Numerical	The age of a player in a given season
Demographics	Nation group	Categorical	All countries are regrouped into 11 labels (France, Italy, England, Germany, Brazil, Argentina, Belgium, Spain, Netherland, Portugal, other countries)
Market	Team rating	Numerical	The average overall rating of all players in each club
	League	Categorical	The league a club belongs to
	Contract remaining	Numerical	The number of remaining year(s) in each player’s contract
Popularity	International reputation	Numerical	The higher the rating the more famous the player is
Seniority	Games	Numerical	The number of games a player appears
Seniority	Minutes	Numerical	The number of minutes a player appears
Physiological Attribute	Acceleration	Numerical	The higher the rating, the shorter the time needed to reach the maximum sprint speed
	Spring speed	Numerical	The higher the rating, the faster the player runs while in full speed
	Agility	Numerical	The higher the rating, the more agile the player is while moving or turning
	Reactions	Numerical	The higher the rating, the more quickly the player is responding to a situation around him
	Balance	Numerical	The higher the rating, the more easily the player is able to maintain balance when facing physical challenges
	Stamina	Numerical	High stamina rating means longer time the player can spend sprinting during a game as well as shorter recovery time
	Jumping	Numerical	The higher the rating, the higher the player can jump to win aerial balls
	Strength	Numerical	The higher the rating the more physically strong the player is
	Injury risk	Categorical	The chance of a player being injured (e.g. low, medium, high)
Psychological Attribute	Aggression	Numerical	The higher the rating, the more successful tackles and more fouls a player is to commit
	Composure	Numerical	The higher the rating, the better the players perform under pressure
	Vision	Numerical	The higher the rating, the greater the player’s awareness of the position of teammates and opponents is
	Positioning	Numerical	The higher the rating, the more likely a player is to occupy advantageous positions for receiving the ball and attacking the opponent’s goal
Soccer Performance Metrics and Technical Skills	Position category	Categorical	The general position category of a player
	Goals	Numerical	The number of goals a player scores
	xg	Numerical	The number of goals a player would have scored given the opportunities (i.e. expected goals)
	Assists	Numerical	The number of assists a player makes
	xa	Numerical	The number of assists a player would have made given the opportunities (i.e. expected assists)
	Red cards	Numerical	The number of red cards a player receives
	Yellow cards	Numerical	The number of yellow cards a player receives
	Tackles won	Numerical	The number of tackles a player wins
	Pressure regain	Numerical	The number of possessions a player regains after applying pressure
	Blocks	Numerical	The number of incoming shots a player stops
	Interceptions	Numerical	The number of the opposing team’s passes a player catches
	Clearance	Numerical	The number of kicks by a player to get the ball away from the danger area
	Fouls	Numerical	The number of fouls a player commits
	Fouled	Numerical	The number of fouls a player causes the opposing team to commit
	Long shots	Numerical	The higher the rating, the more accurate shots from outside the box are
	Shot power	Numerical	The higher the rating, the harder the player hit the ball while still keeping a shot accurate
	Penalties	Numerical	High penalties rating means the player is good at taking penalties
	Heading accuracy	Numerical	The higher the rating, the more accurate a headed pass or header at goal is going to be
	Volleys	Numerical	High volley rating means accurate shots taken while the ball is in air
	Free kick accuracy	Numerical	The higher the rating the better the accuracy of a direct free kick on goal
	Short passing	Numerical	The higher the rating, the faster and more accurate the short or ground pass will be
	Long passing	Numerical	The higher the rating, the faster and more accurate the long pass in the air will be
	Dribbling	Numerical	A high dribbling rating means the player will be able to keep better possession of the ball whilst dribbling
	Curve	Numerical	The higher the rating the more curl the player is capable of putting on the ball when passing and shooting
	Crossing	Numerical	High crossing rating means high probability of a medium or long-range pass from a wide area of the field towards the center of the opponent’s box finding the teammate and circumventing the opponents
	Ball control	Numerical	The higher the rating, the less likely the ball is to bounce away from the player after receiving it
	Standing tackle	Numerical	The higher the rating, the more likely the player is to perform a standing tackle without committing a foul
	Sliding tackle	Numerical	The higher the rating, the more likely the player is to perform a sliding tackle without committing a foul
	Marking	Numerical	The higher the rating, the more easily the player can track and defend an opposing player
	Weak foot	Numerical	Weak foot is defined as the player’s foot other than the preferred foot. High weak foot rating means higher shot power and better ball control for the weak foot of that player
	gk_kicking	Numerical	Goalkeeper’s ability to distribute long and accurate goal kicks, from out of the hands or on the ground
	gk_positioning	Numerical	Goalkeeper’s ability to position himself correctly when saving shots or reacting to crosses
	gk_reflexes	Numerical	Goalkeeper’s agility when making a save
	gk_diving	Numerical	Goalkeeper’s ability to make a save whilst diving through the air
	gk_handling	Numerical	Goalkeeper’s ability to catch the ball and hold onto it

Category	Feature	Data Type	Description
Demographics	Age	Numerical	The age of a player in a given season
Demographics	Nation group	Categorical	All countries are regrouped into 11 labels (France, Italy, England, Germany, Brazil, Argentina, Belgium, Spain, Netherland, Portugal, other countries)
Market	Team rating	Numerical	The average overall rating of all players in each club
	League	Categorical	The league a club belongs to
	Contract remaining	Numerical	The number of remaining year(s) in each player’s contract
Popularity	International reputation	Numerical	The higher the rating the more famous the player is
Seniority	Games	Numerical	The number of games a player appears
Seniority	Minutes	Numerical	The number of minutes a player appears
Physiological Attribute	Acceleration	Numerical	The higher the rating, the shorter the time needed to reach the maximum sprint speed
	Spring speed	Numerical	The higher the rating, the faster the player runs while in full speed
	Agility	Numerical	The higher the rating, the more agile the player is while moving or turning
	Reactions	Numerical	The higher the rating, the more quickly the player is responding to a situation around him
	Balance	Numerical	The higher the rating, the more easily the player is able to maintain balance when facing physical challenges
	Stamina	Numerical	High stamina rating means longer time the player can spend sprinting during a game as well as shorter recovery time
	Jumping	Numerical	The higher the rating, the higher the player can jump to win aerial balls
	Strength	Numerical	The higher the rating the more physically strong the player is
	Injury risk	Categorical	The chance of a player being injured (e.g. low, medium, high)
Psychological Attribute	Aggression	Numerical	The higher the rating, the more successful tackles and more fouls a player is to commit
	Composure	Numerical	The higher the rating, the better the players perform under pressure
	Vision	Numerical	The higher the rating, the greater the player’s awareness of the position of teammates and opponents is
	Positioning	Numerical	The higher the rating, the more likely a player is to occupy advantageous positions for receiving the ball and attacking the opponent’s goal
Soccer Performance Metrics and Technical Skills	Position category	Categorical	The general position category of a player
	Goals	Numerical	The number of goals a player scores
	xg	Numerical	The number of goals a player would have scored given the opportunities (i.e. expected goals)
	Assists	Numerical	The number of assists a player makes
	xa	Numerical	The number of assists a player would have made given the opportunities (i.e. expected assists)
	Red cards	Numerical	The number of red cards a player receives
	Yellow cards	Numerical	The number of yellow cards a player receives
	Tackles won	Numerical	The number of tackles a player wins
	Pressure regain	Numerical	The number of possessions a player regains after applying pressure
	Blocks	Numerical	The number of incoming shots a player stops
	Interceptions	Numerical	The number of the opposing team’s passes a player catches
	Clearance	Numerical	The number of kicks by a player to get the ball away from the danger area
	Fouls	Numerical	The number of fouls a player commits
	Fouled	Numerical	The number of fouls a player causes the opposing team to commit
	Long shots	Numerical	The higher the rating, the more accurate shots from outside the box are
	Shot power	Numerical	The higher the rating, the harder the player hit the ball while still keeping a shot accurate
	Penalties	Numerical	High penalties rating means the player is good at taking penalties
	Heading accuracy	Numerical	The higher the rating, the more accurate a headed pass or header at goal is going to be
	Volleys	Numerical	High volley rating means accurate shots taken while the ball is in air
	Free kick accuracy	Numerical	The higher the rating the better the accuracy of a direct free kick on goal
	Short passing	Numerical	The higher the rating, the faster and more accurate the short or ground pass will be
	Long passing	Numerical	The higher the rating, the faster and more accurate the long pass in the air will be
	Dribbling	Numerical	A high dribbling rating means the player will be able to keep better possession of the ball whilst dribbling
	Curve	Numerical	The higher the rating the more curl the player is capable of putting on the ball when passing and shooting
	Crossing	Numerical	High crossing rating means high probability of a medium or long-range pass from a wide area of the field towards the center of the opponent’s box finding the teammate and circumventing the opponents
	Ball control	Numerical	The higher the rating, the less likely the ball is to bounce away from the player after receiving it
	Standing tackle	Numerical	The higher the rating, the more likely the player is to perform a standing tackle without committing a foul
	Sliding tackle	Numerical	The higher the rating, the more likely the player is to perform a sliding tackle without committing a foul
	Marking	Numerical	The higher the rating, the more easily the player can track and defend an opposing player
	Weak foot	Numerical	Weak foot is defined as the player’s foot other than the preferred foot. High weak foot rating means higher shot power and better ball control for the weak foot of that player
	gk_kicking	Numerical	Goalkeeper’s ability to distribute long and accurate goal kicks, from out of the hands or on the ground
	gk_positioning	Numerical	Goalkeeper’s ability to position himself correctly when saving shots or reacting to crosses
	gk_reflexes	Numerical	Goalkeeper’s agility when making a save
	gk_diving	Numerical	Goalkeeper’s ability to make a save whilst diving through the air
	gk_handling	Numerical	Goalkeeper’s ability to catch the ball and hold onto it

Source(s): Created by the authors

Table A2

Tuning parameters for XGBoost

Tuning parameter	Description	Default value	Optimal value
Number of trees B	B is also known as the number of estimators. Unlike random forests, XGBoost can overfit if B is too large	A relatively small number of trees (e.g. 100 trees)	100
Learning rate λ	λ is a small positive number that controls the rate at which boosting learns. Unlike fitting a single large decision tree to the data, the boosting approach instead learns slowly	Typical values are 0.01 or 0.001	0.1
Max depth	The max depth is the maximum number of nodes allowed from the root to the farthest leaf of a tree. Deeper trees can model more complex relationships by adding more nodes, but sometimes end up following noise, causing the model to overfit	The default number of the max depth is 6	3
Min child weight	The min child weight is the minimum weight (or number of samples if all samples have a weight of 1) required in order to create a new node in the tree. A smaller min child weight allows the algorithm to create children that correspond to fewer samples, thus allowing for more complex trees, but again, more likely to overfit	The default number of the min child weight is 1	7

Tuning parameter	Description	Default value	Optimal value
Number of trees B	B is also known as the number of estimators. Unlike random forests, XGBoost can overfit if B is too large	A relatively small number of trees (e.g. 100 trees)	100
Learning rate λ	λ is a small positive number that controls the rate at which boosting learns. Unlike fitting a single large decision tree to the data, the boosting approach instead learns slowly	Typical values are 0.01 or 0.001	0.1
Max depth	The max depth is the maximum number of nodes allowed from the root to the farthest leaf of a tree. Deeper trees can model more complex relationships by adding more nodes, but sometimes end up following noise, causing the model to overfit	The default number of the max depth is 6	3
Min child weight	The min child weight is the minimum weight (or number of samples if all samples have a weight of 1) required in order to create a new node in the tree. A smaller min child weight allows the algorithm to create children that correspond to fewer samples, thus allowing for more complex trees, but again, more likely to overfit	The default number of the min child weight is 1	7

Source(s): Created by the authors

Al-Asadi

M.A.

and

Tasdemır

(

2022

), “

Predict the value of football players using FIFA video game data and machine learning techniques

”,

IEEE Access

, Vol.

, pp.

22631

22645

, doi:

Al-Madi

Al-Tarawneh

K.I.

and

Alshammarı

M.A.

(

2016

), “

HR practices in the soccer industry: promising research arena

”,

International Review of Management and Marketing

, Vol.

No.

, pp.

641

653

https://pubsonline.informs.org/do/10.1287/LYTX.2011.05.05/full

Alamar

and

Mehrotra

(

2011

), “

Beyond moneyball: The rapidly evolving world of sports analytics, Part I

”,

Analytics Magazine, available at:

https://doi.org/10.1111/j.1600-0838.2010.01256.x

Ali

(

2011

), “

Measuring soccer skill performance: a review

”,

Scandinavian Journal of Medicine and Science in Sports

, Vol.

No.

, pp.

170

183

, doi:

https://doi.org/10.1016/j.eswa.2021.115736

Antwarg

Miller

R.M.

Shapira

and

Rokach

(

2021

), “

Explaining anomalies detected by autoencoders using Shapley additive explanations

”,

Expert Systems with Applications

, Vol.

186

, 115736, doi:

https://doi.org/10.1145/3442188.3445884

Awasthi

Beutel

Kleindessner

Morgenstern

and

Wang

(

2021

), “

Evaluating fairness of machine learning models under uncertain and incomplete information

”,

Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency

, pp.

206

214

, doi:

https://doi.org/10.1108/sbm-02-2021-0011

Balliauw

Bosmans

and

Pauwels

(

2022

), “

Does the quality of a youth academy impact a football player’s market value?

”,

Sport, Business and Management: An International Journal

, Vol.

No.

, pp.

269

283

, doi:

https://doi.org/10.1007/s10618-021-00763-7

Bauer

and

Anzer

(

2021

), “

Data-driven detection of counterpressing in professional football: a supervised machine learning task based on synchronized positional and event data with expert-based feature extraction

”,

Data Mining and Knowledge Discovery

, Vol.

No.

, pp.

2009

2049

, doi:

https://doi.org/10.1287/isre.2023.1199

Bauer

von Zahn

and

Hinz

(

2023

), “

Expl (AI) ned: the impact of explainable artificial intelligence on users’ information processing

”,

Information Systems Research

, Vol.

No.

, pp.

1582

1602

, doi:

https://doi.org/10.1016/j.techfore.2021.120577

Beiderbeck

Frevel

von der Gracht

H.A.

Schmidt

S.L.

and

Schweitzer

V.M.

(

2021

), “

The impact of COVID-19 on the European football ecosystem–A Delphi-based scenario analysis

”,

Technological Forecasting and Social Change

, Vol.

165

, 120577, doi:

https://doi.org/10.1287/deca.2017.0354

Bogaert

Ballings

Hosten

and

Van den Poel

(

2017

), “

Identifying soccer players on Facebook through predictive analytics

”,

Decision Analysis

, Vol.

No.

, pp.

274

297

, doi:

https://doi.org/10.1123/jsm.2020-0319

Breuer

Feiler

and

Rossi

(

2021

), “

Increasing human capital of coaches—an investigation into individual and organizational factors

”,

Journal of Sport Management

, Vol.

No.

, pp.

199

209

, doi:

https://doi.org/10.1177/1527002511435118

Bryson

Frick

and

Simmons

(

2013

), “

The returns to scarce talent: footedness and player remuneration in European soccer

”,

Journal of Sports Economics

, Vol.

No.

, pp.

606

628

, doi:

https://doi.org/10.1177/15270025211059527

Campa

(

2022

), “

Exploring the market of soccer player registrations: an empirical analysis of the difference between transfer fees and estimated players’ inherent value

”,

Journal of Sports Economics

, Vol.

No.

, pp.

379

406

, doi:

https://doi.org/10.1080/00036849300000150

Carmichael

and

Thomas

(

1993

), “

Bargaining in the transfer market: theory and evidence

”,

Applied Economics

, Vol.

No.

, pp.

1467

1476

, doi:

https://doi.org/10.1007/s11205-020-02323-w

Carpita

Ciavolino

and

Pasca

(

2021

), “

Players’ role-based performance composite indicators of soccer teams: a statistical perspective

”,

Social Indicators Research

, Vol.

156

Nos

2-3

, pp.

815

830

, doi:

https://doi.org/10.1177/15270025211049791

Carreras-Simó

and

García

(

2022

), “

Offensive/defensive talent and sporting success in football: evidence from the big five European leagues

”,

Journal of Sports Economics

, Vol.

No.

, pp.

251

276

, doi:

https://doi.org/10.1016/j.ejor.2021.10.046

Coates

and

Parshakov

(

2022

), “

The wisdom of crowds and transfer market values

”,

European Journal of Operational Research

, Vol.

301

No.

, pp.

523

534

, doi:

https://doi.org/10.1287/inte.1110.0606

Coleman

B.J.

(

2012

), “

Identifying the “players” in sports analytics research

”,

Interfaces

, Vol.

No.

, pp.

109

118

, doi:

https://doi.org/10.1504/ijsmm.2018.091345

Coluccia

Fontana

and

Solimene

(

2018

), “

An application of the option-pricing model to the valuation of a football player in the’Serie A League

”,

International Journal of Sport Management and Marketing

, Vol.

Nos

1-2

, pp.

155

168

, doi:

https://doi.org/10.1016/j.dss.2024.114276

Coussement

Abedin

M.Z.

Kraus

Maldonado

and

Topuz

(

2024

), “

Explainable AI for enhanced decision-making

”,

Decision Support Systems

, Vol.

184

, 114276, doi:

Davenport

T.H.

(

2014

), “

What businesses can learn from sports analytics

”,

MIT Sloan Management Review

, Vol.

No.

, p.

https://doi.org/10.1108/pr-02-2024-0130

Dubois

L.-E.

and

Walzak

(

2025

), “

Blind scouting: using artificial intelligence to alleviate bias in selection

”,

Personnel Review

, Vol.

No.

, pp.

953

970

, doi:

https://doi.org/10.1108/md-06-2023-0899

Follert

and

Gleißner

(

2024

), “

A decision model to value football player investments under uncertainty

”,

Management Decision

, Vol.

No.

, pp.

178

200

, doi:

https://doi.org/10.1111/joes.12552

Franceschi

Brocard

J.F.

Follert

and

Gouguet

J.J.

(

2024

), “

Determinants of football players’ valuation: a systematic review

”,

Journal of Economic Surveys

, Vol.

No.

, pp.

577

600

, doi:

https://doi.org/10.1080/16184740802024450

Franck

and

Nüesch

(

2008

), “

Mechanisms of superstar formation in German soccer: empirical evidence

”,

European Sport Management Quarterly

, Vol.

No.

, pp.

145

164

, doi:

https://doi.org/10.1111/j.1465-7295.2010.00360.x

Franck

and

Nüesch

(

2012

), “

Talent and/or popularity: what does it take to be a superstar?

”,

Economic Inquiry

, Vol.

No.

, pp.

202

216

, doi:

https://doi.org/10.1515/jqas-2016-0098

Franks

A.M.

D’Amour

Cervone

and

Bornn

(

2016

), “

Meta-analytics: tools for understanding the statistical properties of sports metrics

”,

Journal of Quantitative Analysis in Sports

, Vol.

No.

, pp.

151

165

, doi:

https://doi.org/10.1111/j.1467-9485.2007.00423.x

Frick

(

2007

), “

The football player’s labor market: empirical evidence from the major European leagues

”,

Scottish Journal of Political Economy

, Vol.

No.

, pp.

422

446

, doi:

https://doi.org/10.1287/inte.1120.0633

Fry

M.J.

and

Ohlmann

J.W.

(

2012

), “

Introduction to the special issue on analytics in sports, part I: general sports applications

”,

Interfaces

, Vol.

No.

, pp.

105

108

, doi:

https://doi.org/10.1108/mf-04-2020-0213

Garcia-del-Barrio

and

Pujol

(

2021

), “

Recruiting talent in a global sports market: appraisals of soccer players’ transfer fees

”,

Managerial Finance

, Vol.

No.

, pp.

789

811

, doi:

https://doi.org/10.1002/mde.1313

Garcia-del-Barrio

and

Pujol

(

2007

), “

Hidden monopsony rents in winner-take-all markets—sport and economic contribution of Spanish soccer players

”,

Managerial and Decision Economics

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1038/s41598-021-90264-w

Garnica-Caparrós

and

Memmert

(

2021

), “

Understanding gender differences in professional European football through machine learning interpretability and match actions data

”,

Scientific Reports

, Vol.

No.

, 10805, doi:

Gavião

L.O.

Sant’Anna

A.P.

Lima

G.B.A.

and

de Almada Garcia

P.A.

(

2023

), “Evaluation of soccer players under the Moneyball concept”,

Science and Football

Routledge

London

, pp.

https://doi.org/10.1177/155862350700200405

Gerrard

(

2007

), “

Is the Moneyball approach transferable to complex invasion team sports?

”,

International Journal of Sport Finance

, Vol.

No.

, pp.

214

230

, doi:

Gerrard

(

2014

), “Achieving transactional efficiency in professional team sports: the theory and practice of player valuation”,

Handbook on the Economics of Professional Football

Edward Elgar Publishing

Cheltenham

, pp.

189

202

Gerrard

(

2016

), “Analytics, technology and high-performance sport”,

Critical Issues in Global Sport Management

Routledge

London

, pp.

227

240

Gigerenzer

(

2023

The Intelligence of Intuition

Cambridge University Press

Cambridge

https://doi.org/10.1016/j.smr.2013.12.006

Herm

Callsen-Bracker

H.-M.

and

Kreis

(

2014

), “

When the crowd evaluates soccer players’ market values: accuracy and evaluation attributes of an online community

”,

Sport Management Review

, Vol.

No.

, pp.

484

492

, doi:

https://doi.org/10.1016/j.jbusres.2019.03.045

Hofmann

Schnittka

Johnen

and

Kottemann

(

2021

), “

Talent or popularity: what drives market value and brand image for human brands?

”,

Journal of Business Research

, Vol.

124

, pp.

748

758

, doi:

https://doi.org/10.1123/jsm.2018-0344

Katz

Baker

T.A.

and

(

2020

), “

Team identity, supporter club identity, and fan relationships: a brand community network analysis of a soccer supporters club

”,

Journal of Sport Management

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1123/jsm.22.1.50

Kedar-Levy

and

Bar-Eli

(

2008

), “

The valuation of athletes as risky investments: a theoretical model

”,

Journal of Sport Management

, Vol.

No.

, pp.

, doi:

Kuper

and

Szymanski

(

2009

Soccernomics: Why England Loses, Why Germany and Brazil Win, and Why the US, Japan, Australia, Turkey--and Even Iraq--are Destined to Become the Kings of the World’s Most Popular Sport

Nation Books

New York, NY

https://doi.org/10.1371/journal.pone.0156504

Liu

X.F.

Liu

Y.-L.

X.-H.

Wang

Q.-X.

and

Wang

T.-X.

(

2016

), “

The anatomy of the global football player transfer network: club functionalities versus network properties

”,

PLoS One

, Vol.

No.

e0156504

, doi:

https://doi.org/10.1080/24733938.2024.2341837

Lolli

Bauer

Irving

Bonanno

Höner

Gregson

and

Di Salvo

(

2025

), “

Data analytics in the football industry: a survey investigating operational frameworks and practices in professional clubs and national federations from around the world

”,

Science and Medicine in Football

, Vol.

No.

, pp.

189

198

, doi:

arXiv preprint arXiv:1802.03888

Lundberg

S.M.

Erion

G.G.

and

Lee

S.-I.

(

2018

), “

Consistent individualized feature attribution for tree ensembles

”,

https://doi.org/10.1515/foli-2017-0019

Majewski

and

Majewska

(

2017

), “

Using Monte Carlo methods for the valuation of intangible assets in sports economics

”,

Folia Oeconomica Stetinensia

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1371/journal.pone.0209362

Matesanz

Holzmayer

Torgler

Schmidt

S.L.

and

Ortega

G.J.

(

2018

), “

Transfer market activities and sportive performance in European first football leagues: a dynamic network approach

”,

PLoS One

, Vol.

No.

e0209362

, doi:

https://doi.org/10.1016/j.ejor.2022.06.033

McHale

I.G.

and

Holmes

(

2023

), “

Estimating transfer fees of professional footballers using advanced performance metrics and machine learning

”,

European Journal of Operational Research

, Vol.

306

No.

, pp.

389

399

, doi:

https://doi.org/10.1287/inte.1110.0589

McHale

I.G.

Scarf

P.A.

and

Folker

D.E.

(

2012

), “

On the development of a soccer player performance rating system for the English Premier League

”,

Interfaces

, Vol.

No.

, pp.

339

351

, doi:

Memmert

and

Raabe

(

2023

Data Analytics in Football: Positional Data Collection, Modelling and Analysis

Routledge

London

https://doi.org/10.1080/16184740701814381

Montanari

Silvestri

and

Bof

(

2008

), “

Performance and individual characteristics as predictors of pay levels: the case of the Italian ‘Serie A’

”,

European Sport Management Quarterly

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.ejor.2017.05.005

Müller

Simons

and

Weinmann

(

2017

), “

Beyond crowd judgments: data-driven estimation of market value in association football

”,

European Journal of Operational Research

, Vol.

263

No.

, pp.

611

624

, doi:

https://doi.org/10.1080/16184742.2021.1939397

Neri

Russo

Di Domizio

and

Rossi

(

2023

), “

Football players and asset manipulation: the management of football transfers in Italian Serie A

”,

European Sport Management Quarterly

, Vol.

No.

, pp.

942

962

, doi:

https://doi.org/10.1145/3343172

Pappalardo

Cintia

Ferragina

Massucco

Pedreschi

and

Giannotti

(

2019

), “

PlayeRank: data-driven performance evaluation and player ranking in soccer via a machine learning approach

”,

ACM Transactions on Intelligent Systems and Technology (TIST)

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1007/s10669-019-09721-7

Payyappalli

V.M.

and

Zhuang

(

2019

), “

A data-driven integer programming model for soccer clubs’ decision making on player transfers

”,

Environment Systems and Decisions

, Vol.

No.

, pp.

466

481

, doi:

https://doi.org/10.1177/1527002507301422

Pedace

(

2008

), “

Earnings, performance, and nationality discrimination in a highly competitive labor market as an analysis of the English professional soccer league

”,

Journal of Sports Economics

, Vol.

No.

, pp.

115

140

, doi:

https://doi.org/10.1177/0149206313512152

Ployhart

R.E.

Nyberg

A.J.

Reilly

and

Maltarich

M.A.

(

2014

), “

Human capital is dead; long live human capital resources!

”,

Journal of Management

, Vol.

No.

, pp.

371

398

, doi:

https://doi.org/10.3390/economies10010004

Poli

Besson

and

Ravenel

(

2022

), “

Econometric approach to assessing the transfer fees and values of professional football players

”,

Economies

, Vol.

No.

, p.

, doi:

https://doi.org/10.3389/fpsyg.2015.01672

Raab

and

Gigerenzer

(

2015

), “

The power of simplicity: a fast-and-frugal heuristics approach to performance science

”,

Frontiers in Psychology

, Vol.

, p.

1672

, doi:

https://doi.org/10.1177/1042258717732957

Radaelli

Dell’Era

Frattini

and

Messeni Petruzzelli

(

2018

), “

Entrepreneurship and human capital in professional sport: a longitudinal analysis of the Italian soccer league

”,

Entrepreneurship Theory and Practice

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1108/sbm-06-2020-0062

Rai

J.S.

Yousaf

Itani

M.N.

and

Singh

(

2021

), “

Sports celebrity personality and purchase intention: the role of endorser-brand congruence, brand credibility and brand image transfer

”,

Sport, Business and Management: An International Journal

, Vol.

No.

, pp.

340

361

, doi:

https://doi.org/10.14198/jhse.2017.12.proc2.05

Rathke

(

2017

), “

An examination of expected goals and shot efficiency in soccer

”,

Journal of Human Sport and Exercise

, Vol.

No.

, pp.

514

529

, doi:

https://doi.org/10.1080/02640410050120078

Reilly

Williams

A.M.

Nevill

and

Franks

(

2000

), “

A multidisciplinary approach to talent identification in soccer

”,

Journal of Sports Sciences

, Vol.

No.

, pp.

695

702

, doi:

https://doi.org/10.1186/s40064-016-3108-2

Rein

and

Memmert

(

2016

), “

Big data and tactical analysis in elite soccer: future challenges and opportunities for sports science

”,

SpringerPlus

, Vol.

, pp.

, doi:

https://doi.org/10.1086/260169

Rosen

(

1974

), “

Hedonic prices and implicit markets: product differentiation in pure competition

”,

Journal of Political Economy

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1177/152700250000100102

Rottenberg

(

2000

), “

Resource allocation and income distribution in professional team sports

”,

Journal of Sports Economics

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1108/jic-06-2020-0211

Rubio Martin

Manuel García

C.M.

Rodríguez-López

Á.

and

Gonzalez Sanchez

F.J.

(

2022

), “

Measuring football clubs’ human capital: analytical and dynamic models based on footballers’ life cycles

”,

Journal of Intellectual Capital

, Vol.

No.

, pp.

1107

1137

, doi:

https://doi.org/10.1038/s42256-019-0048-x

Rudin

(

2019

), “

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

”,

Nature Machine Intelligence

, Vol.

No.

, pp.

206

215

, doi:

https://doi.org/10.1177/1527002518808344

Serna Rodríguez

Ramírez Hassan

and

Coad

(

2019

), “

Uncovering value drivers of high performance soccer players

”,

Journal of Sports Economics

, Vol.

No.

, pp.

819

849

, doi:

https://doi.org/10.1080/16184742.2017.1329331

Shapiro

S.L.

DeSchriver

T.D.

and

Rascher

D.A.

(

2017

), “

The Beckham effect: examining the longitudinal impact of a star performer on league marketing, novelty, and scarcity

”,

European Sport Management Quarterly

, Vol.

No.

, pp.

610

634

, doi:

https://doi.org/10.2307/23042796

Shmueli

and

Koppius

O.R.

(

2011

), “

Predictive analytics in information systems research

”,

MIS Quarterly

, Vol.

No.

, pp.

553

572

, doi:

https://doi.org/10.1016/j.psychsport.2006.05.002

Stambulova

Stephan

and

Jäphag

(

2007

), “

Athletic retirement: a cross-national comparison of elite French and Swedish athletes

”,

Psychology of Sport and Exercise

, Vol.

No.

, pp.

101

118

, doi:

https://doi.org/10.1109/icdmw.2016.0031

Stanojevic

and

Gyarmati

(

2016

), “

Towards data-driven football player assessment

”,

2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW)

, pp.

167

172

, doi:

https://doi.org/10.1016/j.jsis.2023.101790

Sturm

Pumplun

Gerlach

J.P.

Kowalczyk

and

Buxmann

(

2023

), “

Machine learning advice in managerial decision-making: the overlooked role of decision makers’ advice utilization

”,

The Journal of Strategic Information Systems

, Vol.

No.

, 101790, doi:

Taylor

M.S.

and

Giannantonio

C.M.

(

1993

), “

Forming, adapting, and terminating the employment relationship: a review of the literature from individual, organizational, and interactionist perspectives

”,

Journal of Management

, Vol.

No.

, pp.

461

515

https://doi.org/10.1016/j.techfore.2022.122116

Toma

and

Campobasso

(

2023

), “

Using data analytics to capture the strategic and financial decision-making of Europe’s top football club

”,

Technological Forecasting and Social Change

, Vol.

186

, 122116, doi:

https://doi.org/10.1016/j.rfe.2004.11.002

Tunaru

Clark

and

Viney

(

2005

), “

An option pricing framework for valuation of football players

”,

Review of Financial Economics

, Vol.

Nos

3-4

, pp.

281

295

, doi:

https://doi.org/10.1016/0010-0285(73)90033-9

Tversky

and

Kahneman

(

1973

), “

Availability: a heuristic for judging frequency and probability

”,

Cognitive Psychology

, Vol.

No.

, pp.

207

232

, doi:

Vroonen

Decroos

Van Haaren

and

Davis

(

2017

), “

Predicting the potential of professional soccer players

”,

Proceedings of the 4th Workshop on Machine Learning and Data Mining for Sports Analytics

, Vol.

1971

, pp.

https://doi.org/10.3233/jsa-200554

Wakelam

Steuber

and

Wakelam

(

2022

), “

The collection, analysis and exploitation of footballer attributes: a systematic review

”,

Journal of Sports Analytics

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1123/jsm.2022-0026

Wanless

and

Naraine

M.L.

(

2023

), “

Analogous forecasting for predicting sport innovation diffusion: from business analytics to natural language processing

”,

Journal of Sport Management

, Vol.

No.

, pp.

191

202

, doi:

https://doi.org/10.1123/jsm.2021-0067

Watanabe

N.M.

Shapiro

and

Drayer

(

2021

), “

Big data and analytics in sport management

”,

Journal of Sport Management

, Vol.

No.

, pp.

197

202

, doi:

https://doi.org/10.1080/02640410050120113

Williams

A.M.

(

2000

), “

Perceptual skill in soccer: implications for talent identification and development

”,

Journal of Sports Sciences

, Vol.

No.

, pp.

737

750

, doi:

https://doi.org/10.2307/256620

Wright

P.M.

Smart

D.L.

and

McMahan

G.C.

(

1995

), “

Matches between human resources and strategy among NCAA basketball teams

”,

Academy of Management Journal

, Vol.

No.

, pp.

1052

1074

, doi:

https://doi.org/10.1080/16184742.2022.2153898

Yang

Koenigstorfer

and

Pawlowski

(

2024

), “

Predicting transfer fees in professional European football before and during COVID-19 using machine learning

”,

European Sport Management Quarterly

, Vol.

No.

, pp.

603

623

, doi:

https://doi.org/10.1080/02640414.2025.2518694

Zhou

Keogh

J.W.L.

Tong

R.K.Y.

Khan

A.R.

and

Jennings

N.R.

(

2025

), “

Artificial intelligence in sport: a narrative review of applications, challenges and future trends

”,

Journal of Sports Sciences

, pp.

, doi: