Research Article| June 25 2026

When upward feedback is not feasible: testing supervisors’ perspective-taking ability as a sustainable alternative

Glauco Cioffi;

Glauco Cioffi

Department of Psychology,

University of Bologna

, Bologna,

Italy

Corresponding author Glauco Cioffi glauco.cioffi2@unibo.it

Search for other works by this author on:

This Site

PubMed

Google Scholar

Cristian Balducci;

Cristian Balducci

Department of Psychology,

Università di Chieti-Pescara ‘G. d'Annunzio’

, Chieti,

Italy

Search for other works by this author on:

This Site

PubMed

Google Scholar

Stefano Toderi

Department of Psychology,

University of Bologna

, Bologna,

Italy

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Corresponding author Glauco Cioffi glauco.cioffi2@unibo.it

Publisher: Emerald Publishing

Received: November 10 2025

Revision Received: March 04 2026

Accepted: May 01 2026

Online ISSN: 2046-9020

Print ISSN: 2046-9012

Funding

Funding Group:

Award Group:
- Funder(s):
  NextGenerationEU
- Award Id(s):
  CUP J33C22001840002
Funding Statement(s):
The study was conducted within the Ph.D. research project of the first author G.C., funded by the European Union–NextGenerationEU, CUP: J33C22001840002.

2026

Glauco Cioffi, Cristian Balducci and Stefano Toderi

Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at Link to the terms of the CC BY 4.0 licenceLink to the terms of the CC BY 4.0 license.

European Journal of Training and Development (2026) 50 (10): 66–82.

https://doi.org/10.1108/EJTD-11-2025-0267

Purpose

Upward feedback (UF) is a key and evidence-based mechanism for leadership development. When supervisors are confronted with gaps between self-views and team-ratings, developmental needs are specified and concrete adjustments follow. However, organisational constraints (e.g. low maturity) or individual factors (e.g. limited feedback orientation) can hinder its feasibility, creating implementation gaps. Despite its importance, the literature has largely overlooked this issue, and viable alternatives to UF are lacking. The purpose of this study, building on the established role of meta-cognition in leadership development, is to propose one of its core sub-dimensions (i.e. perspective-taking ability) as a potential substitute when UF is not available. The authors hypothesised that supervisors, engaged in a structured prediction exercise, would accurately estimate how their teams perceive their management style.

Design/methodology/approach

In a field training context, 44 supervisors first self-assessed their management competencies and then predicted how their teams would rate such behaviours; self-perception and predictions were compared with team-aggregate ratings from 233 employees. Agreement was assessed using Bland–Altman analyses controlling for systematic disagreement and practical limits of agreement of ≈±1 Likert point

Findings

Conversely to self-assessed competencies, supervisor predictions showed no systematic disagreement compared to team ratings, and the limits of agreement overall fell within the ≈±1 threshold.

Originality/value

This study offers a practice-ready, low-cost proxy for upward feedback in feedback-constrained settings. Supervisors’ perspective-taking, operationalised through a structured prediction exercise, emerges as a sustainable alternative for identifying developmental needs and as a catalyst for supervisors’ reflective thinking.

1. Introduction

The feedback mechanism was considered the most notable innovation of the 1990s for leadership development (Atwater and Waldman, 1998; McCarthy and Garavan, 1999). After 25 years, feedback remains one of the most widely used tools for training supervisors, and it is a pivotal topic in organisational science (Day et al., 2014; Lacerenza et al., 2017; Vogel et al., 2021; Anseel and Sherf, 2025). Conceptually, feedback elicits affective responses and serves as a catalyst for enhancing self-awareness and guiding competency improvement (Braddy et al., 2013; Alansari et al., 2024). The core strength of such a practice lies in allowing supervisors to be confronted with gaps between their self-perceptions and others’ views (e.g. teams and peers) on specific content, thereby allowing the identification of development needs to focus on for growth (London and Smither, 1995; Day et al., 2014). An assumption also supported by a recent meta-analysis by Lacerenza et al. (2017), which found that supervisors’ training incorporating feedback outperforms training without feedback, with comparable benefits whether the feedback is multisource (e.g. colleagues and teams) or strictly upward (only from subordinates).

However, merely assuming that giving a supervisor feedback will result in a behavioural change and, ultimately, organisational performance improvement is overly simplistic. Notably, several barriers limit the potential of feedback delivery in practice. On one hand, contextual factors (key determinants of effective interventions; Pankkonen et al., 2025) such as low organisational maturity or cultural resistance often constrain the use of feedback (Atwater and Waldman, 1998; Day et al., 2014; McCarthy and Garavan, 2006; Fleenor, 2021; Toderi et al., 2025). Moreover, because anonymity strongly influences the sincerity of raters (e.g. subordinates) confidentiality is challenging in very small teams where individual responses may be easily inferred (Macey and Barbara, 2019). Aligned with this, although supervisor-focused interventions can be cost-effective (Kelloway and Barling, 2010), gathering team or peers’ ratings in advance to deliver upward or multisource feedback adds logistical burden and is vulnerable to low participation.

On the other hand, because feedback acceptance is the engine for behaviour change, individual attributes that shape receptivity matter for development outcomes (London and Smither, 1995; Bracken and Rose, 2011; Raemdonck and Strijbos, 2013; Katz et al., 2023). In particular, feedback orientation (i.e. the tendency to seek, value and use feedback) influences supervisors’ reactions and downstream training effects (Braddy et al., 2013; Ellison et al., 2022). A further concern is the subgroup of over-estimators’ supervisors, who are especially prone to adverse reactions when confronted with discrepant (negative) feedback, thereby undermining feedback acceptance (Brett and Atwater, 2001; Atwater et al., 2000).

This raises a key question for research and practice:

RQ1.

What alternatives exist when feedback mechanisms are unavailable or impractical?

Surprisingly, leadership literature has largely neglected this issue, leaving a significant gap in designing effective development interventions in feedback-constrained settings. Paradoxically, these are often the very contexts that embed the highest development needs (Nielsen and Randall, 2013).

One promising avenue forward is meta-cognition, a multifaceted psychological construct increasingly recognised as central to leadership development (Day et al., 2014) because of its ability to personalise learning as needed and trigger reflective thinking (Hannah and Avolio, 2010; Lauzier and Bilodeau Clarke, 2024). By definition, meta-cognition involves the capacity to monitor, evaluate and regulate one’s own cognitive processes, as well as to infer and understand others’ mental states (Pedone et al., 2017). These latter abilities are naturally involved in feedback processes, which require recipients to interpret discrepancies between self-assessments and other assessments and adjust their behaviour accordingly (Lacerenza et al., 2017). Among meta-cognitive abilities, perspective-taking, defined as the capacity to adopt another person’s viewpoint cognitively (Siyal and Peng, 2018; Pedone et al., 2017), is particularly relevant in feedback-constrained environments. Indeed, even though work-related literature studies on perspective-taking have primarily focused on employees’ viewpoints, with almost no attention paid to supervisors (Grutterink and Meister, 2022), perspective-taking may help supervisors anticipate team members’ perceptions and enhance self-awareness in the absence of upward feedback.

1.1 The present study

To our knowledge, no study has assessed whether supervisors’ predictions align with their teams’ actual perceptions of their behaviours. If proven accurate, then such predictions could serve as a cost-effective alternative to upward feedback, helping to design developmental initiatives in feedback-constrained environments. Against this backdrop, the current study examines the level of agreement between supervisors’ predictions (of their own teams) and teams’ actual scores on supervisors’ management competencies. Our goal was twofold:

to determine whether systematic bias exists in supervisors’ predictions (i.e. whether they consistently over- or under-estimate their teams’ perceptions); and
to evaluate whether prediction accuracy falls within a meaningful and practically acceptable range.

As an additional validity check, using the same sample and content, we also test whether supervisors’ self-assessments are systematically inflated relative to their teams’ evaluations, indicating an overrating bias.

To test this, in a field training context, 44 supervisors first self-assessed their management style (e.g. workload/conflict management and interpersonal competencies) and then were asked to predict how their teams would evaluate them. Specifically, the management competencies that were the subject of the prediction exercise capture every day, role-related managerial behaviours – how supervisors communicate, organise and design work – that indirectly support employee well-being by shaping the psychosocial work environment (Yarker et al., 2022; Cioffi et al., 2025a). These self-perceptions and predictions were compared to actual team ratings using the Bland–Altman method, the recommended approach for assessing agreement between two measurement methods (Bland and Altman, 1999; Haghayegh et al., 2020). Unlike association indices (e.g. Pearson’s r and intraclass correlation coefficients) or simple mean comparisons, which may appear “appropriate” because they show whether people differ or converge, these alternatives can mask systematic (dis)agreement and are not suitable for evaluating measurement interchangeability (Haghayegh et al., 2020). Agreement analysis with Bland–Altman instead quantifies how far apart two methods are likely to be for an individual case (e.g. supervisor-team), decomposing error into bias (mean difference between measurement methods) and limits of agreement in the original units, precisely the criterion needed when deciding whether two scores can be substituted in practice (Menghini et al., 2021).

By doing so, this study makes a significant contribution to both theory and practice in the field of leadership development. First, by theorising the supervisors’ prediction-based reflection as a workable pathway when upward feedback is infeasible, this study answers recent calls to align feedback research with the complex realities of organisational life (Anseel and Sherf, 2025). Second, by using Bland–Altman analysis, we shift the evaluation criterion in self–other research from convergence to interchangeability (agreement), providing the decision-relevant test for whether supervisor predictions can stand in for team ratings, something association indices cannot establish (Haghayegh, et al., 2020). Third, we specify metacognition, especially perspective-taking, as a mechanism for leader development that can approximate (and prepare for) upward feedback. This integrates metacognitive theory into leadership development accounts and operationalises the often-recommended but seldom implemented reflective components of leadership development (Day, et al., 2014; Hannah and Avolio, 2010) by also responding to recent calls to study the unexplored potential of supervisors’ perspective-taking ability (Grutterink and Meister, 2022; Taylor et al., 2012). Additionally, by embedding an accuracy-and-agreement test inside an intervention, we respond to reviews urging tighter theory–practice links in leadership development and more rigorous evaluation logic for development tools (Lacerenza et al., 2017). Finally, from a practical point, possibly the main contribution of the study, we investigate innovative leadership development methods under real-world constraints to inform future organisational interventions and prioritise activities based on effort gain-balance (von Thiele Schwarz et al., 2021). Therefore, collectively, these implications position prediction-based reflection as an evidence-informed, scalable option for feedback-constrained settings, while preserving the evaluative rigour needed to judge when and how supervisor predictions can responsibly stand in for team ratings.

1.2 The hypothesis of accuracy

Self-enhancement motives systematically inflate leaders’ self-ratings, especially among certain subgroups, such as males, older managers and those at higher levels, limiting the developmental value of self-reports as stand-alone inputs (Fleenor et al., 2010; Lee and Carpenter, 2018). When supervisors are asked instead to predict how their teams would evaluate their behaviours, these motives may be attenuated, because the focus shifts from protecting one’s self-image and reputation (crucial for leaders) to inferring others’ perceptions (Taylor et al., 2012), an exercise grounded in perspective-taking (Pedone et al., 2017). Indirect field evidence is consistent with this shift: the degree to which leaders’ predictions align with follower evaluations of interpersonal competencies relates more strongly to externally judged managerial effectiveness than does conventional self–other convergence, suggesting that predictions capture a distinct, developmentally relevant facet of awareness (Taylor et al., 2012). Accordingly, we hypothesise that, on average, supervisors will accurately predict their direct teams’ evaluations of their management competencies, reflecting the attenuation of self-enhancement motives under prediction:

H1.

Supervisors will accurately predict direct teams’ evaluations of their management competencies.

2. Method

2.1 Participants and procedure

The full sample included 233 employees (approximately 70% of the baseline sample of 340) and their 44 supervisors, working in three italian public administrations including both regional and local government bodies. Recruitment was initiated through a cooperation agreement between University of Bologna and the participating public administrations. Internal staff within each organisation supported the dissemination of study invitations. Participation was voluntary for both supervisors and employees. Because of missing responses, the analytic sample for the supervisors’ self-assessment versus team ratings dyad comprised 43 supervisors and 231 employees. For the supervisors’ prediction versus team ratings dyad, the full sample was retained for all competencies except AAIA and SIMI, for which data were available for 42 supervisors and 221 employees. For transparency, all analyses used all available observations.

Participants were mainly drawn from administrative positions, with routine office tasks predominating. Supervisors held positions at middle management levels. No demographic information was asked because of privacy concerns. On average, employees were nested within teams of 7.73 members (SD = 6.10). Data stems from a wide organisational intervention project to promote employees’ well-being via supervisors’ development. Specifically, the intervention consisted in enhancing key stress-preventive management competencies of supervisors to improve teams’ psychosocial work environment (Cioffi et al., 2025a) using self-reflection, development needs identification and group discussion activities (Toderi et al., 2025).

Before starting with training programs, supervisors and employees filled an online questionnaire (with an attention check) collecting self-perception and teams’ perception of supervisors’ management style. Then, one week after, supervisors completed a structured prediction exercise following three preparation phases in a single training session:

a two-hour seminar introducing the content of the exercise (i.e. management competencies);
exercise’s goals explanation, such as to foster self-awareness (i.e. identifying development needs) and provide individual and confidential report in a subsequent training session; and
task instructions, inviting supervisors to adopt the team’s perspective and respond as they believed the team, on average, would rate the items.

All participants provided informed consent before the collection of data. This study was approved by the Institutional Bioethics Committee of the University of Bologna (Protocol Number 0128009, May 8, 2024).

2.2 The measures

Two validated instruments were used for the predictive task. The first instrument was the Stress Management Competencies Indicator Tool (Toderi and Sarchielli, 2016; Yarker et al., 2008), which allow the evaluation of four core supervisory competencies. Respectful and Responsible (RR) captures integrity, emotional regulation and respectful behaviour (e.g. “Acts calmly in pressured situations”). Managing and Communicating Existing and Future Work (MCW) reflects skills in workload structuring, problem-solving and autonomy enhancement (e.g. “Deals with problems as soon as they arise”). Reasoning and Managing Difficult Situations (RDS) assesses the ability to constructively resolve conflicts (e.g. “Acts as a mediator in conflict situations”), while Managing the Individual within the Team (MIT) focuses on availability and empathy (e.g. “Tries to see things from my point of view”).

The second measure, the Digital Stress-Preventive Management Competencies Indicator Tool (Cioffi et al., 2025b), assesses supervisors’ behaviours in ICT-mediated contexts. It includes two dimensions: Supportive ICT-Mediated Interaction (SIMI), referring to clear communication and timely support (e.g. “Provides necessary feedback to carry out work when working remotely”), and Avoidance of Abusive ICT Adoption (AAIA), which addresses the prevention of excessive digital practices and the safeguarding of work-life boundaries (e.g. “Sends messages/emails with work-related requests outside of working hours”).

All items were rated on a five-point Likert scale ranging from 1 (strongly disagree) to 5 (strongly agree). Descriptive statistics and reliability coefficients (Cronbach’s alpha) for each scale are presented in Table 1.

Acknowledging the team-based data structure, we calculated the agreement indices to justify aggregating employee ratings to the team level. As similar studies, we used the mean uniform within-group agreement index [rWG(j)] and the Intraclass correlation coefficient ICC(1) (Cioffi et al., 2025b; Zhao et al., 2024). Suggested interpretation thresholds are rWG(j) above 0.60 and ICC(1) above 0.12 (James, 1982). For Respectful and Responsible (RR), agreement was strong [rWG(j) = 0.90] and between-team clustering was substantive [ICC(1) = 0.166; F(43,189) = 2.053; and p < 0.001]. Managing and Communicating Work (MCW) showed similarly high consensus [rWG(j) = 0.87] with meaningful clustering [ICC(1) = 0.153; F = 1.945; and p = 0.001]. For Reasoning and Managing Difficult Situations (RDS), consensus remained high [rWG(j) = 0.89], while clustering was modest but statistically reliable [ICC(1) = 0.078; F = 1.447; and p = 0.049]. Managing the Individual within the Team (MIT) showed very strong consensus [rWG(j) = 0.91] and clear clustering [ICC(1) = 0.121; F = 1.727; and p = 0.007]. The digital competencies behaved similarly: Supportive ICT–Mediated Interaction (SIMI) showed high consensus [rWG(j) = 0.85] and significant clustering [ICC(1) = 0.109; F = 1.647; and p = 0.013]; Avoidance of Abusive ICT Adoption (AAIA), despite strong consensus [rWG(j) = 0.87], exhibited negligible between-team variance [ICC(1) = 0.013; F = 1.074; and p = 0.34]. Taken together, these indices indicate that all constructs meet conventional within-team consensus thresholds, supporting the formation of team-level composites, even if, for AAIA, caution is needed given its low ICC(1).

2.3 Statistical analysis

Agreement between supervisor data and team scores was assessed using the Bland–Altman method (Bland and Altman, 1999), which estimates:

the bias (mean difference between supervisor and team score), tested for significance against zero to identify systematic difference between measurement methods;
the means (or magnitude; that is, the average between the two measurement methods); and
the limits of agreement (LoA), calculated as bias ± 1.96 × SD of the differences, representing the 95% confidence range within which most prediction errors are expected to fall.

Before Bland–Altman analysis, the assumptions of normality of the differences (with the Shapiro–Wilk test and Q–Q plots), constant bias (regressing differences on means) and homoscedasticity (regressing the absolute values of the residuals obtained from the constant bias test on the magnitude) were assessed (Menghini et al., 2021). In cases of assumption violations, we applied the regression-based approach, modelling bias and LoA as functions of the measurement magnitude, as recommended (Menghini et al., 2021; Haghayegh et al., 2020). Then the first step was to assess the existence of systematic difference between the dyads supervisors’ self-assessment vs teams’ score and supervisors’ prediction vs teams’ score. Following this, the Bland–Altman plots were generated for the dyad (2), displaying the mean of the two scores on the x-axis and their difference on the y-axis, along with lines for the bias and upper/lower LoA. In the regression-based case, the mean value of the x-axis (i.e. the mean of the measurement means) was then substituted to compute an interpretable LoA at a representative level.

To evaluate practical significance, we pre-specified a tolerance of ± 1.00 on the 1–5 Likert scales (≈20% of range) as our smallest effect size of interest for formative substitution and used ± 1.10/±1.15 as sensitivity bands. In line with the Bland–Altman framework, acceptability of limits must be defined by the intended use, not by statistical significance; limits are “acceptable” only relative to decision needs (e.g. development needs diagnosis) (Bland and Altman, 1999). On five-point ordinal instruments, a one-category shift is the smallest interpretable unit in practice, and thus, a reasonable, anchor-based margin for judging whether leader predictions are close enough to team ratings to support reflection and action planning in low-stakes, feedback-constrained settings. We note that this standard is conservative given the team-level nature and cognitive complexity of the prediction task. However, rather than consistently reaching their LoA extremes, the observed differences between scores are distributed within these limits.

3. Results

Supervisor self-ratings are consistently higher than team ratings on every competency (e.g. RR: 4.27 vs 4.07; and MCW: 4.14 vs 3.83). By contrast, predictions are much closer to team scores (e.g. RR: 4.05 vs 4.05; and SIMI: 4.00 vs 4.00), aligning with the idea that perspective-taking attenuates self-enhancement. Internal consistencies are for some variable modest in self/prediction forms (e.g. SIMI α = 0.63) likely because of small sample size (Cioffi et al., 2025b); by contrast, employee-rated reliabilities are high. Overall, Table 1 previews a pattern in which predictions track teams more closely than self-ratings.

For the self-assessment versus team dyad, supervisors’ self-ratings were systematically higher than team ratings for RR, MCW, RDS, MIT and SIMI: in each case, the bias was positive and the 95% CI excluded zero, indicating reliable overestimation (from 0.19 to 0.30). For the prediction vs team dyad, no systematic bias was detected for RR, MCW, MIT and SIMI: bias estimates were close to zero and 95% CIs spanned zero, showing that supervisors’ predictions tracked team scores on average. See Table 2 for the test of systematic bias.

Proportional bias (i.e. differences varying with the score magnitude) emerged for AAIA in both dyads, self versus team (B = 1.49 and p = 0.001) and prediction versus team (B = 0.49 and p = 0.03) with also clear departures from normality of the differences (Shapiro–Wilk p = 0.01; Q–Q deviations). RDS showed proportional bias in the prediction versus team dyad (B = 0.72 and p = 0.02). Because fixed (constant) bias is not meaningful under proportional bias (Menghini et al., 2021), we estimated and reported regression-based, mean-dependent LoA rather than constant LoA; the fitted functions are provided in Table 2, following Bland–Altman’s extension for trends with magnitude and current reporting guidance (Bland and Altman, 1999). Nevertheless, the results show a clear dissociation between dyads: self-assessments exhibit a robust overrating tendency, whereas predictions generally align with team evaluations.

Bland–Altman plots for all variables are shown in Figures 1–3. The solid line represents the estimated bias (constant or regression-based), while the dotted lines depict the upper and lower limits of agreement. Specifically, RR LoA were −0.99 to 0.99, meeting the strictest predefined standard (±1.00). Practically, most prediction errors should remain within one response category, consistent with interchangeability for formative use. This trend is illustrated in Figure 2 (left), where the agreement limits exceed ± 1, whereas in Figure 1 (left), the limits for RR closely align with the ± 1 benchmark. MCW LoA were −0.92–1.14. Although the upper limit slightly exceeds ± 1.00, the pair lies within ± 1.10, supporting qualified acceptability for low-stakes development. MIT LoA were −1.15–1.15, exceeding ± 1.10 but within ± 1.15; this suggests borderline but usable agreement when facilitators explicitly acknowledge the wider error band. SIMI LoA were −1.03–1.03, narrowly missing ± 1.00 yet within ± 1.10; predictions approximate team views closely enough for formative diagnostics, with occasional deviations slightly over one category. Because for AAIA and RDS the bias varies with the average of the two methods, for clarity, we evaluated the limits at a representative value: the mean of the measurement means (4.29 and 3.86). Substituting this value for AAIA yields limits of agreement of −1.03–1.19. Interpreted against our practical bands, the lower limit meets the ± 1.10 sensitivity margin, while the upper limit slightly exceeds ± 1.15, yet as shown in Figure 3 (right panel), the agreement lines remain close to the bias line, indicating a relatively narrow error band. Conversely, for RDS this produces limits of −1.18–1.49, which exceed the ≈±1 threshold. Accordingly, interchangeability is not acceptable. Despite RDS and AAIA, the study hypothesis is supported considering the unsystematic bias of the prediction-team difference and practical useful limit of agreement for RR, MCW, MIT and SIMI. All limits LoA are presented in Table 3.

4. Discussion

This study examined whether supervisors’ perspective-taking predictions can approximate their teams’ evaluations of day-to-day management competencies when upward feedback is difficult to obtain or implement. Using an agreement test rather than association metrics or mean comparison, we found that prediction–team discrepancies were, on average, systematically unbiased conversely to self-assessment that tends to over-estimation. Additionally, for most competencies, such predictions fell within practice-relevant limits of agreement overall supporting the study hypothesis. Indeed, predictions met the tolerance (≈±1.00) for Respectful and Responsible (RR), Managing and Communicating Work (MCW), Supportive ICT-Mediated Interaction (SIMI) and Managing the Individual within the Team (MIT). In contrast, agreement for Reasoning and Managing Difficult Situations (RDS) and Avoidance of Abusive ICT Adoption (AAIA) did not reach acceptable thresholds, suggesting that team data substitution with supervisor prediction should be judged cautiously.

Taken together, the results provide field evidence that a structured prediction exercise can yield practically usable information for several supervisory competencies, while also delineating boundaries where substitution is not advisable. This adds a complementary angle to the self–other literature, which documents systematic inflation in leaders’ self-ratings, especially on interpersonal domains and among certain subgroups and only moderate convergence with subordinate views (Fleenor et al., 2010; Lee and Carpenter, 2018). By shifting the leader’s task from self-evaluation to inferring team perceptions, the exercise appears to attenuate the tendencies that typically undermine self-other alignment (Taylor et al., 2012), at least for competencies that are frequently observed and behaviourally concrete (e.g. RR, MCW and SIMI). At the same time, the poorer agreement for RDS and AAIA suggests that competencies involving conflict handling and complex social judgment as techno-invasion may be more difficult to predict accurately, possibly because they are episodic, high-stakes and more susceptible to attributional ambiguities. Overall, these findings suggest that prediction-based reflection can serve as a low-burden, evidence-informed adjunct to leadership development when upward feedback is limited, while maintaining an evaluative standard aligned with development decisions.

4.1 Implications for theory and practice

Theoretically, these findings contribute to leadership development literature by supporting the view that perspective-taking is a key ability for supervisors and of high scientific potential (Grutterink and Meister, 2022; Taylor et al., 2012). Our results suggest that when leaders are asked to infer team views – rather than to rate themselves – prediction–other agreement can reach tolerances consistent with formative use on several day-to-day managerial behaviours (e.g. respectful conduct and workload/communication structuring), whereas more episodic or attributionally complex domains (e.g. conflict handling) remain harder to approximate. This competency-specific pattern extends prior reviews by specifying where supervisor–team incongruence is likely to persist and where perspective-taking judgments may close the gap, thus sharpening theory about rater source, content observability and the conditions under which leader-generated information is developmentally diagnostic (Lacerenza et al., 2017). This study also integrates metacognition into leadership development theorising in a concrete, testable way. By operationalising a structured prediction task, it connects reflective, perspective-taking processes – often recommended but under-operationalised in development programs (Hannah and Avolio, 2010) – to observable agreement outcomes with follower data. In doing so, it offers field evidence that perspective-taking can be examined as a proximate mechanism for building self-insight when formal upward ratings are constrained, complementing broader accounts of developmental readiness and reflective learning in leader growth (Day et al., 2014; Taylor et al., 2012). Collectively, these implications contribute a theoretically coherent and methodologically transparent template for studying leader-generated information under real constraints: situating prediction–other alignment within the self–other tradition, embedding metacognitive mechanisms in development theory and privileging agreement over association when the research question concerns responsible substitution in practice (Haghayegh et al., 2020).

From a practical standpoint, when upward feedback is difficult to obtain because of anonymity risks in small teams, survey fatigue, low organisational readiness or limited resources, a structured prediction-team data comparison can offer a pragmatic way to keep development moving without suspending reflective work. Additionally, where upward data are available, comparing predictions with team aggregates and self-perceptions can further inform supervisors on development needs and the complex social dynamics of teams. Thus, to reduce defensiveness and increase learning value, implementing a low-threat calibration loop within the training workflow can be useful. For instance, begin with a brief self-assessment to make supervisors’ self-views explicit; then, collect the team-prediction exercise to prompt perspective-taking. When available, add the team data for comparison. Finally, guide a structured reflection that ends with one or two concrete, time-bound development commitments.

Where survey fatigue is a concern, use the prediction exercise early in a program cycle to prioritise which competencies warrant investment; collect upward ratings later and only for those priorities. Maximise transfer by scaffolding receptivity and use. A brief pre-brief on how to interpret discrepancies (and why they occur) can help prevent defensive attributions, especially among individuals who are likely to overestimate. Where possible, integrate the exercise into existing performance or talent routines (e.g. quarterly check-ins) so that prediction, reflection and progress review are not one-off events but part of a cadence leaders already recognise.

4.2 Limitations and strengths

Several limitations warrant a cautious interpretation of the findings. First, this study was conducted with 44 supervisors and their teams within three Italian public administrations, predominantly in administrative roles. This sampling frame, together with and the absence of demographic covariates (e.g. leader gender, age and hierarchical level), constrains generalizability and prevents testing known moderators of self–other dynamics and prediction accuracy. Second, although we specified a priori agreement thresholds to judge practical interchangeability, these bands are pragmatic rather than normative. Different stakes, scales or organisational uses may justify tighter or looser criteria; accordingly, our acceptance decisions should be viewed as context-specific and replicable, rather than universal. Third, the temporal design is near-contemporaneous: employees rated supervisors one week before leaders completed the prediction task after a preparatory seminar. This strengthens ecological validity but does neither establish whether prediction-based reflection improves subsequent behaviour or calibrates leaders over time nor does it rule out short-term priming effects from the training content on prediction strategies. Fourth, although aggregation of employee ratings to the team level was justified by within-team consensus and clustering, between-team variance was negligible for AAIA, suggesting that for this dimension, the team signal is weak and substitution decisions should be particularly conservative.

Balanced against these constraints, this study has several strengths. It provides field-based, multisource evidence on a theoretically motivated alternative to upward feedback, testing supervisors’ perspective-taking predictions against aggregated team ratings in a real training context rather than in vignette or laboratory settings. Analytically, it deploys a decision-relevant agreement framework, using Bland–Altman logic to quantify bias and limits of agreement in the original metric; this aligns the evaluative criterion with the practical question of whether predictions can stand in for upward ratings for specific competencies and score ranges. The design is transparent and reproducible, with clear diagnostics for normality and proportional bias and competency-level reporting that distinguishes where substitution is defensible (e.g. RR, MCW, SIMI and MIT under specified bands) and where it is not (RDS and AAIA). Substantively, this study examines every day, behaviourally grounded competencies that shape the psychosocial work environment, including digitally mediated behaviours, thereby extending the conversation beyond generic leadership traits. More broadly, the work offers a practical template for organisations operating under feedback constraints: a lightweight predict–compare–reflect loop embedded in training, coupled with explicit thresholds and competency-specific decisions, which preserves confidentiality and reduces burden while maintaining an evaluative standard suited to formative development.

5. Future research

Future research should examine whether prediction-based reflection merely approximates team views at a single point in time or can be calibrated through repeated cycles to improve accuracy and, critically, to drive behavioural change. Therefore, longitudinal and experimental designs, embedding multiple predict–compare–reflect iterations over months, focused to tests of within-leader change, as well as downstream transfer (e.g. employee well-being) are warranted. Experimental trials could compare a prediction-only pathway against standard upward feedback and hybrid protocols, isolating the additive or substitutive value and clarifying the conditions under which predictions can be used responsibly in place of team ratings.

A second priority is to map boundary conditions more precisely. The present competency-specific pattern suggests that some domains (e.g. respectful conduct and work organisation) may be more predictable than others (e.g. conflict management). Future studies should test moderators that theory and practice deem relevant, such as leaders’ feedback orientation or team size and features of the feedback environment (e.g. psychological safety); and contextual factors (public vs private sector). Where possible, collecting minimal demographics will enable tests of known self–other asymmetries across subgroups without compromising confidentiality.

Finally, future studies should integrate implementation and ethics questions that address the real-world uptake of these findings. Cost–benefit analyses can quantify when prediction protocols are preferable to other feedback mechanisms, and process data and qualitative reflections can illuminate how leaders interpret discrepancies.

6. Conclusion

This study examined whether supervisors’ perspective-taking predictions can serve as a viable proxy for upward feedback under the real constraints that often limit multi-source processes. Findings revealed no systematic bias between predictions and team ratings, and for most competencies, the limits of agreement fell within practice-relevant bands. These results indicate that a structured prediction exercise can generate developmentally useful information. In theoretical terms, the findings situate prediction–other alignment as a tractable construct within the self–other tradition and offer a concrete, metacognitively grounded pathway for building supervisors’ self-insight when upward data are not immediately attainable. In practical terms, they provide an implementable, low-burden protocol – self-assess, predict, (when available) compare and reflect – that help practitioners implement formative coaching.

The contribution is necessarily circumscribed: our thresholds are pragmatic rather than universal; prediction accuracy varied by competency; and the design did not test downstream behavioural change. Yet these constraints underscore the value of the present evidence as a field-based starting point. By specifying accuracy as interchangeability in the original metric and demonstrating feasibility in a training context, this study offers a scalable method for leadership development in feedback-constrained environments. Future work should extend this approach with longitudinal intervention studies, sharper boundary-condition tests and stronger outcome linkages; in the meantime, organisations can deploy prediction-based reflection to sustain learning momentum, protect confidentiality where needed and concentrate scarce resources where they are most likely to yield developmental gains.

References

Alansari

Lim

P.W.

Ramani

and

Palaganas

J.C.

(

2024

), “

What do you think of when you hear the word ‘feedback’? A reflective thematic analysis study of interviews

”,

The Clinical Teacher

, Vol.

No.

, p.

e13696

, doi:

https://doi.org/10.1111/tct.13696

Google Scholar

Crossref

Anseel

and

Sherf

E.N.

(

2025

), “

A 25-year review of research on feedback in organizations: from simple rules to complex realities

”,

Annual Review of Organizational Psychology and Organizational Behavior

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1146/annurev-orgpsych-110622-031927

Google Scholar

Crossref

Atwater

and

Waldman

(

1998

), “

360 Degree feedback and leadership development

”,

The Leadership Quarterly

, Vol.

No.

, pp.

423

426

, doi:

https://doi.org/10.1016/S1048-9843(98)90009-1

Google Scholar

Crossref

Atwater

L.E.

Waldman

D.A.

Atwater

and

Cartier

(

2000

), “

An upward feedback field experiment: supervisors’ cynicism, reactions, and commitment to subordinates

”,

Personnel Psychology

, Vol.

No.

, pp.

275

297

, doi:

https://doi.org/10.1111/j.1744-6570.2000.tb00202.x

Google Scholar

Crossref

Bland

J.M.

and

Altman

D.G.

(

1999

), “

Measuring agreement in method comparison studies

”,

Statistical Methods in Medical Research

, Vol.

No.

, pp.

135

160

, doi:

https://doi.org/10.1177/096228029900800204

Google Scholar

Crossref

Bracken

D.W.

and

Rose

D.S.

(

2011

), “

When does 360-degree feedback create behavior change? And how would we know it when it does?

”,

Journal of Business and Psychology

, Vol.

No.

, pp.

183

192

, doi:

https://doi.org/10.1007/s10869-011-9218-5

Google Scholar

Crossref

Braddy

P.W.

Sturm

R.E.

Atwater

L.E.

Smither

J.W.

and

Fleenor

J.W.

(

2013

), “

Validating the feedback orientation scale in a leadership development context

”,

Group and Organization Management

, Vol.

No.

, pp.

690

716

, doi:

https://doi.org/10.1177/1059601113508432

Google Scholar

Crossref

Brett

J.F.

and

Atwater

L.E.

(

2001

), “

360° Feedback: accuracy, reactions, and perceptions of usefulness

”,

Journal of Applied Psychology

, Vol.

No.

, p.

930

, doi:

https://doi.org/10.1037/0021-9010.86.5.930

Google Scholar

Crossref

Cioffi

Balducci

and

Toderi

(

2025a

), “

Stress-preventive management competencies

”,

Encyclopedia

, Vol.

No.

, p.

, doi:

https://doi.org/10.3390/encyclopedia5020046

Google Scholar

Crossref

Cioffi

Balducci

and

Toderi

(

2025b

), “

Digital stress-preventive management competencies: definition, identification and tool development for research and practice

”,

International Journal of Environmental Research and Public Health

, Vol.

No.

, p.

267

, doi:

https://doi.org/10.3390/ijerph22020267

Google Scholar

Crossref

Day

D.V.

Fleenor

J.W.

Atwater

L.E.

Sturm

R.E.

and

McKee

R.A.

(

2014

), “

Advances in leader and leadership development: a review of 25 years of research and theory

”,

The Leadership Quarterly

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.leaqua.2013.11.004

Google Scholar

Crossref

Ellison

L.J.

Steelman

L.A.

Young

S.F.

and

Riordan

B.G.

(

2022

), “

Setting the stage: feedback environment improves outcomes for a 360-degree-feedback leader-development program

”,

Consulting Psychology Journal

, Vol.

No.

, p.

363

, doi:

https://doi.org/10.1037/cpb0000236

Google Scholar

Crossref

Fleenor

J.W.

(

2021

), “What can we learn from research on multisource feedback in organizations?”, in

Student Feedback on Teaching in Schools: Using Student Perceptions for the Development of Teaching and Teachers

, pp.

221

236

, doi:

https://doi.org/10.1007/978-3-030-75150-0_14

Google Scholar

Crossref

Fleenor

J.W.

Smither

J.W.

Atwater

L.E.

Braddy

P.W.

and

Sturm

R.E.

(

2010

), “

Self–other rating agreement in leadership: a review

”,

The Leadership Quarterly

, Vol.

No.

, pp.

1005

1034

, doi:

https://doi.org/10.1016/j.leaqua.2010.10.006

Google Scholar

Crossref

Grutterink

and

Meister

(

2022

), “

Thinking of you thinking of me: an integrative review of meta-perception in the workplace

”,

Journal of Organizational Behavior

, Vol.

No.

, pp.

327

341

, doi:

https://doi.org/10.1002/job.2516

Google Scholar

Crossref

Haghayegh

Kang

H.A.

Khoshnevis

Smolensky

M.H.

and

Diller

K.R.

(

2020

), “

A comprehensive guideline for Bland–Altman and intraclass correlation calculations to properly compare two methods of measurement and interpret findings

”,

Physiological Measurement

, Vol.

No.

, p.

055012

, doi:

https://doi.org/10.1088/1361-6579/ab86d6

Google Scholar

Crossref

Hannah

S.T.

and

Avolio

B.J.

(

2010

), “

Ready or not: how do we accelerate the developmental readiness of leaders?

”,

Journal of Organizational Behavior

, Vol.

No.

, pp.

1181

1187

, doi:

https://doi.org/10.1002/job.675

Google Scholar

Crossref

James

L.R.R.

(

1982

), “

Aggregation bias in estimates of perceptual agreement

”,

Journal of Applied Psychology

, Vol.

No.

, pp.

219

229

, doi:

https://doi.org/10.1037/0021-9010.67.2.219

Google Scholar

Crossref

Katz

I.M.

Moughan

C.M.

and

Rudolph

C.W.

(

2023

), “

Feedback orientation: a meta-analysis

”,

Human Resource Management Review

, Vol.

No.

, p.

100986

, doi:

https://doi.org/10.1016/j.hrmr.2023.100986

Google Scholar

Crossref

Kelloway

E.K.

and

Barling

(

2010

), “

Leadership development as an intervention in occupational health psychology

”,

Work and Stress

, Vol.

No.

, pp.

260

279

, doi:

https://doi.org/10.1080/02678373.2010.518441

Google Scholar

Crossref

Lacerenza

C.N.

Reyes

D.L.

Marlow

S.L.

Joseph

D.L.

and

Salas

(

2017

), “

Leadership training design, delivery, and implementation: a meta-analysis

”,

Journal of Applied Psychology

, Vol.

102

No.

, p.

1686

, doi:

https://doi.org/10.1037/apl0000241

Google Scholar

Crossref

Lauzier

and

Bilodeau Clarke

(

2024

), “

Linking learning goal orientation to learning from error: the mediating role of motivation to learn and metacognition

”,

European Journal of Training and Development

, Vol.

Nos

5-6

, pp.

485

500

, doi:

https://doi.org/10.1108/EJTD-11-2022-0127

Google Scholar

Crossref

Lee

and

Carpenter

N.C.

(

2018

), “

Seeing eye to eye: a meta-analysis of self–other agreement of leadership

”,

The Leadership Quarterly

, Vol.

No.

, pp.

253

275

, doi:

https://doi.org/10.1016/j.leaqua.2017.06.002

Google Scholar

Crossref

London

and

Smither

J.W.

(

1995

), “

Can multi-source feedback change perceptions of goal accomplishment, self-evaluations, and performance-related outcomes? Theory-based applications and directions for research

”,

Personnel Psychology

, Vol.

No.

, pp.

803

839

, doi:

https://doi.org/10.1111/j.1744-6570.1995.tb01782.x

Google Scholar

Crossref

Macey

W.H.

and

Barbara

K.M.

(

2019

), “The ethical context of 360 feedback”, in

Church

A.H.

Bracken

D.W.

Fleenor

J.W.

and

Rose

D.S.

(Eds),

The Handbook of Strategic 360 Feedback

Oxford University Press

, pp.

461

478

, doi:

https://doi.org/10.1093/oso/9780190879860.001.0001

Google Scholar

Crossref

McCarthy

and

Garavan

(

2006

), “

Postfeedback development perceptions: applying the theory of planned behavior

”,

Human Resource Development Quarterly

, Vol.

No.

, pp.

245

267

, doi:

https://doi.org/10.1002/hrdq.1173

Google Scholar

Crossref

McCarthy

A.M.

and

Garavan

T.N.

(

1999

), “

Developing self-awareness in the managerial career development process: the value of 360-degree feedback and the MBTI

”,

Journal of European Industrial Training

, Vol.

No.

, pp.

437

445

, doi:

https://doi.org/10.1108/03090599910302613

Google Scholar

Crossref

Menghini

Cellini

Goldstone

Baker

F.C.

and

de Zambotti

(

2021

), “

A standardized framework for testing the performance of sleep-tracking technology: step-by-step guidelines and open-source code

”,

Sleep

, Vol.

No.

, p.

zsaa170

, doi:

https://doi.org/10.1093/sleep/zsaa170

Google Scholar

Crossref

Nielsen

and

Randall

(

2013

), “

Opening the black box: presenting a model for evaluating organizational-level interventions

”,

European Journal of Work and Organizational Psychology

, Vol.

No.

, pp.

601

617

, doi:

https://doi.org/10.1080/1359432X.2012.690556

Google Scholar

Crossref

Pankkonen

Moazami-Goodarzi

and

Vuori

(

2025

), “

Effects of a competence development intervention on informal learning and work engagement – a randomized controlled trial

”,

European Journal of Training and Development

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1108/EJTD-01-2025-0011

Google Scholar

Crossref

Pedone

Semerari

Riccardi

Procacci

Nicolò

and

Carcione

(

2017

), “

Development of a self-report measure of metacognition: the metacognition Self-Assessment scale (MSAS). Instrument description and factor structure

”,

Clinical Neuropsychiatry

, Vol.

No.

, pp.

185

194

, available at Development of a self-report measure of metacognition: the metacognition Self-Assessment scale (MSAS). Instrument description and factor structureLink to the cited article (

accessed

7 November 2025).

Google Scholar

Raemdonck

and

Strijbos

J.W.

(

2013

), “

Feedback perceptions and attribution by secretarial employees: effects of feedback-content and sender characteristics

”,

European Journal of Training and Development

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1108/03090591311293275

Google Scholar

Crossref

Siyal

and

Peng

(

2018

), “

Does leadership lessen turnover? The moderated mediation effect of leader–member exchange and perspective taking on public servants

”,

Journal of Public Affairs

, Vol.

No.

, p.

e1830

, doi:

https://doi.org/10.1002/pa.1830

Google Scholar

Crossref

Taylor

S.N.

Wang

and

Zhan

(

2012

), “

Going beyond self–other rating comparison to measure leader self-awareness

”,

Journal of Leadership Studies

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1002/jls.21235

Google Scholar

Crossref

Toderi

and

Sarchielli

(

2016

), “

Psychometric properties of a 36-item version of the ‘stress management competency indicator tool

”,

International Journal of Environmental Research and Public Health

, Vol.

, p.

1086

, doi:

https://doi.org/10.3390/ijerph13111086

Google Scholar

Crossref

Toderi

Cioffi

Houdmont

Yarker

Lewis

and

Balducci

(

2025

), “

Developing stress-preventive management competencies: an evaluation of the mechanism and the process in a training experience

”,

International Journal of Workplace Health Management

, Vol.

No.

, pp.

314

330

, doi:

https://doi.org/10.1108/IJWHM-05-2024-0101

Google Scholar

Crossref

Vogel

Reichard

R.J.

Batistič

and

Černe

(

2021

), “

A bibliometric review of the leadership development field: how we got here, where we are, and where we are headed

”,

The Leadership Quarterly

, Vol.

No.

, p.

101381

, doi:

https://doi.org/10.1016/j.leaqua.2020.101381

Google Scholar

Crossref

von Thiele Schwarz

Nielsen

Edwards

Hasson

Ipsen

Savage

and

Reed

J.E.

(

2021

), “

How to design, implement and evaluate organizational interventions for maximum impact: the Sigtuna principles

”,

European Journal of Work and Organizational Psychology

, Vol.

No.

, pp.

415

427

, doi:

https://doi.org/10.1080/1359432X.2020.1803960

Google Scholar

Crossref

Yarker

Donaldson-Feilder

and

Lewis

(

2022

), “Management competencies for health and wellbeing”, in

Handbook on Management and Employment Practices

Springer International Publishing

Cham

, pp.

available at:

Management competencies for health and wellbeing”Link to a pdf of the cited article (

accessed

21 October 2025).

Google Scholar

Yarker

Lewis

and

Donaldson-Feilder

(

2008

), “

Management competencies for preventing and reducing stress at work: Identifying the management behaviours necessary to implement the management standards: Phase two

”,

available at:

Management competencies for preventing and reducing stress at work: Identifying the management behaviours necessary to implement the management standards: Phase twoLink to a pdf of the cited article (

accessed

21 October 2025).

Google Scholar

Zhao

and

Freeney

(

2024

), “

Being at one with each other: leader–follower (in)congruence in transformational leadership and team performance

”,

Leadership and Organization Development Journal

, Vol.

No.

, pp.

190

207

, doi:

https://doi.org/10.1108/LODJ-05-2022-0240

Google Scholar

Crossref

2026

Glauco Cioffi, Cristian Balducci and Stefano Toderi

Figure 1.

View large Download slide

The set of 2 Bland–Altman plots compares supervisor prediction and team rating values for R R and M C W. In both plots, the x-axis represents the mean of supervisor prediction and team rating values ranging from approximately 1 to 5, while the y-axis represents the difference between supervisor prediction and team rating values ranging from negative 3 to positive 3. Circular data points are clustered mainly between mean values of 3 and 4.5, and difference values between negative 1 and positive 1. Each plot contains a solid horizontal reference line at zero difference and dashed horizontal agreement limits near positive 1 and negative 1. The R R plot shows tightly grouped points around the zero difference line, while the M C W plot includes one higher point above the upper agreement limit, indicating slightly greater variation.

Bland–Altman plots for Respectful and Responsible (RR) (left) and for Managing and Communicating Work (MCW) (right)

Source(s): Authors’ own work

Figure 2.

A set of 2 Bland–Altman plots compares supervisor prediction and team rating values for R D S and M I T.

View large Download slide

The set of 2 Bland–Altman plots compares supervisor prediction and team rating values for R D S and M I T. The x-axis in both plots shows the mean of supervisor prediction and team rating values ranging from approximately 1 to 5, and the y-axis shows the difference between supervisor prediction and team rating values ranging from negative 3 to positive 3. The R D S plot contains an upward-sloping solid trend line with dashed diagonal agreement limits, indicating increasing differences as mean values rise. Circular data points are spread between mean values of 3 and 4.5, with difference values from approximately negative 1.5 to positive 2. The M I T plot includes a solid horizontal reference line at zero difference with dashed horizontal agreement limits near positive 1 and negative 1. Most data points in the M I T plot remain clustered close to the zero difference line, suggesting stable agreement.

Bland–Altman plots for Reasoning Difficult Situations (RDS) (left) and for Managing the Individual Within the Team (MIT) (right)

Source(s): Authors’ own work

Figure 3.

A set of 2 Bland–Altman plots compares supervisor prediction and team rating values for S I M I and A A I A.

View large Download slide

The set of 2 Bland–Altman plots compares supervisor prediction and team rating values for S I M I and A A I A. In both plots, the x-axis displays the mean of supervisor prediction and team rating values ranging from approximately 1 to 5. The S I M I plot has a y-axis ranging from negative 3 to positive 3, while the A A I A plot ranges from negative 4 to positive 4. The S I M I plot contains a solid horizontal reference line at zero difference with dashed horizontal agreement limits near positive 1 and negative 1. Data points cluster closely around the zero difference line. The A A I A plot contains an upward-sloping solid trend line with dashed diagonal agreement limits, showing a gradual increase in differences as mean values rise. Circular data points in both plots are concentrated mainly between the mean values of 3 and 5.

Bland–Altman plots for Supportive ICTs-mediated Interaction (SIMI) (left) and for Avoidance of Abusive ICTs-adoption (AAIA) (right)

Source(s): Authors’ own work

Table 1.

Descriptive statistics of study variables

		Supervisor self-assessment		Teams’ score			Supervisors’ prediction		Teams’ score
Variable	N	Mean (SD)	α	Mean (SD)	α	N	Mean (SD)	α	Mean (SD)	α
RR	43	4.27 (0.37)	0.67	4.07 (0.40)	0.89	44	4.05 (0.31)	0.66	4.05 (0.41)	0.89
MCW	43	4.14 (0.43)	0.83	3.83 (0.42)	0.90	44	3.94 (0.40)	0.82	3.83 (0.41)	0.90
RDS	43	4.05 (0.41)	0.76	3.80 (0.37)	0.95	44	3.94 (0.54)	0.90	3.78 (0.37)	0.95
MIT	43	4.22 (0.40)	0.77	3.95 (0.42)	0.92	44	3.96 (0.48)	0.83	3.95 (0.41)	0.92
SIMI	43	4.20 (0.38)	0.65	4.01 (0.40)	0.87	42	4.00 (0.47)	0.63	4.00 (0.41)	0.87
AAIA	43	4.33 (0.59)	0.69	4.30 (0.32)	0.84	42	4.33 (0.52)	0.67	4.26 (0.37)	0.84

		Supervisor self-assessment		Teams’ score			Supervisors’ prediction		Teams’ score
Variable	N	Mean (SD)	α	Mean (SD)	α	N	Mean (SD)	α	Mean (SD)	α
RR	43	4.27 (0.37)	0.67	4.07 (0.40)	0.89	44	4.05 (0.31)	0.66	4.05 (0.41)	0.89
MCW	43	4.14 (0.43)	0.83	3.83 (0.42)	0.90	44	3.94 (0.40)	0.82	3.83 (0.41)	0.90
RDS	43	4.05 (0.41)	0.76	3.80 (0.37)	0.95	44	3.94 (0.54)	0.90	3.78 (0.37)	0.95
MIT	43	4.22 (0.40)	0.77	3.95 (0.42)	0.92	44	3.96 (0.48)	0.83	3.95 (0.41)	0.92
SIMI	43	4.20 (0.38)	0.65	4.01 (0.40)	0.87	42	4.00 (0.47)	0.63	4.00 (0.41)	0.87
AAIA	43	4.33 (0.59)	0.69	4.30 (0.32)	0.84	42	4.33 (0.52)	0.67	4.26 (0.37)	0.84

Note(s):

RR = respectful and responsible; MCW = managing and communicating work; RDS = reasoning and managing difficult situations; MIT = managing the individual within the team; SIMI = supportive ICTs mediated interaction; AAIA = avoidance of abusive ICTs adoption; and α = cronbach’s alpha

Source(s): Authors’ own work

Table 2.

Test of the systematic bias between the dyads supervisor self-assessment versus teams’ score and supervisors’ prediction versus teams’ score

Variable	N	Supervisor self-assessment versus Teams’ score			Variable	N	Supervisors’ prediction versus Teams’ score
Variable	N	Bias ± SD	−95% CI of Bias	+95% CI of Bias	Variable	N	Bias ± SD	−95% CI of Bias	+95% CI of Bias
RR	43	0.21 ± 0.56	0.04	0.38	RR	44	0.00 ± 0.51	−0.16	0.15
MCW	43	0.30 ± 0.60	0.11	0.48	MCW	44	0.11 ± 0.53	−0.05	0.27
RDS	43	0.26 ± 0.53	0.10	0.42	RDS*	44	−2.63 + 0.72x	NA	NA
MIT	43	0.27 ± 0.55	0.10	0.44	MIT	44	0.00 ± 0.59	−0.17	0.19
SIMI	43	0.19 ± 0.55	0.02	0.36	SIMI	42	0.00 ± 0.53	−0.16	0.16
AAIA*	43	−5.51 + 1.28x	NA	NA	AAIA*	42	−2.03 + 0.49x	NA	NA

Variable	N	Supervisor self-assessment versus Teams’ score			Variable	N	Supervisors’ prediction versus Teams’ score
Variable	N	Bias ± SD	−95% CI of Bias	+95% CI of Bias	Variable	N	Bias ± SD	−95% CI of Bias	+95% CI of Bias
RR	43	0.21 ± 0.56	0.04	0.38	RR	44	0.00 ± 0.51	−0.16	0.15
MCW	43	0.30 ± 0.60	0.11	0.48	MCW	44	0.11 ± 0.53	−0.05	0.27
RDS	43	0.26 ± 0.53	0.10	0.42	RDS*	44	−2.63 + 0.72x	NA	NA
MIT	43	0.27 ± 0.55	0.10	0.44	MIT	44	0.00 ± 0.59	−0.17	0.19
SIMI	43	0.19 ± 0.55	0.02	0.36	SIMI	42	0.00 ± 0.53	−0.16	0.16
AAIA*	43	−5.51 + 1.28x	NA	NA	AAIA*	42	−2.03 + 0.49x	NA	NA

Note(s):

RR = respectful and responsible; MCW = managing and communicating work; RDS = reasoning and managing difficult situations; MIT = Managing the Individual Within the Team; SIMI = supportive ICTs mediated interaction; AAIA = avoidance of abusive ICTs adoption; * = Variable with normality of the differences and/or constant bias assumptions violated; x = the means between measurement methods; and NA: Not applicable

Source(s): Authors’ own work

Table 3.

Upper and lower agreement limits for Bland–Altman plots between supervisors’ predictions versus team ratings

Supervisors’ prediction versus teams’ score
Variable	N	Lower agreement limit	Upper agreement limit
RR	44	−0.99	0.99
MCW	44	−0.92	1.14
RDS*	44	−3.96 + 0.72x	−1.29 + 0.72x
MIT	44	−1.15	1.15
SIMI	42	−1.03	1.03
AAIA*	42	−3.14 + 0.49x	−0.91 + 0.49x

Supervisors’ prediction versus teams’ score
Variable	N	Lower agreement limit	Upper agreement limit
RR	44	−0.99	0.99
MCW	44	−0.92	1.14
RDS*	44	−3.96 + 0.72x	−1.29 + 0.72x
MIT	44	−1.15	1.15
SIMI	42	−1.03	1.03
AAIA*	42	−3.14 + 0.49x	−0.91 + 0.49x

Note(s):

RR = respectful and responsible; MCW = managing and communicating work; RDS = reasoning and managing difficult situations; MIT = managing the individual within the team; SIMI = supportive ICTs mediated interaction; AAIA = Avoidance of abusive ICTs adoption; * = variable with normality and/or constant bias violated; and x = the means between measurement methods

Source(s): Authors’ own work

Alansari

Lim

P.W.

Ramani

and

Palaganas

J.C.

(

2024

), “

What do you think of when you hear the word ‘feedback’? A reflective thematic analysis study of interviews

”,

The Clinical Teacher

, Vol.

No.

, p.

e13696

, doi:

https://doi.org/10.1111/tct.13696

Google Scholar

Crossref

Anseel

and

Sherf

E.N.

(

2025

), “

A 25-year review of research on feedback in organizations: from simple rules to complex realities

”,

Annual Review of Organizational Psychology and Organizational Behavior

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1146/annurev-orgpsych-110622-031927

Google Scholar

Crossref

Atwater

and

Waldman

(

1998

), “

360 Degree feedback and leadership development

”,

The Leadership Quarterly

, Vol.

No.

, pp.

423

426

, doi:

https://doi.org/10.1016/S1048-9843(98)90009-1

Google Scholar

Crossref

Atwater

L.E.

Waldman

D.A.

Atwater

and

Cartier

(

2000

), “

An upward feedback field experiment: supervisors’ cynicism, reactions, and commitment to subordinates

”,

Personnel Psychology

, Vol.

No.

, pp.

275

297

, doi:

https://doi.org/10.1111/j.1744-6570.2000.tb00202.x

Google Scholar

Crossref

Bland

J.M.

and

Altman

D.G.

(

1999

), “

Measuring agreement in method comparison studies

”,

Statistical Methods in Medical Research

, Vol.

No.

, pp.

135

160

, doi:

https://doi.org/10.1177/096228029900800204

Google Scholar

Crossref

Bracken

D.W.

and

Rose

D.S.

(

2011

), “

When does 360-degree feedback create behavior change? And how would we know it when it does?

”,

Journal of Business and Psychology

, Vol.

No.

, pp.

183

192

, doi:

https://doi.org/10.1007/s10869-011-9218-5

Google Scholar

Crossref

Braddy

P.W.

Sturm

R.E.

Atwater

L.E.

Smither

J.W.

and

Fleenor

J.W.

(

2013

), “

Validating the feedback orientation scale in a leadership development context

”,

Group and Organization Management

, Vol.

No.

, pp.

690

716

, doi:

https://doi.org/10.1177/1059601113508432

Google Scholar

Crossref

Brett

J.F.

and

Atwater

L.E.

(

2001

), “

360° Feedback: accuracy, reactions, and perceptions of usefulness

”,

Journal of Applied Psychology

, Vol.

No.

, p.

930

, doi:

https://doi.org/10.1037/0021-9010.86.5.930

Google Scholar

Crossref

Cioffi

Balducci

and

Toderi

(

2025a

), “

Stress-preventive management competencies

”,

Encyclopedia

, Vol.

No.

, p.

, doi:

https://doi.org/10.3390/encyclopedia5020046

Google Scholar

Crossref

Cioffi

Balducci

and

Toderi

(

2025b

), “

Digital stress-preventive management competencies: definition, identification and tool development for research and practice

”,

International Journal of Environmental Research and Public Health

, Vol.

No.

, p.

267

, doi:

https://doi.org/10.3390/ijerph22020267

Google Scholar

Crossref

Day

D.V.

Fleenor

J.W.

Atwater

L.E.

Sturm

R.E.

and

McKee

R.A.

(

2014

), “

Advances in leader and leadership development: a review of 25 years of research and theory

”,

The Leadership Quarterly

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1016/j.leaqua.2013.11.004

Google Scholar

Crossref

Ellison

L.J.

Steelman

L.A.

Young

S.F.

and

Riordan

B.G.

(

2022

), “

Setting the stage: feedback environment improves outcomes for a 360-degree-feedback leader-development program

”,

Consulting Psychology Journal

, Vol.

No.

, p.

363

, doi:

https://doi.org/10.1037/cpb0000236

Google Scholar

Crossref

Fleenor

J.W.

(

2021

), “What can we learn from research on multisource feedback in organizations?”, in

Student Feedback on Teaching in Schools: Using Student Perceptions for the Development of Teaching and Teachers

, pp.

221

236

, doi:

https://doi.org/10.1007/978-3-030-75150-0_14

Google Scholar

Crossref

Fleenor

J.W.

Smither

J.W.

Atwater

L.E.

Braddy

P.W.

and

Sturm

R.E.

(

2010

), “

Self–other rating agreement in leadership: a review

”,

The Leadership Quarterly

, Vol.

No.

, pp.

1005

1034

, doi:

https://doi.org/10.1016/j.leaqua.2010.10.006

Google Scholar

Crossref

Grutterink

and

Meister

(

2022

), “

Thinking of you thinking of me: an integrative review of meta-perception in the workplace

”,

Journal of Organizational Behavior

, Vol.

No.

, pp.

327

341

, doi:

https://doi.org/10.1002/job.2516

Google Scholar

Crossref

Haghayegh

Kang

H.A.

Khoshnevis

Smolensky

M.H.

and

Diller

K.R.

(

2020

), “

A comprehensive guideline for Bland–Altman and intraclass correlation calculations to properly compare two methods of measurement and interpret findings

”,

Physiological Measurement

, Vol.

No.

, p.

055012

, doi:

https://doi.org/10.1088/1361-6579/ab86d6

Google Scholar

Crossref

Hannah

S.T.

and

Avolio

B.J.

(

2010

), “

Ready or not: how do we accelerate the developmental readiness of leaders?

”,

Journal of Organizational Behavior

, Vol.

No.

, pp.

1181

1187

, doi:

https://doi.org/10.1002/job.675

Google Scholar

Crossref

James

L.R.R.

(

1982

), “

Aggregation bias in estimates of perceptual agreement

”,

Journal of Applied Psychology

, Vol.

No.

, pp.

219

229

, doi:

https://doi.org/10.1037/0021-9010.67.2.219

Google Scholar

Crossref

Katz

I.M.

Moughan

C.M.

and

Rudolph

C.W.

(

2023

), “

Feedback orientation: a meta-analysis

”,

Human Resource Management Review

, Vol.

No.

, p.

100986

, doi:

https://doi.org/10.1016/j.hrmr.2023.100986

Google Scholar

Crossref

Kelloway

E.K.

and

Barling

(

2010

), “

Leadership development as an intervention in occupational health psychology

”,

Work and Stress

, Vol.

No.

, pp.

260

279

, doi:

https://doi.org/10.1080/02678373.2010.518441

Google Scholar

Crossref

Lacerenza

C.N.

Reyes

D.L.

Marlow

S.L.

Joseph

D.L.

and

Salas

(

2017

), “

Leadership training design, delivery, and implementation: a meta-analysis

”,

Journal of Applied Psychology

, Vol.

102

No.

, p.

1686

, doi:

https://doi.org/10.1037/apl0000241

Google Scholar

Crossref

Lauzier

and

Bilodeau Clarke

(

2024

), “

Linking learning goal orientation to learning from error: the mediating role of motivation to learn and metacognition

”,

European Journal of Training and Development

, Vol.

Nos

5-6

, pp.

485

500

, doi:

https://doi.org/10.1108/EJTD-11-2022-0127

Google Scholar

Crossref

Lee

and

Carpenter

N.C.

(

2018

), “

Seeing eye to eye: a meta-analysis of self–other agreement of leadership

”,

The Leadership Quarterly

, Vol.

No.

, pp.

253

275

, doi:

https://doi.org/10.1016/j.leaqua.2017.06.002

Google Scholar

Crossref

London

and

Smither

J.W.

(

1995

), “

Can multi-source feedback change perceptions of goal accomplishment, self-evaluations, and performance-related outcomes? Theory-based applications and directions for research

”,

Personnel Psychology

, Vol.

No.

, pp.

803

839

, doi:

https://doi.org/10.1111/j.1744-6570.1995.tb01782.x

Google Scholar

Crossref

Macey

W.H.

and

Barbara

K.M.

(

2019

), “The ethical context of 360 feedback”, in

Church

A.H.

Bracken

D.W.

Fleenor

J.W.

and

Rose

D.S.

(Eds),

The Handbook of Strategic 360 Feedback

Oxford University Press

, pp.

461

478

, doi:

https://doi.org/10.1093/oso/9780190879860.001.0001

Google Scholar

Crossref

McCarthy

and

Garavan

(

2006

), “

Postfeedback development perceptions: applying the theory of planned behavior

”,

Human Resource Development Quarterly

, Vol.

No.

, pp.

245

267

, doi:

https://doi.org/10.1002/hrdq.1173

Google Scholar

Crossref

McCarthy

A.M.

and

Garavan

T.N.

(

1999

), “

Developing self-awareness in the managerial career development process: the value of 360-degree feedback and the MBTI

”,

Journal of European Industrial Training

, Vol.

No.

, pp.

437

445

, doi:

https://doi.org/10.1108/03090599910302613

Google Scholar

Crossref

Menghini

Cellini

Goldstone

Baker

F.C.

and

de Zambotti

(

2021

), “

A standardized framework for testing the performance of sleep-tracking technology: step-by-step guidelines and open-source code

”,

Sleep

, Vol.

No.

, p.

zsaa170

, doi:

https://doi.org/10.1093/sleep/zsaa170

Google Scholar

Crossref

Nielsen

and

Randall

(

2013

), “

Opening the black box: presenting a model for evaluating organizational-level interventions

”,

European Journal of Work and Organizational Psychology

, Vol.

No.

, pp.

601

617

, doi:

https://doi.org/10.1080/1359432X.2012.690556

Google Scholar

Crossref

Pankkonen

Moazami-Goodarzi

and

Vuori

(

2025

), “

Effects of a competence development intervention on informal learning and work engagement – a randomized controlled trial

”,

European Journal of Training and Development

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1108/EJTD-01-2025-0011

Google Scholar

Crossref

Pedone

Semerari

Riccardi

Procacci

Nicolò

and

Carcione

(

2017

), “

Development of a self-report measure of metacognition: the metacognition Self-Assessment scale (MSAS). Instrument description and factor structure

”,

Clinical Neuropsychiatry

, Vol.

No.

, pp.

185

194

, available at Development of a self-report measure of metacognition: the metacognition Self-Assessment scale (MSAS). Instrument description and factor structureLink to the cited article (

accessed

7 November 2025).

Google Scholar

Raemdonck

and

Strijbos

J.W.

(

2013

), “

Feedback perceptions and attribution by secretarial employees: effects of feedback-content and sender characteristics

”,

European Journal of Training and Development

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1108/03090591311293275

Google Scholar

Crossref

Siyal

and

Peng

(

2018

), “

Does leadership lessen turnover? The moderated mediation effect of leader–member exchange and perspective taking on public servants

”,

Journal of Public Affairs

, Vol.

No.

, p.

e1830

, doi:

https://doi.org/10.1002/pa.1830

Google Scholar

Crossref

Taylor

S.N.

Wang

and

Zhan

(

2012

), “

Going beyond self–other rating comparison to measure leader self-awareness

”,

Journal of Leadership Studies

, Vol.

No.

, pp.

, doi:

https://doi.org/10.1002/jls.21235

Google Scholar

Crossref

Toderi

and

Sarchielli

(

2016

), “

Psychometric properties of a 36-item version of the ‘stress management competency indicator tool

”,

International Journal of Environmental Research and Public Health

, Vol.

, p.

1086

, doi:

https://doi.org/10.3390/ijerph13111086

Google Scholar

Crossref

Toderi

Cioffi

Houdmont

Yarker

Lewis

and

Balducci

(

2025

), “

Developing stress-preventive management competencies: an evaluation of the mechanism and the process in a training experience

”,

International Journal of Workplace Health Management

, Vol.

No.

, pp.

314

330

, doi:

https://doi.org/10.1108/IJWHM-05-2024-0101

Google Scholar

Crossref

Vogel

Reichard

R.J.

Batistič

and

Černe

(

2021

), “

A bibliometric review of the leadership development field: how we got here, where we are, and where we are headed

”,

The Leadership Quarterly

, Vol.

No.

, p.

101381

, doi:

https://doi.org/10.1016/j.leaqua.2020.101381

Google Scholar

Crossref

von Thiele Schwarz

Nielsen

Edwards

Hasson

Ipsen

Savage

and

Reed

J.E.

(

2021

), “

How to design, implement and evaluate organizational interventions for maximum impact: the Sigtuna principles

”,

European Journal of Work and Organizational Psychology

, Vol.

No.

, pp.

415

427

, doi:

https://doi.org/10.1080/1359432X.2020.1803960

Google Scholar

Crossref

Yarker

Donaldson-Feilder

and

Lewis

(

2022

), “Management competencies for health and wellbeing”, in

Handbook on Management and Employment Practices

Springer International Publishing

Cham

, pp.

available at:

Management competencies for health and wellbeing”Link to a pdf of the cited article (

accessed

21 October 2025).

Google Scholar

Yarker

Lewis

and

Donaldson-Feilder

(

2008

), “

Management competencies for preventing and reducing stress at work: Identifying the management behaviours necessary to implement the management standards: Phase two

”,

available at:

accessed

21 October 2025).

Google Scholar

Zhao

and

Freeney

(

2024

), “

Being at one with each other: leader–follower (in)congruence in transformational leadership and team performance

”,

Leadership and Organization Development Journal

, Vol.

No.

, pp.

190

207

, doi:

https://doi.org/10.1108/LODJ-05-2022-0240

Google Scholar

Crossref

When upward feedback is not feasible: testing supervisors’ perspective-taking ability as a sustainable alternative

1. Introduction

1.1 The present study

1.2 The hypothesis of accuracy

2. Method

2.1 Participants and procedure

2.2 The measures

2.3 Statistical analysis

3. Results

4. Discussion

4.1 Implications for theory and practice

4.2 Limitations and strengths

5. Future research

6. Conclusion

References

Data & Figures

Contents

Supplements

References

Email Alerts

Cited By

Languages

When upward feedback is not feasible: testing supervisors’ perspective-taking ability as a sustainable alternative

1. Introduction

1.1 The present study

1.2 The hypothesis of accuracy

2. Method

2.1 Participants and procedure

2.2 The measures

2.3 Statistical analysis

3. Results

4. Discussion

4.1 Implications for theory and practice

4.2 Limitations and strengths

5. Future research

6. Conclusion

References

Data & Figures

Contents

Supplements

References

Related

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Languages

Sharing Unavailable