Skip to Main Content
Purpose

This study aims to systematically identify the drivers of corporate greenwashing through the institution, market, organization and cognition (IMOC) framework. Grounded in institutional theory, resource-based view, agency theory and upper echelons theory, this research examines how factors across these four dimensions drive greenwashing through the dual mechanisms of institutional pressures and organizational opportunities, while establishing an effective prediction framework using machine learning techniques.

Design/methodology/approach

Employing a comparative ML approach, we comprehensively identify the factors affecting corporate greenwashing based on the pressure and opportunity dimensions and then explore how these factors influence corporate greenwashing by a sample of A-share listed firms in China from 2011 to 2022. The LightGBM, XGBoost and random forest algorithms are benchmarked against traditional logistic regression, with SHapley Additive exPlanations (SHAP) values used for feature interpretation.

Findings

The paper finds that (1) LightGBM outperforms other models in greenwashing prediction. (2) Institutional ownership concentration emerges as the strongest positive predictor, while moderate digital transformation and market competition exhibit inhibitory effects. (3) Digital economy development amplifies media monitoring efficacy and reduces principal-agent conflicts’ impact on greenwashing.

Practical implications

Regulators should prioritize institutional investors’ governance roles and establish digital transformation thresholds for environmental, social and governance (ESG) compliance. Investors can utilize our ML framework to assess greenwashing risks in portfolio companies.

Originality/value

This research pioneers the integration of institutional theory with explainable artificial intelligence in greenwashing detection, revealing the non-linear impacts of digital transformation. The proposed SHAP-empowered framework enables dynamic monitoring of emerging ESG risks.

Growing ecological challenges and frequent extreme weather events have made sustainable development a global priority. In September 2020, China formally announced its “dual carbon” goals (carbon peaking and neutrality), later reinforced by the 20th National Congress of the Communist Party of China, which emphasized green economic transformation and harmonious coexistence between humans and nature (Niu & Wang, 2024; Fang, Nolan, & Linggui, 2019). However, green transformation faces significant barriers, including high capital demands, long payback periods and investment risks. Under regulatory pressure, firms may engage in greenwashing – misleading environmental claims that appear more sustainable than they are (Du, 2015). Such practices harm consumer rights, market competition and policy effectiveness (Zeng, 2024), posing a significant barrier to sustainable development.

Academic research considers greenwashing as inherently fraudulent, with scholars predominantly applying fraud theory frameworks (triangular/four-factor models) to investigate its determinants. Empirical evidence identifies multi-level drivers encompassing organizational characteristics (firm lifecycle and leadership style) and external pressures (regulatory gaps, market competition and consumer expectations) (Zhang, Yang, & Wang, 2024; Zhang et al., 2024). While prior research examines greenwashing drivers (e.g. regulatory pressures and firm traits; Zhang et al., 2024), gaps remain in predictive modeling, especially amid evolving environmental, social and governance (ESG) complexities (e.g. supply chain or strategic greenwashing).

Contemporary corporate green governance confronts evolving contextual challenges, necessitating systematic identification of critical determinants of environmental conduct and their predictive mechanisms to advance market-oriented ecological regulation. In multifaceted decision-making landscapes, stakeholders increasingly prioritize developing predictive models for greenwashing risks – a strategic imperative enabling cost-efficient preventive measures and market-based internalization of sustainability risks.

This study addresses these gaps by employing machine learning (ML) to predict greenwashing behavior among Chinese A-share firms (2011–2022). Unlike traditional methods prioritizing unbiasedness, ML optimizes predictive accuracy and generalizability (Tiffin, 2019). To systematically disentangle these complexities, we propose the institution, market, organization and cognition (IMOC) framework, integrating institutional theory, the resource-based view, agency theory and upper echelons theory. This framework categorizes drivers into four hierarchical levels: institution, market, organization and cognition. Within this framework, we identify 27 key factors and employ ML to reveal how they drive greenwashing through the dual mechanisms of institutional pressures and organizational opportunities.

While emerging studies have begun to apply ML to predict greenwashing (e.g. Zeng, Wang, & Zeng, 2025; Li, He, & Li, 2025), our research differentiates itself in three pivotal aspects. First, we develop and empirically validate the Institution, Market, Organization, and Cognition (IMOC) framework, which synthesizes institutional, market, organizational and cognitive drivers into a coherent theoretical model. This move beyond identifying isolated predictors to offering a holistic understanding of the greenwashing ecosystem. Second, we incorporate a temporal dimension by examining how the predictive importance of key factors (e.g. media attention and agency costs) shifted after China's “Digital China” initiative (2015), providing dynamic insights into the evolving governance landscape. Third, our analytical focus extends beyond predictive accuracy to interpretability. By leveraging SHapley Additive exPlanations (SHAP), we uncover non-linear relationships and threshold effects (e.g. the conditional impact of digital transformation), thereby “opening the black box” of ML models to yield actionable, theory-informed insights.

Our contributions are threefold.

  1. Theoretical: A unified framework of greenwashing drivers, expanding corporate social responsibility literature;

  2. Methodological: Demonstrated superiority of ML in greenwashing prediction and

  3. Practical: Insights for firms, investors and policymakers to mitigate risks and refine governance.

The paper proceeds as follows: Section 2 reviews greenwashing and its drivers; Sections 35 detail methods, results and analysis; Section 6 concludes with implications.

Greenwashing constitutes strategic information distortion across disciplinary contexts. (1) As symbolic impression management in business ethics, involving decoupled symbolic-substantive actions to accumulate moral capital (Effron, O'Connor, Leroy, & Lucas, 2018); (2) As selective disclosure in economic analysis, characterized by asymmetric environmental reporting (Pizzetti, Gatti, & Seele, 2021); (3) As hyperbolic marketing in management studies, manifesting through unsubstantiated eco-claims (Parguel, Benoît-Moreau, & Larceneux, 2011). This behavioral divergence stems from perceived green performance gaps, wherein firms employ environmental speculation tactics – from information manipulation to outright deception – when actual sustainability outcomes lag expectations (Wang, Zeng, & Li, 2022). Synthesizing fraud theory frameworks, we posit greenwashing as institutionally enabled fraud driven by pressure-opportunity dynamics, where organizational pressures catalyze deception and structural opportunities facilitate its execution.

Environmental performance is a key element for modern corporations to manage their public impression (Yue & Li, 2023). Greenwashing can improve a firm's image, meet regulatory requirements and reduce financial costs by attracting government subsidies. Such “advantages” may encourage more conduction of greenwashing and decrease very green and sustainable behaviors (Zhang, 2022). In addition, pressure has become a behavioral motivation for firms' implementation of corporate greenwashing and drives the corporates' direct interest.

Pressure can be categorized as exogenous and endogenous. First, from the outside, market pressure may lead to greenwashing (Kim & Lyon, 2015; Delmas, Nairn-Birch, & Lim, 2015). As competition intensifies, companies increasingly strive to “demonstrate their commitment to the environment and comply with external environmental regulations, such as public scrutiny, by cultivating a positive environmental corporate image” (Li, Li, Seppänen, & Koivumäki, 2023). Nevertheless, Lyon and Montgomery (2013) find that the external pressure generated by companies' public participation and social media involvement can curb their greenwashing behavior. Second, financial precarity and agency conflicts constitute primary internal catalysts for greenwashing. Resource-constrained firms, particularly in environmentally sensitive industries, frequently adopt low-cost sustainability signaling to mitigate financing pressures and regulatory compliance costs (Zhang, 2022, 2023; Hu, Yu, & Han, 2023b). This fiscal maneuvering manifests through two channels: (1) principal-agent divergence where shareholders prioritize short-term gains via symbolic environmental commitments (Ferrón-Vílchez et al., 2021) and (2) executive myopia incentivizing superficial compliance over substantive green investments (Kim & Lyon, 2015; Li, Wang, & He, 2023). The confluence of fiscal vulnerability and governance deficiencies thus creates institutionalized pathways for environmental deception.

Opportunities are the internal and external conditions under which enterprises can transform their greenwashing motives into actual actions. First, from an external perspective, weak enforcement regimes create permissive environments for symbolic compliance (Bernini, Giuliani, & La Rosa, 2023). Zhang (2022) demonstrates that financial constraints arising from the financial environment drive companies to engage in greenwashing, making these constraints key determinants of such behavior. However, it remains uncertain whether fintech has a positive or negative effect on corporate ethical behavior (Li, Cao, & Wang, 2024a; Li, Miao, & Xu, 2024b; Xie, Chen, Liu, & Wang, 2023). Zhou, Jin, Li, & Tao (2024) show that if companies are politically associated, their tendency to greenwash is even more severe. Therefore, the higher the greenwashing in cohort firms, the higher the greenwashing in target firms (Chen & Dagestani, 2023).

Second, greenwashing behavior is closely relevant to a firm's size. Compared with smaller ones, large firms are more likely to overcome regulatory controls (Bernini et al., 2023). They are more inclined to greenwash to meet stakeholder demands (Testa, Boiral, & Iraldo, 2018). In pursuit of profits, companies may select the positive information regarding their green development to report instead of negative information (Hu, Wang, & Du, 2023c). Management characteristics can also lead to greenwashing. Blome, Foerstl and Schleper (2017) found that authoritative leadership styles influence greenwashing, which is inhibited by moral incentives. Moreover, executives’ ages, risk preferences and experience can affect greenwashing (Xia, Chen, Yang, Li, & Zhang, 2023; Zhang, Qin, & Zhang, 2023). Additionally, customers become more conscious about climate change and environmental protection (Lin & Niu, 2018; Kumar, Manrai, & Manrai, 2017). Consequently, corporations may resort to wash their environmental performance to attract this kind of customers and enhance their social good image (Szabo & Webster, 2021; Ioannou, Kassinis, & Papagiannakis, 2023).

Recent methodological advances have seen the application of ML to greenwashing prediction. For instance, Zeng et al. (2025) employed optimized ML algorithms to identify key predictors of ESG greenwashing, while Li et al. (2025) developed a predictive model focusing on heavy-pollution industries (HPIs). These studies underscore the value of ML in handling complex, non-linear relationships. However, they primarily focus on predictive performance and feature ranking, leaving room for a more theoretically integrated framework that explains why and under what conditions these factors matter across different levels of analysis. Furthermore, the temporal evolution of these drivers in the context of rapid digitalization remains underexplored.

Through a systematic review of existing literature, this study finds that corporate greenwashing is influenced by a complex array of factors across multiple levels. In order to establish a clear analytical structure for this research, we have developed an integrated theoretical framework that categorizes the driving factors into four interrelated theoretical dimensions: the institutional level, market level, internal organizational level and individual cognitive level. This framework moves beyond the traditional “pressure-opportunity” dichotomy by offering a more nuanced classification system to clarify the theoretical rationale for including each variable.

  1. Institutional level. Grounded in institutional theory, this dimension emphasizes the role of the external institutional environment in shaping corporate behavior. It includes regulatory pressure, normative pressure and cognitive mimicry. Specifically, regulatory pressure stems from the intensity of government environmental regulations and policies, leading firms to engage in symbolic compliance to gain legitimacy. Normative pressure arises from societal expectations, such as those from the media and the public, prompting companies to manage environmental impressions to protect their reputation. Cognitive mimicry refers to the tendency of firms, under conditions of uncertainty, to imitate the behaviors (including greenwashing) of other firms in the same industry, resulting in isomorphic practices within the sector.

  2. Market level. Derived from industrial organization theory and resource dependence theory, this dimension focuses on the competitive and resource environments in which firms operate. It encompasses competitive pressure and financing pressure. Competitive pressure typically arises from intense market competition and performance expectation gaps, driving firms to use green claims for differentiation or to alleviate performance pressures. Financing pressure refers to the motivation for firms to cultivate a positive green image to attract green investments or improve financing conditions.

  3. Internal organizational level. Primarily based on agency theory, this dimension examines internal governance, resources and managerial characteristics. The framework further refines this level by highlighting governance deficiencies: weak corporate governance mechanisms create opportunities for managerial opportunism. In terms of resources and capabilities, if a firm lacks the resources or technical capacity to implement substantive green innovations, it is more likely to opt for low-cost symbolic strategies. Regarding managerial cognition and motivation, executives’ personal backgrounds, characteristics and incentive structures significantly influence their preferences and judgments in environmental decision-making.

  4. Individual cognitive level. Drawing on behavioral ethics, this dimension delves into the psychological motivations of individual decision-makers. Although variables at this level are often difficult to observe directly, their underlying mechanisms are critical. For instance, myopic management, characterized by an excessive focus on short-term goals, may lead to the sacrifice of long-term environmental value. Similarly, moral disengagement allows decision-makers to rationalize unethical behaviors, such as greenwashing, through cognitive restructuring mechanisms like shifting responsibility or minimizing consequences.

In summary, this framework not only provides a clear theoretical basis and justification for the predictor variables selected in this study, ensuring systematic and comprehensive variable selection, but also establishes a solid theoretical foundation for interpreting the predictive results of ML in subsequent sections. We posit that greenwashing is not driven by a single factor but rather results from the interplay and accumulation of factors across these four levels.

Based on the available time period of the ESG rating data, we take all the listed companies from 2011 to 2022 in the A-share market as the initial sample. Referring to conventional practice in the existing literature, we exclude some special samples as follows: (1) companies with abnormal status, including ST, *ST and PT (particular transfer firms), (2) companies in the financial field and (3) observations with missing research variables. After that, we get a final sample containing 5,326 observations. We winsorize all continuous variables at 1% and 99%. We collect the ESG performance rating data from the “Morgan Stanley Capital International ESG ratings” in the Wind database, the ESG disclosure ratings data from the Bloomberg database and all other data from the China Stock Market & Accounting Research (CSMAR) database. Table 1 presents the annual and industrial distributions of the sample.

Table 1

Distribution of the sample by a year and industry

YearSample sizeSample proportion (%)IndustrySample sizeSample proportion (%)
20123787.097A591.108
20134047.585B2935.501
20143997.492C3,67068.907
20154798.994D2073.887
20164798.994E1623.042
20175129.613F1432.685
201854210.176G1432.685
201963711.960H120.225
202063411.904I3095.802
202174313.950K1001.878
20221192.234L500.939
   M280.526
   N500.939
   Q330.620
   R520.976
   S150.282
Total5,326100 5,326100

The annual distribution shows an increasing trend year by year, especially after 2017, and the annual sample that can be used to calculate greenwashing is more than 500. This reflects that, on the one hand, China has attached great importance to ecological issues in recent years, and the capital market further echoes the relevant policy orientation. On the other hand, in January 2017, Hong Kong Exchanges and Clearing Limited completed the revision of its ESG Guidelines, which made it mandatory for both A-share and H-share listed companies to disclose their ESG-related reports (Liao, Sun, & Xu, 2023). This act of disclosure also serves as a demonstration for A-share listed companies only.

Regarding industry distribution, C (manufacturing) has the largest sample size, accounting for nearly 70% of the total, which rightly reflects the fact that China has many listed companies in the manufacturing industry. Four industries – that is, H (accommodation and food service), M (scientific research and technical services), Q (health and social work industry) and S (general) – have a smaller sample size, accounting for a combined share of about 1.65%. This is because the number of listed companies in these four industries is already relatively low (Chen, Du, Wu, & Zhou, 2017), and the main businesses of these industries have a relatively small negative impact on the environment and society (Baudry, Bukowski, & Lament, 2024); thus, they may pay less attention to ESG practices.

3.2.1 Dependent variable

We use a calculation formula to measure the extent of corporate greenwashing as follows:

where ERdisi,t denotes the environmental disclosure score of firm i in year t and is measured using the ESG rating score from the Bloomberg database; ER̅disi,t denotes the mean of the environmental disclosure scores of the same industry in the same year; σdis is the standard deviation of the firms' environmental disclosure scores in the current year; ERperi,t denotes the actual score of the environmental performance; ER̅peri,t is the mean of the actual environmental performance scores for the same industry in the same year and σper is the standard deviation of the actual scores of the firms. When the GW value is positive, firms tend to conceal their poor environmental performance by releasing misleading environmental information.

3.2.2 Independent variables

In line with ML practices, where all input variables are treated as features for prediction, we select a comprehensive set of potential predictors for corporate greenwashing based on established literature. These features can be broadly categorized based on the pressure-opportunity framework and other key firm characteristics identified in prior studies. However, it is important to note that all features are treated collectively and their influence is assessed simultaneously by the models. The selected features are defined as follows.

  1. Industry competition: Herfindahl–Hirschman Index (HHI) is used to measure industry competition according to the usual practice. The lower the value of this index, the higher the extent of competition in the industry where the firm is involved.

  2. External attention: We use two indicators to measure the intensity of external attention towards the environment: governmental attention (GOV) and media attention (MEDIA). Specifically, GOV is measured by the magnitude of the low-carbon policy intensity in prefecture-level cities (Dong, Wang, Zhang, Zhang, & Xia, 2024), while MEDIA is measured by the total number of news relative to the company in the year (Hu & Liu, 2023; Yue & Li, 2023).

  3. Environmental uncertainty: We use the environmental uncertainty (EU) index (Kim, Fairclough, & Dibrell, 2017) to measure EU. The larger the value, the higher the EU extent.

  4. Corporate finance: A company's financial status and capabilities usually affect its strategic decision-making. We use leverage and return on assets return on assets as the representatives to measure the company's financial need and profitability (Xia et al., 2023).

  5. Company type: Heavily polluting firms have more incentives to greenwash (Zhang, Qin, & Zhang, 2023). We define the HPIs and code them based on the “Guidelines on Industry Classification of Listed Companies revised by the China Securities Regulatory Commission in 2012” (Guo, 2022).

  6. Agency problem. The two types of agency costs are introduced to measure the firm's agency problem. The first type (AC1) is calculated by the ratio of expenses on sales and administration to operational revenues, while the second type (AC2) is calculated by the ratio of other receivables to total assets (Zhao & Wang, 2024; Zhang, Wang, & Jiang, 2023). The higher the agency cost, the more serious the agency problem.

  7. Green investor: Green investor (GI) plays an active role in the ESG-driving market. We use “the Napierian Logarithm of the number of GIs plus one” to measure this variable (Feng and Yuan, 2024).

  8. Bankruptcy risk: Bankruptcy risk is a crucial force to challenge a firm's ethic. We use the Z-score index (ZSCORE) [1] to measure the corporate bankruptcy risk (Zhang, 2023).

Second, the following variables are used for opportunities.

  1. Financial technology: Referring to Wen, Wang and He (2024), we use the provincial aggregate index of “the Peking University Digital Financial Inclusion Index[2] to measure the level of digital financial technology development (DFIIC).

  2. Digital transformation: We use “the Digital Transformation Index” from the CSMAR Database to measure a firm's digital transformation level (DT). The comprehensive index integrates the strategy-driven, technology-enabled, organization-enabled, environment-enabled, digital achievement and digital application scores.

  3. Analyst following: According to Zhang (2022), we use “the Napierian logarithm of the number of analysts following plus one” to measure analyst following (ANA). The higher the ANA, the lower is the level of information asymmetry.

  4. Economic policy uncertainty: According to Baker, Bloom and Davis (2016), “the Economic Uncertainty index[3] is chosen to measure economic policy uncertainty.

  5. Corporate governance: We use indicators such as institutional ownership (INSTITUTION), internal control (IC) and the proportion of independent directors (INDBOARD) (Nadeesha, 2022) to measure corporate governance characteristics.

  6. Management characteristics: Studies show that management characteristics can significantly influence firms' greenwashing behavior (Yue & Li, 2023; Zhang, Qin, & Zhang, 2023). We consider the chief executive officer's (CEO's) age (CEO_AGE), gender (GENDER), academic background (ACADEMIC_BACK), financial background (FINANCE_BACK), green background (ENV_BACK) and diversity of functional experience (COMP_BACK) as variables predicting greenwashing behavior.

  7. Political affiliation: We mainly use the CEOs’ government background to measure their corporate political affiliation (GOV_BACK). Specifically, the value is taken as 1 if the CEO has served in a government department and 0 otherwise.

  8. Family-owned company: We take family-owned company as a dummy variable to measure whether the company is controlled by family members. It takes the value of 1 if the company is a family business and 0 otherwise (Habbash, 2013). A family business is one in which at least one family member with kinship ties – besides the actual controller – holds shares in, manages or controls the listed or controlling shareholder company (Kim et al., 2017).

  9. Technology innovation: We adopt R&D intensity (RD) to measure a firm's technological innovation capability. Specifically, “the ratio of expenditure on research and development to operational revenue” is used to calculate the RD (Liu, Liu, Hu, & Xie, 2020). The higher the RD, the higher the firm's technological innovation ability.

  10. Firm-characteristic features: Based on the literature on ESG and corporate governance, we also include several fundamental firm-level characteristics as features. These include firm size (SIZE), measured as the natural logarithm of total assets; firm age (AGE), calculated as the natural logarithm of the current year minus the year of incorporation plus one; total asset turnover (TAT), which is operating revenue divided by total assets, and management shareholding (MHOLD), the proportion of management's shareholding to total share capital. We also incorporate Tobin's Q (TobinQ), the ratio of market value to total assets; whether the firm is a state-owned enterprise (SOE), a dummy variable equal to 1 for SOEs and 0 otherwise and CEO-chairman duality (DUAL), which is 1 if the chairman and CEO are the same person and 0 otherwise. The detailed definitions and descriptive statistics of all variables are presented in Appendix Table A1.

We selected three machine learning models – Random Forest, XGBoost and LightGBM – which are currently widely used in the field. Moreover, we use a multiple linear regression model as a benchmark for cross-reference to compare the predictive performance of the ML and traditional econometric models.

The selection of these established ML algorithms is motivated not by algorithmic novelty but by their proven efficacy in capturing complex, non-linear patterns in high-dimensional data – a task for which traditional linear models are often ill-suited. While the absolute improvement in predictive metrics (e.g. R2) over linear regression may appear modest (see Table 2), the primary value of our ML approach lies in its explanatory power. By combining these algorithms with SHAP interpretation, we shift the focus from mere prediction to understanding the nuanced, conditional and non-linear influences of various drivers on greenwashing – insights that are typically obscured in conventional linear models.

In the ML domain, regression tasks are extensively utilized to establish a connection between the continuous target variable and one or more independent variables. To assess the performance of ML algorithms in regression tasks, four commonly used evaluation metrics are applied (Jabeur, Mefteh-Wali, & Viviani, 2024): root mean squared error, mean squared error, mean absolute error and R-squared (R2). These metrics provide quantitative insights into the accuracy and generalizability of the trained models.

SHAP is a method for interpreting the predictions of ML models. It can provide insights into how individual features contribute to specific predictions. This interpretability is crucial for building trust in ML models and understanding their decision-making processes.

4.3.1 SHAP value

SHAP values are based on Shapley values, which were originally used in cooperative game theory to fairly distribute the “payout” among players based on their contributions. The SHAP value for the feature is given by

where ϕi representsthe SHAP value for feature i, N is the set of all features, S is a subset of features excluding feature i and f(S) is the prediction function when applied to the subset S. This formula computes the average marginal contribution of a feature over all possible combinations of features, ensuring a fair allocation of the prediction across all features.

4.3.2 SHAP feature importance

SHAP feature importance is determined by averaging the absolute SHAP values for each feature across all samples. This provides a global view of the most influential features of the model:

where M denotes the number of samples, and ϕij is the SHAP value for feature i in sample j. This formula aggregates the SHAP values to measure the overall importance of each feature in the model.

The dataset is randomly split into a training set and a test set at an 80:20 ratio. To mitigate the impact of randomness and obtain a more robust assessment of model performance, a five-fold cross-validation is implemented on the training set. This involves iteratively dividing the training set into five folds: four folds for training and one for evaluation. Each algorithm's performance is averaged across five folds to provide a more reliable estimate of its generalization ability. To further enhance the ML algorithms’ performance, hyperparameter optimization was conducted using Optuna [4], as shown in Figure 1.

Figure 1
A flow diagram shows dataset splitting into training, validation folds, and test set with hyperparameter tuning.At the top, a long box labeled “Whole dataset” spans the width. Below it, the dataset is divided into two sections: a larger box on the left labeled “Training set” and a smaller box on the right labeled “Test set”. Under the “Training set”, there are five rows, each containing five boxes arranged in columns. Each row includes “Fold 1”, “Fold 2”, “Fold 3”, “Fold 4”, and “Fold 5”. The diagonal cells are highlighted and represent Validation sets. To the right of these rows is a vertical section labeled “Validation set”, containing five stacked boxes labeled “Fold 1”, “Fold 2”, “Fold 3”, “Fold 4”, and “Fold 5”. A curly bracket groups these five validation folds. Further to the right, a box labeled “Hyperparameter tuning” is shown. An upward arrow labeled “Final evaluation” connects from the “Hyperparameter tuning” box to the “Test set”. At the bottom right corner, a small circular logo icon appears next to the text “OPTUNA”.

Workflow of K-fold cross-validation with hyperparameter tuning

Figure 1
A flow diagram shows dataset splitting into training, validation folds, and test set with hyperparameter tuning.At the top, a long box labeled “Whole dataset” spans the width. Below it, the dataset is divided into two sections: a larger box on the left labeled “Training set” and a smaller box on the right labeled “Test set”. Under the “Training set”, there are five rows, each containing five boxes arranged in columns. Each row includes “Fold 1”, “Fold 2”, “Fold 3”, “Fold 4”, and “Fold 5”. The diagonal cells are highlighted and represent Validation sets. To the right of these rows is a vertical section labeled “Validation set”, containing five stacked boxes labeled “Fold 1”, “Fold 2”, “Fold 3”, “Fold 4”, and “Fold 5”. A curly bracket groups these five validation folds. Further to the right, a box labeled “Hyperparameter tuning” is shown. An upward arrow labeled “Final evaluation” connects from the “Hyperparameter tuning” box to the “Test set”. At the bottom right corner, a small circular logo icon appears next to the text “OPTUNA”.

Workflow of K-fold cross-validation with hyperparameter tuning

Close modal

Table 2 summarizes the prediction performance achieved by each ML algorithm on the testing set. These scores reflect the ability of each algorithm to accurately predict the target variable for unseen data points. LightGBM outperforms the other algorithms in all metrics.

Table 2

Comparison of the prediction performance

ModelRMSEMSEMAER2
LinearRegression1.2771.6301.0007.179%
RandomForest1.1031.2170.86030.698%
XGboost1.0821.1710.82833.352%
LightGBM1.0741.1550.82734.274%

Note(s): RMSE = root mean squared error, MSE = mean squared error and MAE = mean absolute error

To ensure the robustness of our findings, we conducted multiple supplementary analyses. First, we transformed the continuous greenwashing measure into a binary indicator (GW_Dummy) and reframed the task as a classification problem. As detailed in  Appendix 2, the LightGBM model maintained superior performance (AUC = 0.763), and the relative importance of key predictors remained consistent. Second, we performed subsample analyses by (1) industry type (manufacturing vs. non-manufacturing) and (2) ownership (state-owned vs. non-state-owned). The core findings regarding the top predictors (INSTITUTION, DT and HHI) were qualitatively unchanged across subsamples. Third, we varied the train-test split ratios (70:30, 85:15) and obtained similar model performance rankings. These tests collectively affirm the reliability and generalizability of our primary results.

To investigate the difference in predictive power between opportunities and pressures, we calculate the feature importance of the opportunities, pressures and baseline variables sets using SHAP feature importance analysis. The sets’ results are 0.529, 0.571 and 0.523, respectively. This indicates that there is no significant difference between the two groups. We then observe the status of each group.

As shown in Figure 2, among the opportunities, institutional ownership is the most important in predicting corporate greenwashing, followed by the digital transformation and regional fintech. The results indicate that external shareholders, digital strategy and regional financial facilities have a more predictive power than other factors such as managerial characteristics.

Figure 2
A horizontal bar chart shows feature importance values for different features.The horizontal axis represents “Feature Importance” ranging from 0 to 0.12 in increments of 0.02. The vertical axis lists features from top to bottom: “INSTITUTION”, “DT”, “DFIIC”, “RD”, “ANA”, “CEO underscore AGE”, “IC”, “IND BOARD”, “EPU”, “COMP_BACK”, “GOV_BACK”, “FAM”, “ACADEMIC_BACK”, “ENV_BACK”, “GENDER”, and “FINANCE_BACK”. The data for the bars are as follows: “INSTITUTION”: 0.121. “DT”: 0.095. “DFIIC”: 0.075. “RD”: 0.065. “ANA”: 0.048. “CEO _AGE”: 0.043. “IC”: 0.036. “INDBOARD”: 0.024. “EPU”: 0.022. “COMP_BACK”: 0.013. “GOV_BACK”: 0.011. “FAM”: 0.007. “ACADEMIC_BACK”: 0.004. “ENV_BACK”: 0.003. “GENDER”: 0.003. “FINANCE_BACK”: 0.0027. Note: All numerical data values are approximated.

Feature importance of opportunities

Figure 2
A horizontal bar chart shows feature importance values for different features.The horizontal axis represents “Feature Importance” ranging from 0 to 0.12 in increments of 0.02. The vertical axis lists features from top to bottom: “INSTITUTION”, “DT”, “DFIIC”, “RD”, “ANA”, “CEO underscore AGE”, “IC”, “IND BOARD”, “EPU”, “COMP_BACK”, “GOV_BACK”, “FAM”, “ACADEMIC_BACK”, “ENV_BACK”, “GENDER”, and “FINANCE_BACK”. The data for the bars are as follows: “INSTITUTION”: 0.121. “DT”: 0.095. “DFIIC”: 0.075. “RD”: 0.065. “ANA”: 0.048. “CEO _AGE”: 0.043. “IC”: 0.036. “INDBOARD”: 0.024. “EPU”: 0.022. “COMP_BACK”: 0.013. “GOV_BACK”: 0.011. “FAM”: 0.007. “ACADEMIC_BACK”: 0.004. “ENV_BACK”: 0.003. “GENDER”: 0.003. “FINANCE_BACK”: 0.0027. Note: All numerical data values are approximated.

Feature importance of opportunities

Close modal

As shown in Figure 3, among the pressures, market competition is the most important feature in predicting corporate greenwashing, followed by type-II agency problems and the debt-to-assets ratio. The results indicate that market competitive pressures, agency cost pressures and debt risk pressures are critical motivators for greenwashing because they have more predictive power than other factors such as external concerns, pollution status and firm profitability.

Figure 3
A horizontal bar chart shows feature importance values for different features.The horizontal axis represents “Feature Importance” ranging from 0 to 0.08 in increments of 0.01. The vertical axis lists features from top to bottom: “HHI”, “AC2”, “LEV”, “MEDIA”, “AC1”, “EU”, “ROA”, “Z SCORE”, “HPI”, “GOV”, and “GI”. The data for the bars are as follows: “HHI”: 0.082. “AC2”: 0.066. “LEV”: 0.065. “MEDIA”: 0.06. “AC1”: 0.054. “EU”: 0.047. “ROA”: 0.047. “ZSCORE”: 0.041. “HPI”: 0.026. “GOV”: 0.023. “GI”: 0.019. Note: All numerical data values are approximated.

Feature importance of pressures

Figure 3
A horizontal bar chart shows feature importance values for different features.The horizontal axis represents “Feature Importance” ranging from 0 to 0.08 in increments of 0.01. The vertical axis lists features from top to bottom: “HHI”, “AC2”, “LEV”, “MEDIA”, “AC1”, “EU”, “ROA”, “Z SCORE”, “HPI”, “GOV”, and “GI”. The data for the bars are as follows: “HHI”: 0.082. “AC2”: 0.066. “LEV”: 0.065. “MEDIA”: 0.06. “AC1”: 0.054. “EU”: 0.047. “ROA”: 0.047. “ZSCORE”: 0.041. “HPI”: 0.026. “GOV”: 0.023. “GI”: 0.019. Note: All numerical data values are approximated.

Feature importance of pressures

Close modal

We further employ SHAP to compare the importance of the variables individually. As shown in Figure 4, among all the opportunities and pressures, institutional ownership, a firm's digital transformation and market competition have the best predictive power for corporate greenwashing.

Figure 4
A horizontal bar chart shows feature importance values for different features.The horizontal axis represents “Feature Importance” ranging from 0 to 0.12 in increments of 0.02. The vertical axis lists features from top to bottom: “MHOLD”, “INSTITUTION”, “SIZE”, “DT”, “HHI”, “DFIIC”, “AC2”, “TAT”, “LEV”, “RD”, “GROWTH”, “AGE”, “MEDIA”, “AC1”, “ANA”, “EU”, “ROA”, “CEO_AGE”, “ZSCORE”, “TOBINQ”, “IC”, “SOE”, “HPI”, “INDBOARD”, “DUAL”, “GOV”, “EPU”, “GI”, “COMP_BACK”, “GOV_BACK”, “FAM”, “ACADEMIC_BACK”, “ENV_BACK”, “GENDER”, and “FINANCE_BACK”. The data for the bars are as follows: “M HOLD”: 0.122. “INSTITUTION”: 0.121. “SIZE”: 0.115. “DT”: 0.095. “HHI”: 0.083. “DFIIC”: 0.074. “AC2”: 0.066. “TAT”: 0.066. “LEV”: 0.065. “RD”: 0.064. “GROWTH”: 0.063. “AGE”: 0.060. “MEDIA”: 0.060. “AC1”: 0.054. “ANA”: 0.048. “EU”: 0.046. “ROA”: 0.046. “CEO_AGE”: 0.043. “ZSCORE”: 0.041. “TOBINQ”: 0.041. “IC”: 0.037. “SOE”: 0.031. “HPI”: 0.026. “INDBOARD”: 0.023. “DUAL”: 0.022. “GOV”: 0.022. “EPU”: 0.020. “GI”: 0.019. “COMP_BACK”: 0.012. “GOV_BACK”: 0.011. “FAM”: 0.007. “ACADEMIC_BACK”: 0.004. “ENV_BACK”: 0.003. “GENDER”: 0.003. “FINANCE _BACK”: 0.002. Note: All numerical data values are approximated.

Feature importance of variables

Figure 4
A horizontal bar chart shows feature importance values for different features.The horizontal axis represents “Feature Importance” ranging from 0 to 0.12 in increments of 0.02. The vertical axis lists features from top to bottom: “MHOLD”, “INSTITUTION”, “SIZE”, “DT”, “HHI”, “DFIIC”, “AC2”, “TAT”, “LEV”, “RD”, “GROWTH”, “AGE”, “MEDIA”, “AC1”, “ANA”, “EU”, “ROA”, “CEO_AGE”, “ZSCORE”, “TOBINQ”, “IC”, “SOE”, “HPI”, “INDBOARD”, “DUAL”, “GOV”, “EPU”, “GI”, “COMP_BACK”, “GOV_BACK”, “FAM”, “ACADEMIC_BACK”, “ENV_BACK”, “GENDER”, and “FINANCE_BACK”. The data for the bars are as follows: “M HOLD”: 0.122. “INSTITUTION”: 0.121. “SIZE”: 0.115. “DT”: 0.095. “HHI”: 0.083. “DFIIC”: 0.074. “AC2”: 0.066. “TAT”: 0.066. “LEV”: 0.065. “RD”: 0.064. “GROWTH”: 0.063. “AGE”: 0.060. “MEDIA”: 0.060. “AC1”: 0.054. “ANA”: 0.048. “EU”: 0.046. “ROA”: 0.046. “CEO_AGE”: 0.043. “ZSCORE”: 0.041. “TOBINQ”: 0.041. “IC”: 0.037. “SOE”: 0.031. “HPI”: 0.026. “INDBOARD”: 0.023. “DUAL”: 0.022. “GOV”: 0.022. “EPU”: 0.020. “GI”: 0.019. “COMP_BACK”: 0.012. “GOV_BACK”: 0.011. “FAM”: 0.007. “ACADEMIC_BACK”: 0.004. “ENV_BACK”: 0.003. “GENDER”: 0.003. “FINANCE _BACK”: 0.002. Note: All numerical data values are approximated.

Feature importance of variables

Close modal

The above analysis indicates that institutional ownership, digital transformation and market competition are key forces with the strongest capacity to predict corporate greenwashing. However, we still do not know exactly how these factors matter. Therefore, we combine the SHAP value interpretation to further explain the predictive pattern.

5.5.1 Institutional ownership

As shown in Figure 5, the SHAP value of institutional ownership increases as the shareholding proportion increases, indicating that institutional ownership may increase the risk of greenwashing. Specifically, from 0% to 40%, we can see a negative influence on greenwashing, which verifies the existing conclusion that institutional investors are sufficiently motivated and capable of playing the role of a monitor or governor against greenwashing (Crane, Koch, & Sebastien, 2019). However, when the proportion is over 40%, the negative influence decreases gradually, and the positive influence is mixed from 40% to 80%. This finding calls back the argument that whether institutional investors are watchdogs or profit grabbers in corporate social responsibility decision-making depends on the stability or dynamics of the market. Jiang and Kim (2015) suggest that in emerging countries, the capital market is not well-matured and institutional investment is less stable, which may easily lead to myopia.

Figure 5
A scatter plot shows values for institutional ownership across different ownership levels.The horizontal axis represents “Institutional ownership” ranging from 0 to 80 in increments of 20. The vertical axis represents “S H A P value for Institutional ownership” ranging from negative 0.5 to 1.0 in increments of 0.5. The plot displays a dense distribution of grey data points. From 0 to 70 on the horizontal axis, most points are clustered slightly below and near 0, generally between negative 0.4 and 0.2. From 70 to 80, the points begin to shift upward toward positive values. Beyond 80, the spread increases significantly, with many points rising above 0.5 and reaching up to 1.0, while some points still extend below 0. Note: All numerical data values are approximated.

SHAP dependence plot of institutional ownership

Figure 5
A scatter plot shows values for institutional ownership across different ownership levels.The horizontal axis represents “Institutional ownership” ranging from 0 to 80 in increments of 20. The vertical axis represents “S H A P value for Institutional ownership” ranging from negative 0.5 to 1.0 in increments of 0.5. The plot displays a dense distribution of grey data points. From 0 to 70 on the horizontal axis, most points are clustered slightly below and near 0, generally between negative 0.4 and 0.2. From 70 to 80, the points begin to shift upward toward positive values. Beyond 80, the spread increases significantly, with many points rising above 0.5 and reaching up to 1.0, while some points still extend below 0. Note: All numerical data values are approximated.

SHAP dependence plot of institutional ownership

Close modal

In addition, we observe further dramatic results. As shown in Figure 5, above 80%, the influence of institutional shareholdings is positive and increases in step with the proportion. This means that the more ownership is concentrated among institutional investors, the more likely they are to engage in greenwashing. According to the agency theory, when ownership is over-concentrated, large shareholders are likely to use their absolute right to control manager behavior. Additionally, most Chinese firms are in a period of transition as well as face financial constraints. Thus, instead of developing real green innovations, firms may greenwash their environmental performance as a low-cost strategy to gain the benefit of a good reputation (Wang, Lin, & Xie, 2022).

5.5.2 Firm’s digital transformation

Figure 6 shows the trend wherein the SHAP value decreases as the firm's digital transformation intensifies, which indicates that high digitalization may inhibit its greenwashing behavior. Note that the negative influence becomes significant from the score of 50, before which the influence is steadily positive. This result suggests that the digital technology empowerment effect inhibits greenwashing only when digital transformation reaches a certain level. Otherwise, when the firm is in the primary stage of digital exploration, transformational costs and challenges aggravate the burden of capital and other resources, which will likely crowd out the investment in green activities. Under pressure from both institutions and markets, a firm may greenwash to attract more investment and facilitate transformation.

Figure 6
A scatter plot shows values for firm’s digital transformation across different levels.The horizontal axis represents “Firm’s digital transformation” ranging from 30 to 70 in increments of 10. The vertical axis represents “SHAP value for Firm’s digital transformation” ranging from negative 1.5 to 0.25 in increments of 0.25. The plot displays a dense distribution of grey data points. From 25 to 40 on the horizontal axis, most points are clustered slightly above and near 0, generally between negative 0.1 and 0.2. From 40 to 50, the points shift downward toward negative values, mostly between negative 0.4 and 0.1. From 50 to 60, the distribution continues downward with increased spread, with many points between negative 0.5 and 0.1. Beyond 60, the spread increases significantly, with many points extending downward to negative 1.5 and a few points near 0. Note: All numerical data values are approximated.

SHAP dependence plot for the digital transformation (DT) score

Figure 6
A scatter plot shows values for firm’s digital transformation across different levels.The horizontal axis represents “Firm’s digital transformation” ranging from 30 to 70 in increments of 10. The vertical axis represents “SHAP value for Firm’s digital transformation” ranging from negative 1.5 to 0.25 in increments of 0.25. The plot displays a dense distribution of grey data points. From 25 to 40 on the horizontal axis, most points are clustered slightly above and near 0, generally between negative 0.1 and 0.2. From 40 to 50, the points shift downward toward negative values, mostly between negative 0.4 and 0.1. From 50 to 60, the distribution continues downward with increased spread, with many points between negative 0.5 and 0.1. Beyond 60, the spread increases significantly, with many points extending downward to negative 1.5 and a few points near 0. Note: All numerical data values are approximated.

SHAP dependence plot for the digital transformation (DT) score

Close modal

An advanced level of digitalization, where digital technologies are deeply integrated with the operations, increases resource efficiency, reduces the cost of fulfilling environmental responsibilities and provides incentives to undertake substantive green activities. Artificial intelligence (AI)-enabled resource optimization reduces environmental compliance costs while improving profitability, thereby alleviating financial pressures that traditionally incentivize symbolic sustainability claims (Cao, Li, Hu, Wan, & Wang, 2023). The resultant standardized data streams foster unprecedented environmental disclosure transparency, enabling real-time public verification through advanced analytical tools that heighten greenwashing detection risks. Concurrently, digitally transformed organizations become hyper-visible entities subjected to intensified multilateral scrutiny from media, investors and industry watchdogs – a phenomenon amplified by machine-readable sustainability reporting formats that enable automated compliance audits (Hu, Han, & Zhong, 2023a; Martínez-Peláez et al., 2023). This technological triad systematically reconfigures the cost-benefit calculus, making substantive green investments strategically preferable to deceptive environmental posturing.

5.5.3 Market competition

As Figure 7 shows, changes in the SHAP value of market competition have a U-shaped impact on corporate greenwashing. This indicates that both extreme competition and monopoly stimulate corporate greenwashing, while market competition within an appropriate range (HHI = 0.3–0.9) effectively inhibits greenwashing.

Figure 7
A scatter plot shows values for market competition across different levels.The horizontal axis represents “Market competition” ranging from 0.2 to 1.0 in increments of 0.1. The vertical axis represents “SHAP value for Market competition” ranging from negative 0.6 to 0.6 in increments of 0.2. The plot displays a dense distribution of grey data points. From 0.2 to 0.4 on the horizontal axis, points are spread above and below 0, generally between negative 0.3 and 0.5. From 0.4 to 0.7, most points are clustered slightly below 0, generally between negative 0.3 and 0.1. From 0.7 to 0.9, points gradually shift upward toward 0, with a range between negative 0.1 and 0.2. At 1.0, there is a vertical concentration of points with values ranging from negative 0.2 to 0.4. Note: All numerical data values are approximated.

SHAP dependence plot for market competition

Figure 7
A scatter plot shows values for market competition across different levels.The horizontal axis represents “Market competition” ranging from 0.2 to 1.0 in increments of 0.1. The vertical axis represents “SHAP value for Market competition” ranging from negative 0.6 to 0.6 in increments of 0.2. The plot displays a dense distribution of grey data points. From 0.2 to 0.4 on the horizontal axis, points are spread above and below 0, generally between negative 0.3 and 0.5. From 0.4 to 0.7, most points are clustered slightly below 0, generally between negative 0.3 and 0.1. From 0.7 to 0.9, points gradually shift upward toward 0, with a range between negative 0.1 and 0.2. At 1.0, there is a vertical concentration of points with values ranging from negative 0.2 to 0.4. Note: All numerical data values are approximated.

SHAP dependence plot for market competition

Close modal

First, a more competitive market allows a firm to have less pricing power. In this case, it is difficult for a firm to transfer the costs of its environmental activities to consumers. High prices trigger product market risks such as consumer shifting or substitution, which harm a firm's competitiveness and market share (Hu, Yu, & Han, 2023). To appeal to the consumer market and investor demand for green products, firms may use fake advertisements, exaggerated ESG performance and other greenwashing approaches to enhance their image and earnings (Nardi, 2022). In contrast, a few large corporations in a monopolized industry have the advantage of controlling the price and supply, which makes it difficult for consumers to bargain for green claims and thus drive innovation.

The global ascendancy of digital economies, now constituting core national competitiveness, finds particular manifestation in China's strategic “Digital China” initiative. Since its 2015 elevation to national policy, this technological paradigm shift has propelled China's digital economy to RMB 56.1 trillion (44% of gross domestic product) by 2023, fundamentally reconfiguring industrial operations and governance frameworks. Our temporal analysis of techno-economic paradigm shifts (pre-/post-2015 bifurcation) reveals evolving greenwashing predictors: institutional ownership and corporate digitalization demonstrate growing explanatory power, whereas media influence (MEDIA) and principal-agent conflicts (AC2) exhibit non-linear feature importance trajectories. This empirical evidence underscores digitalization's dual role as both an economic accelerator and a corporate accountability modulator within transitional economies.

5.6.1 Media attention

Empirical analysis reveals media attention's predictive importance for greenwashing ascended from 16th to 4th position post-2015 (Figures 8 and 9), surpassing market competition and trailing only institutional ownership and digital transformation. This growing influence reflects media's reconfigured role in digital economies, where monitoring mechanisms have evolved from traditional reportage to AI-enhanced investigative analytics (Sun, Wang, Sun, & Zhang, 2023; Lungu, Georgescu, & Juravle, 2024). Technological convergence enables three transformative capabilities: (1) blockchain-enabled supply chain forensics ensuring commitment veracity (Jedynak, 2024); (2) cross-domain data integration generating multidimensional sustainability assessments and (3) algorithm-driven public engagement platforms amplifying grassroots environmental oversight. These advancements establish the media as both technological auditors and social accountability amplifiers, compelling enhanced disclosure transparency while democratizing green governance through crowdsourced vigilance.

Figure 8
A horizontal bar chart shows feature importance values for different features.The horizontal axis represents “Feature Importance” ranging from 0 to 0.14 in increments of 0.02. The vertical axis lists features from top to bottom: “SIZE”, “MHOLD”, “HHI”, “AC2”, “TAT”, “ZSCORE”, “ROA”, “DFIIC”, “GROWTH”, “RD”, “INSTITUTION”, “LEV”, “AC1”, “TOBINQ”, “ANA”, “MEDIA”, “IC”, “DT”, “EU”, “AGE”, “INDBOARD”, “CEO_AGE”, “HPI”, “GOV”, “GI”, “COMP_BACK”, “SOE”, “ACADEMIC_BACK”, “FAM”, “GOV_BACK”, “EPU”, “FINANCE_BACK”, “DUAL”, “GENDER”, and “ENV_BACK”. The data for the bars are as follows: “SIZE”: 0.15. “MHOLD”: 0.135. “HHI”: 0.13. “AC2”: 0.11. “TAT”: 0.075. “ZSCORE”: 0.072. “ROA”: 0.067. “DFIIC”: 0.064. “GROWTH”: 0.063. “RD”: 0.061. “INSTITUTION”: 0.058. “LEV”: 0.057. “AC1”: 0.052. “TOBINQ”: 0.049. “ANA”: 0.048. “MEDIA”: 0.047. “IC”: 0.046. “DT”: 0.044. “EU”: 0.043. “AGE”: 0.04. “INDBOARD”: 0.034. “CEO_AGE”: 0.029. “HPI”: 0.026. “GOV”: 0.023. “GI”: 0.020. “COMP_BACK”: 0.018. “SOE”: 0.016. “ACADEMIC_BACK”: 0.012. “FAM”: 0.011. “GOV_BACK”: 0.009. “EPU”: 0.006. “FINANCE_BACK”: 0.005. “DUAL”: 0.004. “GENDER”: 0.003. “ENV_BACK”: 0.002. Note: All numerical data values are approximated.

Feature importance (2012–2014)

Figure 8
A horizontal bar chart shows feature importance values for different features.The horizontal axis represents “Feature Importance” ranging from 0 to 0.14 in increments of 0.02. The vertical axis lists features from top to bottom: “SIZE”, “MHOLD”, “HHI”, “AC2”, “TAT”, “ZSCORE”, “ROA”, “DFIIC”, “GROWTH”, “RD”, “INSTITUTION”, “LEV”, “AC1”, “TOBINQ”, “ANA”, “MEDIA”, “IC”, “DT”, “EU”, “AGE”, “INDBOARD”, “CEO_AGE”, “HPI”, “GOV”, “GI”, “COMP_BACK”, “SOE”, “ACADEMIC_BACK”, “FAM”, “GOV_BACK”, “EPU”, “FINANCE_BACK”, “DUAL”, “GENDER”, and “ENV_BACK”. The data for the bars are as follows: “SIZE”: 0.15. “MHOLD”: 0.135. “HHI”: 0.13. “AC2”: 0.11. “TAT”: 0.075. “ZSCORE”: 0.072. “ROA”: 0.067. “DFIIC”: 0.064. “GROWTH”: 0.063. “RD”: 0.061. “INSTITUTION”: 0.058. “LEV”: 0.057. “AC1”: 0.052. “TOBINQ”: 0.049. “ANA”: 0.048. “MEDIA”: 0.047. “IC”: 0.046. “DT”: 0.044. “EU”: 0.043. “AGE”: 0.04. “INDBOARD”: 0.034. “CEO_AGE”: 0.029. “HPI”: 0.026. “GOV”: 0.023. “GI”: 0.020. “COMP_BACK”: 0.018. “SOE”: 0.016. “ACADEMIC_BACK”: 0.012. “FAM”: 0.011. “GOV_BACK”: 0.009. “EPU”: 0.006. “FINANCE_BACK”: 0.005. “DUAL”: 0.004. “GENDER”: 0.003. “ENV_BACK”: 0.002. Note: All numerical data values are approximated.

Feature importance (2012–2014)

Close modal
Figure 9
A horizontal bar chart shows feature importance values for different features.The horizontal axis represents “Feature Importance” ranging from 0 to 0.14 in increments of 0.02. The vertical axis lists features from top to bottom: “INSTITUTION”, “M HOLD”, “D T”, “MEDIA”, “HHI”, “TAT”, “SIZE”, “RD”, “AGE”, “ANA”, “AC1”, “AC2”, “TOBINQ”, “CEO_AGE”, “GROWTH”, “DFIIC”, “LEV”, “EU”, “ZSCORE”, “IC”, “ROA”, “DUAL”, “INDBOARD”, “SOE”, “HPI”, “GOV”, “COMP_BACK”, “GI”, “GOV_BACK”, “FAM”, “EPU”, “GENDER”, “ACADEMIC_BACK”, “ENV_BACK”, and “FINANCE_BACK”. The data for the bars are as follows: “INSTITUTION”: 0.145. “MHOLD”: 0.11. “DT”: 0.095. “MEDIA”: 0.082. “HHI”: 0.079. “TAT”: 0.074. “SIZE”: 0.073. “RD”: 0.073. “AGE”: 0.063. “ANA”: 0.058. “AC1”: 0.056. “AC2”: 0.057. “TOBINQ”: 0.055. “CEO_AGE”: 0.053. “GROWTH”: 0.053. “DFIIC”: 0.052. “LEV”: 0.051. “EU”: 0.05. “ZSCORE”: 0.05. “IC”: 0.05. “ROA”: 0.049. “DUAL”: 0.030. “INDBOARD”: 0.024. “SOE”: 0.023. “HPI”: 0.023. “GOV”: 0.022. “COMP_BACK”: 0.021. “GI”: 0.020. “GOV_BACK”: 0.018. “FAM”: 0.016. “EPU”: 0.015. “GENDER”: 0.004. “ACADEMIC_BACK”: 0.003. “ENV_BACK”: 0.002. “FINANCE_BACK”: 0.001. Note: All numerical data values are approximated.

Feature importance (2015–2022)

Figure 9
A horizontal bar chart shows feature importance values for different features.The horizontal axis represents “Feature Importance” ranging from 0 to 0.14 in increments of 0.02. The vertical axis lists features from top to bottom: “INSTITUTION”, “M HOLD”, “D T”, “MEDIA”, “HHI”, “TAT”, “SIZE”, “RD”, “AGE”, “ANA”, “AC1”, “AC2”, “TOBINQ”, “CEO_AGE”, “GROWTH”, “DFIIC”, “LEV”, “EU”, “ZSCORE”, “IC”, “ROA”, “DUAL”, “INDBOARD”, “SOE”, “HPI”, “GOV”, “COMP_BACK”, “GI”, “GOV_BACK”, “FAM”, “EPU”, “GENDER”, “ACADEMIC_BACK”, “ENV_BACK”, and “FINANCE_BACK”. The data for the bars are as follows: “INSTITUTION”: 0.145. “MHOLD”: 0.11. “DT”: 0.095. “MEDIA”: 0.082. “HHI”: 0.079. “TAT”: 0.074. “SIZE”: 0.073. “RD”: 0.073. “AGE”: 0.063. “ANA”: 0.058. “AC1”: 0.056. “AC2”: 0.057. “TOBINQ”: 0.055. “CEO_AGE”: 0.053. “GROWTH”: 0.053. “DFIIC”: 0.052. “LEV”: 0.051. “EU”: 0.05. “ZSCORE”: 0.05. “IC”: 0.05. “ROA”: 0.049. “DUAL”: 0.030. “INDBOARD”: 0.024. “SOE”: 0.023. “HPI”: 0.023. “GOV”: 0.022. “COMP_BACK”: 0.021. “GI”: 0.020. “GOV_BACK”: 0.018. “FAM”: 0.016. “EPU”: 0.015. “GENDER”: 0.004. “ACADEMIC_BACK”: 0.003. “ENV_BACK”: 0.002. “FINANCE_BACK”: 0.001. Note: All numerical data values are approximated.

Feature importance (2015–2022)

Close modal

5.6.2 Type-II agency problem

Empirical evidence demonstrates type-II agency problems' diminishing predictive power for greenwashing post-2015 (Figures 8 and 9), reflecting digital economies' capacity to counterbalance controlling shareholders' expropriation tendencies. In concentrated ownership structures typical of emerging markets, majority shareholders traditionally exploit informational asymmetries to prioritize short-term tunneling over sustainable investments (La Porta, Lopez-de-Silanes, & Shleifer, 1999; Zhang, Meng, & Zhang, 2023). Digital transformation introduces three disciplinary mechanisms: (1) algorithmic auditing of historical decision patterns enables real-time tunneling detection; (2) ML-enhanced text analysis deciphers hidden expropriation signals (Lafarre and Van der Elst, 2018) and (3) investor portals like “Hudongyi” of the Shenzhen Stock Exchange and “Yihudong” of the Shanghai Stock Exchange create multilateral oversight through crowdsourced governance (Wang & Tang, 2024). These technological safeguards increase tunneling risks while empowering minority shareholders through enhanced information parity, thereby reorienting dominant shareholders' calculus toward long-term environmental commitments over immediate rent extraction.

Although ESG investment is booming in China's A-share market, a report released by Standard & Poor's [5] shows that investors are concerned about ESG investment, with more than 44% of investors believing that greenwashing is the biggest concern. This study systematically analyzes the drivers of corporate greenwashing by considering two dimensions: opportunity and pressure. Based on a sample of A-share listed companies in China from 2012 to 2022, we use ML approaches to comprehensively examine and compare the predictive effects of 27 specific factors on corporate greenwashing.

First, the results show no significant difference in the overall performance of the two categories of factors – opportunity and pressure – in predicting firms' greenwashing. Among the opportunity factors, the feature importance of the institutional investors' shareholdings, digital transformation and regional fintech where the firms are located are ranked first, second and third, respectively, and are important conditions for the occurrence of greenwashing behavior. Among the pressure factors, the feature importance of market competition, type-II agency problem and asset-liability ratio ranked first, second and third, respectively, and constituted the key motives for greenwashing behavior.

Second, among the many influencing factors, three variables – institutional investor shareholdings, digital transformation and market competition – are the best predictors of corporate greenwashing. Specifically, institutional investor ownership positively predicts corporate greenwashing, which means that greenwashing behavior is more likely to be conducted as the institutional ownership is concentrated, and a certain level of digital transformation has a negative predictive effect on enterprises' greenwashing behavior. However, a U-shaped relationship exists between market competition intensity and greenwashing behavior. It demonstrates that over-dispersed and over-concentrated industries are prone to greenwashing, whereas moderate market competition inhibits it.

Third, the fast development of the digital economy profoundly changes the factors and modes of production and significantly impacts the strategic decision-making of corporate sustainable development. In particular, social media's governance effect was significantly enhanced, and internal agency problems’ stimulus effect on greenwashing behavior was mitigated after 2015 as the digital transformation became a national strategy in China. Finally, by comparing the models, this study confirms that ML algorithms significantly outperform traditional regression models in terms of event prediction. In addition, we find that LightGBM is the most effective model for predicting corporate greenwashing issues.

First, unlike the existing research on the motivational effect of greenwashing, this study makes no harsh assumptions. Instead, it adopts the advanced ML integration algorithm to “learn” from the disordered information, develops the optimal function model, predicts enterprises' greenwashing behavior and obtains more systematic empirical conclusions.

Second, we advance the literature by constructing the IMOC framework, which integrates institutional theory, resource-based view, agency theory and upper echelons theory. Unlike prior studies that isolate specific drivers, our model systematically categorizes 27 factors into institution, market, organization and cognition levels, offering a holistic view of the greenwashing ecosystem. The model expands the understanding and analysis of the dynamics of greenwashing and enriches the literature on ESG and ESG greenwashing behavior.

Third, we move the nascent literature on ML applications in greenwashing (e.g. Zeng et al., 2025; Li et al., 2025) from a primarily predictive focus to an explanatory and diagnostic one. By integrating SHAP interpretability with our IMOC framework, we decode the “black box” of ML predictions. We demonstrate, for instance, that organizational factors like ownership structure primarily function as opportunity modulators, while market factors like competition intensity act as structural pressure generators. This nuanced understanding refines the application of fraud theory in environmental contexts and provides a template for future explainable AI research in sustainability governance.

Fourth, this study obtains several interesting findings through an in-depth investigation of the predictive mechanisms of these key factors. While most existing studies emphasize the monitoring and governance roles of institutional investors, our study finds the opposite: The risk of greenwashing gradually increases as institutional ownership increases. The literature emphasizes market competition as an important pressure factor for firms to manipulate disclosures for impression management (Bagnoli & Watts, 2010; Markarian & Santalo, 2014). However, our study finds that the relationship between market competition intensity and corporate greenwashing is not a simple linear relationship but rather a U-shaped relationship. Our study further supports the condition that digital transformation can effectively restrain greenwashing behavior. We further suggest that this effect becomes apparent only after achieving a certain level of digital transformation.

In addition, our analysis of the time dynamics reveals that the rapid development of the digital economy has important implications for firms' internal and external governance environments. The governance effect of media supervision is significantly enhanced, and the stimulus of greenwashing behavior by intra-firm agency problems is mitigated.

The findings of this study provide the following practical insights. First, we should pay attention to institutional investors' governance and jointly discourage greenwashing behavior. The results suggest that the proportion of institutional investor shareholdings is the most important factor affecting greenwashing behavior. In particular, greenwashing risk increases as shareholding increases. Therefore, companies should take the initiative to disclose detailed environmental performance data to institutional investors, including information on carbon emissions, energy consumption and pollution control, all of which are important for improving corporate performance. This will increase the transparency of information and help firms realize that green development also has economic benefits. In addition, companies should formulate clear long-term green development strategies and integrate them into their overall development plans. By working with institutional investors to develop and implement these strategies, they can effectively advance green governance objectives.

Second, moderate industry competition guarantees green development. Our research shows that dynamic changes in market competition have a complex, non-linear impact on green behavior. Accordingly, the government should perform macro-control functions to guide market competition and prudently optimize resource allocation. Specifically, the government should adopt anti-monopoly policies, laws and regulations for industries with excessive market concentration. This will help prevent enterprises from forming monopolies and enhance market competition. Through policy incentives, large enterprises can be guided to assume more environmental responsibility and enhance their green governance. Moreover, the government can promote the optimization and upgrading of industrial structures and support the development of green and low-carbon industries through industrial policies and economic planning. Moderately regulating and transforming high-pollution and high-energy-consumption industries can reduce adverse impacts on the environment and promote industry-wide green transformation.

Third, enterprises' digital transformation is an important governance mechanism that can inhibit greenwashing behavior. The descriptive statistics indicate that the mean value of digital transformation is 38.369, which is still low. This study's results show that the deep integration of digital technology and enterprise operation and management can effectively exert their enabling effects only when the digital transformation score is higher than 40. Digital transformation can improve the quality of IC, reduce information asymmetry, improve enterprise productivity and profitability, alleviate financing constraints, inhibit enterprise greenwashing and promote substantive green performance.

Fourth, media has become an important subject of external governance with the development of the digital economy. The results show that the media's importance in predicting corporate greenwashing increased from the 16th to the fourth position after 2015, confirming that digital economy development further enhances the media's role in monitoring corporate greenwashing. This dynamic is vividly illustrated by real-world events; for example, the exposure of several carbon emission certificate frauds by Chinese authorities in 2022 [6]. Such incidents, once disclosed, are rapidly amplified by the digital media, leading to heightened public scrutiny and significant reputational damage for the firms involved. This reinforces the media's critical function as a powerful external governance mechanism in the digital era. Therefore, media should fully use digital technologies such as big data analytics, AI and blockchain to enhance the depth and breadth of corporate environmental reporting. Specifically, through digital technology, media can track enterprises' environmental performance more accurately, expose greenwashing behavior and provide authoritative analyses and evaluations. This can effectively curb corporate greenwashing, encourage enterprises to enhance green practices and promote the society's sustainable development.

This study has some limitations. First, our sample only contains data on listed companies in China. Because institutional and cultural differences affect corporate ethical behavior, our findings may lack global generalizability. In the future, we can expand the sample to conduct empirical research and incorporate institutional and cultural differences into the research framework for examination and comparison. Second, ML methods emphasize out-of-sample generalization, do not require prior assumptions and are characterized by a “black box” that lacks the explanatory advantages of the influence mechanism. However, we refer to the existing scholarly research and explain the prediction mechanism of key variables by combining ML methods with human theoretical knowledge. Moreover, the principle of SHAP scatter dependence may be challenging to understand owing to its difference from traditional measurement logic. In future research, a hybrid method combining traditional measurement and ML methods can be attempted to elucidate and interpret the influence mechanism.

All errors are our own.

Table A1

Variable definitions and descriptive statistics

VariableDefinitionMeanMaxSDMedianMin
GWCorporate greenwashing behavior, model (1) for details of the calculation−0.4405.6911.310−0.492−4.708
HHIHerfindahl-Hirschman Index0.7821.0320.2510.9150.221
GOVLow-carbon policy intensity in prefecture-level cities2.85721.5004.3840.0000.000
MEDIALn (Company-related news +1)3.7935.7430.8253.9511.609
EUStandard deviation of sales revenues (industry-adjusted) over the five years1.2406.6431.0960.9480.123
ROANet Income/Total Assets0.0550.2290.0550.046−0.106
HPI1 if firm is in heavy polluting industries and 0 otherwise0.2951.0000.4560.0000.000
AC1(Administration expenses + selling expenses)/Operating revenue0.1410.5760.1160.1060.013
AC2Other accounts receivable/Operating revenue0.0130.1150.0180.0070.000
GILn(The sum of green investor+1)1.1583.3670.9421.0990.000
ZSCOREAltman Z index4.82344.5786.2722.8630.273
DFIICProvincial level financial inclusion index242.763351.53265.717250.62491.000
DTDigital transformation index38.36970.20411.37635.99723.411
ANALn(Analyst followings+1)2.3854.0070.9332.4850.693
EPUEconomic Policy Uncertainty index from Baker et al. (2016) 436.948791.874232.109460.470113.897
INSTITUTIONThe institutional investors' shareholdings/The total number of shares57.06095.38322.53560.6333.563
ICInternal control index of Shenzhen Dibo database667.059880.920121.721683.5250.000
INDBOARDIndependent directors/Total directors0.3750.5710.0550.3640.333
CEO_AGECEO Age51.22265.0005.79151.00036.000
GENDER1 if CEO is males and 0 for females0.9361.0000.2441.0000.000
ACADEMIC_BACK1 if the CEO has worked in a university research organization and 0 otherwise0.1841.0000.3870.0000.000
FINANCE_BACK1 if CEO has worked in banks, securities, or fund companies and 0 otherwise0.0441.0000.2060.0000.000
ENV_BACK1 if CEO's resume includes environment related experience and 0 otherwise0.0481.0000.2130.0000.000
COMP_BACKNumber of CEOs with professional backgrounds2.0384.0000.9012.0001.000
GOV_BACK1 if the CEO has served in the government and 0 otherwise0.1031.0000.3030.0000.000
FAM1 if the listed company is a family business and 0 otherwise0.3231.0000.4680.0000.000
RDR&D expenditures/Operating revenue0.0400.2810.0460.0300.000
SIZELn (Total assets)23.51227.2941.33723.39520.806
LEVTotal debts/Total assets0.4680.8490.1870.4820.075
AGELn(Current year – year of incorporation + 1)18.95933.0005.71919.0006.000
TATOperating revenue/Total assets0.6692.3500.3990.5840.115
MHOLDManagement shareholding/Total share capital0.0750.6420.1500.0010.000
TOBINQMarket value/Total assets2.0799.1671.5081.5480.818
SOE1 if firm is state-owned enterprise and 0 if it is a private enterprise0.4861.0000.5000.0000.000
DUAL1 if the chairman and CEO are the same, and 0 otherwise0.2331.0000.4230.0000.000
GROWTH(Sales revenue for the year t/sales revenue for the year t-1)-10.1782.0890.3390.121−0.392

Note(s): SD = standard deviation, CEO = chief executive officer and R&D = research and development

To address the inherent challenges in quantifying greenwashing and to ensure the robustness of our findings, we conducted an additional analysis using an alternative specification for our dependent variable. While our main analysis uses a continuous variable to capture the magnitude of the disclosure-performance gap, for this check, we transformed it into a binary variable to test the model's ability to predict the occurrence of greenwashing.

We defined a dummy variable, GW_Dummy, which takes a value of 1 if the original greenwashing score GW is greater than zero and 0 otherwise. A value of 1 indicates that a firm's standardized environmental disclosure outranks its substantive environmental performance, signifying a state of greenwashing. This transformed our task from a regression problem into a binary classification problem.

We then retrained the LightGBM, XGBoost and random forest models on this new target variable. We evaluated their performance using standard classification metrics: accuracy, precision, recall, F1-score and the area under the receiver operating characteristic curve (AUC). The results are presented in  Appendix Table A2. The results show that all machine learning models demonstrate strong predictive capability, significantly outperforming a baseline logistic regression. Consistent with our primary findings, LightGBM achieved the best performance. The consistency of these results confirms that our model and the identified key features are robust predictors of greenwashing behavior.

Table A2

Performance of models for binary classification task

ModelAccuracyPrecisionRecallF1-scoreAUC
Logistic regression0.6410.6400.0410.0770.599
Random forest0.7080.8320.2540.3890.748
XGBoost0.7270.7440.3870.5090.759
LightGBM0.7330.7310.4260.5380.763
1.

The data is from CSMAR database.

2.

The data information is from https://idf.pku.edu.cn/yjcg/zsbg/513800.htm

3.

According to Baker et al. (2016), the index is conducted by newspaper-based indices of policy uncertainty in China.

4.

Optuna is a Bayesian optimization framework that efficiently explores the hyperparameter space to identify the combination that yields the best performance. The optimized hyperparameters are then applied to the trained models on the testing set.

6.

“China uncovers fraud scheme in carbon emission reporting,” Upstream Online, March 25, 2022. https://www.upstreamonline.com/energy-transition/china-uncovers-fraud-scheme-in-carbon-emission-reporting/2-1-1184461

Bagnoli
,
M.
, &
Watts
,
G. S.
(
2010
).
Oligopoly, disclosure and earnings management
.
The Accounting Review
,
85
(
4
),
1191
1214
. doi: .
Baker
,
S. R.
,
Bloom
,
N.
, &
Davis
,
S. J.
(
2016
).
Measuring economic policy uncertainty
.
Quarterly Journal of Economics
,
131
(
4
),
1593
1636
. doi: .
Baudry
,
M.
,
Bukowski
,
S.
, &
Lament
,
M. B.
(
2024
).
Financial stability, economic growth and sustainable development
.
London
:
Routledge
.
Bernini
,
F.
,
Giuliani
,
M.
, &
La Rosa
,
F.
(
2023
).
Measuring greenwashing: A systematic methodological literature review
.
Business Ethics, the Environment & Responsibility
,
1
(
4
),
1
19
. doi: .
Blome
,
C.
,
Foerstl
,
K.
, &
Schleper
,
M. C.
(
2017
).
Antecedents of green supplier championing and greenwashing: An empirical study on leadership and ethical incentives
.
Journal of Cleaner Production
,
152
,
339
350
. doi: .
Cao
,
Y.
,
Li
,
X.
,
Hu
,
H. L.
,
Wan
,
G. Y.
, &
Wang
,
S. Y.
(
2023
).
How does digitalization drive the green transformation in manufacturing companies? An exploratory case study from the perspective of resource orchestration theory
.
Journal of Management World
,
39
(
3
),
96
112+126+113
. doi: .
Chen
,
P.
, &
Dagestani
,
A. A.
(
2023
).
Greenwashing behavior and firm value–From the perspective of board characteristics
.
Corporate Social Responsibility and Environmental Management
,
30
(
5
),
2330
2343
. doi: .
Chen
,
Y.
,
Du
,
Y.
,
Wu
,
Y.
, &
Zhou
,
H.
(
2017
).
The study on zombie firms in China using modified FN-CHK method
. In
2017 4th International Conference on Industrial Economics System and Industrial Security Engineering (IEIS)
(pp. 
1
7
).
IEEE
. doi: .
Crane
,
A. D.
,
Koch
,
A.
, &
Sebastien
,
M.
(
2019
).
Institutional investor cliques and governance
.
Journal of Financial Economics
,
133
(
11
),
175
197
. doi: .
Delmas
,
M. A.
,
Nairn-Birch
,
N.
, &
Lim
,
J.
(
2015
).
Dynamics of environmental and financial performance: The case of greenhouse gas emissions
.
Organization & Environment
,
28
(
4
),
374
393
. doi: .
Dong
,
X.
,
Wang
,
C.
,
Zhang
,
F.
,
Zhang
,
H.
, &
Xia
,
C.
(
2024
).
China’s low-carbon policy intensity dataset from national-to prefecture-level over 2007–2022
.
Scientific Data
,
11
(
1
),
213
. doi: .
Du
,
X.
(
2015
).
How the market values greenwashing? Evidence from China
.
Journal of Business Ethics
,
128
(
3
),
547
574
. doi: .
Effron
,
D. A.
,
O'Connor
,
K.
,
Leroy
,
H.
, &
Lucas
,
B. J.
(
2018
).
From inconsistency to hypocrisy: When does ‘saying one thing but doing another’ invite condemnation
.
Research in Organizational Behavior
,
38
,
61
75
. doi: .
Fang
,
C.
,
Nolan
,
P.
, &
Linggui
,
W.
(
2019
).
Routledge handbook of the belt and road
(pp. 
213
217
).
London
:
Routledge
.
Feng
,
J.
, &
Yuan
,
Y.
(
2024
).
Green investors and corporate ESG performance: Evidence from China
.
Finance Research Letters
,
60
, 104892. doi: .
Ferrón‐Vílchez
,
V.
,
Valero‐Gil
,
J.
, &
Suárez‐Perales
,
I.
(
2021
).
How does greenwashing influence managers' decision‐making? An experimental approach under stakeholder view
.
Corporate Social Responsibility and Environmental Management
,
28
(
2
),
860
880
. doi: .
Guo
,
X.
(
2022
).
Environmental regulation and green innovation of enterprises
.
BCP Business and Management
,
25
,
593
604
. doi: .
Habbash
,
M.
(
2013
).
Earnings management, audit committee effectiveness and the role of blockholders ownership: Evidence from UK large firms
.
International Journal of Business Governance and Ethics
,
8
(
2
),
155
180
. doi: .
Hu
,
Y.
, &
Liu
,
Q.
(
2023
).
Local digital economy and corporate social responsibility
.
Sustainability
,
15
(
11
),
8487
. doi: .
Hu
,
J.
,
Han
,
Y. M.
, &
Zhong
,
Y.
(
2023a
).
How corporate digital transformation affects corporate ESG performance
.
Review of Industrial Economics
,
1
,
105
123
. doi: .
Hu
,
J.
,
Yu
,
X. R.
, &
Han
,
Y. M.
(
2023b
).
Can ESG rating promote green transformation of enterprises?
.
Journal of Quantitative and Technological Economics
,
40
(
7
),
90
111
. doi: .
Hu
,
S.
,
Wang
,
A.
, &
Du
,
K.
(
2023c
).
Environmental tax reform and greenwashing: Evidence from Chinese listed companies
.
Energy Economics
,
124
, 106873. doi: .
Ioannou
,
I.
,
Kassinis
,
G.
, &
Papagiannakis
,
G.
(
2023
).
The impact of perceived greenwashing on customer satisfaction and the contingent role of capability reputation
.
Journal of Business Ethics
,
185
(
2
),
333
347
. doi: .
Jabeur
,
S. B.
,
Mefteh-Wali
,
S.
, &
Viviani
,
J. L.
(
2024
).
Forecasting gold price with the XGBoost algorithm and SHAP interaction values
.
Annals of Operations Research
,
334
(
1
),
679
699
. doi: .
Jedynak
,
M.
(
2024
).
Corporate social responsibility and the supply chain: CSR collaboration with Suppliers
.
New York
:
Routledge
.
Jiang
,
F.
, &
Kim
,
K. A.
(
2015
).
Corporate governance in China: A modern perspective
.
Journal of Corporate Finance
,
32
,
190
216
.
Kim
,
E. H.
, &
Lyon
,
T. P.
(
2015
).
Greenwash vs. brownwash: Exaggeration and undue modesty in corporate sustainability disclosure
.
Organization Science
,
26
(
3
),
705
723
. doi: .
Kim
,
J.
,
Fairclough
,
S.
, &
Dibrell
,
C.
(
2017
).
Attention, action, and greenwash in family-influenced firms? Evidence from polluting industries
.
Organization & Environment
,
30
(
4
),
304
323
. doi: .
Kumar
,
B.
,
Manrai
,
A. K.
, &
Manrai
,
L. A.
(
2017
).
Purchasing behaviour for environmentally sustainable products: A conceptual framework and empirical study
.
Journal of Retailing and Consumer Services
,
34
,
1
9
. doi: .
Lafarre
,
A.
, &
Van der Elst
 
C.
(
2018
).
Blockchain technology for corporate governance and shareholder activism
.
European Corporate Governance Institute-Law Working Paper, (390)
,
1
-
26
. doi: .
La Porta
,
R.
,
Lopez-de-Silanes
,
F.
, &
Shleifer
,
A.
(
1999
).
Corporate ownership around the world
.
The Journal of Finance
,
54
(
2
),
471
517
. doi: .
Li
,
Q.
,
Wang
,
R.
, &
He
,
Z. C.
(
2023a
).
Can the exit threat of institutional investors play green governance? Based on the greenwashing behavior of enterprises
.
Economy and Management
,
37
(
04
),
72
82
.
Available from:
 https://link.cnki.net/urlid/13.1032.F.20230613.1349.004
Li
,
W.
,
Li
,
W.
,
Seppänen
,
V.
, &
Koivumäki
,
T.
(
2023b
).
Effects of greenwashing on financial performance: Moderation through local environmental regulation and media coverage
.
Business Strategy and the Environment
,
32
(
1
),
820
841
. doi: .
Li
,
Y.
,
Cao
,
X.
, &
Wang
,
M.
(
2024a
).
Can the cumulative effect of technological resources promote green technology collaborative innovation in resource-based regions?
.
Journal of Cleaner Production
,
461
, 142589. doi: .
Li
,
Z.
,
Miao
,
S.
, &
Xu
,
L.
(
2024b
).
Digital transformation and environmental, social, and governance greenwashing: Evidence from China
.
Journal of Environmental Management
,
365
, 121460. doi: .
Li
,
Q.
,
He
,
Z.
, &
Li
,
H.
(
2025
).
Uncovering corporate greenwashing: A predictive model based on Chinese heavy-pollution industries
.
Sustainability Accounting, Management and Policy Journal
,
16
(
1
),
137
167
. doi: .
Liao
,
F.
,
Sun
,
Y.
, &
Xu
,
S.
(
2023
).
Financial report comment letters and greenwashing in environmental, social and governance disclosures: Evidence from China
.
Energy Economics
,
127
, 107122. doi: .
Lin
,
S. T.
, &
Niu
,
H. J.
(
2018
).
Green consumption: E nvironmental knowledge, environmental consciousness, social norms, and purchasing behavior
.
Business Strategy and the Environment
,
27
(
8
),
1679
1688
. doi: .
Liu
,
T.
,
Liu
,
M.
,
Hu
,
X.
, &
Xie
,
B.
(
2020
).
The effect of environmental regulations on innovation in heavy-polluting and resource-based enterprises: Quasi-natural experimental evidence from China
.
PLoS One
,
15
(
12
), e0239549. doi: .
Lungu
,
A. E.
,
Georgescu
,
M. R.
, &
Juravle
,
D.
(
2024
).
A bibliometric analysis of digital entrepreneurship
.
Journal of the Knowledge Economy
,
15
(
4
),
1
29
. doi: .
Lyon
,
T. P.
, &
Montgomery
,
A. W.
(
2013
).
Tweetjacked: The impact of social media on corporate greenwash
.
Journal of Business Ethics
,
118
(
4
),
747
757
. doi: .
Markarian
,
G.
, &
Santalo
,
J.
(
2014
).
Product market competition, information and earnings management
.
Journal of Business Finance & Accounting
,
4
(
5-6
),
572
599
. doi: .
Martínez-Peláez
,
R.
,
Ochoa-Brust
,
A.
,
Rivera
,
S.
,
Félix
,
V. G.
,
Ostos
,
R.
,
Brito
,
H.
, &
Mena
,
L. J.
(
2023
).
Role of digital transformation for achieving sustainability: Mediated role of stakeholders, key capabilities, and technology
.
Sustainability
,
15
(
14
), 11221. doi: .
Nadeesha
,
D. H. D.
(
2022
).
The effectiveness of board diversity reforms and corporate practices: The moderating role of a familial culture
.
Prevalence of Family Businesses and Family Ownership
. (Doctoral dissertation,
The University of Liverpool, United Kingdom
).
Nardi
,
L.
(
2022
).
The corporate social responsibility price premium as an enabler of substantive CSR
.
Academy of Management Review
,
2
,
282
308
. doi: .
Niu
,
D.
, &
Wang
,
Z.
(
2024
).
Can ESG ratings promote green total factor productivity? Empirical evidence from Chinese listed companies
.
Heliyon
,
10
(
7
), e29307. doi: .
Parguel
,
B.
,
Benoît-Moreau
,
F.
, &
Larceneux
,
F.
(
2011
).
How sustainability ratings might deter ‘greenwashing’: A closer look at ethical corporate communication
.
Journal of Business Ethics
,
102
(
1
),
15
28
. doi: .
Pizzetti
,
M.
,
Gatti
,
L.
, &
Seele
,
P.
(
2021
).
Firms talk, suppliers walk: Analyzing the locus of greenwashing in the blame game and introducing ‘vicarious greenwashing’
.
Journal of Business Ethics
,
170
(
1
),
21
38
. doi: .
Sun
,
Z. Y.
,
Wang
,
W.
,
Sun
,
M. X.
, &
Zhang
,
Y.
(
2023
).
Impact of media coverage on corporate greenwashing—mediating role of executive characteristics and internal supervision
.
Journal of Beijing Institute of Technology (Social Sciences Edition)
,
25
(
1
),
67
79
. doi: .
Szabo
,
S.
, &
Webster
,
J.
(
2021
).
Perceived greenwashing: The effects of green marketing on environmental and product perceptions
.
Journal of Business Ethics
,
171
(
4
),
719
739
. doi: .
Testa
,
F.
,
Boiral
,
O.
, &
Iraldo
,
F.
(
2018
).
Internalization of environmental practices and institutional complexity: Can stakeholders pressures encourage greenwashing?
.
Journal of Business Ethics
,
147
(
2
),
287
307
. doi: .
Tiffin
,
A. J.
(
2019
).
Machine learning and causality: the impact of financial crises on growth
.
Washington, DC
:
International Monetary Fund
.
Wang
,
Z.
, &
Tang
,
P.
(
2024
).
Substantive digital innovation or symbolic digital innovation: Which type of digital innovation is more conducive to corporate ESG performance?
.
International Review of Economics and Finance
,
93
,
1212
1228
. doi: .
Wang
,
L.
,
Zeng
,
T.
, &
Li
,
C.
(
2022a
).
Behavior decision of top management team and enterprise green technology innovation
.
Journal of Cleaner Production
,
367
, 133120. doi: .
Wang
,
H.
,
Lin
,
W. F.
, &
Xie
,
R.
(
2022b
).
Environmental backgrounds of executives and green investor entry
.
Journal of Quantitative and Technological Economics
,
39
(
12
),
173
194
. doi: .
Wen
,
Y. D.
,
Wang
,
R.
, &
He
,
Z. C.
(
2024
).
Digital finance and enterprise ESG performance: Effects, mechanisms and ‘greenwash’ test
.
Statistics and Decision
,
40
(
1
),
142
147
. doi: .
Xia
,
F.
,
Chen
,
J.
,
Yang
,
X.
,
Li
,
X.
, &
Zhang
,
B.
(
2023
).
Financial constraints and corporate greenwashing Rong strategies in China
.
Corporate Social Responsibility and Environmental Management
,
30
(
4
),
1770
1781
. doi: .
Xie
,
J.
,
Chen
,
L.
,
Liu
,
Y.
, &
Wang
,
S.
(
2023
).
Does fintech inhibit corporate greenwashing behavior?-Evidence from China
.
Finance Research Letters
,
55
, 104002. doi: .
Yue
,
J.
, &
Li
,
Y.
(
2023
).
Media attention and corporate greenwashing behavior: Evidence from China
.
Finance Research Letters
,
55
, 104016. doi: .
Zeng
,
W. Q.
(
2024
).
A review of corporate ‘greenwashing’ research: Behavioral identification, motivational analysis and economic effects
.
Finance and Accounting Monthly
,
45
(
10
),
64
70
. doi: .
Zeng
,
F. L.
,
Wang
,
J. T.
, &
Zeng
,
C. Y.
(
2025
).
An optimized machine learning framework for predicting and interpreting corporate ESG greenwashing behaviour
.
PLoS One
,
20
(
3
), e0316287. doi: .
Zhang
,
D.
(
2022
).
Green financial system regulation shock and greenwashing behaviors: Evidence from Chinese firms
.
Energy Economics
,
111
, 106064. doi: .
Zhang
,
D.
(
2023
).
Subsidy expiration and greenwashing decision: Is there a role of bankruptcy risk?
.
Energy Economics
,
118
, 106530. doi: .
Zhang
,
D.
,
Meng
,
L.
, &
Zhang
,
J.
(
2023a
).
Environmental subsidy disruption, skill premiums and ESG performance
.
International Review of Financial Analysis
,
90
, 102862. doi: .
Zhang
,
J. W.
,
Wang
,
R. Z.
, &
Jiang
,
Y.
(
2023b
).
Digital economy and tunneling behavior of major shareholders: A new perspective of technological governance
.
Business and Management Journal
,
45
(
04
),
64
81
. doi: .
Zhang
,
W.
,
Qin
,
C.
, &
Zhang
,
W.
(
2023c
).
Top management team characteristics, technological innovation and firm's greenwashing: Evidence from China's heavy-polluting industries
.
Technological Forecasting and Social Change
,
191
, 122522. doi: .
Zhang
,
C. J.
,
Yang
,
Y.
, &
Wang
,
W. T.
(
2024
).
A review of corporate greenwash research: Drivers, consequences and governance
.
Finance and Accounting Monthly
,
45
(
04
),
28
33
. doi: .
Zhao
,
T.
, &
Wang
,
H.
(
2024
).
The industry peer effect of enterprise ESG performance: The moderating effect of customer concentration
.
International Review of Economics & Finance
,
92
,
1499
1525
. doi: .
Zhou
,
K.
,
Jin
,
X.
,
Li
,
X.
, &
Tao
,
Y.
(
2024
).
Enhancing sustainable development through effective disclosure: Corporate environmental performance and readability
.
Corporate Social Responsibility and Environmental Management
,
31
(
1
),
274
291
. doi: .
Athey
,
S.
, &
Imbens
,
G. W.
(
2019
).
Machine learning methods that economists should know about
.
Annual Review of Economics
,
11
(
1
),
685
725
. doi: .
Castree
,
N.
,
Hulme
,
M.
, &
Proctor
,
J. D.
(
2018
).
Companion to environmental studies
.
London
:
Routledge
.
Choudhury
,
P.
,
Starr
,
E.
, &
Agarwal
,
R.
(
2020
).
Machine learning and human capital complementarities: Experimental evidence on bias mitigation
.
Strategic Management Journal
,
41
(
8
),
1381
1411
. doi: .
Cicchiello
,
A. F.
,
Cotugno
,
M.
, &
Foroni
,
C.
(
2023
).
Does competition affect ESG controversies? Evidence from the banking industry
.
Finance Research Letters
,
55
, 103972. doi: .
Du
,
X.
,
Jian
,
W.
,
Zeng
,
Q.
, &
Chang
,
Y.
(
2018
).
Do auditors applaud corporate environmental performance? Evidence from China
.
Journal of Business Ethics
,
151
(
4
),
1049
1080
. doi: .
Grimmer
,
J.
,
Roberts
,
M. E.
, &
Stewart
,
B. M.
(
2021
).
Machine learning for social science: An agnostic approach
.
Annual Review of Political Science
,
24
(
24
),
395
419
. doi: .
Hafezi
,
M.
, &
Zolfagharinia
,
H.
(
2018
).
Green product development and environmental performance: Investigating the role of government regulations
.
International Journal of Production Economics
,
204
,
395
410
. doi: .
Hastie
,
T. J.
, &
Pregibon
,
D.
(
2017
). Generalized linear models. In
Statistical models in S
(pp. 
195
247
).
Routledge
.
He
,
F.
,
Guo
,
X.
, &
Yue
,
P.
(
2024
).
Media coverage and corporate ESG performance: Evidence from China
.
International Review of Financial Analysis
,
91
, 103003. doi: .
Hu
,
J.
,
Fang
,
Q.
, &
Long
,
W. B.
(
2023d
).
Carbon emission regulation, corporate emission reduction incentive and total factor productivity: A natural experiment based on China’s carbon emission trading system
.
Economic Research Journal
,
58
(
4
),
77
94
. doi: .
Mullainathan
,
S.
, &
Spiess
,
J.
(
2017
).
Machine learning: An applied econometric approach
.
The Journal of Economic Perspectives
,
31
(
2
),
87
106
. doi: .
Tan
,
J.
, &
Tang
,
Y.
(
2016
).
Donate money, but whose? An empirical study of ultimate control rights, agency problems, and corporate philanthropy in China
.
Journal of Business Ethics
,
134
(
4
),
593
610
. doi: .
Tidhar
,
R.
, &
Eisenhardt
,
K. M.
(
2020
).
Get rich or die trying… finding revenue model fit using machine learning and multiple cases
.
Strategic Management Journal
,
41
(
7
),
1245
1273
. doi: .
Valizade
,
D.
,
Schulz
,
F.
, &
Nicoara
,
C.
(
2024
).
Towards a paradigm shift: How can machine learning extend the boundaries of quantitative management scholarship?
.
British Journal of Management
,
35
(
1
),
99
114
. doi: .
Xiang
,
X.
,
Liu
,
C.
, &
Yang
,
M.
(
2022
).
Who is financing corporate green innovation?
.
International Review of Economics and Finance
,
78
,
321
337
. doi: .
Published in China Accounting and Finance Review. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at Link to the terms of the CC BY 4.0 licence.

or Create an Account

Close Modal
Close Modal