A novel method for calculating privacy leakage probability in text generated of social network users

Yang, Ruixian; Du, Yifan; Bi, Chongwu; Li, Xian

doi:10.1108/AJIM-03-2025-0120

Article navigation

Research Article| August 26 2025

A novel method for calculating privacy leakage probability in text generated of social network users

Ruixian Yang;

Ruixian Yang

School of Information Management, Zhengzhou University

, Zhengzhou,

China

Data Governance Research Center of Henan Province

, Zhengzhou,

China

Zhengzhou Data Science Research Center

, Zhengzhou,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Yifan Du

0009-0006-3333-2660

;

Yifan Du

School of Information Management, Zhengzhou University

, Zhengzhou,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Chongwu Bi

0000-0001-7874-258X

;

Chongwu Bi

School of Information Management, Zhengzhou University

, Zhengzhou,

China

Data Governance Research Center of Henan Province

, Zhengzhou,

China

Zhengzhou Data Science Research Center

, Zhengzhou,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Xian Li

0000-0002-3402-0275

Xian Li

Department of Otolaryngology, Chongqing General Hospital, School of Medicine, Chongqing University

, Chongqing,

China

College of Computer and Information Science, Southwest University

, Chongqing,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Xian Li can be contacted at: lx2009yet@126.com

Publisher: Emerald Publishing

Received: March 04 2025

Revision Received: March 26 2025

Revision Received: July 02 2025

Accepted: July 26 2025

Online ISSN: 2050-3814

Print ISSN: 2050-3806

Funding

Funding Group:

Award Group:
- Funder(s):
  National Social Science Fund of China
- Award Id(s):
  22BTQ072
- Principal Award Recipient(s):
Award Group:
- Funder(s):
  Support Plan for Innovative Philosophical and Social Science Teams in Higher Education Institutions of Henan Province
- Award Id(s):
  2024-CXTD-01
Funding Statement(s):
Funding: This study is supported by National Social Science Foundation of China (Grant Number 22BTQ072) and the Support Plan for Innovative Philosophical and Social Science Teams in Higher Education Institutions of Henan Province (Grant Number 2024-CXTD-01). The sponsor only provided financial support and did not participate in research design, data analysis or thesis writing.

2025

Emerald Publishing Limited

Licensed re-use rights only

Aslib Journal of Information Management 1–23.

https://doi.org/10.1108/AJIM-03-2025-0120

Purpose

The increasing popularity of social media increases the risk of user data privacy leakage. This study introduces a method called BERT-based Privacy Risk Assessment and Probability Calculation (B-PRAPC) to help users identify and quantify the risk of privacy leakage on social networks.

Design/methodology/approach

From the user’s perspective, this study focuses on the generation and storage stage of data life cycle, builds a privacy lexicon and a risk identification model of privacy leakage, and calculates the privacy leakage probability by combining the frequency of risk-related words and the amount of privacy contained in the user’s text.

Findings

Compared to baseline models, B-PRAPC achieves the highest accuracy (0.9264) and F1 score (0.9253) in identifying the risk of users’ text privacy leakage. The results show that personal location, medical, identity and work education information are more prone to disclosure, while users demonstrate a strong awareness of protecting their personal property, network identity and health-related privacy.

Originality/value

These findings highlight the effectiveness of B-PRAPC in enhancing user privacy protection on social media, and provide insights for social media platforms and users on how to protect the privacy of personal data.

2025

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

A novel method for calculating privacy leakage probability in text generated of social network users

New and popular articles

Email Alerts

Cited By

A novel method for calculating privacy leakage probability in text generated of social network users

Sign in

Client Account

ICE Member Sign In

New and popular articles

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable