This study aims to investigate the transformative impact of Generative Artificial Intelligence (Gen-AI), particularly ChatGPT, on education. Through comprehensive bibliometric and content analysis, this study maps publication trends, identifies key research themes, uncovers gaps in the literature and explores future directions for effectively integrating AI into educational systems.
A systematic review of 817 articles published between 2021 and 2024 was conducted to explore the evolving landscape of Gen-AI in education. Using bibliometric and content analysis, this study used coword analysis, thematic mapping, cluster analysis and bibliometric coupling to identify trends, gaps and the structural and conceptual frameworks underlying the integration of Gen-AI in educational settings.
The analysis identified four key thematic clusters: Gen-AI as a driver of educational transformation, its impact on curriculum and pedagogy, ethical and integrity considerations in higher education and its role in enhancing creativity and knowledge. This study proposes a conceptual framework with 10 propositions and 12 research inquiries, emphasizing personalized learning approaches and robust ethical safeguards.
This research offers actionable insights for educators, policymakers and AI developers, providing a roadmap for the responsible integration of Gen-AI into educational strategies. It emphasizes fostering innovation while addressing concerns about ethical, social and academic integrity.
This study offers a comprehensive synthesis of the rapidly expanding research on Gen-AI in education, with a particular emphasis on ChatGPT. Analyzing 817 peer-reviewed articles from 2021 to 2024, this study combines bibliometric, thematic and content analyses to identify four research clusters and track their evolution. Aligned with UNESCO’s AI Ethics Recommendation, it provides a conceptual framework, 10 theoretical propositions and 12 research questions, delivering practical insights and a strategic agenda for researchers, educators and policymakers.
1. Introduction
The recent developments in Generative Artificial Intelligence (Gen-AI) signify a substantial shift with far-reaching educational consequences. As we navigate this technological revolution, it is crucial to map the most recent research, understand the current state of knowledge, identify gaps and pave the way for future research. Activities previously considered beyond automation, especially those involving creative processes, are now being redefined by AI models like ChatGPT (Burger et al., 2023), leading to discussions about their impact on creativity and innovation management (Marr, 2023). The emergence of ChatGPT, known for its ability to create new content independently, has sparked increasing interest across various fields (Larsen and Narayan, 2023), including education (e.g. Baidoo-Anu and Ansah, 2023; Grassini, 2023; Lo, 2023).
However, the unprecedented rise of Gen-AI tools, such as ChatGPT, has transformed the educational landscape; yet, existing research remains fragmented, often lacking a cohesive synthesis of conceptual trends, ethical considerations and practical applications. Many reviews offer narrow analyses that fail to integrate structural and thematic bibliometric insights. This fragmentation hinders educators, policymakers and researchers from fully understanding how Gen-AI reshapes pedagogy, assessment and creativity in education. Addressing this gap, the present study systematically analyzes 817 peer-reviewed articles using bibliometric and content analysis to identify dominant research clusters, map intellectual trends and propose a framework and future research agenda for the responsible integration of AI in education.
A comprehensive literature review is essential to grasp the evolving research landscape of Gen-AI, particularly ChatGPT, in education. Examining existing knowledge can uncover trends, methodologies and new insights in this rapidly changing field. Moreover, systematically reviewing the literature allows us to understand the increased interest among researchers since the launch of ChatGPT and identify areas for future exploration. This paper aims to present the findings of a systematic literature review (SLR) on Gen-AI, explicitly focusing on the influence of ChatGPT in the education sector. In this review, our purpose is to add to the ongoing discussion about how ChatGPT-powered Gen-AI is changing the landscape of the education sector. By looking at existing literature, we hope to provide valuable insights for educators, researchers and policymakers who want to use ChatGPT in the education sector. We aim to answer below four research questions (RQ) in this study:
What are the publication trends in using Generative Artificial Intelligence (Gen-AI) in the education sector (Gen-AI_ES)?
Which significant journals, authors, publications and emerging themes shape the discourse on Gen-AI in the education sector?
What conceptual framework underpins the integration of Gen-AI in the education sector, and how can methodologies such as thematic mapping, coword analysis and publication network evolution be used to explore this?
What gaps exist in the current research, and what directions should future investigations take to explore using Gen-AI in enhancing creativity and innovation in educational practices?
This paper systematically reviews the use of Gen-AI in the education sector, focusing on ChatGPT, examining publication trends, significant themes and its impact on educational practices. ChatGPT is a generative AI model based on the Generative Pretrained Transformer (GPT) architecture (Yenduri et al., 2024). GPT is a deep learning architecture that uses self-attention mechanisms to understand textual data relationships, enabling the model to generate new content based on input. It is designed to create human-like text by predicting the next word sequentially. This allows the generation of new content, such as responses or stories. Unlike other AI models that primarily classify or retrieve information, ChatGPT uses a transformer-based approach to process long-range dependencies in text, producing coherent and contextually relevant outputs (Brown, 2020; Radford et al., 2019). This study addresses the calls for diverse data utilization and theoretical development by Ansari et al. (2023) and Preiksaitis and Rose (2023) by using bibliometric review and exploring literature through comprehensive search strategies. It maps out educational AI’s structural and conceptual frameworks, identifying research gaps and suggesting directions for future studies to enhance creativity and innovation.
In addition, it assesses the effects of AI tools on student engagement and learning outcomes, underscoring the potential of AI to personalize and improve the educational experience. Therefore, through this study, we aim to offer scholars, educators and practitioners a detailed understanding of the impact of ChatGPT and Gen-AI on the education sector. By spotlighting topics of interest and identifying areas requiring further research, we aim to empower stakeholders to make informed decisions as we navigate the ChatGPT era.
2. Literature review
2.1 Artificial intelligence in education: a brief history
The exploration of artificial intelligence (AI) in education can be traced back to seminal works like Pinker’s (1979) study, which combined theoretical linguistics with computational models. Pinker demonstrated how early AI techniques were applied to decipher complex cognitive tasks such as language learning, illustrating the merger of human cognition and machine processing from an early stage. Building on this, Buchanan’s (2005) work provided a historical analysis of AI’s evolution from philosophical and fictional origins, influenced by thinkers like Descartes and Leibniz and science fiction authors such as Verne and Asimov. Buchanan detailed how the post-Second World War boom in computer technology enabled the practical application of AI theories, marking an era where robotics evolved from simple mechanical devices to complex systems capable of sophisticated tasks. Continuing this exploration, Burkhardt and Rieder’s (2024) discussion focuses on deep learning’s impact and how models like OpenAI’s GPT use extensive data sets and computational power for various applications. They emphasize the role of users as codevelopers in refining AI applications and discuss the centralization of AI capabilities among large tech companies, raising issues of bias, opacity and energy demands. This narrative reflects AI’s intellectual and technological evolution, with significant socioeconomic implications. Thus, the historical narrative of AI, from its theoretical origins to its status as a powerful practical tool in education, reflects a continuum of intellectual and technological evolution with profound socioeconomic implications.
Based on training data, Generative AI (Gen-AI) techniques create new data or content, like images, text, music or videos. These models can be used for creative purposes like art or deepfake videos but raise ethical concerns about potential misuse for misinformation or media manipulation (Ali et al., 2021). Gen-AI has existed for several decades, as mentioned by Buchanan (2005), who wrote that early milestones in AI include knowledge representation, inference and problem-solving, with roots in natural language processing. Generative AI started being used in education around the early 2000s. The specific term “Gen-AI” may not have a precise starting point. However, generative AI as a field of AI research has been around since at least the early 2010s and has gained prominence with advancements in deep learning and neural network techniques. Its applications include image generation, text generation, music composition and video synthesis (Kaswan et al., 2023).
2.2 Systematic reviews of generative artificial intelligence in education
While many reviews have explored generative AI and ChatGPT in education, comprehensive analyses of 817 articles (2021–2024) using bibliometric and content methods to examine AI’s transformative potential, identify clusters and frameworks and address ethical, methodological and practical challenges remain scarce, as emphasized by Bond et al. (2024), who stress gaps in ethical considerations, methodological diversity, stakeholder inclusivity, geographical representation, interdisciplinary focus and longitudinal studies on AI’s long-term impacts in education. The literature indicates that AI has the potential to improve learning outcomes, engage students and assist educators, but there is a need for empirical research to address challenges effectively. Discussions suggest a promising future where AI, integrated thoughtfully into educational methods, could significantly enhance personalized and engaging learning experiences. Table 1 summarizes the various approaches to generative AI in each review and the connection between AI and education. Given the existence of over 31 SLRs on generative AI in education at the time of this study, this research prioritized selecting the most cited reviews and those published in the highest impact journals to ensure quality and relevance. This approach focuses on influential works representing key insights and trends in the field while maintaining a manageable scope. The criteria for selection ensured methodological rigor and the inclusion of reviews that have significantly shaped academic discourse.
Systematic literature papers in literature
| Author/s | Aim of review | Target population | Type of review | Search sources | Types of studies reviewed | No. of studies included |
|---|---|---|---|---|---|---|
| Wu and Yu (2024) | Investigate the impact of AI chatbots on students’ learning outcomes | Students in educational settings | Meta-analysis | Academic databases, journals | Effects of AI chatbots on learning outcomes | 24 |
| Chiu (2024) | Explore the impact of generative AI on higher education | Students in undergraduate and postgraduate programs | Systematic literature review | ERIC, ProQuest, Web of Science, SCOPUS | AI-based tools in education | 92 |
| Park and Doo (2024) | Synthesize research on AI applications in blended learning | AI applications in blended learning studies | Systematic literature review | Databases (not specified) | AI applications in blended learning | 30 |
| Ansari et al. (2023) | Explore the global evidence of ChatGPT use in higher education | Studies on ChatGPT in education | Systematic scoping review | ScienceDirect, Google Scholar, PubMed and others | Empirical and nonempirical studies | 69 |
| Wang et al. (2024) | Explore AI technologies in classroom discourse | Studies on AI in educational discourse | Systematic review | SCI-E, SSCI, Webweb of Science, conferences | Classroom discourse with AI technologies | 68 |
| Baber et al. (2023) | Investigate research on ChatGPT and its trends | Articles on ChatGPT | Systematic literature review | Scopus, Google Scholar | Bibliometric analysis and literature review | 328 (34 for in-depth review) |
| Preiksaitis and Rose (2023) | Explore generative AI in medical education | Individuals in medical education | Scoping review | PubMed, Web of Science, Google Scholar | Opinion pieces and original research | 41 |
| Imran and Almusharraf (2023) | Explore ChatGPT as a writing assistant in higher education | Academic and scientific writers | Systematic review | Scopus, ScienceDirect, PubMed, Web of Science | AI tools for writing tasks | 30 |
| Bahroun et al. (2023) | Analyze the integration of Gen-AI into education | Researchers, educators, policymakers | Systematic literature review | Scopus | Various research articles and publications | 217 |
| Fahd et al. (2022) | A systematic analysis of research that applied machine learning in higher education, with a focus on student performance, at-risk student identification and attrition prediction | Researchers, educators, policymakers and stakeholders involved in the field of higher education and artificial intelligence | Systematic review and a meta-analysis | Academic databases, journals, conference proceedings and relevant publications in the field of artificial intelligence in higher education | A combination of primary studies, systematic reviews and possibly meta-analyses related to the applications of AI in higher education | 89 |
| Chen et al. (2020) | To assess the impact of artificial intelligence on education, specifically focusing on its effects on administration, instruction and learning | Educational institutions | Qualitative research design, incorporating qualitative content and thematic analysis | EBSCOhost, ProQuest, Web of Science and Google Scholar | Journal articles, professional publications and government and institutional reports | 30 |
| Vargas-Murillo et al. (2023) | To investigate the impact of ChatGPT in higher education by exploring its applications, risks, challenges and overall effects on teaching and learning outcomes | Anyone interested in the integration of AI like ChatGPT in education | Structured systematic literature review | Scopus, ScienceDirect, ProQuest, IEEE Xplore, and ACM Digital Library | Studies discussing ChatGPT use in education focusing on its applications, risks, challenges, opportunities and overall impact on teaching and learning | 16 (Scopus: 5, ScienceDirect: 4, ProQuest: 4, IEEE Xplore: 2, ACM Digital Library: 1) |
| Lo et al. (2024) | To synthesize existing research on ChatGPT’s impact on student engagement | University and K-12 students | Systematic review | Academic Search Ultimate, Education Research Complete, ERIC, Scopus, Web of Science | Empirical studies, systematic reviews | 72 |
| Song and Wang (2020) | To perform a bibliometric analysis of the global development of educational AI research over the past 20 years, examining trends, cooperation and evolution in the field | Researchers and academics in the field of educational AI | Bibliometric review | Scopus database | Research articles on educational AI | 8,660 |
| Chen et al. (2024) | To systematically explore the current state of research in AIGC’s educational application | Educators, researchers and policymakers | Systematic literature review | EBSCO, Ei Compendex, Scopus, Web of Science | Original research, review articles, editorials, perspectives, letters, notes | 134 |
| Chen et al. (2022) | To analyze the growth and trends of AI in education | Researchers and educators | Bibliometric review | Web of Science, Scopus, ERIC, ICAIED, IJAIED | Research articles, conference papers | 4,519 |
| Chaka, C. (2023) | To review applications, prospects and challenges of AI, robotics and blockchain in HEIs between 2013 and 2019 | Higher education institutions (HEIs) | Systematic review | Google Scholar, Semantic Scholar, ERIC, ScienceDirect, SpringerLink, Scopus, ResearchGate, Academia.edu | Quasi-experimental, overview, exploratory mixed-methods, pilot, multiple descriptive case, implementation, quantitative single-case, prospective comparative studies | 26 |
| Zawacki-Richter et al. (2019) | Provide an overview of research on AI applications in higher education and explore the involvement of educators. Analyze how AI is conceptualized and address the ethical implications and challenges | Educators and researchers in higher education | Systematic review | EBSCO Education Source, Web of Science, Scopus | Empirical and descriptive studies | 146 |
| Sapci and Sapci (2020) | To evaluate the current state of AI training and use of AI tools in medical and health informatics education, enhancing the learning experience and assessing AI education practices | Medical and health informatics students and educators | Systematic review | PubMed, IEEE Xplore, CINAHL Plus, ScienceDirect, ProQuest Central | Peer-reviewed research articles, review papers, conference papers | 26 |
| Park and Doo (2024) | To synthesize research findings on AI applications in blended learning | K-12, higher education, teacher education, lifelong learning | Systematic literature review | Various academic databases (not specified) | Empirical studies focusing on AI in blended learning | 30 |
| Cheung et al. (2024) | To explore the epistemic insights into the relationships between science and AI in education | K-12 students | Systematic review | Web of Science, Scopus, ERIC | Research articles, book chapters and peer-reviewed articles | 15 |
| Ivanova et al. (2024) | To conduct a bibliometric analysis of scientific production in AI in teaching | Researchers and educators | Bibliometric analysis | Scopus, Web of Science | Articles related to AI in teaching | 6,010 (Scopus), 500 (WoS) |
| Author/s | Aim of review | Target population | Type of review | Search sources | Types of studies reviewed | No. of studies included |
|---|---|---|---|---|---|---|
| Investigate the impact of | Students in educational settings | Meta-analysis | Academic databases, journals | Effects of | 24 | |
| Explore the impact of generative | Students in undergraduate and postgraduate programs | Systematic literature review | ERIC, ProQuest, Web of Science, | AI-based tools in education | 92 | |
| Synthesize research on | Systematic literature review | Databases (not specified) | 30 | |||
| Explore the global evidence of ChatGPT use in higher education | Studies on ChatGPT in education | Systematic scoping review | ScienceDirect, Google Scholar, PubMed and others | Empirical and nonempirical studies | 69 | |
| Explore | Studies on | Systematic review | SCI-E, SSCI, Webweb of Science, conferences | Classroom discourse with | 68 | |
| Investigate research on ChatGPT and its trends | Articles on ChatGPT | Systematic literature review | Scopus, Google Scholar | Bibliometric analysis and literature review | 328 (34 for in-depth review) | |
| Explore generative | Individuals in medical education | Scoping review | PubMed, Web of Science, Google Scholar | Opinion pieces and original research | 41 | |
| Explore ChatGPT as a writing assistant in higher education | Academic and scientific writers | Systematic review | Scopus, ScienceDirect, PubMed, Web of Science | 30 | ||
| Analyze the integration of | Researchers, educators, policymakers | Systematic literature review | Scopus | Various research articles and publications | 217 | |
| A systematic analysis of research that applied machine learning in higher education, with a focus on student performance, at-risk student identification and attrition prediction | Researchers, educators, policymakers and stakeholders involved in the field of higher education and artificial intelligence | Systematic review and a meta-analysis | Academic databases, journals, conference proceedings and relevant publications in the field of artificial intelligence in higher education | A combination of primary studies, systematic reviews and possibly meta-analyses related to the applications of | 89 | |
| To assess the impact of artificial intelligence on education, specifically focusing on its effects on administration, instruction and learning | Educational institutions | Qualitative research design, incorporating qualitative content and thematic analysis | EBSCOhost, ProQuest, Web of Science and Google Scholar | Journal articles, professional publications and government and institutional reports | 30 | |
| To investigate the impact of ChatGPT in higher education by exploring its applications, risks, challenges and overall effects on teaching and learning outcomes | Anyone interested in the integration of | Structured systematic literature review | Scopus, ScienceDirect, ProQuest, | Studies discussing ChatGPT use in education focusing on its applications, risks, challenges, opportunities and overall impact on teaching and learning | 16 (Scopus: 5, ScienceDirect: 4, ProQuest: 4, | |
| To synthesize existing research on ChatGPT’s impact on student engagement | University and K-12 students | Systematic review | Academic Search Ultimate, Education Research Complete, ERIC, Scopus, Web of Science | Empirical studies, systematic reviews | 72 | |
| To perform a bibliometric analysis of the global development of educational | Researchers and academics in the field of educational | Bibliometric review | Scopus database | Research articles on educational | 8,660 | |
| To systematically explore the current state of research in AIGC’s educational application | Educators, researchers and policymakers | Systematic literature review | EBSCO, Ei Compendex, Scopus, Web of Science | Original research, review articles, editorials, perspectives, letters, notes | 134 | |
| To analyze the growth and trends of | Researchers and educators | Bibliometric review | Web of Science, Scopus, ERIC, ICAIED, | Research articles, conference papers | 4,519 | |
| To review applications, prospects and challenges of AI, robotics and blockchain in HEIs between 2013 and 2019 | Higher education institutions (HEIs) | Systematic review | Google Scholar, Semantic Scholar, ERIC, ScienceDirect, SpringerLink, Scopus, ResearchGate, Academia.edu | Quasi-experimental, overview, exploratory mixed-methods, pilot, multiple descriptive case, implementation, quantitative single-case, prospective comparative studies | 26 | |
| Provide an overview of research on | Educators and researchers in higher education | Systematic review | Empirical and descriptive studies | 146 | ||
| To evaluate the current state of | Medical and health informatics students and educators | Systematic review | PubMed, | Peer-reviewed research articles, review papers, conference papers | 26 | |
| To synthesize research findings on | K-12, higher education, teacher education, lifelong learning | Systematic literature review | Various academic databases (not specified) | Empirical studies focusing on | 30 | |
| To explore the epistemic insights into the relationships between science and | K-12 students | Systematic review | Web of Science, Scopus, | Research articles, book chapters and peer-reviewed articles | 15 | |
| To conduct a bibliometric analysis of scientific production in | Researchers and educators | Bibliometric analysis | Scopus, Web of Science | Articles related to | 6,010 (Scopus), 500 (WoS) |
Three reviews before ChatGPT correctly foresaw AI’s role in facilitating personalized learning, tailored feedback and predictive analytics, as well as assisting educators by automating tasks and providing insights into student learning patterns. These reviews – by Zawacki-Richter et al. (2019), Sapci and Sapci (2020) and Song and Wang (2020) – also predicted AI’s continued ability to manage large data sets and improve learning outcomes, especially in virtual learning environments used in medical and health education for student profiling and performance predictions.
The reviews adopt various methodological approaches, from meta-analyses and SLRs to bibliometric analyses (Imran and Almusharraf, 2023; Ansari et al., 2023). Research also underlined the positive effects of AI on students’ learning outcomes and engagement. For instance, Wu and Yu’s meta-analysis (2024) strongly suggests that AI chatbots significantly enhance learning outcomes, mainly through short-term interventions. Similarly, Chiu’s systematic review (2024) emphasizes the potential of generative AI tools to improve student motivation, engagement and academic performance, urging further research from students’ perspectives to better understand AI’s implications in education. Also, Park and Doo (2024) focused on AI applications in blended learning, identifying trends in AI applications that predict learners’ status and achievement, indicating a promising role for AI in personalizing learning experiences. Systematic reviews revealed the diverse applications of AI in educational settings, from classroom discourse to writing assistance. For example, Wang et al.’s (2024) review sheds light on AI’s role in analyzing classroom discourse, enhancing both teaching and learning experiences. Alternatively, Baber et al.’s study (2023) provided an early-stage analysis of ChatGPT research, highlighting its use in natural language processing, dialogue systems and response generation.
While AI’s benefits are widely acknowledged, reviews also highlight the need for cautious optimism. For example, Preiksaitis and Rose (2023) discussed the speculative nature of generative AI’s benefits in medical education, stressing the importance of empirical studies to evaluate its real-world impacts. This sentiment echoes across the reviews, emphasizing the need for balanced approaches to leveraging AI technologies in education. Yet, a review by Bahroun et al. (2023) showcased the interdisciplinary nature of generative AI research in education. Their analysis found that education research spanned computer science, engineering and medical fields, but this emphasized the need for collaboration to tackle complex challenges and applications. Their study found generative AI is revolutionizing education by offering personalized learning experiences. Generative AI was also found to adapt educational content to individual needs, transforming education by creating practical, inclusive and engaging learning environments in the digital age.
Recent SLRs have highlighted several essential research gaps that need addressing to advance our understanding of generative AI’s role in education. First, there is a critical need to evaluate the role and impact of generative AI in blended learning and higher education settings. Researchers like Park and Doo (2024) emphasized the importance of identifying emerging trends and gaps to integrate these technologies better. Future reviews are encouraged to address ethical and privacy concerns related to AI applications in educational contexts. These concerns are not only academic but also aligned with international standards. For instance, UNESCO’s Recommendation on the Ethics of UNESCO (2021) emphasizes the importance of transparency, accountability, inclusiveness and the protection of human rights in the development and deployment of AI. This global ethical framework provides a critical lens through which the integration of AI tools in education – such as ChatGPT – should be evaluated, ensuring that innovation does not come at the cost of fairness, privacy or academic integrity. According to Baber et al. (2023) and Imran and Almusharraf (2023), gathering user feedback and ensuring responsible use are essential to addressing these critical ethical challenges.
Furthermore, Ansari et al. (2023) and Preiksaitis and Rose (2023) advocated using diverse data sources and comprehensive search strategies. They suggested developing theoretical frameworks for domains such as classroom discourse analysis and medical education, which could significantly optimize AI integration and enhance educational outcomes. Finally, the long-term effects of AI tools and their sustainability within academic environments, including K-12 settings, require thorough investigation. This would involve assessing the effectiveness of these tools in enhancing teaching, learning and research processes to ensure that their integration delivers sustained benefits. Fahd et al. (2022) provided a systematic analysis of research applying machine learning in higher education, focusing on student performance, at-risk student identification and attrition prediction, drawing from a combination of primary studies, systematic reviews and meta-analyses totaling 89 relevant publications.
In addition, Chen et al. (2020) assessed the impact of AI on education administration, instruction and learning using a qualitative research design and analyzing 30 sources, including journal articles and professional publications. Vargas-Murillo et al. (2023) investigated the impact of ChatGPT in higher education, exploring its applications, risks, challenges and overall effects on teaching and learning outcomes through a structured, SLR, consulting 16 studies across platforms like Scopus, ScienceDirect and others. This would involve assessing the effectiveness of these tools in enhancing teaching, learning and research processes to ensure that their integration delivers sustained benefits.
Finally, in their meta-SLR, Bond et al. (2024) analyzed secondary studies, including systematic reviews, meta-analyses and scoping reviews, to provide insights and recommendations for advancing AI in education. The review highlights the importance of adhering to rigorous frameworks like preferred reporting items for systematic reviews and meta-analyses, diversifying data sources and using advanced synthesis tools such as EPPI Reviewer to enhance SLRs. It emphasizes the need for diverse data collection methods, including qualitative, mixed-methods and design-based approaches, to ensure broad applicability across different populations. The authors advocate for expanding AI applications beyond STEM fields and into areas such as admissions, course scheduling and student support services, while addressing ethical concerns like data privacy and bias mitigation. In addition, they call for interdisciplinary collaboration, innovative methodologies like bibliometric analysis and focused exploration of emerging AI technologies to advance knowledge and practical applications in education. Finally, Baig and Yadegaridehkordi (2024) conducted a systematic review of 57 studies, offering a comprehensive analysis of ChatGPT’s role in higher education. Their research is notable for encompassing diverse stakeholder perspectives, including students, academic staff, researchers and nonacademic personnel, thereby providing a holistic understanding of ChatGPT’s applications, adoption measures and limitations within the educational context.
While the reviews in Table 1 focus on broader AI applications or specific educational tools, the current bibliometric review provides a deeper analysis of Gen-AI and ChatGPT’s integration into educational practices. It offers a detailed overview of the current state and future directions, advocating for frameworks for the responsible use of Gen-AI in education and setting a clear direction for future studies.
3. Methodology
3.1 Research design
This study uses bibliometric analysis to investigate the landscape of AI in education, leveraging systematic methodologies to explore the existing literature comprehensively. Drawing on Donthu et al. (2021) and Linnenluecke et al. (2020) framework for conducting bibliometric analyses, our research integrates co-citation analysis and text mining to clarify AI’s structural, conceptual and theoretical foundations within academia. Paul and Criado (2020) stated that a bibliometric review is systematic. Still, it focuses on the quantitative analysis of publication data rather than the qualitative synthesis of research findings. It uses statistical tools to analyze trends, citations and co-citations in a large body of published research. Still, it focuses more on metrics like authors and affiliations than the theories, methods and constructs typically emphasized in other systematic reviews.
Bibliometric reviews help summarize extensive bibliographic data to reveal the intellectual structure and emerging research topic or field trends. It aids in identifying prolific contributors and shows the field’s bibliometric and philosophical structure, enabling high-impact research (Donthu et al., 2021). Typically, bibliometric analyses use statistical techniques to scrutinize vast quantities of published studies on a particular theme, shedding light on patterns, citations, co-citations and variables like authorship, publication dates or journals. In contrast to other review types, bibliometric reviews emphasize the roles of authors, their affiliations, countries and citation networks rather than focusing primarily on the theories, methodologies and concepts involved. Our approach, depicted in Figure 1, consists of selecting samples, gathering data, conducting analyses and presenting findings. We undertake an SLR that adopts a science mapping strategy using Scopus and Web of Science for analytical purposes. This method facilitates exploring connections among the contributors to research and investigates themes via coword analysis in the chosen articles (Donthu et al., 2021; Linnenluecke et al., 2020).
The image presents a flowchart outlining a research process on the impact of generative artificial intelligence in education. It begins with keywords related to generative A I and its applications in education, including variations involving Chat G P T. The data is sourced from two databases, Scopus and Web of Science, with an initial total search result of 1,364 documents. From these, 951 papers are selected for analysis, followed by the removal of reoccurring papers, leading to a final count of 817. The process includes defined inclusion criteria, such as publication years, document types, language, and research areas. Exclusion criteria specify unrelated articles to the focus of A I and education. The flowchart concludes with four themes identified, including topics such as generative A I in transforming education, A I in learning and teaching, integrity in higher education, and the adoption of Chat G P T to enhance creativity and knowledge. The structure uses a hierarchical format with arrows indicating the flow of the research process.Science map of the systematic literature review
Source(s): Figure by authors
The image presents a flowchart outlining a research process on the impact of generative artificial intelligence in education. It begins with keywords related to generative A I and its applications in education, including variations involving Chat G P T. The data is sourced from two databases, Scopus and Web of Science, with an initial total search result of 1,364 documents. From these, 951 papers are selected for analysis, followed by the removal of reoccurring papers, leading to a final count of 817. The process includes defined inclusion criteria, such as publication years, document types, language, and research areas. Exclusion criteria specify unrelated articles to the focus of A I and education. The flowchart concludes with four themes identified, including topics such as generative A I in transforming education, A I in learning and teaching, integrity in higher education, and the adoption of Chat G P T to enhance creativity and knowledge. The structure uses a hierarchical format with arrows indicating the flow of the research process.Science map of the systematic literature review
Source(s): Figure by authors
This study adopts the approach of Aguinis et al. (2023) to enhance the rigor and utility of its review. It uses a broad spectrum of relevant studies sourced from Scopus and Web of Science databases through a Boolean search string (e.g. “generative ai” AND “educ*”). A transparent screening process was applied, resulting in 817 final documents analyzed using bibliometric tools like VOSviewer and RStudio. These tools facilitated the visualization of the intellectual structure, identification of study relationships and clustering through co-occurrence mapping. The clusters were further examined through content analysis to define themes and research streams, providing a comprehensive overview of the field (Tables 2 and 3 in the manuscript).
Methodological process – steps, databases or applications and further justification
| Steps | Used databases or programs | Justification necessary toward the taxonomy and recommendations for future research |
|---|---|---|
| 1 Data collection | Scopus and WoS: 1,364 core publication articles | Search period through 2024 with search terms (“generative ai”) AND (educ*) OR (generative ai”) AND (teach*) OR (“generative ai”) AND (learn*) OR (“generative ai”) AND (student*); |
| Search terms were interchanged with (ChatGPT) AND (educ*) OR (ChatGPT) AND (teach*) OR (ChatGPT) AND (learn*) OR (ChatGPT) AND (student*) | ||
| 2 Quality checks | Scopus and Web of Science | To guarantee that documents are related to generative AI in Education, all abstracts were read: 951 final documents |
| 3 SMS analysis and cluster identification | VOSviewer and RStudio | Network map of 817 documents based on keywords co-occurrence; four core clusters were automatically identified. The applied techniques enable the illumination of large research literature (Waltman et al., 2010) and have already been used in educational papers (e.g. Kosmützky and Krücken, 2014; Tseng et al., 2013) |
| 4 Further analysis and maps | Scopus and WoS, VOSviewer and RStudio | Co-occurrence of keywords and Scopus and WoS research areas; development of the generative AI in education research literature. These additional maps provide further detail to the taxonomy scheme and a cross-check of the results |
| 5 Cluster interpretation | Content analysis and qualitative interpretations | A content analysis was conducted to name and define the four clusters of generative AI in education. This qualitative interpretation involved reviewing 817 articles, focusing on their titles, authorship and data sources to identify distinct research streams, in line with methodologies proposed by prior studies (e.g. Gaur and Kumar, 2018) |
| 6 Taxonomy scheme | The analyses were grouped into a taxonomy scheme, with each cluster analyzed and discussed to identify future research recommendations |
| Steps | Used databases or programs | Justification necessary toward the taxonomy and recommendations for future research |
|---|---|---|
| 1 Data collection | Scopus and WoS: 1,364 core publication articles | Search period through 2024 with search terms (“generative ai”) |
| Search terms were interchanged with (ChatGPT) | ||
| 2 Quality checks | Scopus and Web of Science | To guarantee that documents are related to generative |
| 3 | VOSviewer and RStudio | Network map of 817 documents based on keywords co-occurrence; four core clusters were automatically identified. The applied techniques enable the illumination of large research literature ( |
| 4 Further analysis and maps | Scopus and WoS, VOSviewer and RStudio | Co-occurrence of keywords and Scopus and WoS research areas; development of the generative |
| 5 Cluster interpretation | Content analysis and qualitative interpretations | A content analysis was conducted to name and define the four clusters of generative |
| 6 Taxonomy scheme | The analyses were grouped into a taxonomy scheme, with each cluster analyzed and discussed to identify future research recommendations |
Publication metrics
| Description | Results |
|---|---|
| Main information about data | |
| Timespan | 2021:2024 |
| Sources (journals, books, etc.) | 337 |
| Documents | 817 |
| Annual growth rate % | 1,884.94 |
| Document average age | 0.38 |
| Average citations per doc | 12.61 |
| References | 7,900 |
| Document contents | |
| Keywords plus (ID) | 1,117 |
| Author’s keywords (DE) | 1,729 |
| Authors | |
| Authors | 2,150 |
| Authors of single-authored docs | 147 |
| Authors collaboration | |
| Single-authored docs | 152 |
| Coauthors per doc | 3.69 |
| International coauthorships % | 24.92 |
| Document types | |
| Article | 817 |
| Description | Results |
|---|---|
| Main information about data | |
| Timespan | 2021:2024 |
| Sources (journals, books, etc.) | 337 |
| Documents | 817 |
| Annual growth rate % | 1,884.94 |
| Document average age | 0.38 |
| Average citations per doc | 12.61 |
| References | 7,900 |
| Document contents | |
| Keywords plus ( | 1,117 |
| Author’s keywords ( | 1,729 |
| Authors | |
| Authors | 2,150 |
| Authors of single-authored docs | 147 |
| Authors collaboration | |
| Single-authored docs | 152 |
| Coauthors per doc | 3.69 |
| International coauthorships % | 24.92 |
| Document types | |
| Article | 817 |
3.2 Sample selection and data collection
This study conducted a comprehensive literature search using Scopus and Web of Science to identify peer-reviewed journal articles on generative AI in education, covering the period from 2021 to 2024. A Boolean search strategy was used, combining terms such as “generative AI” and “ChatGPT” with education-related keywords (e.g. “educ,” “teach,” “learn,” “student”) to ensure comprehensive coverage of relevant studies.
The initial search retrieved 1,364 records. After applying inclusion criteria – focusing on English-language, peer-reviewed journal articles discussing AI applications in educational contexts – and removing irrelevant or duplicate entries, a final data set of 817 articles was obtained. These articles were sourced from journals across multiple disciplines, including social sciences, decision sciences, arts and humanities, business, psychology, economics and multidisciplinary research.
Metadata (e.g. titles, abstracts, authors, keywords, publication years and journal names) were exported in BibTeX format and analyzed using Bibliometrix (R package) and VOSviewer. Manual checks ensured accurate keyword normalization and cluster identification. Two researchers independently conducted content analysis following established guidelines (Gaur and Kumar, 2018; Salipante et al., 1982), focusing on key article features such as theoretical frameworks, research questions, data sources, variables and main findings.
Leveraging content analysis, mainly through unstructured ontological discovery, enabled extracting and identifying concepts, themes and patterns within the textual data of AI in education literature. This approach verified the conceptual structure and thematic progression and complemented the bibliometric review by offering a nuanced interpretation of the content within articles. Unlike statistical and network analysis, content analysis focuses on qualitative aspects of research, including the development of concepts, theoretical debates and methodological approaches. It enhances the understanding of thematic clusters identified by bibliometric techniques like coword analysis, providing a comprehensive perspective (Donthu et al., 2021).
4. Results
In this section, we have conducted a bibliographic compilation using R-software and a biblioshiny package. Subsequently, we identified the trends observed in publications (RQ1) and identified noteworthy journals, articles and authors (RQ2) per the performance analysis. Furthermore, we have characterized the research domain by identifying themes and clusters (RQ3).
4.1 Descriptive bibliometric results
Table 3 presents a synopsis of the data encompassing document contents, authors, author collaborations and document types. Over the years 2021–2024, 817 articles were generated (note: A negative annual growth is because it has just been a quarter of 2024, and the articles for 2023 are for a complete year, which might be higher than those in the first quarter of 2024). The articles demonstrated citation rates, averaging around six citations per article and accumulating over 18,650 citations collectively in a short span. Collaboration among authors was prevalent, with most papers coauthored by three to four individuals, while sole authors authored 105 articles (22.2%). However, most research activities were concentrated within specific countries, with only around 23% of articles originating from international scholars.
Clustering by bibliographic coupling was used to identify and categorize research streams within the Gen-AI_ES domain. This technique groups articles based on shared references if publications citing similar sources are intellectually related. The data set, comprising 817 articles from Scopus and Web of Science, was analyzed using VOSviewer, a bibliometric mapping tool that applies modularity-based clustering algorithms to visualize the relationships between publications. Each publication’s reference list was extracted and processed to compute bibliographic coupling strengths, with the algorithm grouping documents into distinct clusters based on the density of their shared references to ensure clusters were both internally homogeneous and distinct. Following this, the clusters were refined using content analysis, where the titles, abstracts and keywords of the constituent articles were reviewed to define overarching themes. This combined approach ensured the cluster titles accurately represented the intellectual and thematic priorities of the research domain, providing a robust foundation for mapping intellectual structures and uncovering key thematic areas in the literature, as adapted from methodologies such as Rafols et al. (2012) and Waltman et al. (2010).
4.1.1 Scientific production over time (RQ1).
The trend in scientific production over time is a key indicator of the field’s development (Bornmann et al., 2021). Figure 2 depicts the annual growth of publications in the Gen-AI_ES domain. A significant surge in output occurred in 2023 and the first quarter of 2024, coinciding with the introduction of ChatGPT. This growth highlights the increasing engagement of researchers with generative AI technologies in education. Such patterns underscore the rising importance of Gen-AI_ES as a research area.
The image presents a bar graph titled Number of Published Articles, illustrating the number of articles published each year from 2021 to 2024. The horizontal axis lists the years, 2021, 2022, 2023, and 2024, while the vertical axis quantifies the number of articles, ranging from zero to 600. The bars depict the following values, 2 articles for 2021, 1 for 2022, 296 for 2023, and 518 for 2024. The layout flows vertically from the bottom to the top, with data points systematically represented to show growth in publication over the designated years.Annual scientific production
Source(s): Figure by authors
The image presents a bar graph titled Number of Published Articles, illustrating the number of articles published each year from 2021 to 2024. The horizontal axis lists the years, 2021, 2022, 2023, and 2024, while the vertical axis quantifies the number of articles, ranging from zero to 600. The bars depict the following values, 2 articles for 2021, 1 for 2022, 296 for 2023, and 518 for 2024. The layout flows vertically from the bottom to the top, with data points systematically represented to show growth in publication over the designated years.Annual scientific production
Source(s): Figure by authors
4.1.2 Prominent journals in the Gen-AI_ES domain (RO2).
Using bibliometrics is helpful to get insights into journal prominence (Donthu et al., 2021). This section highlights journals that have contributed to the Gen-AI_ES domain. The Scimago Journal Rank (SJR) journal ranking systems were consulted, offering distinct methodologies in journal assessment. SJR uses a citation-based quartile ranking system, with Quartile 1 (Q1) representing the highest impact factor (Mason and Singh, 2022). Expert-based rankings involve rigorous peer review by scientific community members to evaluate journal quality, whereas citation-based rankings reflect community preference based on citation patterns (Walters and Zheng, 2023).
Table 4 outlines the most prolific journals in the Gen-AI_ES domain, detailing the number of articles published and their respective rankings according to SJR. The tabulation features the top ten productive journals, with the Computers and Education: Artificial and Education and Information Technologies journals leading with 38 articles each, trailed by the Medical Education with 31 articles and Computers and Education: Artificial Intelligence with 24 articles. Several other journals in the list have published eight or more articles. Notably, most of these journals predominantly fall within the Q1 or Q2 categories in SJR rankings. Furthermore, most journals in the list are indexed in fields such as AI and education, aligning with the focus of this study’s research domain.
Most influential journals
| Journals | Total publications | Scimago journal ranking |
|---|---|---|
| Education and Information Technologies | 38 | Q1 (SJR 1.25) |
| JMIR Medical Education | 31 | Q1 (0.84) |
| Computers and Education: Artificial Intelligence | 24 | Q1 (SJR 1.7) |
| Education Sciences | 19 | Q2 (SJR 0.61) |
| Interactive Learning Environments | 16 | Q1 (SRR 1.31) |
| Journal of University Teaching and Learning Practice | 15 | Q2 (SJR 0.58) |
| Techtrends | 15 | Q1 (SJR 0.87) |
| Journal of Applied Learning and Teaching | 14 | Q2 (SJR 0.57) |
| IEEE Transactions on Learning Technologies | 13 | Q1 (SJR 1.49) |
| International Journal of Educational Technology in Higher Education | 13 | Q1 (SJR 2.05) |
| Journals | Total publications | Scimago journal ranking |
|---|---|---|
| Education and Information Technologies | 38 | Q1 ( |
| 31 | Q1 (0.84) | |
| Computers and Education: Artificial Intelligence | 24 | Q1 ( |
| Education Sciences | 19 | Q2 ( |
| Interactive Learning Environments | 16 | Q1 ( |
| Journal of University Teaching and Learning Practice | 15 | Q2 ( |
| Techtrends | 15 | Q1 ( |
| Journal of Applied Learning and Teaching | 14 | Q2 ( |
| 13 | Q1 ( | |
| International Journal of Educational Technology in Higher Education | 13 | Q1 ( |
4.1.3 Leading authors in the Gen-AI_ES domain (RO2).
This section outlines the top ten authors distinguished by their productivity within the Gen-AI_ES domain and their publication output (Tables 5 and 6) by author and country. Pack A emerged as the most prolific author, contributing seven publications. Pack A is closely followed by Xu X, Chan Cky and Cowling M, with six, five and five publications, respectively. Notably, all the top 1 authors have contributed at least three papers within the Gen-AI_ES domain. Figure 6 illustrates the most prolific authors based on publication volume, with authors such as Tan S., Wang C. and Rudolph J. leading in the number of published documents. In contrast, Table 6 highlights the most influential authors based on total link strength in the bibliographic coupling network, which reflects scholarly impact rather than productivity. This distinction accounts for the differences in author names across the two outputs, as some authors may have a greater influence through citation networks despite publishing fewer articles.
Top 10 countries in research domain
| Country | Articles | SCP | MCP | Frequency | MCP ratio |
|---|---|---|---|---|---|
| USA | 165 | 158 | 7 | 0.014 | 0.042 |
| China | 107 | 96 | 11 | 0.022 | 0.103 |
| UK | 50 | 42 | 8 | 0.016 | 0.16 |
| Australia | 49 | 42 | 7 | 0.014 | 0.143 |
| Korea | 31 | 27 | 4 | 0.008 | 0.130 |
| India | 27 | 25 | 2 | 0.004 | 0.074 |
| Germany | 26 | 24 | 2 | 0.004 | 0.077 |
| Malaysia | 18 | 13 | 5 | 0.010 | 0.278 |
| Saudi Arabia | 18 | 16 | 2 | 0.004 | 0.111 |
| Canada | 17 | 16 | 1 | 0.002 | 0.059 |
| Country | Articles | Frequency | |||
|---|---|---|---|---|---|
| 165 | 158 | 7 | 0.014 | 0.042 | |
| China | 107 | 96 | 11 | 0.022 | 0.103 |
| 50 | 42 | 8 | 0.016 | 0.16 | |
| Australia | 49 | 42 | 7 | 0.014 | 0.143 |
| Korea | 31 | 27 | 4 | 0.008 | 0.130 |
| India | 27 | 25 | 2 | 0.004 | 0.074 |
| Germany | 26 | 24 | 2 | 0.004 | 0.077 |
| Malaysia | 18 | 13 | 5 | 0.010 | 0.278 |
| Saudi Arabia | 18 | 16 | 2 | 0.004 | 0.111 |
| Canada | 17 | 16 | 1 | 0.002 | 0.059 |
SCP: single country publication; MCP: multiple country publication
Most locally and globally cited documents
| Document | Title | Local citations (LC) | Global citations (GC) | LC/GC ratio (%) |
|---|---|---|---|---|
| Chan (2023) | The AI generation gap: Are Gen Z students more interested in adopting generative AI such as ChatGPT in teaching and learning than their Gen X and millennial generation teachers? | 8 | 2 | 400 |
| de Winter (2023) | Can ChatGPT pass high school exams on English Language Comprehension? | 5 | 5 | 100 |
| Chan (2023) | Students’ voices on generative AI: Perceptions, benefits, and challenges in higher education | 4 | 24 | 16.67 |
| Chan (2023) | A comprehensive AI policy education framework for university teaching and learning | 3 | 32 | 9.38 |
| Pursnani et al. (2023) | Performance of ChatGPT on the US fundamentals of engineering exam: Comprehensive assessment of proficiency and potential implications for professional environmental engineering practice | 2 | 1 | 200 |
| Impact of ChatGPT on learners in an L2 writing practicum: An exploratory investigation | 1 | 46 | 2.17 | |
| Herbold et al. (2023) | A large-scale comparison of human-written versus ChatGPT-generated essays | 1 | 2 | 50 |
| Jeon and Lee (2023) | Large language models in education: a focus on the complementary relationship between human teachers and ChatGPT | 1 | 38 | 2.63 |
| Rudolph (2023a, 2023b) | War of the chatbots: Bard, bing chat, ChatGPT, ernie and beyond. The new AI gold rush and its impact on higher education | 1 | 116 | 0.86 |
| Document | Title | Local citations ( | Global citations ( | LC/GC ratio (%) |
|---|---|---|---|---|
| The | 8 | 2 | 400 | |
| Can ChatGPT pass high school exams on English Language Comprehension? | 5 | 5 | 100 | |
| Students’ voices on generative AI: Perceptions, benefits, and challenges in higher education | 4 | 24 | 16.67 | |
| A comprehensive | 3 | 32 | 9.38 | |
| Performance of ChatGPT on the | 2 | 1 | 200 | |
| Impact of ChatGPT on learners in an L2 writing practicum: An exploratory investigation | 1 | 46 | 2.17 | |
| A large-scale comparison of human-written versus ChatGPT-generated essays | 1 | 2 | 50 | |
| Large language models in education: a focus on the complementary relationship between human teachers and ChatGPT | 1 | 38 | 2.63 | |
| War of the chatbots: Bard, bing chat, ChatGPT, ernie and beyond. The new | 1 | 116 | 0.86 |
4.1.4 Prominent publications in the Gen-AI_ES domain (RO2).
Global citations (GCs), derived from databases such as Scopus and Web of Science, measure a publication’s worldwide impact and visibility. In contrast, Local citations (LCs) represent the number of citations a document receives within the analyzed collection (Apriliyanti and Alon, 2017). Combined, GCs and LCs help contextualize the influence of records within the reviewed data set. This section highlights the most locally cited documents in the Gen-AI_ES domain, emphasizing the significance of citations in advancing knowledge (Bornmann et al., 2021). Table 6 lists the top ten most locally cited documents and their corresponding LC, GC and LC/GC ratios.
The most locally cited documents are Chan (2023), with eight LCs; de Winter (2023), with five, and Chan (2023), with four, as presented in Table 5. While Chan (2023) leads in LC count, Rudolph et al. (2023a) appear more influential in terms of overall network connectivity and thematic relevance despite receiving fewer LCs. This broader influence is reflected in Table 6, which considers bibliographic coupling and link strength. Collectively, four articles authored by Chan exhibit significant local impact, positioning them as key contributions to the Gen-AI_ES discourse.
4.1.5 Prominent topics in the Gen-AI_ES domain (RO2).
Figure 3 illustrates the prevailing trend topics identified through authors’ keywords within the Gen-AI_ES domain. To ensure clarity and coherence, synonymous terms were consolidated to prevent redundancy (e.g. artificial intelligence, ai, generative ai, etc.). The top five recent and most significant topics (with a frequency exceeding 100) encompass ChatGPT (528), artificial intelligence (349), education (137), generative AI (120) and higher education (105). ChatGPT and AI have gained substantial attention in studies from 2023 and 2024. In addition, topics related to education (not exclusively higher education), academic integrity and large language models were identified as key themes in this domain. These terms are recurrently depicted in the subsequent thematic maps and evolution diagram, illustrating their growing significance within the Gen-AI_ES domain.
The image shows a complex network diagram with interconnected keywords that revolve around themes of artificial intelligence and education. The central node is Chat G P T, represented by a large green circle, surrounded by various other nodes connected by lines, illustrating relationships among terms. Keywords include higher education, students, educational technology, and language model chatbots, displayed in varying sizes that suggest importance or relevance. The diagram features multiple bubbles in different colours, indicating categories such as academic topics, educational systems, and specific technologies. Connections between terms suggest how they relate to each other within the context of their discussions. The layout allows for easy navigation through the keywords as they cluster around common themes, showing both dense and isolated networks.Bibliometric mapping of research clusters – network map of keywords co-occurrence
Source(s): Figure by authors
The image shows a complex network diagram with interconnected keywords that revolve around themes of artificial intelligence and education. The central node is Chat G P T, represented by a large green circle, surrounded by various other nodes connected by lines, illustrating relationships among terms. Keywords include higher education, students, educational technology, and language model chatbots, displayed in varying sizes that suggest importance or relevance. The diagram features multiple bubbles in different colours, indicating categories such as academic topics, educational systems, and specific technologies. Connections between terms suggest how they relate to each other within the context of their discussions. The layout allows for easy navigation through the keywords as they cluster around common themes, showing both dense and isolated networks.Bibliometric mapping of research clusters – network map of keywords co-occurrence
Source(s): Figure by authors
4.2 Thematic cluster analysis (RQ3)
Understanding the interrelationships between terms and identifying underlying clusters is essential in bibliometrics analysis. Clustering by coupling, introduced by Kessler (1960), involves grouping related items based on their co-occurrence patterns, revealing underlying structures and themes within data sets. Coupling frequently appearing words helps identify cohesive clusters, providing valuable insights into complex information landscapes (Gao et al., 2020). In our study, we explore the utility of clustering by coupling authors’ keywords to shed light on clusters and facilitate knowledge discovery within scholarly literature (Figure 4).
A horizontal bar chart titled Prevailing Topics shows the number of articles associated with specific keywords. The keywords are listed vertically on the left: Chat G P T, artificial intelligence, education, generative A I, higher education, students, A I, learning, chatbots, and generative artificial. The bars extend to the right, with values labelled inside each bar: Chat G P T with 528 articles, artificial intelligence with 349, education with 137, generative A I with 120, higher education with 105, students with 91, A I with 76, learning with 65, chatbots with 55, and generative artificial with 49.Most frequent keywords used in the literature
Source(s): Figure by authors
A horizontal bar chart titled Prevailing Topics shows the number of articles associated with specific keywords. The keywords are listed vertically on the left: Chat G P T, artificial intelligence, education, generative A I, higher education, students, A I, learning, chatbots, and generative artificial. The bars extend to the right, with values labelled inside each bar: Chat G P T with 528 articles, artificial intelligence with 349, education with 137, generative A I with 120, higher education with 105, students with 91, A I with 76, learning with 65, chatbots with 55, and generative artificial with 49.Most frequent keywords used in the literature
Source(s): Figure by authors
We identified four interconnected research streams: (Cluster 1 red) – Generative AI in transforming education (Generative AI in Advancing Education), (Cluster 2 green) – AI in learning and teaching: transforming curriculum (OpenAI and its impact on pedagogy), (Cluster 3 blue) – Integrity and AI in higher education (AI in Higher Education: Integrity and Assessment), (Cluster 4 yellow) – Adopting ChatGPT: enhancing creativity and knowledge. Cluster 1 represents “The transformative impact of AI on education, highlighting its role in enhancing teaching methodologies, reshaping learning experiences and offering insights into cutting-edge technology integration.” The first cluster points to AI’s educational potential by introducing innovative teaching and assessment methods and reshaping educational settings and outcomes. Cluster 2 represents “The influence of AI technology on language processing, focusing on large language models, and its implications in the field of AI.” The second cluster stresses AI’s potential to improve information literacy, transform learning experiences and enhance educational outcomes, highlighting its potential for future teachers and students. Cluster 3 represents “The ethical use of ChatGPT in education assessment, focusing on its impact on academic integrity and assessment practices in the age of advanced AI.” This cluster also features the role of AI in enhancing creativity in education, promoting innovative teaching methods, personalizing learning experiences and reshaping educational practices to foster innovation. Furthermore, cluster 4 explores the integration of ChatGPT in education, focusing on creativity, knowledge adoption and technology acceptance models (TAM). It focuses on how AI technologies enhance learning experiences, provide personalized content and support knowledge adoption through instant access to information and customized learning paths.
4.2.1 Cluster 1: Generative AI in transforming education (Red).
The first cluster comprises keywords such as “artificial intelligence,” “chatgpt,” “higher education,” “academic integrity,” “large language models,” “plagiarism,” “academia” and “academic advising.” Cluster 1 highlights how AI tools, such as ChatGPT, are reshaping education by personalizing learning, enhancing teaching efficiency and transforming instructional practices.
While ChatGPT introduces powerful new tools for learning and interaction, its use raises concerns about academic integrity. Its ability to generate text and simulate human responses has prompted institutions to reevaluate existing policies. For example, Alkhaaldi et al. (2023) assessed medical students’ perceptions of ChatGPT, noting its potential to enhance education efficiency. Another instance involves Šedlbauer et al. (2024), who examine the interactions of undergraduate students with ChatGPT, highlighting its advantages for learning and critical thinking support.
Papers on this theme primarily focus on how AI technologies like ChatGPT influence creative processes, idea generation and the enhancement of creative skills among students and educators. For instance, Alshahrani (2023) suggested modifying learning objectives to enhance students’ creativity and critical thinking skills. In another example, Hallal et al. (2023) examined the potential of AI chatbots in organic chemistry education, suggesting the development of innovative teaching and learning methods by improving chatbots’ performance to foster engagement.
Most studies in Cluster 1 do not apply formal theoretical frameworks. Instead, they focus on applied concepts from AI, machine learning and educational technology to explore ChatGPT’s influence on teaching and learning. Karakose et al. (2023) discussed ChatGPT in education through the lens of Kranzberg’s (1986) view on technological neutrality, Henriksen’s (2016) ideas on transformative teaching and Luo et al.’s (2024) concept of intelligence augmentation (IA). Gilson et al. (2023) emphasized ChatGPT’s role as an interactive tool in medical education, underscoring the importance of AI literacy. They highlighted its ability to support assessment and deliver personalized, contextually relevant feedback. Similarly, Dwivedi et al. (2023) examined ChatGPT’s potential to enhance learning and teaching efficiency while also raising concerns about plagiarism and distinguishing between AI-generated and human-authored content. They also proposed future research into AI-related skills, ethical risks, human–AI collaboration and algorithmic bias. The key studies from Cluster 1 are summarized in Table 7. This cluster highlights how Gen-AI, particularly ChatGPT, is transforming pedagogy, personalized learning and academic processes. It helps resolve the fragmented understanding in the field by offering a clear synthesis of Gen-AI’s educational impact, supporting educators with insights into its innovative potential.
Summary of influential articles (from Cluster 1)
| Author(s) | Purpose | Key findings | Future research suggestions | Theories used | Research method | AI’s impact on creativity and innovation |
|---|---|---|---|---|---|---|
| Rejeb et al. (2024) | Explore ChatGPT’s impact on education | Insights into ChatGPT’s integration into education | Examine ChatGPT’s long-term effects | NLP techniques, ML algorithms | Web mining, NLP, ML analysis | Enhances writing skills, fosters critical thinking |
| Šedlbauer et al. (2024) | Analyze undergraduate students’ interaction with ChatGPT | Students developed skills in formulating queries | Explore AI’s personalization in education | General inductive approach | Qualitative analysis of essays | Provides learning benefits, supports critical thinking |
| Bringula (2024) | Explore ChatGPT’s use in programming courses | ChatGPT assists in class material creation and learning | Investigate ChatGPT’s capabilities and limitations further | Not mentioned | Qualitative approach | Enhances teaching and learning efficiency |
| Alshahrani (2023) | Investigate ChatGPT in blended learning | ChatGPT can personalize learning experiences | Explore ChatGPT’s functionality in specific subjects | Educational technology, HCI | Qualitative case study | Modifies learning objectives, enhances creativity |
| Zekaj (2023) | Investigate AI in higher education | AI can enhance instructional support | Explore new educational contexts for AI integration | Not mentioned | Systematic literature review | Enhances creative processes, fosters innovation |
| Iwasawa et al. (2024) | Assess pharmacy students’ knowledge of AI | Students with AI knowledge understand ChatGPT’s use | Conduct objective knowledge tests on AI | Not mentioned | Survey with statistical analysis | Focuses on responsible AI usage education |
| Watrianthos et al. (2023) | Overview of ChatGPT-education research | Identification of critical trends and topics in ChatGPT research | Longitudinal impact studies explore K-12 education | Not mentioned | Bibliometric analysis | Improves personalized learning, fosters creativity |
| Hung and Chen (2023) | Investigate ChatGPT’s use in Chinese academia | Polarized opinions on ChatGPT’s use in academia | A systematic review of AI’s benefits and risks | Not mentioned | Content analysis | Enhances academic outputs, supports creative tasks |
| Javaid et al. (2023) | Explore ChatGPT’s potential in education | ChatGPT can transform teaching and learning | Further, explore ChatGPT’s impact on learning outcomes | AI, NLP, educational technology | Literature review, case studies | Foster’s creativity supports research and writing |
| Alkhaaldi et al. (2023) | Assess medical students’ perceptions of ChatGPT | Positive perceptions but unclear role in training | Structured curricula for AI in medicine | Not mentioned | Cross-sectional web-based survey | Enhances education efficiency, supports creative tasks |
| Karakose et al. (2023) | Explore ChatGPT’s facilitation in teaching | ChatGPT offers personalized learning support | Create digital-friendly environments in schools | Kranzberg’s technology neutrality, IA concept | Argumentative writing based on literature review | Enhances creativity, supports innovative teaching |
| Hallal et al. (2023) | Assess ChatGPT and Bard in organic chemistry | ChatGPT showed better accuracy than Bard | Enhance chatbots’ performance in chemistry | AI, NLP, educational technology | Detailed inspection study of responses | Provides personalized learning, fosters engagement |
| Dwivedi et al. (2023) | To provide a multidisciplinary perspective on the opportunities, challenges and implications of generative conversational AI for research, practice and policy | Positive impacts such as enhanced productivity in various industries and challenges including ethical, legal issues, privacy concerns, misinformation and biases. Opinions are split on whether its use should be restricted or legislated | – Identifying skills, resources, and capabilities needed to handle generative AI. Examining biases of generative AI attributable to training datasets and processes. Exploring business and societal contexts best suited for generative AI implementation. Determining optimal human-AI task combinations | Utilitarianism, diffusion of innovations theory and theorization framework | Expert opinion-based synthesis | AI has the potential to significantly enhance productivity in various industries by automating routine tasks and aiding in complex problem-solving. However, it also poses risks of misinformation, biases and ethical dilemmas that need to be carefully managed |
| Gilson et al. (2023) | To evaluate the performance of ChatGPT on medical education and knowledge assessment | ChatGPT outperforming InstructGPT and GPT-3. It showed logical reasoning in responses and used internal information effectively | Further studies are needed to evaluate ChatGPT for simulating small group education and other use cases in medical education | No specific theories mentioned | Two sets of multiple-choice questions were used: AMBOSS question bank and NBME free 120 questions | ChatGPT can provide personalized feedback, simulate small group learning and support problem-solving, indicating potential for enhancing creativity and innovation in educational practices |
| Author(s) | Purpose | Key findings | Future research suggestions | Theories used | Research method | AI’s impact on creativity and innovation |
|---|---|---|---|---|---|---|
| Explore ChatGPT’s impact on education | Insights into ChatGPT’s integration into education | Examine ChatGPT’s long-term effects | Web mining, NLP, | Enhances writing skills, fosters critical thinking | ||
| Analyze undergraduate students’ interaction with ChatGPT | Students developed skills in formulating queries | Explore AI’s personalization in education | General inductive approach | Qualitative analysis of essays | Provides learning benefits, supports critical thinking | |
| Explore ChatGPT’s use in programming courses | ChatGPT assists in class material creation and learning | Investigate ChatGPT’s capabilities and limitations further | Not mentioned | Qualitative approach | Enhances teaching and learning efficiency | |
| Investigate ChatGPT in blended learning | ChatGPT can personalize learning experiences | Explore ChatGPT’s functionality in specific subjects | Educational technology, | Qualitative case study | Modifies learning objectives, enhances creativity | |
| Investigate | Explore new educational contexts for | Not mentioned | Systematic literature review | Enhances creative processes, fosters innovation | ||
| Assess pharmacy students’ knowledge of | Students with | Conduct objective knowledge tests on | Not mentioned | Survey with statistical analysis | Focuses on responsible | |
| Overview of ChatGPT-education research | Identification of critical trends and topics in ChatGPT research | Longitudinal impact studies explore K-12 education | Not mentioned | Bibliometric analysis | Improves personalized learning, fosters creativity | |
| Investigate ChatGPT’s use in Chinese academia | Polarized opinions on ChatGPT’s use in academia | A systematic review of AI’s benefits and risks | Not mentioned | Content analysis | Enhances academic outputs, supports creative tasks | |
| Explore ChatGPT’s potential in education | ChatGPT can transform teaching and learning | Further, explore ChatGPT’s impact on learning outcomes | AI, NLP, educational technology | Literature review, case studies | Foster’s creativity supports research and writing | |
| Assess medical students’ perceptions of ChatGPT | Positive perceptions but unclear role in training | Structured curricula for | Not mentioned | Cross-sectional web-based survey | Enhances education efficiency, supports creative tasks | |
| Explore ChatGPT’s facilitation in teaching | ChatGPT offers personalized learning support | Create digital-friendly environments in schools | Kranzberg’s technology neutrality, | Argumentative writing based on literature review | Enhances creativity, supports innovative teaching | |
| Assess ChatGPT and Bard in organic chemistry | ChatGPT showed better accuracy than Bard | Enhance chatbots’ performance in chemistry | AI, NLP, educational technology | Detailed inspection study of responses | Provides personalized learning, fosters engagement | |
| To provide a multidisciplinary perspective on the opportunities, challenges and implications of generative conversational | Positive impacts such as enhanced productivity in various industries and challenges including ethical, legal issues, privacy concerns, misinformation and biases. Opinions are split on whether its use should be restricted or legislated | – Identifying skills, resources, and capabilities needed to handle generative | Utilitarianism, diffusion of innovations theory and theorization framework | Expert opinion-based synthesis | ||
| To evaluate the performance of ChatGPT on medical education and knowledge assessment | ChatGPT outperforming InstructGPT and GPT-3. It showed logical reasoning in responses and used internal information effectively | Further studies are needed to evaluate ChatGPT for simulating small group education and other use cases in medical education | No specific theories mentioned | Two sets of multiple-choice questions were used: | ChatGPT can provide personalized feedback, simulate small group learning and support problem-solving, indicating potential for enhancing creativity and innovation in educational practices |
4.2.2 Cluster 2: AI in learning and teaching: transforming curriculum (Green).
The second cluster includes keywords such as “AI,” “chatbots,” “artificial intelligence,” “generative AI,” “ChatGPT,” “academic libraries,” “GPT-3” and “academic assignments.” OpenAI technologies are reshaping education by influencing pedagogical strategies and altering student learning perceptions, prompting renewed interest in models like the TAM. Studies in this cluster focus on AI policy development, the integration of tools like ChatGPT to enhance information literacy and understanding the affordances, challenges and student perceptions of AI. For instance, Chan (2023) proposed AI policy frameworks for higher education, while Johnson et al. (2024a, 2024b) examined the role of ChatGPT in library instruction to strengthen student information literacy. Gao et al. (2024) emphasized ChatGPT’s potential to transform learning experiences and Crompton and Burke (2024) discussed its educational benefits and limitations.
The second theme examines how AI enhances creativity in education by improving teaching and learning, facilitating the generation of creative content, enabling human–AI co-creation and addressing challenges such as misinformation. Amaro et al. (2023) examined the use of ChatGPT in creative tasks, such as summarization and programming, while raising concerns about the dissemination of fake information. This cluster also highlights AI’s transformative role in reimagining educational practices, including its impact on mathematics instruction, the use of custom GPTs for evaluating design tasks and shifts in teaching and assessment strategies. For example, Bower et al. (2024) analyzed how AI is reshaping assessment and instructional methods to foster innovation across learning environments.
Most Cluster 2 studies do not explicitly apply theoretical frameworks to examine the educational impact of ChatGPT, although notable exceptions exist. Johnson et al. (2024a, 2024b) used Universal Design for Learning (UDL) and constructivist theories to design an activity for librarians based on the ACRL Framework, promoting adaptable and hands-on learning for information literacy. UDL supports inclusive learning environments, while constructivism emphasizes interactive learning. Similarly, Bower et al. (2024) adopted the Unified Theory of Acceptance and Use of Technology (UTAUT2) model (Venkatesh et al., 2012) to investigate educators’ motivations for adapting their teaching and assessment practices. The relevant studies are summarized in Table 8. This cluster examines ChatGPT’s role in shaping curriculum and assessment, highlighting its influence on evolving teaching practices. It addresses the need for practical guidance by illustrating how Gen-AI supports educators in adapting instruction to AI-enhanced learning environments.
Summary of influential articles (from Cluster 2)
| Author(s) | Purpose | Key findings | Future research suggestions | Theories used | Research methodology | Impact of AI on creativity and innovation |
|---|---|---|---|---|---|---|
| Chan (2023) | Investigate perceptions of ChatGPT in higher education | Insights on planning AI policies for teaching and learning | Informed policy and institutional support | Survey and thematic analysis based on UNESCO’s recommendations | Survey design with quantitative and qualitative data | Enhances creativity and innovation by automating tasks and facilitating content creation |
| Lozano and Blanco Fontao (2023) | Evaluate perceptions of ChatGPT use in education | Potential for enhancing the teaching-learning process | Technological and literacy needs in AI use | Nonexperimental descriptive cross-sectional design | Questionnaire and Delphi method | Fosters creativity and offers new opportunities for personalized learning |
| Pack and Maloney (2023) | To explore how generative AI tools like ChatGPT can assist language education researchers | Demonstrates various uses of ChatGPT in research processes, from compiling information to assisting in research design and data analysis. Discusses ethical considerations | Further work on best practices and ethical guidelines for using AI in language education research | Complex dynamic systems theory in language education and motivation | Case demonstrations of ChatGPT usage | AI tools like ChatGPT can streamline research processes, though ethical considerations are crucial |
| Walczak and Cellary (2023) | To examine the challenges and potential of AI in higher education | Highlights AI’s potential in transforming higher education through enhanced learning experiences and personalized education. Discusses challenges like authorship, originality and detecting AI-generated content | Investigate further into the ethical implications and practical applications of AI in higher education | Not explicitly stated | Survey and literature review | AI can enhance creativity in education but raises issues of originality and authorship that need addressing |
| Kohnke (2024) | To understand AI’s role in enhancing educational practices and outcomes | Identifies AI tools’ effectiveness in providing personalized feedback and supporting educators in curriculum development. Discusses the integration of AI in teaching and its impact on student engagement and learning outcomes | Explore AI’s long-term impact on student learning and teacher practices | Technology acceptance model (TAM) | Mixed-methods including surveys and case studies | AI supports creative teaching methods and personalized learning experiences, boosting educational outcomes |
| Wang et al. (2023) | To assess ChatGPT’s feedback accuracy in argumentation teaching | Finds ChatGPT’s feedback accuracy affected by argument length and number of discourse markers. Suggests combining AI feedback with human oversight for more comprehensive feedback | Develop automated feedback tools tailored for specific educational contexts and needs | Discourse theory, feedback theory and argumentation theory | Experimental analysis of ChatGPT feedback | AI can provide meaningful feedback but requires human oversight to ensure accuracy and relevance, enhancing the feedback process |
| Rodríguez (2023) | To evaluate the effectiveness of AI-driven educational technologies in academic settings | Discusses AI’s role in improving academic integrity through plagiarism detection and personalized learning aids. Highlights the need for ethical guidelines and best practices in AI application | Investigate AI’s broader impact on academic integrity and educational fairness | Active learning approaches, project-based learning, cooperative learning and problem-based learning | Literature review and analysis | AI-driven tools enhance educational integrity and support innovation but require ethical frameworks to manage potential misuse |
| Yildiz (2023) | To explore AI’s role in fostering creativity and innovation in educational environments | Highlights AI’s potential in facilitating creative problem-solving and innovative teaching practices. Discusses the balance between AI assistance and human creativity | Further research on AI’s role in diverse educational settings and its impact on different subjects | Nation’s multidimensional aspects of vocabulary knowledge | Case studies and surveys | AI fosters creativity by providing diverse perspectives and innovative solutions, enhancing both teaching and learning experiences |
| Author(s) | Purpose | Key findings | Future research suggestions | Theories used | Research methodology | Impact of |
|---|---|---|---|---|---|---|
| Investigate perceptions of ChatGPT in higher education | Insights on planning | Informed policy and institutional support | Survey and thematic analysis based on UNESCO’s recommendations | Survey design with quantitative and qualitative data | Enhances creativity and innovation by automating tasks and facilitating content creation | |
| Evaluate perceptions of ChatGPT use in education | Potential for enhancing the teaching-learning process | Technological and literacy needs in | Nonexperimental descriptive cross-sectional design | Questionnaire and Delphi method | Fosters creativity and offers new opportunities for personalized learning | |
| To explore how generative | Demonstrates various uses of ChatGPT in research processes, from compiling information to assisting in research design and data analysis. Discusses ethical considerations | Further work on best practices and ethical guidelines for using | Complex dynamic systems theory in language education and motivation | Case demonstrations of ChatGPT usage | ||
| To examine the challenges and potential of | Highlights AI’s potential in transforming higher education through enhanced learning experiences and personalized education. Discusses challenges like authorship, originality and detecting AI-generated content | Investigate further into the ethical implications and practical applications of | Not explicitly stated | Survey and literature review | ||
| To understand AI’s role in enhancing educational practices and outcomes | Identifies | Explore AI’s long-term impact on student learning and teacher practices | Technology acceptance model ( | Mixed-methods including surveys and case studies | ||
| To assess ChatGPT’s feedback accuracy in argumentation teaching | Finds ChatGPT’s feedback accuracy affected by argument length and number of discourse markers. Suggests combining | Develop automated feedback tools tailored for specific educational contexts and needs | Discourse theory, feedback theory and argumentation theory | Experimental analysis of ChatGPT feedback | ||
| To evaluate the effectiveness of AI-driven educational technologies in academic settings | Discusses AI’s role in improving academic integrity through plagiarism detection and personalized learning aids. Highlights the need for ethical guidelines and best practices in | Investigate AI’s broader impact on academic integrity and educational fairness | Active learning approaches, project-based learning, cooperative learning and problem-based learning | Literature review and analysis | AI-driven tools enhance educational integrity and support innovation but require ethical frameworks to manage potential misuse | |
| To explore AI’s role in fostering creativity and innovation in educational environments | Highlights AI’s potential in facilitating creative problem-solving and innovative teaching practices. Discusses the balance between | Further research on AI’s role in diverse educational settings and its impact on different subjects | Nation’s multidimensional aspects of vocabulary knowledge | Case studies and surveys |
4.2.3 Cluster 3: Integrity and AI in higher education (Blue).
Cluster 3 addresses the ethical challenges arising from the use of generative AI in higher education, including issues related to plagiarism, cheating, authorship and fairness in assessment. Several studies have examined the implications of students using tools like ChatGPT to generate assignments or write exam responses without proper acknowledgment, raising complex questions about originality and intellectual ownership (Baidoo-Anu and Ansah, 2023; McGee, 2023).
Rather than outright bans, some researchers advocate for policy reform, emphasizing the importance of clear guidelines and AI literacy. For instance, Nguyen (2025) suggested repositioning generative AI as a pedagogical tool under ethical supervision. Similarly, Osman et al. (2024) proposed that educators develop transparent assessment criteria that clarify acceptable AI use.
This cluster directly addresses the research problem of the lack of ethical clarity in AI integration. It aligns with global frameworks, such as UNESCO’s Recommendation on the Ethics of Artificial Intelligence (UNESCO, 2021). These findings underscore the importance of institutions establishing comprehensive governance strategies and fostering ethical awareness among students and faculty (Table 9).
Summary of influential articles (from Cluster 3)
| Primary author(s) | Purpose | Key findings | Future research suggestions | Theories used | Research methodology | Impact on creativity, innovation, and education |
|---|---|---|---|---|---|---|
| Kavadella et al. (2024) | Evaluate the implementation of ChatGPT in undergraduate dental education | ChatGPT enhanced the learning experience with positive feedback from students | Explore educational affordances of ChatGPT; address plagiarism and academic integrity | Educational theories on technology integration | Mixed-methods: quantitative and qualitative analysis | AI, like ChatGPT, enhances creative thinking and offers innovative educational solutions |
| Johnston et al. (2024) | Investigate student perspectives on Gen-AI technologies in higher education | Students are aware but cautious of Gen-AI for academic integrity support for grammar help | Research on Gen-AI’s impact on teaching methods and academic integrity policies | Implicit references to academic integrity and technology-enhanced learning theories | Surveys and focus groups | Gen-AI technologies enhance creativity and support tasks that foster innovation |
| Fuchs and Aguilos (2023) | Investigate student perceptions of ChatGPT’s usefulness in studies | Potential to enhance learning, but ethical concerns exist | Quantitatively test hypotheses on ChatGPT’s adoption | Technology acceptance model (TAM) | Exploratory thematic analysis of interviews | Enhances students’ creative processes and supports personalized learning |
| Lancaster (2023) | Investigate misuse of text generation tools in academia | Ease of cheating; challenges in detecting generated content | Explore detection techniques and secure assessment methods | Concepts of educational integrity | Experiments on academic misuse of AI tools | Calls for responsible AI use to foster creativity while ensuring integrity |
| Akiba and Fraboni (2023) | Evaluate ChatGPT’s advice on academic advising | ChatGPT provided high-quality, accurate advice | Rigorous assessment of ChatGPT’s answers; evaluate other AI models | Narrative analysis approach | Narrative analysis of ChatGPT advice | Highlights AI’s potential in reshaping academic advising practices |
| Bin-Nashwan et al. (2023) | Investigate factors influencing ChatGPT use in academia | Academic integrity is crucial in moderating ChatGPT use | Explore ASNSs, expand theoretical frameworks, and investigate additional factors | Social cognitive theory (SCT) | Quantitative web-based survey | AI enhances productivity and fosters innovation in academia |
| Geerling et al. (2023) | Evaluate ChatGPT’s performance on economics tests | ChatGPT excelled, suggesting rethinking assessment strategies | Incorporate AI technologies in assessments of innovative approaches | Not explicitly mentioned | Testing ChatGPT on economics discipline questions | Advocates for innovative and technology-driven assessment in education |
| Kelly and Sullivan (2023) | Analyze media coverage of ChatGPT in higher education | Highlighted opportunities and risks; academic integrity concerns | Expand search methodology; investigate student and staff perspectives | Literature review and analysis of media coverage | Systematic search and content analysis | Suggests ChatGPT can support innovation but raises ethical concerns |
| Črček and Patekar (2023) | Investigate ChatGPT’s use among university students | Used for idea generation, paraphrasing and writing assistance | Guidance on ethical use; impact on cognitive processes | Not explicitly mentioned | Online questionnaire and nonprobability sampling | AI tools like ChatGPT offer opportunities to enhance education and foster innovation |
| Vargas-Murillo et al. (2023) | Explore ChatGPT’s applications and challenges in education | Various uses and impacts; benefits and ethical considerations | Ethical considerations; specific impacts on educational domains | Systematic literature review approach | Systematic literature review (SLR) | AI models revolutionize education by providing tools for creativity and knowledge integration |
| Gamage et al. (2023) | Explore ChatGPT in higher education assessments | Limitations in semantic and factual questions; importance of human input | Autonomy in answering questions; AI’s impact on academic dishonesty | References to various theories on AI in education | Literature review on ChatGPT’s implications | Enhances learning, encourages innovation and delivers personalized feedback |
| Steele (2023) | Examine ChatGPT’s challenges and opportunities in education | ChatGPT helps generate content but presents issues with plagiarism and distinguishing AI-generated content. It emphasizes critical thinking | Explore AI tools’ integration into curricula to promote critical thinking and academic integrity | Educational technology, information literacy, critical thinking, academic integrity | Qualitative analysis of benefits/challenges of ChatGPT in education | AI, like ChatGPT, fosters critical thinking and creativity, reshapes education by making tasks more accessible and potentially reduces social disparities |
| Netto (2023) | Explore ChatGPT’s impact on social work assessment writing | ChatGPT poses a low cheating risk in knowledge applications but lacks depth in responses | Further, investigates AI’s impact on learning and critical thinking across disciplines | Educational theories, Ethical Principle Screen (EPS) | Testing AI-generated responses in social work education | This raises questions about technology’s role in creativity and emphasizes the need for AI to complement human skills in fostering innovation and problem-solving |
| Crawford et al. (2023) | Emphasize ethical leadership in ChatGPT’s educational use | ChatGPT should assist with tasks, not author academic papers. Leadership and ethical considerations are crucial | Research ChatGPT’s potential for essay plans, grammar checking and content suggestions | Theories related to AI ethics, educational technology and leadership might be involved, although not explicitly mentioned | Literature review and possibly qualitative or quantitative methods on ChatGPT implications | Suggests AI, with ethical use and leadership, can enhance creativity, innovation and learning outcomes, acknowledging misuse concerns |
| Ansari et al. (2023) | Analyze global evidence on ChatGPT’s use in higher education | ChatGPT aids in academic tasks, teaching and learning, with most studies from high-income countries. Concerns over accuracy and academic integrity | Suggests experimental studies, RCTs on ChatGPT’s impacts and ethnographic studies on its benefits in knowledge acquisition | Systematic scoping review methodology was used, with descriptive and thematic analysis to identify trends and themes | Highlights ChatGPT’s role in enhancing educational processes, needing further research to maximize its potential for creativity and innovation. Ethical usage and reliability concerns must be addressed to ensure effective integration without compromising creativity | |
| Jürgen Rudolph, Samson Tan, Shannon Tan | To explore the ChatGPT’s impact on traditional assessments and academic integrity | ChatGPT can generate human-like text, raising significant concerns about plagiarism and the validity of traditional assessment methods | Further research is needed to develop effective strategies to integrate AI tools in education while maintaining academic integrity while exploring new assessment methods | Constructive alignment and student-centric pedagogy | Extensive literature review and experiments with ChatGPT to evaluate its performance and implications for education | ChatGPT can potentially enhance creativity and innovation by providing new ways of engaging with educational content, though it also necessitates a rethinking of assessment strategies to prevent misuse |
| Primary author(s) | Purpose | Key findings | Future research suggestions | Theories used | Research methodology | Impact on creativity, innovation, and education |
|---|---|---|---|---|---|---|
| Evaluate the implementation of ChatGPT in undergraduate dental education | ChatGPT enhanced the learning experience with positive feedback from students | Explore educational affordances of ChatGPT; address plagiarism and academic integrity | Educational theories on technology integration | Mixed-methods: quantitative and qualitative analysis | AI, like ChatGPT, enhances creative thinking and offers innovative educational solutions | |
| Johnston et al. (2024) | Investigate student perspectives on | Students are aware but cautious of | Research on Gen-AI’s impact on teaching methods and academic integrity policies | Implicit references to academic integrity and technology-enhanced learning theories | Surveys and focus groups | |
| Investigate student perceptions of ChatGPT’s usefulness in studies | Potential to enhance learning, but ethical concerns exist | Quantitatively test hypotheses on ChatGPT’s adoption | Technology acceptance model ( | Exploratory thematic analysis of interviews | Enhances students’ creative processes and supports personalized learning | |
| Investigate misuse of text generation tools in academia | Ease of cheating; challenges in detecting generated content | Explore detection techniques and secure assessment methods | Concepts of educational integrity | Experiments on academic misuse of | Calls for responsible | |
| Evaluate ChatGPT’s advice on academic advising | ChatGPT provided high-quality, accurate advice | Rigorous assessment of ChatGPT’s answers; evaluate other | Narrative analysis approach | Narrative analysis of ChatGPT advice | Highlights AI’s potential in reshaping academic advising practices | |
| Investigate factors influencing ChatGPT use in academia | Academic integrity is crucial in moderating ChatGPT use | Explore ASNSs, expand theoretical frameworks, and investigate additional factors | Social cognitive theory ( | Quantitative web-based survey | ||
| Evaluate ChatGPT’s performance on economics tests | ChatGPT excelled, suggesting rethinking assessment strategies | Incorporate | Not explicitly mentioned | Testing ChatGPT on economics discipline questions | Advocates for innovative and technology-driven assessment in education | |
| Analyze media coverage of ChatGPT in higher education | Highlighted opportunities and risks; academic integrity concerns | Expand search methodology; investigate student and staff perspectives | Literature review and analysis of media coverage | Systematic search and content analysis | Suggests ChatGPT can support innovation but raises ethical concerns | |
| Investigate ChatGPT’s use among university students | Used for idea generation, paraphrasing and writing assistance | Guidance on ethical use; impact on cognitive processes | Not explicitly mentioned | Online questionnaire and nonprobability sampling | ||
| Explore ChatGPT’s applications and challenges in education | Various uses and impacts; benefits and ethical considerations | Ethical considerations; specific impacts on educational domains | Systematic literature review approach | Systematic literature review ( | ||
| Explore ChatGPT in higher education assessments | Limitations in semantic and factual questions; importance of human input | Autonomy in answering questions; AI’s impact on academic dishonesty | References to various theories on | Literature review on ChatGPT’s implications | Enhances learning, encourages innovation and delivers personalized feedback | |
| Examine ChatGPT’s challenges and opportunities in education | ChatGPT helps generate content but presents issues with plagiarism and distinguishing AI-generated content. It emphasizes critical thinking | Explore | Educational technology, information literacy, critical thinking, academic integrity | Qualitative analysis of benefits/challenges of ChatGPT in education | AI, like ChatGPT, fosters critical thinking and creativity, reshapes education by making tasks more accessible and potentially reduces social disparities | |
| Explore ChatGPT’s impact on social work assessment writing | ChatGPT poses a low cheating risk in knowledge applications but lacks depth in responses | Further, investigates AI’s impact on learning and critical thinking across disciplines | Educational theories, Ethical Principle Screen ( | Testing AI-generated responses in social work education | This raises questions about technology’s role in creativity and emphasizes the need for | |
| Emphasize ethical leadership in ChatGPT’s educational use | ChatGPT should assist with tasks, not author academic papers. Leadership and ethical considerations are crucial | Research ChatGPT’s potential for essay plans, grammar checking and content suggestions | Theories related to | Literature review and possibly qualitative or quantitative methods on ChatGPT implications | Suggests AI, with ethical use and leadership, can enhance creativity, innovation and learning outcomes, acknowledging misuse concerns | |
| Analyze global evidence on ChatGPT’s use in higher education | ChatGPT aids in academic tasks, teaching and learning, with most studies from high-income countries. Concerns over accuracy and academic integrity | Suggests experimental studies, RCTs on ChatGPT’s impacts and ethnographic studies on its benefits in knowledge acquisition | Systematic scoping review methodology was used, with descriptive and thematic analysis to identify trends and themes | Highlights ChatGPT’s role in enhancing educational processes, needing further research to maximize its potential for creativity and innovation. Ethical usage and reliability concerns must be addressed to ensure effective integration without compromising creativity | ||
| Jürgen Rudolph, Samson Tan, Shannon Tan | To explore the ChatGPT’s impact on traditional assessments and academic integrity | ChatGPT can generate human-like text, raising significant concerns about plagiarism and the validity of traditional assessment methods | Further research is needed to develop effective strategies to integrate | Constructive alignment and student-centric pedagogy | Extensive literature review and experiments with ChatGPT to evaluate its performance and implications for education | ChatGPT can potentially enhance creativity and innovation by providing new ways of engaging with educational content, though it also necessitates a rethinking of assessment strategies to prevent misuse |
4.2.4 Cluster 4: Adopting ChatGPT: enhancing creativity and knowledge (Yellow).
Cluster 4 highlights the role of generative AI in fostering creativity and supporting personalized knowledge construction. Several studies report how tools like ChatGPT enable learners to brainstorm, refine writing and receive iterative feedback – functions that can stimulate higher-order thinking and self-directed learning (Lo, 2023; Zhang et al., 2023).
Generative AI is seen not only as a writing assistant but also as a partner in cognitive exploration. For example, Huang and Lin (2024) documented how students use ChatGPT to generate novel ideas and enhance their understanding of complex concepts. Similarly, Zhao et al. (2023) demonstrated that learners value the tool’s ability to scaffold individual learning paths and adapt to their unique needs and proficiency levels.
This cluster addresses the gap in understanding how Gen-AI supports educational creativity and learner agency, reinforcing the potential of AI as a driver of personalized and inquiry-based learning. However, the authors also caution that uncritical reliance on these tools may inhibit original thought, underscoring the need for guided integration into instructional practice (Table 10).
Summary of influential articles (from Cluster 4)
| Author(s) | Purpose | Key findings | Future research suggestions | Theories used | Research methodology | Impact of AI on creativity and innovation |
|---|---|---|---|---|---|---|
| Johnson et al. (2024a, 2024b) | Explore the integration of ChatGPT into library instruction | ChatGPT can enhance research skills and critical evaluation | Adapting and implementing ChatGPT in library sessions | Universal design for learning (UDL), constructivist theory | Lesson plan with hands-on activities | Potential to enhance human critical thinking and creativity |
| Gao et al. (2024) | Impact of ChatGPT on educational practices | Importance of user experience in adopting ChatGPT | Longitudinal data and dynamic analysis of interactions | Stereotype content model, BIAS map | Regression models | Enhances creativity by providing personalized experiences and supporting educators |
| Crompton and Burke (2024) | Use ChatGPT in education across all learner levels | Benefits for teaching, support and professional development | Explore the potential and limitations of ChatGPT | Configurative systematic review, PRISMA protocol | Systematic review methodology | Supports creativity and innovation among educators and students |
| Volante et al. (2023) | Guide students in improving AI-generated text | ICE model promotes critical and creative thinking | Impact of AI on student learning outcomes | ICE model for formative assessment | A qualitative approach with student guidance | Spur innovations in thinking and promote critical and creative skills |
| Perkins (2023) | Implications of AI LLMs on academic integrity | Need for updated academic integrity policies | Concerns related to LLM use in student work | Literature review on AI and academic integrity | Literature review and analysis | Supports creative expression and collaboration while posing ethical challenges |
| Amaro et al. (2023) | Impact of ChatGPT’s fake information on trust | Users continue to trust ChatGPT despite fake information | Factors influencing user trust in AI tools | User study on trust and satisfaction | User study with tasks and statistical tests | Highlights potential uses of ChatGPT for educational activities |
| Bower et al. (2024) | Understand educators’ views on AI’s impact on teaching | Recognized need for AI understanding and critical thinking | Broader implications of AI in education | Unified Theory of Acceptance and Use of Technology (UTAUT2) | Mixed-methods with surveys and thematic analysis | Encourages holistic approach fostering creativity and innovation |
| Eager and Brunton (2023) | Integrate AI tools in higher education | AI enhances assessment design and teaching materials | Long-term impact of AI on education | Case study approach | Case study on AI implementation | Augments creativity and efficiency in educational practices |
| Wardat et al. (2023) | Perceptions of ChatGPT in teaching mathematics | Encourages reading and writing abilities | Incorporate ChatGPT into teaching methods | Instrumental case study design, social network analysis | Qualitative case study approach | Provides personalized content and promotes innovative teaching methods |
| Almasre (2024) | Effectiveness of GPT-4 in assessing typography course designs | AI ensures consistency and objectivity in the assessment | The potential of AI in education and assessment practices | Training evaluators and analyzing performance | Training and evaluation sessions | Enhances assessment in creative fields and supports objective evaluation |
| Author(s) | Purpose | Key findings | Future research suggestions | Theories used | Research methodology | Impact of |
|---|---|---|---|---|---|---|
| Explore the integration of ChatGPT into library instruction | ChatGPT can enhance research skills and critical evaluation | Adapting and implementing ChatGPT in library sessions | Universal design for learning ( | Lesson plan with hands-on activities | Potential to enhance human critical thinking and creativity | |
| Impact of ChatGPT on educational practices | Importance of user experience in adopting ChatGPT | Longitudinal data and dynamic analysis of interactions | Stereotype content model, | Regression models | Enhances creativity by providing personalized experiences and supporting educators | |
| Use ChatGPT in education across all learner levels | Benefits for teaching, support and professional development | Explore the potential and limitations of ChatGPT | Configurative systematic review, | Systematic review methodology | Supports creativity and innovation among educators and students | |
| Guide students in improving AI-generated text | Impact of | A qualitative approach with student guidance | Spur innovations in thinking and promote critical and creative skills | |||
| Implications of | Need for updated academic integrity policies | Concerns related to | Literature review on | Literature review and analysis | Supports creative expression and collaboration while posing ethical challenges | |
| Impact of ChatGPT’s fake information on trust | Users continue to trust ChatGPT despite fake information | Factors influencing user trust in | User study on trust and satisfaction | User study with tasks and statistical tests | Highlights potential uses of ChatGPT for educational activities | |
| Understand educators’ views on AI’s impact on teaching | Recognized need for | Broader implications of | Unified Theory of Acceptance and Use of Technology (UTAUT2) | Mixed-methods with surveys and thematic analysis | Encourages holistic approach fostering creativity and innovation | |
| Integrate | Long-term impact of | Case study approach | Case study on | Augments creativity and efficiency in educational practices | ||
| Perceptions of ChatGPT in teaching mathematics | Encourages reading and writing abilities | Incorporate ChatGPT into teaching methods | Instrumental case study design, social network analysis | Qualitative case study approach | Provides personalized content and promotes innovative teaching methods | |
| Effectiveness of GPT-4 in assessing typography course designs | The potential of | Training evaluators and analyzing performance | Training and evaluation sessions | Enhances assessment in creative fields and supports objective evaluation |
4.2.5 Cross-cluster insights.
Synthesizing insights across the four clusters reveals a dynamic interplay of pedagogical innovation, ethical considerations and student-centered learning. The focus on instructional redesign in Cluster 1 converges with curriculum and assessment strategies in Cluster 2. Ethical dilemmas identified in Cluster 3 are deeply tied to the practical use of AI tools in learning environments, as described in Cluster 4.
Several studies (e.g. Kohnke et al., 2023; Rodrigues and Rodrigues, 2023) point to the need for integrated frameworks that consider both technological affordances and human values. This cross-cluster narrative indicates that effective AI adoption in education must go beyond tool deployment, encompassing institutional culture, pedagogical intentionality and ethical foresight.
Overall, these interconnections highlight that Gen-AI’s influence on education is multifaceted. As AI tools continue to evolve, a holistic approach – combining ethics, creativity and pedagogy – is necessary to guide their responsible and impactful integration.
4.3 Thematic evolution of the Gen-AI_ES domain (RQ4)
To explore the thematic evolution of the Gen-AI_ES domain, the data set was divided into four annual time slices (2021–2024). For each period, a thematic keyword co-occurrence map was generated using the Bibliometrix software. Figure 5 presents the thematic structure for the first time slice (2021), showing clusters of terms based on their co-occurrence frequency, density and centrality. A thematic map categorizes research themes into four quadrants – motor, basic, niche and emerging or declining – based on their centrality (x-axis) and density (y-axis) metrics (Cobo et al., 2011). Centrality reflects a theme’s importance and its connections to other themes within the research field, while density indicates the theme’s internal development and cohesion. Motor themes, located in the upper-right quadrant, are both well-developed and central, signifying influential research areas. In contrast, themes in the lower-left quadrant are less developed and peripheral, representing either emerging topics or areas of declining interest. This mapping approach provides a comprehensive overview of the research domain’s structure and evolution, facilitating the identification of key areas of growth and emerging significance.
The image shows a scatter plot with two axes, the horizontal axis represents the relevance degree, centrality, while the vertical axis indicates the development degree, density. Different themes are positioned based on these two metrics, with various educational themes like education, learning, and medical education located in the upper left quadrant, classified as niche themes. In the upper right quadrant, representing motor themes, entries such as large language model, language model, and prompt engineering are visible. Further along the horizontal axis, technology, technology acceptance model, and information technology appear, denoting central themes. The bottom section includes themes like active learning, automated writing evaluation, and instruction, categorised as emerging or declining themes. The image includes visual elements like dashed lines that separate the quadrants and various font sizes to denote thematic significance.Thematic keyword map for Time Slice 1 (2021–2024), generated using Bibliometrix
Note(s): The map displays clusters based on co-occurrence patterns of author keywords, illustrating emerging research themes during the early phase of Gen-AI in education
Source(s): Figure by authors
The image shows a scatter plot with two axes, the horizontal axis represents the relevance degree, centrality, while the vertical axis indicates the development degree, density. Different themes are positioned based on these two metrics, with various educational themes like education, learning, and medical education located in the upper left quadrant, classified as niche themes. In the upper right quadrant, representing motor themes, entries such as large language model, language model, and prompt engineering are visible. Further along the horizontal axis, technology, technology acceptance model, and information technology appear, denoting central themes. The bottom section includes themes like active learning, automated writing evaluation, and instruction, categorised as emerging or declining themes. The image includes visual elements like dashed lines that separate the quadrants and various font sizes to denote thematic significance.Thematic keyword map for Time Slice 1 (2021–2024), generated using Bibliometrix
Note(s): The map displays clusters based on co-occurrence patterns of author keywords, illustrating emerging research themes during the early phase of Gen-AI in education
Source(s): Figure by authors
The thematic map categorizes research themes into four quadrants based on centrality (x-axis) and density (y-axis) metrics (Cobo et al., 2011). Motor themes (upper-right quadrant) are both highly developed and central, indicating robust and influential areas of research. Basic themes (lower-right quadrant) are essential but less developed, representing foundational topics with potential for further exploration. Emerging or declining themes (lower-left quadrant) are underdeveloped and peripheral, possibly indicating nascent or waning areas of interest. Niche themes (upper-left quadrant) are well-developed but isolated, reflecting specialized topics with limited connections to other themes.
To complement our thematic analysis, we adopt the approach of Železnik et al. (2017), Kim and So (2022), which involves examining the top 10 most cited articles within each quadrant for each time scale (TS1 and TS2). Figures 6 and 7 outline the four primary themes of research that encompass most keywords within the collection. Based on the figures for TS1, there is one motor theme, two primary themes, one at the intersection of basic and emerging/declining themes, and one niche theme centered on language education.
A horizontal bar chart titled Number of Documents displays the number of documents authored by ten individuals. The vertical axis lists the names: Kou X, Chiu T K F, Crawford J, Chen X, Zhang Y, Xu X, Rudolph J, Wardat Y, Wang C, and Tan S. The horizontal axis ranges from 0 to 8. The chart shows that Kou X and Chiu T K F have 4 documents each, Crawford J and Chen X have 5, Zhang Y, Xu X, Rudolph J, and Wardat Y have 6, and both Wang C and Tan S have 7 documents each. The lengths of the bars represent the number of documents, with Wang C and Tan S having the longest bars.Publications by top ten authors
Source(s): Figure by authors
A horizontal bar chart titled Number of Documents displays the number of documents authored by ten individuals. The vertical axis lists the names: Kou X, Chiu T K F, Crawford J, Chen X, Zhang Y, Xu X, Rudolph J, Wardat Y, Wang C, and Tan S. The horizontal axis ranges from 0 to 8. The chart shows that Kou X and Chiu T K F have 4 documents each, Crawford J and Chen X have 5, Zhang Y, Xu X, Rudolph J, and Wardat Y have 6, and both Wang C and Tan S have 7 documents each. The lengths of the bars represent the number of documents, with Wang C and Tan S having the longest bars.Publications by top ten authors
Source(s): Figure by authors
The image is a flow diagram representing the relationships between concepts across two time periods, 2021 to 2023 and 2024 to 2024. On the left, the concepts include technology adoption, A I literacy, cheating, automation, authentic assessment, Chat G P T, students, technology acceptance model, human, and internet, presented in a vertical column from top to bottom. On the right, the concepts for the later period include Chat G P T, education, technology, and large language model, arranged in a similar fashion. Lines connecting the concepts illustrate the relationships and flow of ideas between these periods, with varying thickness to indicate the strength or frequency of these connections.Thematic evolution of keywords in domain 2021–2024
Source(s): Figure by authors
The image is a flow diagram representing the relationships between concepts across two time periods, 2021 to 2023 and 2024 to 2024. On the left, the concepts include technology adoption, A I literacy, cheating, automation, authentic assessment, Chat G P T, students, technology acceptance model, human, and internet, presented in a vertical column from top to bottom. On the right, the concepts for the later period include Chat G P T, education, technology, and large language model, arranged in a similar fashion. Lines connecting the concepts illustrate the relationships and flow of ideas between these periods, with varying thickness to indicate the strength or frequency of these connections.Thematic evolution of keywords in domain 2021–2024
Source(s): Figure by authors
Figures 5–7 illustrate the thematic evolution and scholarly contributions within the domain of Gen-AI_ES. Figure 5 categorizes themes into motor, basic, niche and emerging/declining categories, highlighting areas such as “large language models” and “prompt engineering” as central, while foundational themes, including “ChatGPT” and “artificial intelligence,” dominate the discourse. Figure 6 identifies prolific contributors, such as Wardat et al. (2023) and Wang et al. (2023), emphasizing the interdisciplinary nature of the field. Figure 7 visualizes the progression of keywords from 2021 to 2024, showing how terms like “human,” “internet” and “ChatGPT” evolved into themes such as “education.” This shift reflects the increasing application of generative AI in pedagogy, focusing on integrating technologies to enhance teaching, learning and student engagement while addressing ethical and practical implications.
A key concern in integrating generative AI into education is “hallucinations,” where models confidently generate false or fabricated information (Sakib et al., 2024). AI hallucinations pose significant risks in education, where students and educators may unknowingly rely on inaccurate content. Ahmad et al. (2023) emphasized the need for safeguards to ensure the accuracy of AI-generated materials, while Weise et al. (2023) pointed to the limitations of these models, which generate content based on patterns rather than factual understanding. These challenges impact the overall reliability and trust in AI systems (MIT n.d.). Strategies such as user warnings and detection frameworks show promise, but ongoing research and consideration of ethical implications are crucial to strengthen AI robustness further and limit misinformation (Ji et al., 2023).
4.3.1 Thematic evolution of the Gen-AI_ES domain for first time slice (2021–2023).
The motor theme explores the articles around internet Web-based learning and curriculum. Two primary themes have been identified in this research domain. The first basic theme, with the most significant number of articles, explores issues related to AI and academic integrity in education. The second basic theme includes articles exploring ChatGPT and assessment issues. Papers on generative AI emerge as an intersection between basic and emerging/declining themes, while language education emerges as a niche theme.
4.3.2 Thematic evolution of the Gen-AI_ES domain for a second-time slice (2024).
The motor theme explores the articles around large language models, creative thinking and undergraduate students. There are three basic themes in the second-time slice. The first basic theme, with the most significant number of articles, explores issues related to AI and integration in education. The second basic theme includes articles that explore the issues around Chatbots and critical thinking. The final basic theme explores the article on academic dishonesty. One emerging theme is prompt engineering, possibly due to the emerging ChatGPT. One niche theme focuses on self-regulated learning, academic writing and digital multimodal composing.
Hence, discernible shifts in focus and thematic priorities are evident throughout the thematic evolution within the Gen-AI_ES domain from 2021 to 2024. Initially, between 2021 and 2023, the predominant themes revolved around internet Web-based learning, curriculum development and the intersection of AI with academic integrity in educational contexts. This period saw a concentration of research articles exploring the implications of AI technologies, particularly in maintaining academic honesty and assessing students through platforms like ChatGPT. However, by 2024, the thematic landscape had undergone a notable transformation. The discourse shifted toward a broader exploration of the applications of large language models, emphasizing creative thinking and their integration into educational settings.
The emergence of themes surrounding chatbots and critical thinking signaled a deeper inquiry into the pedagogical potential of AI, expanding beyond mere assessment concerns. Furthermore, identifying prompt engineering as an emerging theme underscored a growing recognition of the importance of tailored interactions in optimizing AI-driven educational experiences. Concurrently, niche themes such as self-regulated learning, academic writing and digital multimodal composing came to the fore, highlighting specific areas of interest within the evolving landscape of AI-enhanced education. These thematic evolutions reflect the maturation of research within the Gen-AI_ES domain and a dynamic response to the advancing capabilities and emerging challenges posed by AI in educational contexts. Table 11 aligns the thematic clusters with the derived propositions, offering a consolidated view that enhances the coherence and accessibility of our analysis.
Propositions and cluster alignment
| Linked cluster | Justification |
|---|---|
| Cluster 1: Transformation | |
| P1: Gen-AI enhances personalized learning pathways | Aligned with studies on adaptive learning (e.g. Lim et al., 2023) |
| P2: Gen-AI reshapes instructional delivery and feedback | Draws on feedback loops in digital pedagogy (Yadav et al., 2025) |
| P3:AI tools foster reflective and adaptive teaching practices | Links to self-regulated learning theories (Kong and Yang, 2024; Zimmerman, 2002) |
| Cluster 2: Curriculum and Pedagogy | |
| P4:AI integration drives curriculum redesign | Reflects AI-driven curriculum innovations (Chen et al., 2024; Rizvi et al., 2025) |
| P5: Educator roles evolve with AI-supported pedagogy | Based on teacher role transformation with AI (Taufikin et al., 2024) |
| Cluster 3: Ethics | |
| P6: Ethical guidelines are crucial for fair assessment | Echoes UNESCO’s AI ethics guidance (UNESCO, 2021) |
| P7: Institutions must address academic integrity risks | Supported by concerns on AI misuse (Xie et al., 2023) |
| Cluster 4: Creativity | |
| P8: Gen-AI stimulates creativity and innovation in learning | Backed by work on creative AI use (Manditereza and Chamboko-Mpotaringa, 2024) |
| P9: Personalized AI tools support learner autonomy | Grounded in autonomy-supportive learning (Kukreja et al., 2025; Markauskaite et al., 2022) |
| P10:AI enhances knowledge construction and exploration | Connected to constructivist models of learning (Imran et al., 2024; Zawacki-Richter et al., 2019) |
| Linked cluster | Justification |
|---|---|
| Cluster 1: Transformation | |
| P1: Gen-AI enhances personalized learning pathways | Aligned with studies on adaptive learning (e.g. |
| P2: Gen-AI reshapes instructional delivery and feedback | Draws on feedback loops in digital pedagogy ( |
| P3: | Links to self-regulated learning theories ( |
| Cluster 2: Curriculum and Pedagogy | |
| P4: | Reflects AI-driven curriculum innovations ( |
| P5: Educator roles evolve with AI-supported pedagogy | Based on teacher role transformation with |
| Cluster 3: Ethics | |
| P6: Ethical guidelines are crucial for fair assessment | Echoes UNESCO’s |
| P7: Institutions must address academic integrity risks | Supported by concerns on |
| Cluster 4: Creativity | |
| P8: Gen-AI stimulates creativity and innovation in learning | Backed by work on creative |
| P9: Personalized | Grounded in autonomy-supportive learning ( |
| P10: | Connected to constructivist models of learning ( |
5. Discussion
This paper systematically reviews and synthesizes the growing body of literature on generative AI in education, with a focus on ChatGPT. Analyzing 817 peer-reviewed articles from 2021 to 2024, it combines bibliometric analysis, thematic mapping and content analysis to uncover key publication trends, thematic developments and the integration of Gen-AI into educational practice. The findings highlight a surge in research activity, growing interest in personalized learning and improvements in student engagement and outcomes. By identifying four evolving research clusters and aligning them with global ethical standards such as UNESCO’s AI Ethics Recommendation, the study offers a conceptual framework, ten theoretical propositions and a forward-looking research agenda – contributing both scholarly insight and practical guidance for educators, policymakers and AI developers. Table 12 summarizes how each identified research cluster aligns with the study’s problem statement and contributes to addressing key gaps in the literature.
Linking research clusters to identified gaps in the literature
| Cluster | Key focus | Problem addressed | Contribution to study purpose |
|---|---|---|---|
| Cluster 1: Gen-AI transforming education | Teaching innovation, personalized learning, academic workflows | Fragmented understanding of Gen-AI’s educational impact | Synthesizes how Gen-AI reshapes pedagogy, offering clarity to educators and researchers |
| Cluster 2: Curriculum and pedagogical reform | Curriculum design, AI-integrated assessment, educator roles | Lack of practical guidance for instructional adaptation | Provides examples of instructional shifts and models for AI-enhanced teaching strategies |
| Cluster 3: Ethics and integrity in higher education | Academic dishonesty, AI misuse, assessment fairness | Lack of ethical clarity and policy direction | Frames key ethical issues and aligns with UNESCO’s AI ethics guidelines |
| Cluster 4: Creativity and knowledge personalization | Creativity, idea generation, personalized learning | Scattered insights on AI’s role in fostering innovation | Integrates psychological and pedagogical perspectives to support creative, AI-driven learning |
| Cluster | Key focus | Problem addressed | Contribution to study purpose |
|---|---|---|---|
| Cluster 1: Gen-AI transforming education | Teaching innovation, personalized learning, academic workflows | Fragmented understanding of Gen-AI’s educational impact | Synthesizes how Gen-AI reshapes pedagogy, offering clarity to educators and researchers |
| Cluster 2: Curriculum and pedagogical reform | Curriculum design, AI-integrated assessment, educator roles | Lack of practical guidance for instructional adaptation | Provides examples of instructional shifts and models for AI-enhanced teaching strategies |
| Cluster 3: Ethics and integrity in higher education | Academic dishonesty, | Lack of ethical clarity and policy direction | Frames key ethical issues and aligns with UNESCO’s |
| Cluster 4: Creativity and knowledge personalization | Creativity, idea generation, personalized learning | Scattered insights on AI’s role in fostering innovation | Integrates psychological and pedagogical perspectives to support creative, AI-driven learning |
5.1 Managerial and theoretical contributions and implications
5.1.1 Managerial contributions.
The findings emphasize the importance of strategically embedding Gen-AI tools, such as ChatGPT and large language models, into educational frameworks to enhance personalized learning, automate assessments and foster innovation in pedagogy. This integration can significantly improve student engagement and learning outcomes (Dwivedi et al., 2023; Gao et al., 2024). For instance, ChatGPT’s ability to provide tailored feedback and generate creative prompts supports active learning and critical thinking (Crompton and Burke, 2024). Institutions should also prioritize the development of AI-driven ethical frameworks to address challenges such as plagiarism, data privacy, algorithmic bias and misinformation, ensuring responsible AI use by establishing robust guidelines and protocols that balance innovation with ethical integrity.
Addressing ethical concerns such as plagiarism, misinformation and biases is paramount. This study highlights the need for robust ethical guidelines and frameworks to ensure responsible AI use in education. For example, Rudolph et al. (2023a) discussed the potential misuse of AI for academic dishonesty and emphasize the importance of implementing AI-driven plagiarism detection and monitoring systems. Similarly, Bin-Nashwan et al. (2023) called for comprehensive frameworks to mitigate ethical challenges.
Educators and students must be equipped with AI literacy to maximize the benefits of Gen-AI tools while understanding their limitations and ethical implications. Training programs can empower stakeholders to use tools like ChatGPT effectively and responsibly (Fuchs and Aguilos, 2023). Johnson et al. (2024a, 2024b) also advocated for hands-on activities to improve educators’ and students’ understanding of AI’s role in information literacy.
The integration of Gen-AI requires collaboration between educators, AI developers, psychologists and policymakers. Studies such as Jeon and Lee (2023) underscored the importance of designing AI tools that address diverse pedagogical and cognitive needs. Collaboration can ensure that tools like ChatGPT are inclusive and adaptive to various educational contexts.
Policymakers should promote scalable AI adoption strategies for underprivileged schools and underserved regions. Subsidized programs and infrastructure development can bridge the digital divide, ensuring equitable access to AI-driven education. Alshahrani (2023) emphasized the potential of AI to democratize access to quality education, especially in blended learning environments.
Incorporating Gen-AI into curricula is essential to prepare students for the demands of an AI-driven workforce. AI tools can foster critical thinking, creativity and problem-solving skills, as discussed by Amaro et al. (2023) and Bower et al. (2024). Tailoring teaching strategies to integrate AI technologies will help educators optimize learning outcomes.
5.1.2 Theoretical contributions.
This research contributes to advancing theoretical frameworks in AI-integrated education by building on established models such as the TAM and Social Cognitive Theory (SCT). It provides insights into how factors like perceived ease of use, usefulness and cognitive aspects influence the adoption and efficacy of Gen-AI tools in educational contexts (Fuchs and Aguilos, 2023; Bin-Nashwan et al., 2023). In addition, the study maps the thematic evolution of AI education research, offering a deeper understanding of shifting research priorities and identifying emerging themes such as prompt engineering and niche areas like self-regulated learning.
By mapping thematic evolution, this study illustrates the shift in research priorities over time. Emerging themes like prompt engineering and niche areas like self-regulated learning reflect evolving focus areas in the Gen-AI_ES domain (Cobo et al., 2011). The thematic maps presented in Figures 5 and 7 highlight how foundational themes like “ChatGPT” and “artificial intelligence” have matured into applied contexts like education and critical thinking (Gao et al., 2024).
This research provides a foundation for developing normative frameworks addressing ethical challenges such as bias, academic dishonesty and data privacy. For example, Rudolph et al. (2023a) and Steele (2023) discussed the implications of AI use in assessment and propose solutions for maintaining academic integrity while leveraging AI.
The study explores how Gen-AI tools like ChatGPT influence creativity and innovation in education, providing a theoretical lens to examine their interplay with pedagogy. Studies such as Crompton and Burke (2024) and Johnston et al. (2024) demonstrated how AI tools facilitate creative thinking and collaborative learning, offering new opportunities for educational innovation.
The study identifies gaps and proposes areas for further exploration, such as longitudinal studies on the sustained impact of Gen-AI, tailored prompt optimization and AI literacy (Bringula, 2024). The propositions outlined provide a roadmap for advancing theoretical inquiries into AI’s integration in education.
5.1.3 Broader implications.
This study underscores the dynamic nature of the Gen-AI_ES domain, with shifts in thematic priorities from foundational topics to applied and ethical considerations. Key themes such as academic dishonesty, critical thinking and self-regulated learning reflect the evolving discourse in AI-enhanced education. Insights from this research can inform innovative teaching strategies, promote stakeholder activation and inspire ethical frameworks for responsible AI adoption (Bin-Nashwan et al., 2023; Gao et al., 2024). By addressing research gaps and fostering interdisciplinary collaboration, this study enriches the knowledge base on Gen-AI in education, inspiring stakeholders to integrate these technologies thoughtfully and effectively into educational practices while maintaining ethical standards and enhancing learning experiences.
6. Conclusion and future research
6.1 Summary of findings
To ensure rigor and replicability, the study followed a four-step methodology to develop a future research agenda:
Identification of influential articles: A bibliographic coupling map was used to identify 40 highly cited articles that represent core contributions to the Gen-AI_ES literature.
Content analysis: These articles were qualitatively analyzed to extract key themes, research gaps and emerging trends shaping the intellectual landscape of the field.
Development of research propositions: Drawing on the analysis, a set of targeted research questions and propositions was formulated to address underexplored areas and guide future inquiry.
Validation of agenda: The proposed agenda was cross-checked against existing literature to ensure novelty, remove redundancies and prioritize innovative research directions.
This review addressed four key research questions. First (RQ1), the study examined the evolution of scientific production on generative AI in education. The bibliometric analysis revealed a significant surge in scholarly output starting in 2022, indicating a rapid growth in interest in the topic. Second (RQ2), the review identified the dominant thematic clusters within the literature, which include pedagogical transformation, curriculum and teaching strategies, ethics and academic integrity and creativity and knowledge personalization. Third (RQ3), the study explored how these themes have evolved, revealing a shift from initial concerns about AI adoption toward more nuanced discussions on pedagogical redesign, assessment practices and institutional preparedness. Finally, based on the analysis of influential articles, a forward-looking research agenda was proposed (RQ4). This agenda highlights several underexplored areas, such as the integration of ethical frameworks, cross-disciplinary theoretical perspectives and comparative evaluations of different generative AI tools in educational settings.
6.2 Contributions to the field
This structured methodology establishes a transparent and replicable approach for identifying future research directions. The proposed research questions and areas are distinct from the study’s results, as they address unexplored dimensions, leverage emerging technologies and prioritize practical and theoretical advancements in AI-integrated education.
The literature on generative AI in education is expanding rapidly across various research areas. However, certain aspects of generative AI in education warrant further exploration. To set recommendations for future research endeavors, we adopted a four-step methodology. Initially, we identified 40 influential articles through a bibliometric coupling map for all four clusters. Subsequently, we conducted content analysis on these influential articles to ascertain a prospective research agenda. Third, we transformed the potential research agenda into research questions and propositions. At last, we cross-checked and eliminated identified research questions that scholars have previously addressed. This process yielded 12 future research questions identified by various authors (refer to Table 13). To contribute further to the literature, using thematic analysis maps, we identified seven areas of future research in our study and suggested propositions to address them.
Future research questions
| Research streams | Author(s) | Future research questions | |
|---|---|---|---|
| Generative AI in advancing education | Rejeb et al. (2024); Bringula (2024); Zekaj (2023); Javaid et al. (2023); Johnston et al. (2024) | 1 | Integrating AI in education: How does integrating AI technologies affect teaching practices, learning experiences and student outcomes in higher education, and what are the ethical considerations involved? |
| Šedlbauer et al. (2024), Alshahrani (2023), Iwasawa et al. (2024), Watrianthos et al. (2023), Vargas-Murillo et al. (2023) | 2 | AI for personalization, critical thinking and literacy: How does AI enhance personalization, critical thinking and AI literacy across diverse educational settings and student groups, and what are the ethical implications of AI integration in education? | |
| Dwivedi et al. (2023), Gilson et al. (2023) | 3 | Generative AI for knowledge projects and digital transformation: How can generative AI models like ChatGPT enhance knowledge transparency, ethics and efficiency in knowledge-intensive projects while addressing accuracy, biases, and risk management? In addition, how can ChatGPT facilitate digital transformation, create new business models and generate innovations in industries, considering their societal and organizational implications? | |
| Gilson et al. (2023) | 4 | AI in medical and educational sectors: How can future research enhance medical question-answering models, integrate ChatGPT into medical education and assess its long-term impact on learning and practice? In addition, how effective are chatbots in enhancing educational outcomes across various sectors, and what are their long-term impacts? | |
| OpenAI and its impact on pedagogy | Chan, 2023; Ansari et al., 2023 | 5 | AI experience and policy development: How does the level of experience with AI among students and teachers influence AI policy development in educational institutions? |
| Pack and Maloney (2023); Yildiz (2023) | 6 | Generative AI in language education: What are the ethical implications and practical applications of generative AI tools in language education research? | |
| Yildiz (2023); Wang et al. (2023) | 7 | Chatbots in language education: How effective are chatbots in enhancing vocabulary acquisition, learner engagement, personalization, integration with other tools, long-term retention and across diverse cultural contexts in language education? | |
| Wang et al. (2023) | 8 | Optimizing ChatGPT feedback: How can prompt design, diverse data sets, accuracy factors and perceptions of ChatGPT’s feedback be optimized and evaluated in educational settings? | |
| Integrity and ChatGPT in higher education | Kavadella et al., 2024; Fuchs and Aguilos, 2023; Lancaster, 2023; Akiba and Fraboni, 2023; Bin-Nashwan et al., 2023 | 9 | Ethical AI use and academic integrity: How can AI tools be ethically integrated into educational practices to support academic integrity and address plagiarism, and what are the long-term ethical implications? |
| Adopting ChatGPT: Enhancing creativity and knowledge | Gao et al., 2024 | 10 | AI in diverse sectors: How does the use of ChatGPT in sectors like marketing, health care and software development inform trends and impact these domains over time? |
| Wardat et al. (2023), Eager and Brunton (2023) | 11 | Chatbots and human tutors collaboration: How can chatbots be effectively and responsibly integrated into education, in collaboration with human tutors, to explore potential evolutionary changes, using larger sample sizes, qualitative and quantitative analyses, multiple languages and extended time periods for social network analysis? | |
| Eager and Brunton (2023), Crompton and Burke (2024), Bower et al. (2024) | 12 | Leveraging AI in higher education: How can AI be leveraged in higher education to align teaching practices with learning objectives, enhance student engagement, create inclusive materials, address ethical implications and support faculty development programs? |
| Research streams | Author(s) | Future research questions | |
|---|---|---|---|
| Generative | 1 | Integrating | |
| 2 | |||
| 3 | Generative | ||
| 4 | |||
| OpenAI and its impact on pedagogy | 5 | ||
| 6 | Generative | ||
| 7 | Chatbots in language education: How effective are chatbots in enhancing vocabulary acquisition, learner engagement, personalization, integration with other tools, long-term retention and across diverse cultural contexts in language education? | ||
| 8 | Optimizing ChatGPT feedback: How can prompt design, diverse data sets, accuracy factors and perceptions of ChatGPT’s feedback be optimized and evaluated in educational settings? | ||
| Integrity and ChatGPT in higher education | 9 | Ethical | |
| Adopting ChatGPT: Enhancing creativity and knowledge | 10 | ||
| 11 | Chatbots and human tutors collaboration: How can chatbots be effectively and responsibly integrated into education, in collaboration with human tutors, to explore potential evolutionary changes, using larger sample sizes, qualitative and quantitative analyses, multiple languages and extended time periods for social network analysis? | ||
| 12 | Leveraging |
Our review thus revealed several gaps in our knowledge about generative AI in education related to the context (Alshahrani, 2023), methods (Bin-Nashwan et al., 2023; Rejeb et al., 2024), theoretical frameworks (Su and Yang, 2023; Tiwari et al., 2024) and measurements (Iwasawa et al., 2024). We discuss these research gaps and our proposals for dealing with them below. We propose various topics and propositions for future research from our thorough review of each cluster’s papers’ research questions (as in Table 5) and thematic progression in the Gen-AI_ES domain from 2021 to 2024. These future avenues offer scholars opportunities to enrich knowledge and comprehension in AI-integrated education, tackling emerging challenges and leveraging opportunities to enhance student learning experiences.
1. The long-term effects of AI integration in education: While there is growing research on the immediate impact of AI in education, there is a notable absence of studies examining its sustained effects over time. Longitudinal studies are essential to track how AI influences pedagogical practices, student learning outcomes and the overall educational landscape (Bringula, 2024; Zekaj, 2023). Therefore, we propose that:
Conduct longitudinal studies to track the sustained impact of AI integration on teaching practices, student learning outcomes and overall educational effectiveness over extended periods.
Explore the potential evolutionary changes in pedagogical practices and learning environments resulting from continuous AI integration.
2. The impact of AI on specific subjects within blended learning environments: Given the shift toward exploring the integration of large language models and chatbots into educational settings, future research could investigate more deeply the pedagogical approaches, such as blended learning, that leverage AI technologies. This includes examining how these tools can be effectively incorporated into teaching practices to enhance student learning outcomes and foster critical thinking skills. Blended learning environments combine traditional classroom instruction with online learning, presenting unique opportunities and challenges. However, there is a lack of research on how AI tools like ChatGPT affect learning outcomes in specific subjects within such environments, hindering our understanding of their efficacy and applicability (Alshahrani, 2023). Therefore, we propose that:
Conduct empirical studies to investigate the effectiveness of AI tools like ChatGPT in enhancing learning outcomes across various subjects within blended learning settings.
Implement AI-driven interventions in specific subject areas to assess their impact on student engagement, comprehension and academic performance.
3. Ethical considerations and academic integrity: As AI tools become increasingly prevalent in educational environments, addressing ethical concerns and maintaining academic integrity are paramount. Current research highlights these issues but lacks a thorough exploration of systematic management approaches. Therefore, it is proposed to conduct interdisciplinary research to develop comprehensive frameworks and guidelines for ethical AI usage in educational settings (Vargas-Murillo et al., 2023). This research would focus on establishing standards to ensure data privacy, reduce bias, enhance algorithmic transparency and prevent academic dishonesty, ultimately supporting AI’s responsible and ethically sound integration in education. Therefore, we propose that:
Conduct interdisciplinary studies to examine the ethical considerations surrounding AI use in education, including issues of data privacy, bias and algorithmic transparency.
4. Tailored prompt engineering: The emergence of prompt engineering as an area of interest suggests a growing recognition of the importance of tailored interactions in educational AI systems (Denny et al., 2023). Future research could explore the design and optimization of prompts to enhance student engagement, motivation and learning outcomes across various educational contexts. Therefore, we propose that:
Investigate the design and optimization of tailored prompts to enhance student engagement, motivation and learning outcomes.
5. Niche themes exploration: Niche themes such as self-regulated learning, academic writing, language education and digital multimodal composing represent areas of specific interest within the Gen-AI_ES domain. Future research could explore these topics deeper, examining how AI technologies can support and enhance these aspects of the learning process and address any challenges or limitations. As AI becomes increasingly integrated into education, ensuring students possess the necessary literacy skills to navigate and use these technologies effectively is crucial. However, there is a dearth of research on students’ comprehension of AI literacy and their ability to use AI tools responsibly, highlighting the need for empirical studies in this area (Iwasawa et al., 2024; Rudolph et al., 2023a, 2023b). Therefore, we propose that:
Develop and administer objective knowledge tests focused on AI literacy to assess students’ understanding and proficiency in using AI tools responsibly.
Explore the factors influencing students’ perceptions of AI and their attitudes toward its use in educational contexts.
6. Longitudinal studies: Despite emerging evidence on the short-term benefits of AI in education, there is a significant gap in understanding its long-term impacts. Research has focused on immediate outcomes, neglecting how these technologies affect teaching practices, student learning and ethical considerations over extended periods. Therefore, it is crucial to initiate longitudinal studies to systematically evaluate the sustained impacts of AI on educational effectiveness (Tiwari et al., 2024). These studies would track changes in pedagogical approaches, student engagement and academic performance over time, providing a comprehensive understanding of AI’s long-term implications in educational settings. Therefore, we propose that:
Conduct longitudinal studies to track the impact of AI integration on educational practices and outcomes over time.
7. Cross-disciplinary collaboration: Given the interdisciplinary nature of AI in education, future research could benefit from cross-disciplinary collaboration between researchers in education, computer science, psychology and other relevant fields (Bahroun et al., 2023). de Neufville and Baum (2021) emphasized the importance of collective action in developing and regulating AI in education. This is also consistent with our findings, which state that deploying AI technologies could have significant implications for teaching methodologies, student engagement and educational outcomes. This collaborative approach can foster a deeper understanding of the complex interactions between AI technologies and educational practices.
The intellectual structure of generative AI in education reveals contributions from education, computer science and psychology, forming distinct research clusters (Shafique, 2013). For example, Yildiz (2023) showed that chatbots in vocabulary teaching integrate these disciplines by enhancing teaching methods, engaging learners and improving vocabulary retention through natural language processing and machine learning. Psychology contributes by understanding learner behavior and motivation. Similarly, Dwivedi et al. (2023) discussed the multidisciplinary nature of AI in education, focusing on personalized learning and diverse needs, with computer science providing the technical foundation and psychology examining motivation and engagement. Rudolph et al. (2023a, 2023b) emphasized the importance of pedagogical strategies, student engagement and assessment practices, with computer science and psychology exploring AI’s capabilities and learning impacts. Johnson et al. (2024a, 2024 b) highlight ChatGPT’s application in library instruction, where education focuses on pedagogy, computer science on AI tools and psychology on cognitive and emotional learning factors. Therefore, we propose that:
Foster cross-disciplinary collaboration to advance understanding of AI in education.
6.3 Limitations of the study
Despite the significance of our findings within the Gen-AI_ES domain, this study is subject to several limitations. First, our analysis is limited by relying exclusively on research papers indexed in the Scopus and Web of Science databases. Future investigations could expand by incorporating papers from additional databases such as Google Scholar, IEEE Xplore and EBSCOhost. Second, there is potential for deeper analysis by using alternative software tools like Gephi and CiteSpace. In addition to these limitations, it is essential to recognize that our study primarily focused on academic literature, potentially overlooking insights from gray literature, industry reports and other nontraditional sources. Incorporating a broader range of sources could provide a more holistic understanding of Gen-AI in the education landscape.
Furthermore, while our analysis provides a snapshot of the current state of research in Gen-AI_ES, it may not capture ongoing developments or emerging trends in real time. Continuous monitoring and periodic updates could ensure that our findings remain relevant and reflective of the evolving nature of the field.
6.4 Directions for future research
Despite these constraints, our study contributes valuable insights that can inform future research directions, policy decisions and practical implementations of Gen-AI in education, with its unique value underscored by several key aspects that set it apart, particularly in addressing Bond et al.’s (2024) concern about the proliferation of reviews recycling the same studies and duplicating efforts. First, this study analyzed 817 articles from 2021 to 2024 using an extensive Scopus and Web of Science data set, incorporating innovative analytical techniques, namely, bibliometric and content analyses. Second, the study focused on the transformative potential of generative AI in education, specifically through tools like ChatGPT, focusing on its impact on creativity, curriculum development and academic integrity. Third, although this study used an inductive, literature-driven approach to uncover themes, we recognize that frameworks like UTAUT, technological pedagogical content knowledge and substitution, augmentation, modification, redefinition could enhance understanding of pedagogical impacts. Future research could apply these models to assess how Gen-AI tools like ChatGPT influence teaching practices, educator readiness and technology use in various educational settings. Fourth, this study focuses on ChatGPT due to its dominant presence in educational research. However, we recognize that other Gen-AI tools, such as Bard, Claude and Copilot, are also gaining use. The identified themes, creativity, ethics and instructional innovation are likely relevant across tools. Future research could expand by comparing the use and impact of different Gen-AI technologies. Fifth, this review focuses on Gen-AI trends in education. It does not cover recent developments in agile enterprise architecture, digital transformation, sociotechnical views, or related fields such as knowledge management, design science and organizational studies what are key to understanding systemic change. Future research could adopt cross-disciplinary frameworks to explore their intersection with Gen-AI in education. Finally, the paper uncovered seven future research areas, emphasizing ethical guidelines, innovative methodologies, integrating Gen-AI into personalized learning frameworks, advancing academic discourse and real-world application in education.

