As video content becomes increasingly central to hospitality marketing and customer engagement strategies, researchers face mounting challenges in developing robust analytical frameworks for this multimodal medium. This scoping review examines video analysis techniques in hospitality research, identifying current methodological practices, gaps and opportunities to guide researchers conducting video-based studies in this rapidly evolving field.
We conducted a systematic scoping review following Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews guidelines, analyzing 37 peer-reviewed English-language studies published between 2017 and 2025. The review involved comprehensive database searches across Web of Science and Scopus, systematic screening using Rayyan and data extraction focused on sample characteristics, analytical techniques, and methodological approaches.
Analysis revealed three key methodological patterns: researchers primarily analyze textual and visual elements while underutilizing auditory components; YouTube dominates as the source platform, limiting platform diversity, and qualitative content analysis represents the most common approach, with minimal adoption of advanced computational techniques, such as machine learning-based topic modeling.
Researchers should expand data collection beyond YouTube to include emerging platforms like TikTok and Bilibili, investigate underutilized auditory components for richer contextual insights and develop hybrid analytical frameworks that combine machine learning efficiency with qualitative depth to address scalability challenges.
This study represents the first systematic methodological review of video analysis techniques in hospitality research, mapping current practices and providing guidance for methodological advancement in this emerging area.
1. Introduction
The digital transformation of the hospitality industry, driven by rapid advancements in information and communication technologies, has reshaped the landscape of customer engagement and operational strategies (Buhalis and Law, 2008; Ali and Frew, 2014; Xiang et al., 2015; Moreno and Tejada, 2019; Law et al., 2020; Rodrigues et al., 2023). Within this evolving digital ecosystem, video content has emerged as a particularly powerful medium for communication, marketing and service delivery in the hospitality sector (Hudson et al., 2012; Coker et al., 2021; Deng et al., 2021; Agrawal and Mittal, 2022).
The growth of video-sharing platforms, such as YouTube and TikTok, has revolutionized how hospitality stakeholders create, share and consume content (Yetimoğlu and Uğurlu, 2020; Zhu et al., 2022; Ercegovac et al., 2023). These platforms have near-supplanted traditional communication channels (e.g. newspapers, radios), offering new opportunities for immersive storytelling, virtual property tours and real-time event showcases (Leung et al., 2017; Tiago et al., 2019; Briliana et al., 2020; Pratisto et al., 2022). The strategic importance of video content is reflected in general business projections, with 89% of businesses expected to use video as a marketing tool in 2025 and 95% of video marketing professionals regarding it as an essential element of their overall marketing strategy (Wyzowl, 2025).
The surge in video content production necessitates an increased focus on video analysis techniques in hospitality research. The potential for extracting meaningful insights from video data spans multiple domains, such as consumer behavior analysis, marketing strategy evaluation, and service quality assessment. However, the volume and complexity of video data may present challenges to researchers employing traditional analytical methods (Núñez et al., 2024). For instance, manual coding approaches in qualitative content analysis, while valuable for in-depth exploration, typically fall short when researchers attempt to process the extensive multimodal data contained in video material (Schreier, 2012). The multifaceted nature of video, encompassing visual, auditory, and textual elements, demands more sophisticated analytical techniques capable of effectively synthesizing these diverse data types (Grzenkowicz and Wildfeuer, 2025).
Despite the growing importance of video analysis in hospitality research, the field lacks a systematic understanding of current methodological practices. Zhu and Cheng (2024) recently conducted a review of automatic video analytics in tourism, providing a technical framework that advocates computational approaches to overcome scalability limitations of manual coding. However, the analytical approaches hospitality researchers are often employing, the types of video content and data components being analyzed, and the methodological patterns and gaps across the broader hospitality field remain unexamined. This absence of empirical evidence limits the field's ability to assess current practices and identify strategic directions for methodological advancement.
To address this gap, we conducted a methodological scoping review of video analysis techniques employed in hospitality research. A scoping review approach allowed us to systematically map the breadth of methodological approaches across this emerging research area (Arksey and O'Malley, 2005; Munn et al., 2018). Specifically, this review aimed to (1) examine the types and characteristics of video content being analyzed in hospitality research, (2) identify methodological trends in video analysis within the field and (3) propose recommendations for future research directions and methodological advancements.
2. Method
2.1 Search strategy
We conducted this methodological scoping review following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) guidelines (Tricco et al., 2018). Specifically, we performed the literature search using Web of Science and Scopus, two databases commonly employed by researchers in hospitality studies (e.g. Gomezelj, 2016; Mariani et al., 2018; Mehraliyev et al., 2021). Search terms included combinations of (“video” OR “video on demand” OR “VOD” OR “user generated video” OR “vlog*” OR “video blog” OR “social media video”) AND (“content analysis” OR “thematic analysis” OR “textual analysis” OR “sentiment analysis” OR “video analytics” OR “natural language processing” OR “machine learning” OR “big data analytics” OR “engagement metrics” OR “performance metrics” OR “multimodal analysis”), using Boolean AND logic to require studies to include both video-related terminology and analytical method terms. Video-related terms encompassed various formats (e.g. vlogs, social media videos), while analytical method terms included both traditional qualitative approaches (e.g. content analysis, thematic analysis) and computational techniques (e.g. machine learning, natural language processing, sentiment analysis). We included “multimodal analysis” because videos inherently contain multiple data types (visual, auditory, textual, metadata) that researchers may analyze individually or in combination. We managed and organized all identified studies using Rayyan, a web-based tool designed for scoping reviews (Ouzzani et al., 2016). Database searches were completed on April 18, 2025, with the screening and review process finalized by May 2, 2025.
2.2 Inclusion and exclusion criteria and review process
To be included in this review, articles had to meet the following criteria: (1) published in peer-reviewed journals; (2) written in English; (3) published online up to January 1, 2025, with online first publication dates serving as our reference point; (4) related to the hospitality industry (e.g. tourism, destination marketing, food and beverage, lodging, and/or casino gambling); and (5) included methods for video analysis and/or commentary analysis. We included commentary analysis because user-generated comments on video platforms provide audience response data that many researchers use to understand video reception and impact. Lastly, we excluded review articles, editorials, conference abstracts, theses and dissertations from our analysis.
The search in Scopus returned 638 articles limited to the subject areas of Business, Management and Accounting – a specific filter provided by Scopus. The Web of Science returned 194 articles after filters were applied for Hospitality, Leisure, Sport and Tourism fields. A total of 832 articles were initially imported into Rayyan, from which 69 duplicates were removed, resulting in 763 articles for screening. Following a review of titles, abstracts and full texts, 36 articles met the inclusion criteria. An additional search using the references from the included articles and Google Scholar resulted in the inclusion of one more article, bringing the total to 37 articles included in this scoping review. Figure 1 presents a PRISMA flow diagram that illustrates the different phases of the scoping review process, from initial identification to final inclusion of studies.
The PRISMA flow diagram shows three section headings arranged vertically on the left side: “Identification”, “Screening”, and “Included”. The flowchart is divided into two columns. The left column contains five text boxes, which are labeled as follows: Text box 1: “Records identified from: Scopus (n equals 638) Web of Science (n equals 194)”. Text box 2: “Records screened: (n equals 763)”. Text box 3: “Full-text articles assessed for eligibility: (n equals 41)”. Text box 4: “Studies included in scoping review: (n equals 37)”. An additional text box 5 labeled “Additional articles identified: (n equals 1)” is present on the left, between text boxes 3 and 4. The right column contains four text boxes, which are labeled as follows: Text box 6: “Duplicate records removed before screening: (n equals 69)”. Text box 7: “Records excluded: (n equals 722)”. Text box 8: “Full-text articles excluded: (n equals 5)”. The text boxes 1 and 6 are placed under the heading “Identification”, the text boxes 2, 3, 5, 7, and 8 are placed under the heading “Screening”, and the text box 4 is placed under the heading “Included”. Text box 1 is connected to text box 2 with a downward arrow. Text box 2 is connected to text box 3 with a downward arrow. Text box 3 is connected to text box 4 with a downward arrow. Text box 1 is connected to text box 6 with a rightward arrow. Text box 2 is connected to text box 7 with a rightward arrow. Text box 3 is connected to text box 8 with a rightward arrow. Text box 5 is connected with a rightward arrow to the vertical flow between text box 3 and text box 4.PRISMA flow diagram of the scoping review phases. Source(s): Authors’ own work
The PRISMA flow diagram shows three section headings arranged vertically on the left side: “Identification”, “Screening”, and “Included”. The flowchart is divided into two columns. The left column contains five text boxes, which are labeled as follows: Text box 1: “Records identified from: Scopus (n equals 638) Web of Science (n equals 194)”. Text box 2: “Records screened: (n equals 763)”. Text box 3: “Full-text articles assessed for eligibility: (n equals 41)”. Text box 4: “Studies included in scoping review: (n equals 37)”. An additional text box 5 labeled “Additional articles identified: (n equals 1)” is present on the left, between text boxes 3 and 4. The right column contains four text boxes, which are labeled as follows: Text box 6: “Duplicate records removed before screening: (n equals 69)”. Text box 7: “Records excluded: (n equals 722)”. Text box 8: “Full-text articles excluded: (n equals 5)”. The text boxes 1 and 6 are placed under the heading “Identification”, the text boxes 2, 3, 5, 7, and 8 are placed under the heading “Screening”, and the text box 4 is placed under the heading “Included”. Text box 1 is connected to text box 2 with a downward arrow. Text box 2 is connected to text box 3 with a downward arrow. Text box 3 is connected to text box 4 with a downward arrow. Text box 1 is connected to text box 6 with a rightward arrow. Text box 2 is connected to text box 7 with a rightward arrow. Text box 3 is connected to text box 8 with a rightward arrow. Text box 5 is connected with a rightward arrow to the vertical flow between text box 3 and text box 4.PRISMA flow diagram of the scoping review phases. Source(s): Authors’ own work
3. Results
This scoping review identified 37 articles that employed video analysis techniques within hospitality research. As illustrated in Figure 2, the chronological distribution of these publications (2017–2025) revealed a notable acceleration in research output beginning in 2023, with this period accounting for over 60% of the reviewed studies (23 out of 37). Table 1 provides a comprehensive overview of the key characteristics of these studies, including their research focus, methodological frameworks and analytical approaches.
The vertical bar chart shows a horizontal axis that ranges from “2017” to “2024” in increments of 1 year. Eight vertical bars are shown, one for each year, with values labeled above each bar. Bar values are shown as follows: “2017” equals 2. “2018” equals 1. “2019” equals 1. “2020” equals 1. “2021” equals 6. “2022” equals 3. “2023” equals 9. “2024” equals 14.Distribution of publications by year. Source(s): Authors’ own work
The vertical bar chart shows a horizontal axis that ranges from “2017” to “2024” in increments of 1 year. Eight vertical bars are shown, one for each year, with values labeled above each bar. Bar values are shown as follows: “2017” equals 2. “2018” equals 1. “2019” equals 1. “2020” equals 1. “2021” equals 6. “2022” equals 3. “2023” equals 9. “2024” equals 14.Distribution of publications by year. Source(s): Authors’ own work
Key characteristics of the reviewed studies
| Authors | Online publication year | Study objective | Video content type(s) examined | Sample size and source | Research approach | Analysis method(s) | Tools/Software used |
|---|---|---|---|---|---|---|---|
| Antivo et al. | 2024 | Examining online viewers’ non-social emotions in comments on sex tourism vlogs in Southeast Asia | Travel vlogs (metadata) and associated comments (textual) | 27 YouTube videos and 10,864 comments | Qualitative | Phronetic iterative qualitative data analysis | Data Miner |
| Avraham | 2018 | Examining nation branding and marketing strategies used by destinations to combat tourism crises and negative stereotypes | Tourism promotional videos (metadata, visual, textual) | 63 YouTube videos | Mixed methods | Quantitative and qualitative content analysis | Not specified |
| Barnes | 2023 | Investigating the impact of voice and music characteristics in travel and tourism video advertisements on consumer responses | Tourism ads (metadata, auditory) | 291 YouTube videos | Quantitative | Acoustic analytics, regression analysis | Python, R, Google Colab Notebook, Soundgen, Tunebat, Ultimate Vocal Remover |
| Barrett and Feng | 2020 | Analyzing online recipe videos for food safety implications related to flour handling, with a focus on identifying potential risks and safety practices | Recipe videos featuring flour (visual, textual) | 85 blog recipes and 146 YouTube videos | Mixed methods | Qualitative content analysis, descriptive statistics | Microsoft Excel, SPSS |
| Bernal et al. | 2024 | Analyzing tourism resiliency approaches of Philippine local government units during COVID-19 | Tourism promotional videos (visual, textual) | 29 videos | Qualitative | Manifest content analysis | Not specified |
| Chen et al. | 2024 | Investigating how multimodal stimuli in tourism crowdfunding projects predict project success | Videos from travel crowdfunding projects (metadata, visual, textual) | 3,659 videos | Quantitative | Textual analysis, deep learning, predictive modeling | Python (FFmpeg, OpenCV, NLTK, Python Imaging), Google Cloud Video Intelligence API |
| Deng et al. | 2022 | Exploring the content of influencer-endorsed short videos about wine on TikTok (Douyin), with particular attention to gender and generational differences in perceptions and preferences | Comments associated with wine-related short videos (textual) | 10,042 comments on Douyin | Mixed methods | LDA topic modeling, qualitative content analysis, Mann–Whitney U tests | Python (Jieba for LDA topic modeling), SPSS |
| Dewantara et al. | 2023 | Exploring Parasocial Interaction (PSI) attributes in travel vlogs and their influence on viewers’ travel intentions | Travel vlogs (metadata, visual, textual) and associated comments (textual) | 10 YouTube videos and 9,086 comments | Qualitative | Qualitative content analysis, nominal group technique, thematic analysis | NVivo, Microsoft Teams (for transcription) |
| Georgescu Paquin and Cerdan Schwitzguébel | 2021 | Analyzing the tourist landscape representation in Barcelona’s promotional videos in an overtourism context | Tourism promotional videos (visual) | 24 YouTube videos | Mixed methods | Quantitative visual content analysis, qualitative semiotic analysis | Not specified |
| Hoebanx and French | 2023 | Examining how slot machine videos on YouTube portray gambling and align with the norms of YouTube’s platform economy | Slot machine videos (visual, textual) and associated comments (textual) | 21 YouTube videos and 186 comments | Qualitative | Thematic analysis | Not specified |
| Huertas et al. | 2017 | Analyzing how Spanish DMOs use YouTube to communicate their promotional videos and to study whether these videos communicate brands through attraction factors and emotional values | Tourism promotional videos (metadata, visual, textual) | 542 YouTube videos | Mixed methods | Quantitative content analysis, qualitative content analysis, statistical correlation | FanpageKarma |
| Ketter and Avraham | 2021 | Examining digital marketing strategies used by NTBs during the COVID-19 pandemic | Tourism promotional videos (visual, textual) | 29 YouTube videos | Qualitative | Qualitative content analysis | Not specified |
| Lakmali et al. | 2024 | Analyzing how tourists portray crisis-affected destinations through YouTube vlogs and their content creation motivations | Travel vlogs (metadata, textual) | 10 YouTube videos | Qualitative | Thematic analysis | QDA Miner Lite, Excel |
| Lang | 2024 | Analyzing how Hangzhou’s cultural and tourism bureau constructs its international image on YouTube | Tourism promotional videos (visual, textual, auditory) | 83 YouTube videos | Qualitative | Multimodal critical discourse analysis | Not specified |
| Lau et al. | 2024 | Examining how museums reach Generation Z virtual tourists using TikTok videos and the relationship between video elements and types of engagement | Museum promotional videos (metadata, textual, auditory) | 313 videos on TikTok | Mixed methods | Thematic analysis, descriptive analysis, regression analysis | Excel, SPSS |
| Li et al. | 2023 | Exploring cultural meaning construction in social media through analysis of Liziqi’s YouTube channel | Lifestyle vlogs (visual) and associated comments (textual) | 5 YouTube videos and 500 comments | Qualitative | Content analysis (coding), textual analysis (decoding) | Python, NVivo |
| Li et al. | 2024 | Comparing spatial behavior of Chinese and foreign tourists based on landmark recognition in travel vlogs | Travel vlogs (visual, metadata) | 439 YouTube videos, 1,059 videos on Bilibili | Quantitative | Spatial analysis, landmark recognition | Baidu AI APIs, Baidu Maps Coordinate Picking System, Python |
| Ma et al. | 2023 | Examining the effectiveness of bullet comments associated with food vlogs on the tourists’ travel intentions | Food vlogs (metadata, visual, textual, auditory) and associated bullet comments (textual) | 20 videos on Bilibili and 133,680 bullet comments | Mixed methods | Qualitative content analysis, thematic analysis, computational sentiment analysis | Python |
| Ma et al. | 2023 | Exploring how food travel vlogs awaken travel intentions through viewers’ social and non-social emotions | Food vlogs (metadata) and associated comments (textual) | 32 videos on Bilibili and 91,437 comments | Mixed methods | Qualitative content analysis, LDA topic modeling, computational sentiment analysis | Python, (SnowNLP for sentiment analysis) |
| Mesana et al. | 2024 | Mapping online viewers’ social and non-social emotions when watching UNESCO cultural heritage sites’ travel vlogs | Travel vlogs (metadata) and associated comments (textual) | 64 YouTube videos and 3,089 comments | Qualitative | Qualitative sentiment analysis, phronetic iterative data analysis | Data Miner |
| Motahar et al. | 2021 | Exploring how Iran is framed as a travel destination by Social Media Influencers (SMIs) on YouTube | Travel videos (metadata, visual, textual) | 10 YouTube videos | Qualitative | Netnography and narrative analysis | Not specified |
| Nazir | 2023 | Examining destination branding through social media, comparing foreign influencers’ narratives with official presentations | Travel vlogs (visual, textual) | 8 YouTube videos | Qualitative | Thematic analysis | NVivo |
| Rauf and Pasha | 2024 | Understanding Global North–South dynamics in YouTube gastronomic tourism videos and audience feedback | Food vlogs (metadata, textual) and associated comments (textual) | 9 YouTube videos and 128,000 comments | Mixed methods | Textual analysis, qualitative content analysis, computational sentiment analysis | Python, YouTube API |
| Salangsang et al. | 2022 | Examining elements of luxury travel in tourism video advertisements from Asian countries during COVID-19 | Tourism promotional videos (metadata, visual, textual) | 122 YouTube videos | Qualitative | Qualitative content analysis | NVivo, Ncapture |
| Sharma | 2023 | Comparing message strategies adopted by celebrities vs social media influencers in brand-related YouTube content | Fashion and food industry videos (metadata, visual, textual, auditory) | 638 YouTube videos | Mixed methods | Qualitative content analysis, chi-square test, ANOVA, Mann–Whitney U test, Kruskal–Wallis H | SPSS |
| Tavakoli and Ling | 2022 | Exploring consumer perceptions of virtual food consumption and its sociological implications using online comments about a promotional video | Comments associated with a virtual food promotion video (textual) | 250 comments on Facebook | Qualitative | Thematic analysis | Not specified |
| Tham et al. | 2023 | Examining how Penang, Malaysia, is marketed on TikTok by different stakeholders and how they present the destination’s image | Tourism promotional videos (metadata, visual, textual, auditory) | 30 videos on TikTok | Qualitative | Multimodal analysis, Burke’s Pentadic analysis | Not specified |
| Vujičić et al. | 2021 | Examining the techno-social dimensions of tourist drone videography to understand production practices and creator differences | Drone vacation videos (metadata, visual) | 351 YouTube videos | Mixed methods | Qualitative content analysis, descriptive statistics, chi-square tests, regression analysis | Webometric Analyst, YouTube Statistics, SPSS |
| Warton and Brander | 2017 | Assessing the value of the TV show Bondi Rescue for improving tourist beach safety awareness | TV shows (visual, textual) | 98 episodes | Mixed methods | Qualitative content analysis, t-tests, ANOVA | R |
| Wen et al. | 2021 | Exploring travel constraints related to physician-assisted suicide tourism | Documentaries (metadata, textual) and associated comments (textual) | 25 YouTube videos and 1,231 comments | Qualitative | Thematic analysis | Nvivo |
| Yayla et al. | 2024 | Examining food preferences and gastro-tourist typologies of digital nomads | Digital nomads videos (visual, textual) and associated comments (textual) | 21 YouTube videos and 326 comments | Qualitative | Qualitative content analysis | Not specified |
| Yıldırım and Kaya | 2024 | Investigating digital nomads’ impressions and reflections toward intangible cultural heritage (ICH) during travel | Travel videos (visual, textual) and associated comments (textual) | 5 videos on social media and associated comments | Qualitative | Thematic analysis | MAXQDA |
| Yoo, Kim et al. | 2024 | Examining the relationships between discrete emotions expressed by travel influencers and viewer engagement | Travel videos (metadata, textual) | 5,008 YouTube videos | Quantitative | Computational sentiment analysis, regression analysis | Amazon Rekognition API, Receptiviti API (Syntax-Aware Lexical Emotion Engine module) |
| Yoo, Piscarac et al. | 2024 | Investigating the effectiveness of digital outdoor advertising in redefining urban tourism appeal and city branding using Seoul’s “Wave” campaign | Comments associated with YouTube videos (textual) | 956 comments on YouTube | Mixed methods | Word frequency analysis, centrality analysis, network analysis | Ucinet 6, NetDraw |
| Yu | 2019 | Investigating public perceptions of humanlike robots as employees in the hotel industry | Humanlike robot videos (metadata) and associated comments (textual) | 2 YouTube videos and 1,621 comments | Mixed methods | Cluster analysis, thematic analysis | NVivo, Data Miner |
| Zhang | 2021 | Analyzing public perceptions of service robots in hospitality and tourism amid COVID-19 | Comments associated with news report videos (textual) | 1,852 comments on YouTube | Quantitative | Explorative analysis (word cloud), computational sentiment analysis, LDA topic modeling | Python (Gensim for topic modeling), SentiStrength |
| Zhu et al. | 2024 | Examining the impacts of content features of pro-environmental tourism videos on viewers’ in-consumption engagement | Pro-environmental videos (metadata, visual, textual, auditory) | 44 videos on Bilibili | Quantitative | Regression analysis | Microsoft Azure AI Video Indexer, Python (Librosa, FFmpeg), R |
| Authors | Online publication year | Study objective | Video content type(s) examined | Sample size and source | Research approach | Analysis method(s) | Tools/Software used |
|---|---|---|---|---|---|---|---|
| Antivo et al. | Examining online viewers’ non-social emotions in comments on sex tourism vlogs in Southeast Asia | Travel vlogs (metadata) and associated comments (textual) | 27 YouTube videos and 10,864 comments | Qualitative | Phronetic iterative qualitative data analysis | Data Miner | |
| Avraham | 2018 | Examining nation branding and marketing strategies used by destinations to combat tourism crises and negative stereotypes | Tourism promotional videos (metadata, visual, textual) | 63 YouTube videos | Mixed methods | Quantitative and qualitative content analysis | Not specified |
| Barnes | 2023 | Investigating the impact of voice and music characteristics in travel and tourism video advertisements on consumer responses | Tourism ads (metadata, auditory) | 291 YouTube videos | Quantitative | Acoustic analytics, regression analysis | Python, R, Google Colab Notebook, Soundgen, Tunebat, Ultimate Vocal Remover |
| Barrett and Feng | 2020 | Analyzing online recipe videos for food safety implications related to flour handling, with a focus on identifying potential risks and safety practices | Recipe videos featuring flour (visual, textual) | 85 blog recipes and 146 YouTube videos | Mixed methods | Qualitative content analysis, descriptive statistics | Microsoft Excel, |
| Bernal et al. | 2024 | Analyzing tourism resiliency approaches of Philippine local government units during COVID-19 | Tourism promotional videos (visual, textual) | 29 videos | Qualitative | Manifest content analysis | Not specified |
| Chen et al. | 2024 | Investigating how multimodal stimuli in tourism crowdfunding projects predict project success | Videos from travel crowdfunding projects (metadata, visual, textual) | 3,659 videos | Quantitative | Textual analysis, deep learning, predictive modeling | Python (FFmpeg, OpenCV, NLTK, Python Imaging), Google Cloud Video Intelligence API |
| Deng et al. | 2022 | Exploring the content of influencer-endorsed short videos about wine on TikTok (Douyin), with particular attention to gender and generational differences in perceptions and preferences | Comments associated with wine-related short videos (textual) | 10,042 comments on Douyin | Mixed methods | Python (Jieba for | |
| Dewantara et al. | 2023 | Exploring Parasocial Interaction (PSI) attributes in travel vlogs and their influence on viewers’ travel intentions | Travel vlogs (metadata, visual, textual) and associated comments (textual) | 10 YouTube videos and 9,086 comments | Qualitative | Qualitative content analysis, nominal group technique, thematic analysis | NVivo, |
| Georgescu Paquin and Cerdan Schwitzguébel | 2021 | Analyzing the tourist landscape representation in Barcelona’s promotional videos in an overtourism context | Tourism promotional videos (visual) | 24 YouTube videos | Mixed methods | Quantitative visual content analysis, qualitative semiotic analysis | Not specified |
| Hoebanx and French | 2023 | Examining how slot machine videos on YouTube portray gambling and align with the norms of YouTube’s platform economy | Slot machine videos (visual, textual) and associated comments (textual) | 21 YouTube videos and 186 comments | Qualitative | Thematic analysis | Not specified |
| Huertas et al. | 2017 | Analyzing how Spanish DMOs use YouTube to communicate their promotional videos and to study whether these videos communicate brands through attraction factors and emotional values | Tourism promotional videos (metadata, visual, textual) | 542 YouTube videos | Mixed methods | Quantitative content analysis, qualitative content analysis, statistical correlation | FanpageKarma |
| Ketter and Avraham | Examining digital marketing strategies used by NTBs during the COVID-19 pandemic | Tourism promotional videos (visual, textual) | 29 YouTube videos | Qualitative | Qualitative content analysis | Not specified | |
| Lakmali et al. | Analyzing how tourists portray crisis-affected destinations through YouTube vlogs and their content creation motivations | Travel vlogs (metadata, textual) | 10 YouTube videos | Qualitative | Thematic analysis | ||
| Lang | Analyzing how Hangzhou’s cultural and tourism bureau constructs its international image on YouTube | Tourism promotional videos (visual, textual, auditory) | 83 YouTube videos | Qualitative | Multimodal critical discourse analysis | Not specified | |
| Lau et al. | 2024 | Examining how museums reach Generation Z virtual tourists using TikTok videos and the relationship between video elements and types of engagement | Museum promotional videos (metadata, textual, auditory) | 313 videos on TikTok | Mixed methods | Thematic analysis, descriptive analysis, regression analysis | Excel, |
| Li et al. | 2023 | Exploring cultural meaning construction in social media through analysis of Liziqi’s YouTube channel | Lifestyle vlogs (visual) and associated comments (textual) | 5 YouTube videos and 500 comments | Qualitative | Content analysis (coding), textual analysis (decoding) | Python, |
| Li et al. | 2024 | Comparing spatial behavior of Chinese and foreign tourists based on landmark recognition in travel vlogs | Travel vlogs (visual, metadata) | 439 YouTube videos, 1,059 videos on Bilibili | Quantitative | Spatial analysis, landmark recognition | Baidu AI APIs, Baidu Maps Coordinate Picking System, Python |
| Ma et al. | 2023 | Examining the effectiveness of bullet comments associated with food vlogs on the tourists’ travel intentions | Food vlogs (metadata, visual, textual, auditory) and associated bullet comments (textual) | 20 videos on Bilibili and 133,680 bullet comments | Mixed methods | Qualitative content analysis, thematic analysis, computational sentiment analysis | Python |
| Ma et al. | 2023 | Exploring how food travel vlogs awaken travel intentions through viewers’ social and non-social emotions | Food vlogs (metadata) and associated comments (textual) | 32 videos on Bilibili and 91,437 comments | Mixed methods | Qualitative content analysis, | Python, (SnowNLP for sentiment analysis) |
| Mesana et al. | 2024 | Mapping online viewers’ social and non-social emotions when watching UNESCO cultural heritage sites’ travel vlogs | Travel vlogs (metadata) and associated comments (textual) | 64 YouTube videos and 3,089 comments | Qualitative | Qualitative sentiment analysis, phronetic iterative data analysis | Data Miner |
| Motahar et al. | 2021 | Exploring how Iran is framed as a travel destination by Social Media Influencers ( | Travel videos (metadata, visual, textual) | 10 YouTube videos | Qualitative | Netnography and narrative analysis | Not specified |
| Nazir | 2023 | Examining destination branding through social media, comparing foreign influencers’ narratives with official presentations | Travel vlogs (visual, textual) | 8 YouTube videos | Qualitative | Thematic analysis | NVivo |
| Rauf and Pasha | 2024 | Understanding Global North–South dynamics in YouTube gastronomic tourism videos and audience feedback | Food vlogs (metadata, textual) and associated comments (textual) | 9 YouTube videos and 128,000 comments | Mixed methods | Textual analysis, qualitative content analysis, computational sentiment analysis | Python, YouTube API |
| Salangsang et al. | 2022 | Examining elements of luxury travel in tourism video advertisements from Asian countries during COVID-19 | Tourism promotional videos (metadata, visual, textual) | 122 YouTube videos | Qualitative | Qualitative content analysis | NVivo, Ncapture |
| Sharma | 2023 | Comparing message strategies adopted by celebrities vs social media influencers in brand-related YouTube content | Fashion and food industry videos (metadata, visual, textual, auditory) | 638 YouTube videos | Mixed methods | Qualitative content analysis, chi-square test, ANOVA, Mann–Whitney U test, Kruskal–Wallis H | |
| Tavakoli and Ling | 2022 | Exploring consumer perceptions of virtual food consumption and its sociological implications using online comments about a promotional video | Comments associated with a virtual food promotion video (textual) | 250 comments on Facebook | Qualitative | Thematic analysis | Not specified |
| Tham et al. | 2023 | Examining how Penang, Malaysia, is marketed on TikTok by different stakeholders and how they present the destination’s image | Tourism promotional videos (metadata, visual, textual, auditory) | 30 videos on TikTok | Qualitative | Multimodal analysis, Burke’s Pentadic analysis | Not specified |
| Vujičić et al. | 2021 | Examining the techno-social dimensions of tourist drone videography to understand production practices and creator differences | Drone vacation videos (metadata, visual) | 351 YouTube videos | Mixed methods | Qualitative content analysis, descriptive statistics, chi-square tests, regression analysis | Webometric Analyst, YouTube Statistics, |
| Warton and Brander | 2017 | Assessing the value of the TV show Bondi Rescue for improving tourist beach safety awareness | TV shows (visual, textual) | 98 episodes | Mixed methods | Qualitative content analysis, t-tests, ANOVA | R |
| Wen et al. | 2021 | Exploring travel constraints related to physician-assisted suicide tourism | Documentaries (metadata, textual) and associated comments (textual) | 25 YouTube videos and 1,231 comments | Qualitative | Thematic analysis | Nvivo |
| Yayla et al. | 2024 | Examining food preferences and gastro-tourist typologies of digital nomads | Digital nomads videos (visual, textual) and associated comments (textual) | 21 YouTube videos and 326 comments | Qualitative | Qualitative content analysis | Not specified |
| Yıldırım and Kaya | 2024 | Investigating digital nomads’ impressions and reflections toward intangible cultural heritage ( | Travel videos (visual, textual) and associated comments (textual) | 5 videos on social media and associated comments | Qualitative | Thematic analysis | MAXQDA |
| Yoo, Kim et al. | 2024 | Examining the relationships between discrete emotions expressed by travel influencers and viewer engagement | Travel videos (metadata, textual) | 5,008 YouTube videos | Quantitative | Computational sentiment analysis, regression analysis | Amazon Rekognition API, |
| Yoo, Piscarac et al. | 2024 | Investigating the effectiveness of digital outdoor advertising in redefining urban tourism appeal and city branding using Seoul’s “Wave” campaign | Comments associated with YouTube videos (textual) | 956 comments on YouTube | Mixed methods | Word frequency analysis, centrality analysis, network analysis | Ucinet 6, NetDraw |
| Yu | 2019 | Investigating public perceptions of humanlike robots as employees in the hotel industry | Humanlike robot videos (metadata) and associated comments (textual) | 2 YouTube videos and 1,621 comments | Mixed methods | Cluster analysis, thematic analysis | NVivo, Data Miner |
| Zhang | 2021 | Analyzing public perceptions of service robots in hospitality and tourism amid COVID-19 | Comments associated with news report videos (textual) | 1,852 comments on YouTube | Quantitative | Explorative analysis (word cloud), computational sentiment analysis, | Python (Gensim for topic modeling), SentiStrength |
| Zhu et al. | 2024 | Examining the impacts of content features of pro-environmental tourism videos on viewers’ in-consumption engagement | Pro-environmental videos (metadata, visual, textual, auditory) | 44 videos on Bilibili | Quantitative | Regression analysis | Microsoft Azure AI Video Indexer, Python (Librosa, FFmpeg), R |
3.1 Characteristics of video content analyzed
The review of 37 studies revealed a diverse range of video content in hospitality research. Vlogs (i.e. video blogs) emerged as the most prevalent category, featuring in 10 studies (27%). This category encompassed travel vlogs (e.g. Dewantara et al., 2025; Li et al., 2024; Mesana et al., 2024), food vlogs (Ma et al., 2024, 2025; Rauf and Pasha, 2024) and lifestyle vlogs (Li et al., 2023). Tourism promotional ads and videos constituted the second most common category, appearing in nine studies (24%) (e.g. Avraham, 2020; Barns, 2024; Georgescu Paquin and Cerdan Schwitzguébel, 2021).
Other content types included travel-related videos (e.g. Chen et al., 2024; Motahar et al., 2024; Yoo et al., 2024a), food and fashion promotion videos (Sharma, 2025; Tavakoli and Ling, 2022), recipe videos (Barrett and Feng, 2021), wine promotion videos (Deng et al., 2022), slot machine videos (Hoebanx and French, 2023), drone vacation videos (Vujičić et al., 2022), TV shows (Warton and Brander, 2017), marketing campaign videos (Yoo et al., 2024b), news reports (Zhang, 2021), documentaries (Wen et al., 2023), digital nomad videos (Yayla et al., 2024), videos featuring humanlike robots (Yu, 2020), museum promotional videos (Lau et al., 2024) and pro-environmental videos (Zhu et al., 2025).
In our analysis of data types, textual data were most prevalent, examined in 33 studies (89%), followed by visual data in 23 studies (62%). Metadata was analyzed in 22 studies (59%), while auditory data appeared in seven studies (19%). The analysis of data-type combinations revealed that four studies (11%) examined all four data types, eight studies (22%) analyzed three types, 20 studies (54%) focused on two types and five studies (14%) examined a single data type.
Regarding analytical scope, we identified three distinct approaches: 21 studies (57%) concentrated exclusively on primary video content analysis, four studies (11%) focused solely on user-generated comments and 12 studies (32%) adopted an integrated approach examining both video content and audience responses through comments.
3.2 Sample characteristics
YouTube served as the predominant data source, with 25 out of 37 studies (68%) relying on it exclusively or primarily. Alternative platforms were utilized less frequently: Bilibili, a popular video-sharing platform in China, appeared in four studies (11%), while Douyin/TikTok appeared in three studies (8%). The remaining studies drew from diverse sources: Facebook (Tavakoli and Ling, 2022), television show episodes (Warton and Brander, 2017), travel crowdfunding projects (Chen et al., 2024) and government project videos (Bernal et al., 2024). Additionally, one study did not specify its data source (Yıldırım and Kaya, 2024).
The scale of video samples varied across studies, ranging from a focused examination of two videos (Yu, 2020) to a comprehensive analysis of 5,008 videos (Yoo et al., 2024a). Similarly, studies examining associated comments exhibited diverse sample sizes, spanning from 186 comments (Hoebanx and French, 2023) to an extensive collection of 133,680 comments (Ma et al., 2024).
3.3 Research approaches and analysis methods
Qualitative methodological approaches were most frequently employed in the reviewed studies, appearing in 17 out of 37 studies (46%). Mixed methods represented the second most common approach, utilized in 14 studies (38%), while quantitative methods were applied in six studies (16%).
Qualitative content analysis emerged as the dominant analytical method, appearing in 20 studies (54%). Statistical analysis represented the second most common analytical approach, appearing in 11 studies (30%). These studies (e.g. Sharma, 2025; Vujičić et al., 2022; Zhu et al., 2025) employed various statistical techniques ranging from descriptive statistics to more complex inferential methods, including both parametric tests (e.g. t-tests, ANOVA, chi-square) and nonparametric alternatives (e.g. Mann–Whitney U tests, Kruskal–Wallis H). Thematic analysis, which enables systematic identification and analysis of patterns within qualitative data (Braun and Clarke, 2006; Nowell et al., 2017), ranked as the third most common method, utilized in 10 studies (27%) (e.g. Hoebanx and French, 2023; Nazir, 2023; Wen et al., 2023).
Our review also identified an emerging trend toward computational and machine learning-based analytical techniques. Computational sentiment analysis, which quantifies emotional content in text (Liu, 2012), was applied in five studies (14%) (e.g. Ma et al., 2024, 2025; Rauf and Pasha, 2024), while Latent Dirichlet Allocation topic modeling, a statistical approach for identifying latent thematic structures within text corpora (Blei, 2012), was employed in 3 studies (8%) (Deng et al., 2022; Ma et al., 2025; Zhang, 2021). Additionally, several studies implemented specialized analytical approaches designed for specific research objectives, including acoustic analytics for voice and music characteristics analysis (Barnes, 2024) and spatial analysis using landmark recognition (Li et al., 2024).
3.4 Tools and software
Of the 37 studies reviewed, 27 (73%) explicitly specified the tools or software used in their research methodologies. Python emerged as the most frequently employed programming language, utilized in 10 studies (27%) through various specialized libraries. NVivo, a specialized qualitative data analysis software, was the second most frequently used tool, appearing in six studies (16%). Statistical Package for the Social Sciences (SPSS), a comprehensive statistical software package, ranked third, used in five studies (14%), followed by R, an open-source programming language for statistical computing and graphics, employed in three studies (8%), and Data Miner, a data extraction tool, also employed in three studies (8%).
The review also identified several specialized analytical tools deployed for specific research purposes. These included acoustic analysis software, such as Soundgen (Barnes, 2024), social media analytics tools like FanpageKarma (Huertas et al., 2017) and extraction software including QDA Miner Lite (Mesana et al., 2024) and Webometric Analyst (Vujičić et al., 2022). Other specialized tools included SentiStrength for sentiment analysis (Zhang, 2021), Microsoft Azure AI Video Indexer for scene detection (Zhu et al., 2025), Ucinet 6 and NetDraw for network analysis (Yoo et al., 2024b) and Baidu AI for geospatial recognition (Li et al., 2024).
4. Discussion and conclusions
4.1 Conclusions
This scoping review analyzed methodological approaches in video analysis within hospitality research, examining 37 studies published between 2017 and 2025. Our investigation revealed several distinct patterns and challenges, characterizing this dynamic field.
The distribution of data types analyzed across studies reflected the multifaceted nature of video content, with textual elements being examined most frequently, followed by visual components, metadata and auditory features. When examining the sources of this multimodal content, YouTube dominated as the data source platform, with alternative platforms like Bilibili, TikTok and Facebook having more limited representation. In terms of methodological approaches, the analytical methods showed a preference for qualitative methods and mixed methods, with qualitative content analysis being the most widely applied technique, followed by statistical analysis and thematic analysis. The tools employed across studies revealed the multidisciplinary nature of video analysis, with programming languages like Python and R used alongside specialized software such as Data Miner and NVivo. These patterns are examined in detail below.
4.1.1 Complexity of data types
Textual data appeared most frequently in the reviewed studies, with visual data following closely behind. Metadata analysis appeared in more than half of the studies, while auditory data remained notably underutilized. This distribution highlighted both the multimodal character of video content and specific methodological gaps, particularly the limited integration of auditory elements such as voice tone, music and ambient sounds that could provide valuable contextual information about hospitality experiences (e.g. Kemp et al., 2019; Liu et al., 2024).
The results also demonstrated a clear preference for multidimensional analytical frameworks, with most studies analyzing two or more data types. Only a small percentage of studies limited their analysis to a single data type, suggesting recognition among researchers that comprehensive video analysis requires examination of multiple elements. This methodological choice facilitated a more holistic view of video content, as illustrated by Salangsang et al. (2022), who integrated various elements to provide a richer understanding of travel promotion videos.
4.1.2 Dominance of YouTube as a data source
The prevalence of YouTube as the primary data source across the reviewed studies highlighted its central role in hospitality research. This platform’s prominence can be attributed to its extensive global reach (Statista, 2024) and researcher-friendly accessibility (YouTube, 2025). Additionally, the platform hosts diverse content types, from amateur vlogs to professional marketing videos, allowing researchers to examine various aspects of the hospitality industry through a single data source (Arthurs et al., 2018).
However, this heavy reliance on YouTube presented certain limitations. The platform’s content moderation policies and algorithmic recommendations may create an environment that does not fully represent the broader landscape of hospitality-related videos (Rieder et al., 2018). For example, prior research found YouTube’s algorithmic preference for mainstream professional sources over independent creators in news content (Nechushtai et al., 2024). This pattern suggests that hospitality researchers searching for industry-related news or destination updates may similarly encounter algorithmically curated results dominated by established media outlets, potentially overlooking authentic local perspectives and small business voices that remain buried in search results. Additionally, while YouTube has a diverse user base, it may not capture the full spectrum of demographics or cultural contexts relevant to global hospitality research, potentially leading to underrepresentation of certain perspectives (Arthurs et al., 2018). For instance, YouTube’s inaccessibility in China (Reuters, 2009) significantly reduces content creation from Chinese nationals, who are less likely to share their travel experiences and hospitality perspectives on a platform they cannot easily access, thereby underrepresenting voices from the world’s largest outbound tourism market (UN Tourism, 2025).
The emergence of alternative platforms like Bilibili and TikTok in several reviewed studies (e.g. Li et al., 2024; Zhu et al., 2022) indicated a growing recognition of the need for more diverse data sources. These platforms provide access to different geographical regions, user demographics and content formats, potentially enriching the field’s understanding of video content in hospitality. For instance, TikTok’s short-form videos represent a rapidly growing trend in content consumption with potential implications for hospitality marketing (Tham et al., 2024). Similarly, Bilibili’s popularity in China offers researchers opportunities for conducting cross-cultural comparisons and region-specific trend analysis (Li et al., 2024). However, incorporating these alternative platforms introduces new challenges that researchers must address, such as language barriers, cultural nuances and platform-specific features that may require developing new analytical approaches.
4.1.3 Methodological trends and challenges
Among the 37 studies reviewed, qualitative approaches were most frequently employed. Nearly half of the reviewed studies employed exclusively qualitative methods, while mixed methods studies similarly relied heavily on qualitative frameworks, typically supplementing them with only basic quantitative components such as descriptive statistics. Within these qualitative methodological frameworks, qualitative content analysis emerged as the most frequently employed analytical technique. The field’s concentrated reliance on this specific method, however, appears to have created several methodological limitations. The widespread adoption of qualitative content analysis has occurred without establishing standardized protocols specifically designed for video data analysis, creating potential difficulties for methodological consistency and cross-study comparison (Elo et al., 2014). Additionally, the resource-intensive nature of manual coding may become increasingly problematic as video content volume expands exponentially, raising questions about the scalability and representativeness of current analytical practices. Furthermore, the method’s inherent vulnerability to subjective interpretation remains an ongoing challenge for researchers seeking analytical rigor (Schreier, 2012).
The recent emergence of quantitative-only studies in the field may represent a response to these methodological limitations. This shift toward advanced computational approaches suggests that researchers are beginning to explore alternative frameworks that could address the scalability and standardization challenges inherent in traditional qualitative methods. Developing these computational capabilities could benefit from engaging with computer vision research, which has established sophisticated techniques (e.g. Chen et al., 2018; Redmon et al., 2016) for automated scene recognition and object detection that could be adapted for hospitality video analysis. However, rather than completely abandoning qualitative insights, the field may benefit most from developing integrative approaches that combine computational efficiency with the contextual sensitivity that characterizes hospitality research. Such hybrid methods could leverage machine learning’s capacity for pattern recognition across large datasets while preserving interpretive depth through targeted qualitative analysis (Ma et al., 2024, 2025). For instance, researchers examining food vlogs could use computational sentiment analysis to process thousands of viewer comments identifying overall emotional responses to specific cuisines or dining experiences, while employing qualitative analysis to understand the personal narratives that explain why certain foods evoke particular emotional reactions. Similarly, studies of destination marketing in travel videos could apply computer vision algorithms to automatically identify and categorize visual elements across hundreds of videos, such as natural landscapes, urban scenes or cultural activities, and then use qualitative interpretation to analyze how these visual patterns construct destination identities and influence tourist expectations.
4.1.4 Navigating tool selection in video analysis
The review revealed diverse analytics tools employed in hospitality video analysis, with Python, NVivo and SPSS emerging as the most commonly used. These tools presented researchers with specific advantages and implementation challenges. Python, the most frequently utilized programming language across the reviewed research, offers flexibility through its specialized libraries but necessitates coding skills that often fall outside the expertise of hospitality researchers (Guttag, 2016). NVivo, a qualitative data analysis software used in multiple studies, provides comprehensive functionality but involves licensing fees and requires training for effective use (Dhakal, 2022). SPSS, a comprehensive statistical software package, provides a user-friendly interface but has limitations in handling unstructured video data and involves licensing costs that may constrain researchers with limited budgets (Pallant, 2020).
These technical barriers point to a notable challenge in hospitality video analysis: the gap between available analytical capabilities and researcher expertise. Addressing this gap may require both educational and technological approaches. From an educational perspective, hospitality programs might consider integrating coding and analytics training into their curricula to broaden researchers’ technical skills and facilitate adoption of more sophisticated methodological approaches.
From a technological perspective, advances in analytical tools themselves may help address these barriers. AI-assisted software that integrates natural language processing and computer vision algorithms might automate routine aspects of video coding while preserving opportunities for nuanced human interpretation. Such developments could democratize access to advanced analytical methods by reducing both technical prerequisites and associated costs. Additionally, collaborative, cloud-based platforms would facilitate larger-scale video analysis projects, allowing researchers to pool both computational resources and methodological expertise across institutions.
4.2 Theoretical implications
This review advances methodological knowledge in hospitality research by providing the first systematic documentation of video analysis practices across the field. While previous work examined automatic analytics in tourism (Zhu and Cheng, 2024), our comprehensive mapping reveals a critical pattern: qualitative content analysis remains the dominant analytical technique, while computational and machine learning-based analytical approaches are emerging but remain underutilized. This gap between available capabilities and current practices contributes to understanding methodological adoption patterns in emerging research areas. By systematically documenting patterns in data-type utilization, platform concentration, analytical approaches and tool selection, this review provides an empirical foundation for methodological advancement in hospitality video analysis. Our findings enable researchers to situate their methodological choices within the broader landscape, identify underexplored approaches and make informed decisions about methodological innovation.
4.3 Practical implications
Based on the patterns observed in current methodological practices, we identify three strategic priorities for advancing video analysis in hospitality research: (1) platform diversification represents a critical need, given YouTube’s dominance in 68% of reviewed studies. Researchers should examine alternative platforms like Bilibili and TikTok to determine whether current findings are platform-specific or generalizable across different video environments. For example, food vlog engagement metrics on YouTube’s long-form content may significantly differ from TikTok’s algorithm-driven short videos, potentially affecting how hospitality brands should approach different platforms. (2) Investigating underutilized auditory components could enhance analytical depth, as our analysis found that auditory data appeared in only 19% of studies. Voice tone, background music and ambient sounds could provide rich contextual information about hospitality experiences that visual and textual analysis alone cannot capture. Researchers could integrate these components by analyzing vocal characteristics to assess emotional responses in customer review videos, examining background music choices in destination promotional videos or investigating ambient environmental sounds in travel vlogs to determine authenticity cues. For instance, studies examining restaurant review videos could investigate whether background ambient noise influences perceived authenticity beyond what visual food presentation alone conveys. (3) Hybrid analytical approaches offer solutions to current scalability limitations by integrating automated computational techniques with manual qualitative interpretation on the same dataset. These approaches use computational methods to achieve breadth of coverage across large datasets while preserving qualitative depth for interpretive insights. This methodological integration differs from multimodal analysis, which examines multiple data types (e.g. visual, auditory, textual) within videos; hybrid approaches instead focus on combining different analytical methods to address both scalability and interpretive depth. For example, researchers analyzing travel vlogs could employ automated scene detection and visual recognition to identify patterns in destination imagery across hundreds of videos and then use qualitative narrative analysis on selected examples to examine how vloggers construct destination identities through specific storytelling techniques and visual sequencing. Figure 3 summarizes these current practices, observed methodological patterns and strategic priorities for advancing video analysis in hospitality research.
The structured flowchart titled “Current Methodological Landscape (n equals 37)” presents three vertically connected sections. The top section contains four side-by-side boxes. The first box, “Platforms”, lists YouTube: 68 percent, Bilibili: 11 percent, Douyin or TikTok: 8 percent, and Others: 13 percent. The second box, “Data Analyzed”, lists Textual: 89 percent, Visual: 62 percent, Metadata: 59 percent, and Auditory: 19 percent. The third box, “Methodological Approaches”, lists Qualitative: 46 percent, Mixed methods: 38 percent, and Quantitative: 16 percent. The fourth box, “Analytical Techniques (Most Common)”, lists Qualitative Content Analysis, Statistical Analysis, and Thematic Analysis. A downward arrow leads to the middle section titled “Key Patterns Observed”, which states “Heavy reliance on single platform”, “Auditory data rarely analyzed despite presence in videos”, Manual qualitative coding limits scalability”, and “Minimal adoption of computational techniques”. Another downward arrow leads to the bottom section titled “Recommendations for Future Research”, which states “Diversify data sources beyond YouTube to emerging platforms”, “Integrate underutilized auditory components such as voice tone, music, and ambient sounds”, and “Develop hybrid approaches combining computational efficiency with qualitative depth”.Current methodological landscape and future directions for video analysis in hospitality research. Source(s): Authors’ own work
The structured flowchart titled “Current Methodological Landscape (n equals 37)” presents three vertically connected sections. The top section contains four side-by-side boxes. The first box, “Platforms”, lists YouTube: 68 percent, Bilibili: 11 percent, Douyin or TikTok: 8 percent, and Others: 13 percent. The second box, “Data Analyzed”, lists Textual: 89 percent, Visual: 62 percent, Metadata: 59 percent, and Auditory: 19 percent. The third box, “Methodological Approaches”, lists Qualitative: 46 percent, Mixed methods: 38 percent, and Quantitative: 16 percent. The fourth box, “Analytical Techniques (Most Common)”, lists Qualitative Content Analysis, Statistical Analysis, and Thematic Analysis. A downward arrow leads to the middle section titled “Key Patterns Observed”, which states “Heavy reliance on single platform”, “Auditory data rarely analyzed despite presence in videos”, Manual qualitative coding limits scalability”, and “Minimal adoption of computational techniques”. Another downward arrow leads to the bottom section titled “Recommendations for Future Research”, which states “Diversify data sources beyond YouTube to emerging platforms”, “Integrate underutilized auditory components such as voice tone, music, and ambient sounds”, and “Develop hybrid approaches combining computational efficiency with qualitative depth”.Current methodological landscape and future directions for video analysis in hospitality research. Source(s): Authors’ own work
4.4 Limitations and future research
This scoping review had several limitations that warrant consideration when interpreting findings. First, restricting our search to English-language publications excluded substantial relevant research published in other languages, particularly given the global nature of video content and platform usage. This limitation became especially apparent when considering regions like China, where extensive video analysis research likely existed in Chinese-language publications but remained inaccessible through English-focused academic searches. Future reviews could incorporate multilingual literature searches and foster international collaboration to capture diverse methodological approaches and cultural perspectives.
Second, the rapid evolution of video production and analysis technologies created inherent challenges for academic literature to reflect current industry developments. The time lag between research conduct and publication means that recent advancements in video analytics might not be fully reflected in scholarly literature. Our exclusive focus on peer-reviewed publications further limited access to innovative developments typically reported first in industry publications, conference proceedings and technical white papers (e.g. AXIS Communications, 2021; Pensees Singapore, 2023).
Third, our database selection strategy created potential coverage gaps. While Web of Science and Scopus are widely used in hospitality research (e.g. Gomezelj, 2016; Mariani et al., 2018; Mehraliyev et al., 2021), they may not encompass all relevant publications, particularly those focusing on technological innovation (Gomezelj, 2016). The interdisciplinary character of video content analysis means that relevant studies could appear in computer science, media studies or engineering journals that fall outside traditional hospitality research databases. Future reviews should expand database coverage (e.g. Science Citation Index Expanded, Social Sciences Citation Index) and disciplinary scope to achieve more comprehensive literature identification.
Fourth, we did not calculate inter-rater reliability measures during the screening process. While not required by PRISMA-ScR guidelines (Tricco et al., 2018), such measures would enhance methodological transparency and rigor in future scoping reviews.

