Skip to Main Content
Purpose

This study investigates the applications of computer vision (CV) technology in the tourism sector to predict visitors' facial and emotion detection, augmented reality (AR) visitor engagements, destination crowd management and sustainable tourism practices.

Design/methodology/approach

This study employed a systematic literature review, following the Preferred Reporting Items for Systematic reviews and Meta-Analyses methodology and bibliometric study on research articles related to the tourism sector. In total, 407 articles from the year, 2013 to 2024, all indexed in Scopus, were screened. However, only 150 relevant ones on CV in Tourism were selected based on the following criteria: academic journal publication, English language, empirical evidence provision and publication up to 2024.

Findings

The findings reveal a burgeoning interest in utilizing CV in tourism, highlighting its potential for crowd management and personalized experience. However, ethical concerns surrounding facial recognition and integration challenges need addressing. AR enhances engagement, but ethical and accessibility issues persist. Image processing aids sustainability efforts but requires precision and integration for effectiveness.

Originality/value

The study’s originality lies in its thorough examination of CV’s role in tourism, covering facial recognition, crowd insights, AR and image processing for sustainability. It addresses ethical concerns and proposes advancements for a more responsible and sustainable tourist experience, offering novel insights for industry development.

Numerous nations' economies have benefited from the tourist industry. The industry employed 330 million people in 2019, making it one of the world's most populous occupations, and it added $8.9tn to the global gross domestic product (GDP) or 10.3% of all GDP. In addition, the industry received $948bn in funding. Even while tourism is mostly a service industry, it can nevertheless be considered a product in its own right, with five main components: attractions, accessibility, lodging, amenities and activities.

The ongoing generation of large numbers of digital images, particularly in tourism, has created a new opportunity for marketing stakeholders to extract the information required for understanding consumer behaviour and their experiences with the product (Liu et al., 2020). This stakeholder who oversees picture libraries and design archives in huge image collections has long acknowledged the importance of the intricate digitization process, indexing procedure and the necessity for effective storage and retrieval of photographs (Wang and Wu, 2020). Thus, they, along with academic researchers, have developed many strategies to address issues in picture categorization and retrieval. Alkhawlani et al. (2015) identified three primary image retrieval techniques: text-based image retrieval, semantic-based image retrieval and content-based image retrieval (CBIR). CBIR appears to be the optimal method for managing image data by reducing labour costs, minimizing human errors in image data indexing and retrieval and enhancing the digital image era.

This study conducts a systematic literature review (SLR) on possible adoptions of cutting-edge artificial intelligence (AI) computer vision (CV) technology in the tourism industry to reveal the perceptual and behavioural distinctions between locals and tourists. Further, this research aims to assess the appropriateness of CV and the effectiveness of specific pre-trained models for analysing visual user-generated content relevant to brands based on the context-aware system (CAS) theory and goal question metrics (GQM) approach.

CAS theory (Schilit and Theimer, 1994) holds that systems sense, understand and react to real-time contextual data to improve user experiences and maximize processes. Important elements are context acquisition (sensing data), context modelling (structuring data), context reasoning (inferring high-level information), context adaptation (behaviour modification) and context distribution (sharing data). With issues in guaranteeing data accuracy, integration, privacy and resource management, CAS applications encompass smart tourism, healthcare, smart homes and transportation. The GQM approach is a systematic methodology that ensures measurement activities are in line with business objectives. It achieves this by defining precise goals, formulating targeted questions and identifying appropriate metrics. This approach facilitates continuous improvement in processes, products and services.

Applying the principles of CAS theory – which makes use of real-time data to deliver personalized recommendations, interesting interactive experiences and efficient crowd management – can improve destination tourism. This approach not only raises tourist satisfaction but also promotes sustainable tourism practices by spotting and lowering environmental damage and too high travel volume. As per the CAS theory and GQM approach (Kan et al., 1994), our study aims to review the existing scientific literature to describe how researchers and practitioners in the tourism industry use CV concerning approaches, methods and tools. We formulated the following research inquiries to help us reach our objective:

RQ1.

What sources of computer vision have been used in seeking solutions for tourism?

RQ2.

What is the impact of computer vision on the tourism business, taking into account the changes in important research fields, influential contributors and possible future directions?

CAS can greatly improve visitor experiences in the framework of destination tourism by using real-time data to offer individualized and adaptable services. To provide custom recommendations for attractions, meals and activities, CAS can compile information on visitor preferences, historical behaviours and emotional states. Combining augmented reality (AR) with CAS will provide interactive and immersive narratives at historical sites and museums, therefore enhancing the educational value of the experience. Surveillance cameras and CV algorithms can track visitor movement to find busy locations and maximize visitor flow by guiding them to less crowded areas and reallocating resources as needed in crowd control. With automated reporting alerting authorities to respond quickly, image processing systems can also identify environmental damage or overtourism using drone or camera footage. Destination tourism may provide guests with more individualized, interesting and environmentally friendly experiences by using CAS, therefore improving their satisfaction and maintaining the integrity of the tourist destinations. Thus, the following research question [RQ3] is framed:

RQ3.

Can facial recognition and emotion detection algorithms be implemented in tourist destinations to personalize experiences and tailor recommendations based on real-time visitor emotions?

Face recognition and emotion detection algorithms can also be applied in tourist hotspots. For instance, CAS captures the emotional states of visitors in real time by analysing facial expressions at the point of reaching a specific spot. It can identify delight, surprise or even boredom. Based on these feelings, it personalizes recommendations to enhance the experience of the visitor. For example, if he looks bored, it would offer more exciting events or interactive exhibitions. The dynamic and interactive experience, which could be made possible with digital guides, is on account of their ability to change their tone and suggestions based on the emotional feedback received from visitors. Such a high level of visitor satisfaction and engagement depends on the proper and incessant monitoring and reaction to emotional signals. Using cameras, record the facial expressions and emotions of visitors. The intelligent wristband or smartphone applications will analyse them in real time with machine learning algorithms. Therefore, based on current feelings, this research shows that the proposed system allows suggesting attractions, restaurants or events that are corresponding and appropriate for visitors. This approach not only improves the total tourist experience, raises satisfaction and creates loyalty to a customer but also helps travel firms to highlight weaknesses and to better optimize their products. Therefore, the research question [RQ4] presented is:

RQ4.

How can computer vision techniques be used to analyse tourist movement patterns in real time to improve crowd management and optimize resource allocation in tourist destinations?

Real-time movement patterns of tourists can be analysed using CV methods including object detection and tracking, so greatly enhancing crowd management and resource allocation. Tourist sites can monitor crowd density in various locations using surveillance cameras and CV algorithms, visualizing busy regions using heat maps to control congestion and guarantee safety. Monitoring movement patterns helps the system to find popular paths and congestion, thereby guiding choices on the best layout and flow of foot traffic. Dynamic resource allocation made possible by real-time visitor movement data allows workers to be directed to high-density locations or modify cleaning plans in heavily travelled-through zones. Cameras placed at popular tourist destinations, for instance, record footage of guests that CV algorithms examine to track density, flow and velocity, thereby pointing out regions of great traffic congestion. Through traffic flow optimization, identification of bottlenecks and real-time visitor information on crowd levels, wait times and other routes, the system offers insights into crowd management. By adding staff or facilities at high-demand points, this approach not only reduces the problem of crowd control for destinations but also allows tourism operators to better redistribute their resources. Hence the following research question [RQ5] is as follows:

RQ5.

Can tourists engaged at historical sites and museums have interactive experiences that are more enjoyable with the help of AR technology integrated with computer vision?

By overlaying digital information onto real-world locations, including interactive storytelling, customized guided tours and interesting aspects, AR coupled with CV can dramatically improve the tourist experience at historical sites and museums. AR can, for instance, overlay animations, historical data or reconstructions onto actual objects so enhancing the immersive and instructive value of the experience. As they tour many displays, visitors utilizing AR-enabled devices or smart glasses with CV algorithms can get context-aware guided tours with information catered to their interests. For younger guests especially, interactive AR components – games or quizzes – make learning more interesting. The AR technology monitors visitors' location and orientation as they approach an exhibit and then generates 3D overlays including historical background, interactive games and virtual reenactments. This strategy improves the whole experience, raises visitor participation and generates a more immersive and unforgettable visit. Hence the stated research question [RQ6] is as follows:

RQ6.

How can image processing algorithms of computer vision be used to automatically detect and report environmental damage or overtourism in popular destinations, enabling sustainable tourism practices?

The subsequent sections of this work are structured as follows. In Section 2, we provide the theoretical foundation for our SLR. Section 3 provides a detailed account of our approach, encompassing the objectives, research questions and SLR technique. In Section 4, we then give the findings derived from the primary research that has been scrutinized. In Section 5, we will provide the conclusions, outline the remaining tasks and discuss the potential limitations of our study.

There are several uses for AI systems in the travel sector. AI improves the tourist experience by making it easier to find what they're looking for, making it easier to move about and making better decisions (Samara et al., 2020). AI has the potential to revolutionize several managerial tasks, with an emphasis on improving promotion techniques and boosting productivity. It is believed that AI will influence consumers' societal views in a way that encourages sustainable travel (Tussyadiah, 2020).

AI systems in the tourism sector can function independently or be integrated into current applications and systems. The systems encompass recommender systems, personalization systems, conversational systems (chatbots and voice assistants), forecasting tools, autonomous agents, language translation applications and smart tourism destinations (Nannelli et al., 2023). While we examine each system individually, it should be noted that visitors typically engage with technologies that combine many systems (Gajdošík and Marciš, 2019). A guest can engage with a robot that incorporates a conversational system, along with a recommender system, personalization approach or autonomous agent based on their needs. The interaction with the user can be facilitated through a chatbot or voice assistant (Solakis et al., 2022).

CV is a branch of AI that primarily deals with object description. The theory around the deconstruction and reconstruction of object components from their most basic attributes – such as form, transparency, lightness, colour, size and layers – is crucial to CV. CV is useful for the hotel and tourist industry because it can identify and categorize large numbers of images. One example is robots that scan licence plates while they go about their business (Huang et al., 2022). Another is the ability of search and booking engines to automatically classify uploaded photos of houses as a specific type of room (Milman et al., 2020).

The advancement of computer technology has enabled the utilization of deep learning models and algorithms to analyse large amounts of tourism data, leading to new insights and knowledge (Liu et al., 2020). New technology has sparked a study trend in combining large visual data into studies on tourists' views. Various research based on user-generated photos has already begun regarding tourists' impressions. Prentice (2020) analysed international tourists' location preferences by studying geo-tagged images shared on social media. Vu et al. (2015) showed how a dataset created from geo-tagged images may comprehensively collect and enhance the comprehension of international tourists' actions. The author conducted a pilot study on urban perceptions using a deep learning model for scene recognition (Liu et al., 2016). Zhang et al. (2021) studied tourists' behaviours and impressions at a tourist location by examining the visual content of their images with a scene recognition model. A comparative study was conducted on tourists' perceptions of the place using a deep learning semantic segmentation model by Zhang et al. (2020).

In comparison to the conventional method, the coding and machine training phases of a well-designed and implemented deep learning system need a substantial investment of time and effort. Its strength is in the speed and accuracy, with which it processes standardized categorization tasks by identifying commonalities and differences using machine learning. According to Zhang et al. (2019), a large number of scientists will be pursuing careers in data-intensive fields because of how quickly they are expanding. In the fourth paradigm of research, researchers can evaluate and navigate massive datasets thanks to advanced computational capabilities (Cho et al., 2022). The use of user-generated photos and CV technologies in tourist studies is on the rise. In contrast to developments in CV, tourism research has only made little use of current technologies (Hou and Pan, 2023). Additionally, most tourism research only used a single CV deep learning model. Further application of CV technology in solving tourist problems is highly encouraged (Liu et al., 2021).

We constructed an SLR by adhering to the recommended guidelines for conducting a structured literature review outlined by Rahmadian et al. (2022). Figure 1 depicts our systematic review approach. The subsequent sections outline the procedures of the SLR that we carried out to identify influential research and analyse patterns in the literature.

We found 407 articles in the Scopus database. Table 1 outlines the study selection criteria, detailing their range and level of excellence. We employed the Preferred Reporting Items for Systematic reviews and Meta-Analyses approach (Figure 1) to refine the publications as follows.

  • (1)

    407 articles were identified in the database during the “identification” phase;

  • (2)

    220 publications were eliminated during the “screening” phase because of title and subject area problems;

  • (3)

    37 publications were eliminated during the “eligibility” phase because of problems related to the documents and language and

  • (4)

    Finally, 150 articles were included in the study.

Table 1

Inclusion and exclusion criterion

TopicCriterion
FieldComputer Vision and Tourism
LocationStudies from any geographical areas
StudyOnly Publication in academic journals
TopicPublications need to provide empirical evidence to support the research questions
LanguageWritten in English
DatePublished upto 2024

Source(s): Compiled by authors

The final search string, which the experts also validated, is as follows.

(TITLE-ABS-KEY (“Computer vision”) OR TITLE-ABS-KEY (“Image processing”) AND TITLE-ABS-KEY (“Tourism”)) AND (LIMIT-TO (SUBJAREA, “SOCI”) OR LIMIT-TO (SUBJAREA, “BUSI”)).

* SOCI = Social Science

* BUSI = Business Management and Accounting

* LIMIT-TO = Limited to

Figure 2 shows the yearly publication numbers of articles on tourism research utilizing CV. From the result of Figure 2, two primary conclusions may be readily inferred. Research on the application of CV technology in the field of tourism is now in its initial phase. Conversely, there seems to be a consistent increase in the yearly publication of publications from 2014 to 2024, suggesting a growing interest in this developing topic. This section will answer RQ1 regarding the CV that has been used in solutions for tourism.

Figure 2

Publication details year on year

Figure 2

Publication details year on year

Close modal

A study was conducted to identify emergent themes in the domain, which was then represented in Figure 3. The analysis was performed utilizing 100 author's keywords and a minimum cluster frequency of 5 per thousand documents, resulting in the detection of 9 clusters on the map. The diameter of a circle in a cluster is directly proportional to the number of phrases it represents. The cluster identified as “Image Processing” was the most prominent, with 29 unique phrases. By contrast, the cluster linked to “Remote sensing” consisted of 16 distinct keywords. Similarly, “Social media” encompasses a total of 11 unique terms. The map is characterized by two dimensions: centrality and density. Centrality refers to the degree of connectivity between a specific cluster and other clusters. Conversely, density pertains to the level of connection among the individual words inside the cluster.

Figure 3

Thematic analysis

Figure 3

Thematic analysis

Close modal

An extensive analysis of co-citation has been carried out, utilizing the 248 articles under investigation, to gain an in-depth understanding of the data. Upon inspection, it was found that five separate groups of authors are commonly referenced together. This can be observed in Figure 4. The chart highlights the significant contributions made by author “Li X.,” who has received an impressive 44 citations in the field of research. Similarly, authors Wang Y. and Zhang Y. have made noteworthy contributions in their respective study disciplines, with 40 and 37 citations, respectively. In contrast, authors “Li Y.” and “Li J.” have received a comparatively lower number of citations, with 36 and 32 citations, respectively, in the related publications.

Figure 4

Co-citation with cited author coupling

Figure 4

Co-citation with cited author coupling

Close modal

To have a good hold of the data, a full citation analysis was performed utilizing the 248 papers that were considered. Several documents were found to be referenced during the examination. The result is illustrated in Figure 5. With a whopping 1,735 citations, the work of author “Snavely et al. (2008)” had a tremendous influence, as seen in the accompanying chart. In addition, with 610 citations between them, “Tasci and Gartner (2007)” have made substantial contributions to their respective domains. Just one piece of information: “Lee and Gretzel (2012)” has 183 citations. Similarly, in the same area of study, “Hao q. et al.” received 150 citations, while “Curr r.h.f et al.” received 78 citations.

Figure 5

Citation with documents coupling

Figure 5

Citation with documents coupling

Close modal

The RQ3 thus highlights the click-by-click necessity of complementary research before tourism can broadly derive image processing utility, such as facial recognition. All thrilling personal recommendation possibilities or swift security checks are coupled with ethical concerns regarding data privacy and potential biases that have to sufficiently be catered to. There is, therefore, need for further research into whether image processing technologies can be effective and ethical in different eventualities in tourism from the considerations of both innovation and the individual rights of the tourists.

The potential for recognizing facial action units, emotions regarding speech and even face recognition algorithms on their own can make personalized travel experiences and recommendations a great possibility. Many researchers, including Wang et al. (2021) and Elbroch et al. (2023), have stated that image processing can be used in measuring quantitative factors for the safety of lone travellers in the hospitality industry. It can also be used in detecting human eating habits and further integrated with AR and virtual reality (VR) kits to create tailored experiences.

Similarly, Gupta et al. (2023) explored how this facial recognition technology could be applied in personalizing services, travel itineraries, data-driven service, customer identity and authentication, security and ease of making payments within the travel and tourism industry. It is important to equally point out that there are other possible ethical and privacy concerns attributed to the application of facial recognition technology, which calls for critical considerations before the technology begins to be widely used. According to Flaherty and Piyaphanee (2023), AI applications might go on to further personalize travel and tourism experiences. However, in all the excitement, ethical matters related to data privacy, the potential for data bias, data quality and user control need to be carefully pondered before embracing this technology. The security concerns of the system were enunciated in the investigation of Clark et al. (2020): marrying facial recognition with anti-spoofing technologies shows potential for addressing these concerns, enhancing security and accuracy and, perhaps, in the long run, enhancing trust and using this technology in the areas of security and personalized experience. However the effectiveness of the same can be questioned, and ethical issues related to privacy and bias need to be discussed well. Elbroch et al. (2023) talked about ethical issues like data collection and storage, concentration on the information being used and algorithmic bias related to facial recognition on the assistance of image processing on the visitors. In the same way, Ozkul et al. (2019) showed how personalization should be done to modify the surroundings, give targeted recommendations or enhance the experience based on the visitors' expressions.

More authors noted that emotion detection in marketing and, in general, tourism provides an avenue to give tourists a powerful solution to understand user experience and feelings among travellers. This vital information may aid more effective marketing, improved destination development and eventually happier and more loyal visitors. For example, Yuan and Vui (2023) and Liu et al. (2023a, b) show the potential of the model in the application of facial recognition in the security, personalization and accessibility domains; the main ethical problems, however, include matters of data privacy when it comes to image processing. This may cause potential biases that can be a source of discrimination, and no transparency in the making and the use of that technology make the problem much worse. Responsible adoption should be regulated to serve as safeguard mechanisms to ensure diversified datasets and put into play clear accountability mechanisms to protect personal rights and mitigate risks in its applications. Balancing the potential benefits of a technology with its ethical costs is a key step in the process of deciding whether to spread that technology throughout the travelling industry.

On the other hand, RQ4 explains that tourist destinations can unleash deep insights with the use of “computer vision,” a new technology that analyzes real-time visual data of tourists. CV uses camera networks that capture movement and image recognition that identifies people to give a comprehensive and intricate description of tourist behaviour (Zhou et al., 2023). Analysis of the data employs flow analysis, heatmaps and behaviour tracking, among others, which allow one to extract crowd patterns, hotspots and preferences. Such insight forms the basis for improved crowd management in that it is easier to prevent overcrowding, adjust deployment for resources like security personnel and even give the tourist more bespoke recommendations based on location and movement (Wang et al., 2022). That's to say, though CV is a huge achievement, data privacy issues, assurance of the accuracy of the data under different conditions and integration into already existing systems are challenges that ought to be handled in future developments as brought out by Ghosh et al. (2023). Essentially, CV becomes a tool in the domain of tourism that provides a better comprehension of tourists and puts forward the conditions for a more enjoyable and effective way of travelling.

This section addresses RQ5 – a pertinent question, because tourists who visit museums and historical places could have a more interactive and immersive experience with the implementation of AR and picture recognition. Some researchers present facts that could back up such a claim:

Ramtohul and Khedo (2024) created an AR system through which historical images are overlaid in actual physical settings. This permits tourists to conduct a “journey through time” so that some of the most famous places of interest can be seen just as they originally appeared when first built.

According to Liu et al. (2021), with image recognition, the learning output by visitors is significantly much better when using AR software compared to traditional tours. Tourists were required to use their smartphone cameras to view augmented video footage of landmarks in a field study (Liu et al., 2024). In the end, the technology was rated to be of great appeal and very useful. Jiang et al. (2023), in their paper, presented a prototype of an AR application that serves for the identification of landmarks and the display of relevant data. This enabled enhancement in interactivity and understanding of the site's visitors. Zhang et al. (2024) focused on how people use the AI service to tell apart different plant species using their phones. They came up with the bright idea that even in places with patchy Internet coverage, nature tourists could have an even better time using AR apps. The technology involves image recognition as a means of using AR for an in-depth understanding of sites with major cultural or historical significance. The possibilities are numerous, and this is just an example of that, as research suggests this will offer clear insight into the facts by superimposing historical pictures or data into real-world settings (Varol and Öksüz, 2024). Thus, the original purpose of the artefacts and the ways of that period are more learned while gazing upon the relics of ages past. Historic sites can be enhanced using AR in ways that more traditional approaches could never accomplish. However, AR isn't limited to the classroom; it might potentially be a huge hit with the kids. All ages of visitors can be enthralled and involved through the usage of 3D models, animations and games, which can increase their historical learning experience (Tom Dieck et al., 2018). One major advantage of AR is that it allows for a personalized experience for every single visitor. Users can personalize AR apps to their tastes, thus making more meaningful and unique experiences. To illustrate the point, one may make an AR app that targets history enthusiasts by showcasing the site's history or target lovers of art and architecture by showcasing the historical connections of the site being viewed. To optimize the visit, the visitors could be able to concentrate on those parts of the history that caught their interest the most (Basheer et al., 2023). AR could use this by giving access to cultural heritage by attracting more significant traffic to a site.

Information can be delivered using sign language interpretation or audio descriptions for people who are either blind or deaf, which makes it useful for everybody. Conclusion AR/VR with image recognition is a potent transforming element in the way we view sites of enormous cultural or historical significance. This would already bring history to life and make it an exciting subject instead of boring for anyone. This improves comprehension, enchants people, enables customization of experiences and adds new communities into contact with the resources (Chen et al., 2024). Generally, study results may suggest that AR combined with image identification can be a potent tool to create an engaging, interactive experience. Much more exciting and new experiences are likely to come when AR technology further develops. AR joined with picture recognition is going to revolutionize the way we experience historical locations and museums. Indeed, studies have shown that AR can enhance visitor engagement and comprehension tremendously by overlaying historical information and imagery onto real-world surroundings.

Moreover, some individual interests could be added and accessed for visitors with disabilities with the help of AR apps, augmenting it with features like audio description, sign language interpretation and many others. All in all, it makes history come alive, very powerfully and most engagingly for everyone.

This section defines that RQ6 tourism, being a key driver for economic growth in many countries, is facing an increasing challenge, concerning the balance of optimum benefits between maximizing financial gains on one hand and environmental protection on the other. At the very same time, it constitutes such a strong tool with immense potential for protecting valuable ecosystems, managing the pressure of tourist arrivals and finally enhancing the tourist experience in great measure, thereby laying a platform for long-term and sustainable development in tourism.

It becomes the vigilant guardian in this struggle to save the environment through image processing. Using algorithms, the most popularly read study by Shen et al. (2024) dabbles in underwater pictures in trying to identify distinct indicators for coral bleaching, which is an early sign of possible harm towards the reefs. Similarly, methods of deep learning are applied by Han et al. (2022) for the prediction of jellyfish blooms that could be harmful both for tourists and marine animals. In their contribution, Li et al. (2018) clarified underwater photography for more effective in situ measurement of the extent of impacts on such special ecosystems.

It does not, however, stop there in this particular domain of water. The field of image processing also massively contributes to the management of the burden of tourism, which has become a challenge to the communities suffering from overtourism. Ma et al. (2023) propose a method for estimating the extent of overcrowding at specific locations using face detection in tourist photos to undertake countermeasures against congestion in advance. According to Liu et al. (2023a, b), the development of a geographically aware AR system is foreseen. This will guide tourists and possibly reduce overcrowding in key monuments. On the same concept, Lacárcel et al. (2024) clearly demonstrate the potential for analysing user-generated photos to gain an understanding of the visitors' behaviour for establishing areas prone to excess tourism.

Image processing technology holds huge potential for significantly changing the tourism industry in the pursuit of both preserving the environment and improving the quality of the visitor experience. It also protects the environment from coral bleaching, monitoring jellyfish populations and locations that have the risk of overtourism by studying tourist behaviour in images, as noted by Li et al. (2023). It gives the authorities time to act and come up with plans to reduce the impact on the environment in effective management of resources and visitors and reduce the adverse impacts brought about by mass tourism to protect the ecosystem. Processing images thus improves the experience of visitors through real-time monitoring that is possible via sensor networks and aids in reducing congestion around main tourist places. However, challenges remain. We need to develop algorithms that precisely detect certain cases of environmental degradation and be able to tell them apart from tourist activity. Furthermore, it is necessary to couple image processing with sensor networks for real-time monitoring.

The tourism business is about to get its technology shake-up. Imagine a world where facial recognition technology makes recommendations for health services or changes the environment to suit your mood. CV can analyse crowds in real time. It, therefore, optimizes resource allocation and crowd control. Fans of history can use AR technology that makes historical sites come alive on their mobile devices. Algorithms in image processing could detect environmental damage, and authorities could quickly take proactive measures. Social media analysis allows tourist firms to understand the situation at hand and make necessary changes in their marketing strategies. These technologies give the potential for travel tailored by individual choices, more efficient and productive and environmentally friendly. Of course, the ethical concerns about data privacy and bias within such algorithms are moot. Responsible development is a must to enhance this travel experience for all.

This research takes a critical view of the application of AI CV technology in the tourism sector based on the CAS theory and the GQM approach. Besides discussing potential applications, it provides a critical review of other important aspects like data privacy issues, ethical dilemmas and the need for sophisticated algorithms. The paper assesses the challenges and benefits that CAS components bring to the congruence of GQM with corporate goals and the impact on tourism using these elements, considering the background of real-time data analysis with tailored recommendations. It also assesses ethical questions of facial recognition and emotion detection, the required infrastructure costs for real-time movement analysis and pragmatic challenges for merging AR. Furthermore, what has been underlined in the study is the role of CV in terms of facilitating sustainable tourism through environmental monitoring. After all, this is an issue of robust systems against privacy, ethical and infrastructural challenges. This stresses careful and effective application to help the larger travel business as well as visitors.

The paper highlights how CV applications can enhance the experience of tourists with extremely personalized destination site recommendations, tailored suggestions and increased safety through AR and facial recognition technologies. The study helps to analyse how AI CV technology can engage the tourist with historical landmarks and cultural attractions, enriching their understanding and admiration through the utilization of AR and image recognition. The study also explains how CV and image processing technologies can help manage crowds and reduce congestion, allowing tourists to navigate crowded travel areas more effortlessly. Tourist having increased accessibility to attractions, with personalized experiences and real-time information about their surroundings can boost their travel confidence. This paper discusses how CV can use image processing to manage visitor numbers, monitor environmental effects and predict crowd levels, thereby promoting a more responsible and eco-conscious travel industry. Image processing technologies enhance the tourist experience, making it more enjoyable, easily accessible and environmentally friendly for all.

Destination stakeholders to adopt image processing technology have CV to enhance the trip experience of the tourist optimize tourism operations and promote sustainable travel practices. Tour operators and travel agencies can customize experiences and optimize logistical operations, while tourist boards and governments can monitor the flow of visitors and manage population levels. Attractions and museums can engage guests through the use of AR and picture recognition technologies, while hotels and lodging firms have the potential to enhance guest experiences. Local communities can benefit from implementing sustainable tourism practices and contribute to the development of respectful and culturally sensitive travel experiences. Technology suppliers can enhance their image processing capabilities to meet the demands of businesses. It will be assisted by image processing techniques, which tourism stakeholders can use to build a very sensitive, accessible and pleasing travel industry.

While facial recognition and AR realize personalization and the enrichment of tourism experiences, there are still certain challenges. In-depth deliberation on ethical issues around data privacy and any bias that these algorithms harbour is quite necessary. CV development shall consider aspects of privacy concerns, data quality and smooth integration with existing systems. It is also necessary to improve further image processing algorithms, such that the reason for environmental degradation can be better detected and visitor activities can be well differentiated. This has to go hand in hand with sensor networks and real-time monitoring as a means to achieve sustainable practice. In such cases, the responsible development and ethical use of technologies in tourism must be aligned with the enforcement of stronger personal data protection regulations, transparent data collection procedures and awareness of the privacy rights of tourists. If we use this technology responsibly, it will lead to future generations inheriting a prosperous and sustainable tourism sector that coexists peacefully with the environment.

Data availability: Data underlying conclusions – a database in the public domain of the Scopus collection – is concluded with the key terms “Computer vision and tourism” for data extraction searches.

Competing interests: The authors declare no competing interests.

Ethical approval: This manuscript does not contain any studies with human participants performed by any of the authors.

Informed consent: Since there were no human subjects involved in this review study, no consent was required.

Alkhawlani
,
M.
,
Elmogy
,
M.
and
Elbakry
,
H.
(
2015
), “
Content-based image retrieval using local features descriptors and bag-of-visual words
”,
International Journal of Advanced Computer Science and Applications
, Vol. 
6
No. 
9
, doi: .
Basheer
,
S.
,
Farooq
,
S.
,
Hassan
,
V.
,
Malik
,
Y.M.
and
Reshi
,
M.A.
(
2023
), “
Augmented reality and virtual reality in cultural heritage tourism enhancing visitor experiences
”, pp. 
13
-
34
, doi: .
Chen
,
H.
,
Hou
,
L.
,
Wu
,
S.
,
Zhang
,
G.
,
Zou
,
Y.
,
Moon
,
S.
and
Bhuiyan
,
M.
(
2024
), “
Augmented reality, deep learning and vision-language query system for construction worker safety
”,
Automation in Construction
, Vol. 
157
, 105158, doi: .
Cho
,
N.
,
Kang
,
Y.
,
Yoon
,
J.
,
Park
,
S.
and
Kim
,
J.
(
2022
), “
Classifying tourists' photos and exploring tourism destination image using a deep learning model
”,
Journal of Quality Assurance in Hospitality and Tourism
, Vol. 
23
No. 
6
, pp. 
1480
-
1508
, doi: .
Clark
,
L.
,
Butler
,
K.
,
Ritchie
,
K.L.
and
Maréchal
,
L.
(
2020
), “
The importance of first impression judgements in interspecies interactions
”,
Scientific Reports
, Vol. 
10
No. 
1
, p.
2218
, doi: .
Elbroch
,
L.M.
,
Lagos
,
N.
,
Cárdenas
,
J.
,
Goic
,
D.
,
Moraga
,
R.
and
Ohrens
,
O.
(
2023
), “
Tourism and human computers offer new tools to monitor Patagonia's top carnivore
”,
Science of The Total Environment
, Vol. 
877
, 162916, doi: .
Flaherty
,
G.T.
and
Piyaphanee
,
W.
(
2023
), “
Predicting the natural history of artificial intelligence in travel medicine
”,
Journal of Travel Medicine
, Vol. 
30
No. 
1
, taac113, doi: .
Gajdošík
,
T.
and
Marciš
,
M.
(
2019
), “
Artificial intelligence tools for smart
”,
Tourism Development
, pp. 
392
-
402
, doi: .
Ghosh
,
S.
,
Basu
,
A.
,
Paul
,
S.
,
Chanda
,
B.
and
Das
,
S.
(
2023
), “
On regenerative and discriminative learning from digital heritages: a fractal dimension based approach
”, pp. 
405
-
417
, doi: .
Gupta
,
S.
,
Modgil
,
S.
,
Lee
,
C.-K.
and
Sivarajah
,
U.
(
2023
), “
The future is yesterday: use of AI-driven facial recognition to enhance value in the travel and tourism industry
”,
Information Systems Frontiers
, Vol. 
25
No. 
3
, pp. 
1179
-
1195
, doi: .
Han
,
Y.
,
Chang
,
Q.
,
Ding
,
S.
,
Gao
,
M.
,
Zhang
,
B.
and
Li
,
S.
(
2022
), “
Research on multiple jellyfish classification and detection based on deep learning
”,
Multimedia Tools and Applications
, Vol. 
81
No. 
14
, pp. 
19429
-
19444
, doi: .
Hou
,
L.
and
Pan
,
X.
(
2023
), “
Aesthetics of hotel photos and its impact on consumer engagement: a computer vision approach
”,
Tourism Management
, Vol. 
94
, 104653, doi: .
Huang
,
A.
,
Chao
,
Y.
,
de la Mora Velasco
,
E.
,
Bilgihan
,
A.
and
Wei
,
W.
(
2022
), “
When artificial intelligence meets the hospitality and tourism industry: an assessment framework to inform theory and management
”,
Journal of Hospitality and Tourism Insights
, Vol. 
5
No. 
5
, pp. 
1080
-
1100
, doi: .
Jiang
,
S.
,
Moyle
,
B.
,
Yung
,
R.
,
Tao
,
L.
and
Scott
,
N.
(
2023
), “
Augmented reality and the enhancement of memorable tourism experiences at heritage sites
”,
Current Issues in Tourism
, Vol. 
26
No. 
2
, pp. 
242
-
257
, doi: .
Kan
,
S.H.
,
Basili
,
V.R.
and
Shapiro
,
L.N.
(
1994
), “
Software quality: an overview from the perspective of total quality management
”,
IBM Systems Journal
, Vol. 
33
No. 
1
, pp. 
4
-
19
, doi: .
Lacárcel
,
F.J.S.
,
Huete
,
R.
and
Zerva
,
K.
(
2024
), “
Decoding digital nomad destination decisions through user-generated content
”,
Technological Forecasting and Social Change
, Vol. 
200
, 123098, doi: .
Lee
,
W.
and
Gretzel
,
U.
(
2012
), “
Designing persuasive destination websites: a mental imagery processing perspective
”,
Tourism Management
, Vol. 
33
No. 
5
, pp.
1270
-
1280
.
Li
,
X.
,
Ratti
,
C.
and
Seiferling
,
I.
(
2018
), “
Quantifying the shade provision of street trees in the urban landscape: a case study in Boston, USA, using Google Street View
”,
Landscape and Urban Planning
, Vol. 
169
, pp. 
81
-
91
, doi: .
Li
,
H.
,
Zhang
,
L.
and
Hsu
,
C.H.C.
(
2023
), “
Research on user-generated photos in tourism and hospitality: a systematic review and way forward
”,
Tourism Management
, Vol. 
96
, 104714, doi: .
Liu
,
L.
,
Zhou
,
B.
,
Zhao
,
J.
and
Ryan
,
B.D.
(
2016
), “
C-IMAGE: city cognitive mapping through geo-tagged photos
”,
Geojournal
, Vol. 
81
No. 
6
, pp. 
817
-
861
, doi: .
Liu
,
J.
,
Li
,
T.
,
Xie
,
P.
,
Du
,
S.
,
Teng
,
F.
and
Yang
,
X.
(
2020
), “
Urban big data fusion based on deep learning: an overview
”,
Information Fusion
, Vol. 
53
, pp. 
123
-
133
, doi: .
Liu
,
S.
,
Cao
,
Y.
,
Gao
,
L.
,
Xu
,
J.
and
Zeng
,
W.
(
2021
), “
Realization of mobile augmented reality system based on image recognition
”,
Journal of Information Hiding and Privacy Protection
, Vol. 
3
No. 
2
, pp. 
55
-
59
, doi: .
Liu
,
J.
,
Hu
,
S.
,
Mehraliyev
,
F.
,
Zhou
,
H.
,
Yu
,
Y.
and
Yang
,
L.
(
2023a
), “
Recognizing emotions in restaurant online reviews: a hybrid model integrating deep learning and a sentiment lexicon
”,
International Journal of Contemporary Hospitality Management
, Vol. 
36
No. 
9
, pp. 
2955
-
2976
, doi: .
Liu
,
X.
,
Nicolau
,
J.L.
,
Law
,
R.
and
Li
,
C.
(
2023b
), “
Applying image recognition techniques to visual information mining in hospitality and tourism
”,
International Journal of Contemporary Hospitality Management
, Vol. 
35
No. 
6
, pp. 
2005
-
2016
, doi: .
Liu
,
L.
,
Wang
,
Y.
and
Chi
,
W.
(
2024
), “
Image recognition technology based on machine learning
”,
IEEE Access
, p.
1
, doi: .
Ma
,
S.
,
Li
,
H.
,
Hu
,
M.
,
Yang
,
H.
and
Gan
,
R.
(
2023
), “
Tourism demand forecasting based on user-generated images on OTA platforms
”,
Current Issues in Tourism
, Vol. 
27
No. 
11
, pp. 
1
-
20
, doi: .
Milman
,
A.
,
Tasci
,
A.
and
Zhang
,
T.
(
2020
), “
Perceived robotic server qualities and functions explaining customer loyalty in the theme park context
”,
International Journal of Contemporary Hospitality Management
, Vol. 
32
No. 
12
, pp. 
3895
-
3923
, doi: .
Nannelli
,
M.
,
Capone
,
F.
and
Lazzeretti
,
L.
(
2023
), “
Artificial intelligence in hospitality and tourism. State of the art and future research avenues
”,
European Planning Studies
, Vol. 
31
No. 
7
, pp. 
1325
-
1344
, doi: .
Ozkul
,
E.
,
Boz
,
H.
,
Bilgili
,
B.
and
Koc
,
E.
(
2019
), “
What colour and light do in service atmospherics: a neuro-marketing perspective
”, pp. 
223
-
244
, doi: .
Prentice
,
C.
(
2020
), “
Enhancing the tourist experience with emotional intelligence
”,
Tourism Review
, Vol. 
75
No. 
5
, pp. 
733
-
744
, doi: .
Rahmadian
,
E.
,
Feitosa
,
D.
and
Zwitter
,
A.
(
2022
), “
A systematic literature review on the use of big data for sustainable tourism
”,
Current Issues in Tourism
, Vol. 
25
No. 
11
, pp. 
1711
-
1730
, doi: .
Ramtohul
,
A.
and
Khedo
,
K.K.
(
2024
), “
Augmented reality systems in the cultural heritage domains: a systematic review
”,
Digital Applications in Archaeology and Cultural Heritage
, Vol. 
32
, e00317, doi: .
Samara
,
D.
,
Magnisalis
,
I.
and
Peristeras
,
V.
(
2020
), “
Artificial intelligence and big data in tourism: a systematic literature review
”,
Journal of Hospitality and Tourism Technology
, Vol. 
11
No. 
2
, pp. 
343
-
367
, doi: .
Schilit
,
B.N.
and
Theimer
,
M.M.
(
1994
), “
Disseminating active map information to mobile hosts
”,
IEEE network
, Vol. 
8
No. 
5
, pp.
22
-
32
.
Shen
,
L.
,
Reda
,
M.
,
Zhang
,
X.
,
Zhao
,
Y.
and
Kong
,
S.G.
(
2024
), “
Polarization-driven solution for mitigating scattering and uneven illumination in underwater imagery
”,
IEEE Transactions on Geoscience and Remote Sensing
, Vol. 
62
, pp. 
1
-
15
, doi: .
Snavely
,
N.
,
Seitz
,
S.M.
and
Szeliski
,
R.
(
2008
), “
Modeling the world from internet photo collections
”,
International Journal of Computer Vision
, Vol. 
80
, pp.
189
-
210
.
Solakis
,
K.
,
Katsoni
,
V.
,
Mahmoud
,
A.B.
and
Grigoriou
,
N.
(
2022
), “
Factors affecting value co-creation through artificial intelligence in tourism: a general literature review
”,
Journal of Tourism Futures
, Vol. 
10
No. 
1
, pp. 
116
-
130
, doi: .
Tasci
,
A.D.
and
Gartner
,
W.C.
(
2007
), “
Destination image and its functional relationships
”,
Journal of Travel Research
, Vol. 
45
No. 
4
, pp.
413
-
425
.
tom Dieck
,
M.C.
,
Jung
,
T.H.
and
tom Dieck
,
D.
(
2018
), “
Enhancing art gallery visitors’ learning experience using wearable augmented reality: generic learning outcomes perspective
”,
Current Issues in Tourism
, Vol. 
21
No. 
17
, pp. 
2014
-
2034
, doi: .
Tussyadiah
,
I.
(
2020
), “
A review of research into automation in tourism: launching the annals of tourism research curated collection on artificial intelligence and robotics in tourism
”,
Annals of Tourism Research
, Vol. 
81
, 102883, doi: .
Varol
,
F.
and
Öksüz
,
M.
(
2024
), “
Use of advanced measurement and reality technologies in cultural heritage sites from the perspective of technology and tourism
”,
Current Issues in Tourism
, pp. 
1
-
19
, doi: .
Vu
,
H.Q.
,
Li
,
G.
,
Law
,
R.
and
Ye
,
B.H.
(
2015
), “
Exploring the travel behaviours of inbound tourists to Hong Kong using geotagged photos
”,
Tourism Management
, Vol. 
46
, pp. 
222
-
232
, doi: .
Wang
,
H.-J.
and
Wu
,
C.-H.
(
2020
), “
Research and design for hotel security experience for women traveling alone
”,
IOP Conference Series: Materials Science and Engineering
, Vol. 
825
, 012022, doi: .
Wang
,
J.
,
Li
,
Y.
,
Wu
,
B.
and
Wang
,
Y.
(
2021
), “
Tourism destination image based on tourism user-generated content on the internet
”,
Tourism Review
, Vol. 
76
No. 
1
, pp. 
125
-
137
, doi: .
Wang
,
L.
,
Han
,
X.
,
He
,
J.
and
Jung
,
T.
(
2022
), “
Measuring residents' perceptions of city streets to inform better street planning through deep learning and space syntax
”,
ISPRS Journal of Photogrammetry and Remote Sensing
, Vol. 
190
, pp. 
215
-
230
, doi: .
Yuan
,
F.
and
Vui
,
C.N.
(
2023
), “
The influence of destination image on tourists' behavioural intentions: explore how tourists' perceptions of a destination affect their intentions to visit, revisit, or recommend it to others
”,
Journal of Advanced Zoology
, Vol. 
44
No. 
S6
, pp. 
1391
-
1397
, doi: .
Zhang
,
K.
,
Chen
,
Y.
and
Li
,
C.
(
2019
), “
Discovering the tourists' behaviors and perceptions in a tourism destination by analyzing photos' visual content with a computer deep learning model: the case of Beijing
”,
Tourism Management
, Vol. 
75
, pp. 
595
-
608
, doi: .
Zhang
,
K.
,
Chen
,
Y.
and
Lin
,
Z.
(
2020
), “
Mapping destination images and behavioural patterns from user-generated photos: a computer vision approach
”,
Asia Pacific Journal of Tourism Research
, Vol. 
25
No. 
11
, pp. 
1199
-
1214
, doi: .
Zhang
,
K.
,
Lin
,
Z.
and
Zhang
,
J.
(
2021
), “
Tourist gaze through computer vision: differences between Asian, North American, and European tourists
”,
Annals of Tourism Research
, Vol. 
88
, 103039, doi: .
Zhang
,
K.
,
Wang
,
J.
,
Zhang
,
J.
,
Wang
,
Y.
and
Zeng
,
Y.
(
2024
), “
Exploring the impact of location-based augmented reality on tourists' spatial behaviour, experience, and intention through a field experiment
”,
Tourism Management
, Vol. 
102
, 104886, doi: .
Zhou
,
S.
,
Wu
,
X.
,
Jiang
,
F.
,
Huang
,
Q.
and
Huang
,
C.
(
2023
), “
Emotion recognition from large-scale video clips with cross-attention and hybrid feature weighting neural networks
”,
International Journal of Environmental Research and Public Health
, Vol. 
20
No. 
2
,
1400
, doi: .
Afzal
,
S.
,
Khan
,
I.U.
,
Mehmood
,
I.
and
Lee
,
J.W.
(
2024
), “
Leveraging augmented reality, semantic-segmentation, and VANETs for enhanced driver's safety assistance
”,
Computers, Materials and Continua
, Vol. 
78
No. 
1
, pp. 
1443
-
1460
, doi: .
Amparore
,
D.
,
Sica
,
M.
,
Verri
,
P.
,
Piramide
,
F.
,
Checcucci
,
E.
,
De Cillis
,
S.
,
Piana
,
A.
,
Campobasso
,
D.
,
Burgio
,
M.
,
Cisero
,
E.
,
Busacca
,
G.
,
Di Dio
,
M.
,
Piazzolla
,
P.
,
Fiori
,
C.
and
Porpiglia
,
F.
(
2024
), “
Computer vision and machine-learning techniques for automatic 3D virtual images overlapping during augmented reality guided robotic partial nephrectomy
”,
Technology in Cancer Research and Treatment
, Vol. 
23
, 15330338241229368, doi: .
Eswaran
,
U.
and
Khang
,
A.
(
2024
), “Artificial intelligence (AI)-Aided computer vision (CV) in healthcare system”, in
Computer Vision and AI-Integrated IoT Technologies in the Medical Ecosystem
,
CRC Press
,
Boca Raton
, pp. 
125
-
137
, doi: .
Fernandes
,
L.A.F.
(
2024
), “
Editorial to the special issue ‘vision sensors: image processing technologies and applications
”,
Sensors
, Vol. 
24
No. 
6
, p.
1803
, doi: .
Li
,
H.
,
Lien
,
C.-H.
,
Wang
,
S.W.
,
Wang
,
T.
and
Dong
,
W.
(
2021
), “
Event and city image: the effect on revisit intention
”,
Tourism Review
, Vol. 
76
No. 
1
, pp. 
212
-
228
, doi: .
Saroop Roy
,
B.R.
(
2022
), “Information and communication technology application in tourism events, fairs and festivals in India”, in
Technology Application in Tourism Fairs, Festivals and Events in Asia
,
Springer Singapore
,
Singapore
, pp. 
209
-
220
, doi: .
Uthaisar
,
S.
,
Eves
,
A.
and
Wang
,
X.L.
(
2023
), “
Tourists' online information search behavior: combined user-generated and marketer-generated content in restaurant decision making
”,
Journal of Travel Research
, Vol. 
63
No. 
6
, pp. 
1549
-
1573
, doi: .
Yang
,
Y.
,
Wang
,
Y.
and
Zhao
,
J.
(
2023
), “
Effect of user-generated image on review helpfulness: perspectives from object detection
”,
Electronic Commerce Research and Applications
, Vol. 
57
, 101232, doi: .
Published in Journal of Tourism Futures. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at http://creativecommons.org/licences/by/4.0/legalcode

or Create an Account

Close Modal
Close Modal