This study investigates the applications of computer vision (CV) technology in the tourism sector to predict visitors' facial and emotion detection, augmented reality (AR) visitor engagements, destination crowd management and sustainable tourism practices.
This study employed a systematic literature review, following the Preferred Reporting Items for Systematic reviews and Meta-Analyses methodology and bibliometric study on research articles related to the tourism sector. In total, 407 articles from the year, 2013 to 2024, all indexed in Scopus, were screened. However, only 150 relevant ones on CV in Tourism were selected based on the following criteria: academic journal publication, English language, empirical evidence provision and publication up to 2024.
The findings reveal a burgeoning interest in utilizing CV in tourism, highlighting its potential for crowd management and personalized experience. However, ethical concerns surrounding facial recognition and integration challenges need addressing. AR enhances engagement, but ethical and accessibility issues persist. Image processing aids sustainability efforts but requires precision and integration for effectiveness.
The study’s originality lies in its thorough examination of CV’s role in tourism, covering facial recognition, crowd insights, AR and image processing for sustainability. It addresses ethical concerns and proposes advancements for a more responsible and sustainable tourist experience, offering novel insights for industry development.
1. Introduction
Numerous nations' economies have benefited from the tourist industry. The industry employed 330 million people in 2019, making it one of the world's most populous occupations, and it added $8.9tn to the global gross domestic product (GDP) or 10.3% of all GDP. In addition, the industry received $948bn in funding. Even while tourism is mostly a service industry, it can nevertheless be considered a product in its own right, with five main components: attractions, accessibility, lodging, amenities and activities.
The ongoing generation of large numbers of digital images, particularly in tourism, has created a new opportunity for marketing stakeholders to extract the information required for understanding consumer behaviour and their experiences with the product (Liu et al., 2020). This stakeholder who oversees picture libraries and design archives in huge image collections has long acknowledged the importance of the intricate digitization process, indexing procedure and the necessity for effective storage and retrieval of photographs (Wang and Wu, 2020). Thus, they, along with academic researchers, have developed many strategies to address issues in picture categorization and retrieval. Alkhawlani et al. (2015) identified three primary image retrieval techniques: text-based image retrieval, semantic-based image retrieval and content-based image retrieval (CBIR). CBIR appears to be the optimal method for managing image data by reducing labour costs, minimizing human errors in image data indexing and retrieval and enhancing the digital image era.
This study conducts a systematic literature review (SLR) on possible adoptions of cutting-edge artificial intelligence (AI) computer vision (CV) technology in the tourism industry to reveal the perceptual and behavioural distinctions between locals and tourists. Further, this research aims to assess the appropriateness of CV and the effectiveness of specific pre-trained models for analysing visual user-generated content relevant to brands based on the context-aware system (CAS) theory and goal question metrics (GQM) approach.
CAS theory (Schilit and Theimer, 1994) holds that systems sense, understand and react to real-time contextual data to improve user experiences and maximize processes. Important elements are context acquisition (sensing data), context modelling (structuring data), context reasoning (inferring high-level information), context adaptation (behaviour modification) and context distribution (sharing data). With issues in guaranteeing data accuracy, integration, privacy and resource management, CAS applications encompass smart tourism, healthcare, smart homes and transportation. The GQM approach is a systematic methodology that ensures measurement activities are in line with business objectives. It achieves this by defining precise goals, formulating targeted questions and identifying appropriate metrics. This approach facilitates continuous improvement in processes, products and services.
Applying the principles of CAS theory – which makes use of real-time data to deliver personalized recommendations, interesting interactive experiences and efficient crowd management – can improve destination tourism. This approach not only raises tourist satisfaction but also promotes sustainable tourism practices by spotting and lowering environmental damage and too high travel volume. As per the CAS theory and GQM approach (Kan et al., 1994), our study aims to review the existing scientific literature to describe how researchers and practitioners in the tourism industry use CV concerning approaches, methods and tools. We formulated the following research inquiries to help us reach our objective:
What sources of computer vision have been used in seeking solutions for tourism?
What is the impact of computer vision on the tourism business, taking into account the changes in important research fields, influential contributors and possible future directions?
CAS can greatly improve visitor experiences in the framework of destination tourism by using real-time data to offer individualized and adaptable services. To provide custom recommendations for attractions, meals and activities, CAS can compile information on visitor preferences, historical behaviours and emotional states. Combining augmented reality (AR) with CAS will provide interactive and immersive narratives at historical sites and museums, therefore enhancing the educational value of the experience. Surveillance cameras and CV algorithms can track visitor movement to find busy locations and maximize visitor flow by guiding them to less crowded areas and reallocating resources as needed in crowd control. With automated reporting alerting authorities to respond quickly, image processing systems can also identify environmental damage or overtourism using drone or camera footage. Destination tourism may provide guests with more individualized, interesting and environmentally friendly experiences by using CAS, therefore improving their satisfaction and maintaining the integrity of the tourist destinations. Thus, the following research question [RQ3] is framed:
Can facial recognition and emotion detection algorithms be implemented in tourist destinations to personalize experiences and tailor recommendations based on real-time visitor emotions?
Face recognition and emotion detection algorithms can also be applied in tourist hotspots. For instance, CAS captures the emotional states of visitors in real time by analysing facial expressions at the point of reaching a specific spot. It can identify delight, surprise or even boredom. Based on these feelings, it personalizes recommendations to enhance the experience of the visitor. For example, if he looks bored, it would offer more exciting events or interactive exhibitions. The dynamic and interactive experience, which could be made possible with digital guides, is on account of their ability to change their tone and suggestions based on the emotional feedback received from visitors. Such a high level of visitor satisfaction and engagement depends on the proper and incessant monitoring and reaction to emotional signals. Using cameras, record the facial expressions and emotions of visitors. The intelligent wristband or smartphone applications will analyse them in real time with machine learning algorithms. Therefore, based on current feelings, this research shows that the proposed system allows suggesting attractions, restaurants or events that are corresponding and appropriate for visitors. This approach not only improves the total tourist experience, raises satisfaction and creates loyalty to a customer but also helps travel firms to highlight weaknesses and to better optimize their products. Therefore, the research question [RQ4] presented is:
How can computer vision techniques be used to analyse tourist movement patterns in real time to improve crowd management and optimize resource allocation in tourist destinations?
Real-time movement patterns of tourists can be analysed using CV methods including object detection and tracking, so greatly enhancing crowd management and resource allocation. Tourist sites can monitor crowd density in various locations using surveillance cameras and CV algorithms, visualizing busy regions using heat maps to control congestion and guarantee safety. Monitoring movement patterns helps the system to find popular paths and congestion, thereby guiding choices on the best layout and flow of foot traffic. Dynamic resource allocation made possible by real-time visitor movement data allows workers to be directed to high-density locations or modify cleaning plans in heavily travelled-through zones. Cameras placed at popular tourist destinations, for instance, record footage of guests that CV algorithms examine to track density, flow and velocity, thereby pointing out regions of great traffic congestion. Through traffic flow optimization, identification of bottlenecks and real-time visitor information on crowd levels, wait times and other routes, the system offers insights into crowd management. By adding staff or facilities at high-demand points, this approach not only reduces the problem of crowd control for destinations but also allows tourism operators to better redistribute their resources. Hence the following research question [RQ5] is as follows:
Can tourists engaged at historical sites and museums have interactive experiences that are more enjoyable with the help of AR technology integrated with computer vision?
By overlaying digital information onto real-world locations, including interactive storytelling, customized guided tours and interesting aspects, AR coupled with CV can dramatically improve the tourist experience at historical sites and museums. AR can, for instance, overlay animations, historical data or reconstructions onto actual objects so enhancing the immersive and instructive value of the experience. As they tour many displays, visitors utilizing AR-enabled devices or smart glasses with CV algorithms can get context-aware guided tours with information catered to their interests. For younger guests especially, interactive AR components – games or quizzes – make learning more interesting. The AR technology monitors visitors' location and orientation as they approach an exhibit and then generates 3D overlays including historical background, interactive games and virtual reenactments. This strategy improves the whole experience, raises visitor participation and generates a more immersive and unforgettable visit. Hence the stated research question [RQ6] is as follows:
How can image processing algorithms of computer vision be used to automatically detect and report environmental damage or overtourism in popular destinations, enabling sustainable tourism practices?
The subsequent sections of this work are structured as follows. In Section 2, we provide the theoretical foundation for our SLR. Section 3 provides a detailed account of our approach, encompassing the objectives, research questions and SLR technique. In Section 4, we then give the findings derived from the primary research that has been scrutinized. In Section 5, we will provide the conclusions, outline the remaining tasks and discuss the potential limitations of our study.
2. Theoretical background
2.1 Artificial intelligence in tourism
There are several uses for AI systems in the travel sector. AI improves the tourist experience by making it easier to find what they're looking for, making it easier to move about and making better decisions (Samara et al., 2020). AI has the potential to revolutionize several managerial tasks, with an emphasis on improving promotion techniques and boosting productivity. It is believed that AI will influence consumers' societal views in a way that encourages sustainable travel (Tussyadiah, 2020).
AI systems in the tourism sector can function independently or be integrated into current applications and systems. The systems encompass recommender systems, personalization systems, conversational systems (chatbots and voice assistants), forecasting tools, autonomous agents, language translation applications and smart tourism destinations (Nannelli et al., 2023). While we examine each system individually, it should be noted that visitors typically engage with technologies that combine many systems (Gajdošík and Marciš, 2019). A guest can engage with a robot that incorporates a conversational system, along with a recommender system, personalization approach or autonomous agent based on their needs. The interaction with the user can be facilitated through a chatbot or voice assistant (Solakis et al., 2022).
2.2 Computer vision in tourism
CV is a branch of AI that primarily deals with object description. The theory around the deconstruction and reconstruction of object components from their most basic attributes – such as form, transparency, lightness, colour, size and layers – is crucial to CV. CV is useful for the hotel and tourist industry because it can identify and categorize large numbers of images. One example is robots that scan licence plates while they go about their business (Huang et al., 2022). Another is the ability of search and booking engines to automatically classify uploaded photos of houses as a specific type of room (Milman et al., 2020).
The advancement of computer technology has enabled the utilization of deep learning models and algorithms to analyse large amounts of tourism data, leading to new insights and knowledge (Liu et al., 2020). New technology has sparked a study trend in combining large visual data into studies on tourists' views. Various research based on user-generated photos has already begun regarding tourists' impressions. Prentice (2020) analysed international tourists' location preferences by studying geo-tagged images shared on social media. Vu et al. (2015) showed how a dataset created from geo-tagged images may comprehensively collect and enhance the comprehension of international tourists' actions. The author conducted a pilot study on urban perceptions using a deep learning model for scene recognition (Liu et al., 2016). Zhang et al. (2021) studied tourists' behaviours and impressions at a tourist location by examining the visual content of their images with a scene recognition model. A comparative study was conducted on tourists' perceptions of the place using a deep learning semantic segmentation model by Zhang et al. (2020).
In comparison to the conventional method, the coding and machine training phases of a well-designed and implemented deep learning system need a substantial investment of time and effort. Its strength is in the speed and accuracy, with which it processes standardized categorization tasks by identifying commonalities and differences using machine learning. According to Zhang et al. (2019), a large number of scientists will be pursuing careers in data-intensive fields because of how quickly they are expanding. In the fourth paradigm of research, researchers can evaluate and navigate massive datasets thanks to advanced computational capabilities (Cho et al., 2022). The use of user-generated photos and CV technologies in tourist studies is on the rise. In contrast to developments in CV, tourism research has only made little use of current technologies (Hou and Pan, 2023). Additionally, most tourism research only used a single CV deep learning model. Further application of CV technology in solving tourist problems is highly encouraged (Liu et al., 2021).
3. Review methodology
We constructed an SLR by adhering to the recommended guidelines for conducting a structured literature review outlined by Rahmadian et al. (2022). Figure 1 depicts our systematic review approach. The subsequent sections outline the procedures of the SLR that we carried out to identify influential research and analyse patterns in the literature.
We found 407 articles in the Scopus database. Table 1 outlines the study selection criteria, detailing their range and level of excellence. We employed the Preferred Reporting Items for Systematic reviews and Meta-Analyses approach (Figure 1) to refine the publications as follows.
- (1)
407 articles were identified in the database during the “identification” phase;
- (2)
220 publications were eliminated during the “screening” phase because of title and subject area problems;
- (3)
37 publications were eliminated during the “eligibility” phase because of problems related to the documents and language and
- (4)
Finally, 150 articles were included in the study.
Inclusion and exclusion criterion
| Topic | Criterion |
|---|---|
| Field | Computer Vision and Tourism |
| Location | Studies from any geographical areas |
| Study | Only Publication in academic journals |
| Topic | Publications need to provide empirical evidence to support the research questions |
| Language | Written in English |
| Date | Published upto 2024 |
| Topic | Criterion |
|---|---|
| Field | Computer Vision and Tourism |
| Location | Studies from any geographical areas |
| Study | Only Publication in academic journals |
| Topic | Publications need to provide empirical evidence to support the research questions |
| Language | Written in English |
| Date | Published upto 2024 |
Source(s): Compiled by authors
The final search string, which the experts also validated, is as follows.
(TITLE-ABS-KEY (“Computer vision”) OR TITLE-ABS-KEY (“Image processing”) AND TITLE-ABS-KEY (“Tourism”)) AND (LIMIT-TO (SUBJAREA, “SOCI”) OR LIMIT-TO (SUBJAREA, “BUSI”)).
* SOCI = Social Science
* BUSI = Business Management and Accounting
* LIMIT-TO = Limited to
3.1 Inclusion and exclusion criterion
4. Results and discussion
4.1 Publication growth of computer vision in tourism
Figure 2 shows the yearly publication numbers of articles on tourism research utilizing CV. From the result of Figure 2, two primary conclusions may be readily inferred. Research on the application of CV technology in the field of tourism is now in its initial phase. Conversely, there seems to be a consistent increase in the yearly publication of publications from 2014 to 2024, suggesting a growing interest in this developing topic. This section will answer RQ1 regarding the CV that has been used in solutions for tourism.
4.2 Unveiling the tourist gaze: a computer vision analysis of research trends, key players and the future landscape
A study was conducted to identify emergent themes in the domain, which was then represented in Figure 3. The analysis was performed utilizing 100 author's keywords and a minimum cluster frequency of 5 per thousand documents, resulting in the detection of 9 clusters on the map. The diameter of a circle in a cluster is directly proportional to the number of phrases it represents. The cluster identified as “Image Processing” was the most prominent, with 29 unique phrases. By contrast, the cluster linked to “Remote sensing” consisted of 16 distinct keywords. Similarly, “Social media” encompasses a total of 11 unique terms. The map is characterized by two dimensions: centrality and density. Centrality refers to the degree of connectivity between a specific cluster and other clusters. Conversely, density pertains to the level of connection among the individual words inside the cluster.
An extensive analysis of co-citation has been carried out, utilizing the 248 articles under investigation, to gain an in-depth understanding of the data. Upon inspection, it was found that five separate groups of authors are commonly referenced together. This can be observed in Figure 4. The chart highlights the significant contributions made by author “Li X.,” who has received an impressive 44 citations in the field of research. Similarly, authors Wang Y. and Zhang Y. have made noteworthy contributions in their respective study disciplines, with 40 and 37 citations, respectively. In contrast, authors “Li Y.” and “Li J.” have received a comparatively lower number of citations, with 36 and 32 citations, respectively, in the related publications.
To have a good hold of the data, a full citation analysis was performed utilizing the 248 papers that were considered. Several documents were found to be referenced during the examination. The result is illustrated in Figure 5. With a whopping 1,735 citations, the work of author “Snavely et al. (2008)” had a tremendous influence, as seen in the accompanying chart. In addition, with 610 citations between them, “Tasci and Gartner (2007)” have made substantial contributions to their respective domains. Just one piece of information: “Lee and Gretzel (2012)” has 183 citations. Similarly, in the same area of study, “Hao q. et al.” received 150 citations, while “Curr r.h.f et al.” received 78 citations.
4.3 Facial recognition in tourists' experience
The RQ3 thus highlights the click-by-click necessity of complementary research before tourism can broadly derive image processing utility, such as facial recognition. All thrilling personal recommendation possibilities or swift security checks are coupled with ethical concerns regarding data privacy and potential biases that have to sufficiently be catered to. There is, therefore, need for further research into whether image processing technologies can be effective and ethical in different eventualities in tourism from the considerations of both innovation and the individual rights of the tourists.
The potential for recognizing facial action units, emotions regarding speech and even face recognition algorithms on their own can make personalized travel experiences and recommendations a great possibility. Many researchers, including Wang et al. (2021) and Elbroch et al. (2023), have stated that image processing can be used in measuring quantitative factors for the safety of lone travellers in the hospitality industry. It can also be used in detecting human eating habits and further integrated with AR and virtual reality (VR) kits to create tailored experiences.
Similarly, Gupta et al. (2023) explored how this facial recognition technology could be applied in personalizing services, travel itineraries, data-driven service, customer identity and authentication, security and ease of making payments within the travel and tourism industry. It is important to equally point out that there are other possible ethical and privacy concerns attributed to the application of facial recognition technology, which calls for critical considerations before the technology begins to be widely used. According to Flaherty and Piyaphanee (2023), AI applications might go on to further personalize travel and tourism experiences. However, in all the excitement, ethical matters related to data privacy, the potential for data bias, data quality and user control need to be carefully pondered before embracing this technology. The security concerns of the system were enunciated in the investigation of Clark et al. (2020): marrying facial recognition with anti-spoofing technologies shows potential for addressing these concerns, enhancing security and accuracy and, perhaps, in the long run, enhancing trust and using this technology in the areas of security and personalized experience. However the effectiveness of the same can be questioned, and ethical issues related to privacy and bias need to be discussed well. Elbroch et al. (2023) talked about ethical issues like data collection and storage, concentration on the information being used and algorithmic bias related to facial recognition on the assistance of image processing on the visitors. In the same way, Ozkul et al. (2019) showed how personalization should be done to modify the surroundings, give targeted recommendations or enhance the experience based on the visitors' expressions.
More authors noted that emotion detection in marketing and, in general, tourism provides an avenue to give tourists a powerful solution to understand user experience and feelings among travellers. This vital information may aid more effective marketing, improved destination development and eventually happier and more loyal visitors. For example, Yuan and Vui (2023) and Liu et al. (2023a, b) show the potential of the model in the application of facial recognition in the security, personalization and accessibility domains; the main ethical problems, however, include matters of data privacy when it comes to image processing. This may cause potential biases that can be a source of discrimination, and no transparency in the making and the use of that technology make the problem much worse. Responsible adoption should be regulated to serve as safeguard mechanisms to ensure diversified datasets and put into play clear accountability mechanisms to protect personal rights and mitigate risks in its applications. Balancing the potential benefits of a technology with its ethical costs is a key step in the process of deciding whether to spread that technology throughout the travelling industry.
4.4 Smarter tourism: real-time crowd insights
On the other hand, RQ4 explains that tourist destinations can unleash deep insights with the use of “computer vision,” a new technology that analyzes real-time visual data of tourists. CV uses camera networks that capture movement and image recognition that identifies people to give a comprehensive and intricate description of tourist behaviour (Zhou et al., 2023). Analysis of the data employs flow analysis, heatmaps and behaviour tracking, among others, which allow one to extract crowd patterns, hotspots and preferences. Such insight forms the basis for improved crowd management in that it is easier to prevent overcrowding, adjust deployment for resources like security personnel and even give the tourist more bespoke recommendations based on location and movement (Wang et al., 2022). That's to say, though CV is a huge achievement, data privacy issues, assurance of the accuracy of the data under different conditions and integration into already existing systems are challenges that ought to be handled in future developments as brought out by Ghosh et al. (2023). Essentially, CV becomes a tool in the domain of tourism that provides a better comprehension of tourists and puts forward the conditions for a more enjoyable and effective way of travelling.
4.5 Augmented reality and image recognition tourists experiences
This section addresses RQ5 – a pertinent question, because tourists who visit museums and historical places could have a more interactive and immersive experience with the implementation of AR and picture recognition. Some researchers present facts that could back up such a claim:
Ramtohul and Khedo (2024) created an AR system through which historical images are overlaid in actual physical settings. This permits tourists to conduct a “journey through time” so that some of the most famous places of interest can be seen just as they originally appeared when first built.
According to Liu et al. (2021), with image recognition, the learning output by visitors is significantly much better when using AR software compared to traditional tours. Tourists were required to use their smartphone cameras to view augmented video footage of landmarks in a field study (Liu et al., 2024). In the end, the technology was rated to be of great appeal and very useful. Jiang et al. (2023), in their paper, presented a prototype of an AR application that serves for the identification of landmarks and the display of relevant data. This enabled enhancement in interactivity and understanding of the site's visitors. Zhang et al. (2024) focused on how people use the AI service to tell apart different plant species using their phones. They came up with the bright idea that even in places with patchy Internet coverage, nature tourists could have an even better time using AR apps. The technology involves image recognition as a means of using AR for an in-depth understanding of sites with major cultural or historical significance. The possibilities are numerous, and this is just an example of that, as research suggests this will offer clear insight into the facts by superimposing historical pictures or data into real-world settings (Varol and Öksüz, 2024). Thus, the original purpose of the artefacts and the ways of that period are more learned while gazing upon the relics of ages past. Historic sites can be enhanced using AR in ways that more traditional approaches could never accomplish. However, AR isn't limited to the classroom; it might potentially be a huge hit with the kids. All ages of visitors can be enthralled and involved through the usage of 3D models, animations and games, which can increase their historical learning experience (Tom Dieck et al., 2018). One major advantage of AR is that it allows for a personalized experience for every single visitor. Users can personalize AR apps to their tastes, thus making more meaningful and unique experiences. To illustrate the point, one may make an AR app that targets history enthusiasts by showcasing the site's history or target lovers of art and architecture by showcasing the historical connections of the site being viewed. To optimize the visit, the visitors could be able to concentrate on those parts of the history that caught their interest the most (Basheer et al., 2023). AR could use this by giving access to cultural heritage by attracting more significant traffic to a site.
Information can be delivered using sign language interpretation or audio descriptions for people who are either blind or deaf, which makes it useful for everybody. Conclusion AR/VR with image recognition is a potent transforming element in the way we view sites of enormous cultural or historical significance. This would already bring history to life and make it an exciting subject instead of boring for anyone. This improves comprehension, enchants people, enables customization of experiences and adds new communities into contact with the resources (Chen et al., 2024). Generally, study results may suggest that AR combined with image identification can be a potent tool to create an engaging, interactive experience. Much more exciting and new experiences are likely to come when AR technology further develops. AR joined with picture recognition is going to revolutionize the way we experience historical locations and museums. Indeed, studies have shown that AR can enhance visitor engagement and comprehension tremendously by overlaying historical information and imagery onto real-world surroundings.
Moreover, some individual interests could be added and accessed for visitors with disabilities with the help of AR apps, augmenting it with features like audio description, sign language interpretation and many others. All in all, it makes history come alive, very powerfully and most engagingly for everyone.
4.6 Image processing in track sustainability
This section defines that RQ6 tourism, being a key driver for economic growth in many countries, is facing an increasing challenge, concerning the balance of optimum benefits between maximizing financial gains on one hand and environmental protection on the other. At the very same time, it constitutes such a strong tool with immense potential for protecting valuable ecosystems, managing the pressure of tourist arrivals and finally enhancing the tourist experience in great measure, thereby laying a platform for long-term and sustainable development in tourism.
It becomes the vigilant guardian in this struggle to save the environment through image processing. Using algorithms, the most popularly read study by Shen et al. (2024) dabbles in underwater pictures in trying to identify distinct indicators for coral bleaching, which is an early sign of possible harm towards the reefs. Similarly, methods of deep learning are applied by Han et al. (2022) for the prediction of jellyfish blooms that could be harmful both for tourists and marine animals. In their contribution, Li et al. (2018) clarified underwater photography for more effective in situ measurement of the extent of impacts on such special ecosystems.
It does not, however, stop there in this particular domain of water. The field of image processing also massively contributes to the management of the burden of tourism, which has become a challenge to the communities suffering from overtourism. Ma et al. (2023) propose a method for estimating the extent of overcrowding at specific locations using face detection in tourist photos to undertake countermeasures against congestion in advance. According to Liu et al. (2023a, b), the development of a geographically aware AR system is foreseen. This will guide tourists and possibly reduce overcrowding in key monuments. On the same concept, Lacárcel et al. (2024) clearly demonstrate the potential for analysing user-generated photos to gain an understanding of the visitors' behaviour for establishing areas prone to excess tourism.
Image processing technology holds huge potential for significantly changing the tourism industry in the pursuit of both preserving the environment and improving the quality of the visitor experience. It also protects the environment from coral bleaching, monitoring jellyfish populations and locations that have the risk of overtourism by studying tourist behaviour in images, as noted by Li et al. (2023). It gives the authorities time to act and come up with plans to reduce the impact on the environment in effective management of resources and visitors and reduce the adverse impacts brought about by mass tourism to protect the ecosystem. Processing images thus improves the experience of visitors through real-time monitoring that is possible via sensor networks and aids in reducing congestion around main tourist places. However, challenges remain. We need to develop algorithms that precisely detect certain cases of environmental degradation and be able to tell them apart from tourist activity. Furthermore, it is necessary to couple image processing with sensor networks for real-time monitoring.
5. Conclusion
The tourism business is about to get its technology shake-up. Imagine a world where facial recognition technology makes recommendations for health services or changes the environment to suit your mood. CV can analyse crowds in real time. It, therefore, optimizes resource allocation and crowd control. Fans of history can use AR technology that makes historical sites come alive on their mobile devices. Algorithms in image processing could detect environmental damage, and authorities could quickly take proactive measures. Social media analysis allows tourist firms to understand the situation at hand and make necessary changes in their marketing strategies. These technologies give the potential for travel tailored by individual choices, more efficient and productive and environmentally friendly. Of course, the ethical concerns about data privacy and bias within such algorithms are moot. Responsible development is a must to enhance this travel experience for all.
This research takes a critical view of the application of AI CV technology in the tourism sector based on the CAS theory and the GQM approach. Besides discussing potential applications, it provides a critical review of other important aspects like data privacy issues, ethical dilemmas and the need for sophisticated algorithms. The paper assesses the challenges and benefits that CAS components bring to the congruence of GQM with corporate goals and the impact on tourism using these elements, considering the background of real-time data analysis with tailored recommendations. It also assesses ethical questions of facial recognition and emotion detection, the required infrastructure costs for real-time movement analysis and pragmatic challenges for merging AR. Furthermore, what has been underlined in the study is the role of CV in terms of facilitating sustainable tourism through environmental monitoring. After all, this is an issue of robust systems against privacy, ethical and infrastructural challenges. This stresses careful and effective application to help the larger travel business as well as visitors.
6. Implications
The paper highlights how CV applications can enhance the experience of tourists with extremely personalized destination site recommendations, tailored suggestions and increased safety through AR and facial recognition technologies. The study helps to analyse how AI CV technology can engage the tourist with historical landmarks and cultural attractions, enriching their understanding and admiration through the utilization of AR and image recognition. The study also explains how CV and image processing technologies can help manage crowds and reduce congestion, allowing tourists to navigate crowded travel areas more effortlessly. Tourist having increased accessibility to attractions, with personalized experiences and real-time information about their surroundings can boost their travel confidence. This paper discusses how CV can use image processing to manage visitor numbers, monitor environmental effects and predict crowd levels, thereby promoting a more responsible and eco-conscious travel industry. Image processing technologies enhance the tourist experience, making it more enjoyable, easily accessible and environmentally friendly for all.
Destination stakeholders to adopt image processing technology have CV to enhance the trip experience of the tourist optimize tourism operations and promote sustainable travel practices. Tour operators and travel agencies can customize experiences and optimize logistical operations, while tourist boards and governments can monitor the flow of visitors and manage population levels. Attractions and museums can engage guests through the use of AR and picture recognition technologies, while hotels and lodging firms have the potential to enhance guest experiences. Local communities can benefit from implementing sustainable tourism practices and contribute to the development of respectful and culturally sensitive travel experiences. Technology suppliers can enhance their image processing capabilities to meet the demands of businesses. It will be assisted by image processing techniques, which tourism stakeholders can use to build a very sensitive, accessible and pleasing travel industry.
7. Future research direction
While facial recognition and AR realize personalization and the enrichment of tourism experiences, there are still certain challenges. In-depth deliberation on ethical issues around data privacy and any bias that these algorithms harbour is quite necessary. CV development shall consider aspects of privacy concerns, data quality and smooth integration with existing systems. It is also necessary to improve further image processing algorithms, such that the reason for environmental degradation can be better detected and visitor activities can be well differentiated. This has to go hand in hand with sensor networks and real-time monitoring as a means to achieve sustainable practice. In such cases, the responsible development and ethical use of technologies in tourism must be aligned with the enforcement of stronger personal data protection regulations, transparent data collection procedures and awareness of the privacy rights of tourists. If we use this technology responsibly, it will lead to future generations inheriting a prosperous and sustainable tourism sector that coexists peacefully with the environment.
Data availability: Data underlying conclusions – a database in the public domain of the Scopus collection – is concluded with the key terms “Computer vision and tourism” for data extraction searches.
Competing interests: The authors declare no competing interests.
Ethical approval: This manuscript does not contain any studies with human participants performed by any of the authors.
Informed consent: Since there were no human subjects involved in this review study, no consent was required.





