A review of machine learning approaches for improving calibration and performance of low-cost air quality sensor networks

Yan, Chaohua

doi:10.1108/SR-10-2025-0761

Article navigation

Review Article| February 05 2026

A review of machine learning approaches for improving calibration and performance of low-cost air quality sensor networks

Chaohua Yan

School of Digital Media and Art Design,

Nanyang Institute of Technology

, Nanyang,

China

Corresponding author Chaohua Yan yanchaohua568@163.com

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Corresponding author Chaohua Yan yanchaohua568@163.com

Publisher: Emerald Publishing

Received: October 02 2025

Revision Received: December 20 2025

Revision Received: January 02 2026

Accepted: January 10 2026

Online ISSN: 1758-6828

Print ISSN: 0260-2288

Funding

Funding Group:

Award Group:
- Funder(s):
  Nanyang Institute of Technology Scientific Research Platforms; Virtual Digital Human Technology Research Center; Nanyang City Key Laboratory of Virtual Reality Technology; Henan Province Science and Technology Research, Project Name: Research and Application of Plant Image Recognition Method in Danjiang Basin Based on Deep Convolutional Neural Network Optimization; Henan Provincial Science and Technology Research Project
- Award Id(s):
  252102210043
Funding Statement(s):
This work has been supported by Nanyang Institute of Technology Scientific Research Platforms; Virtual Digital Human Technology Research Center; Nanyang City Key Laboratory of Virtual Reality Technology; Henan Province Science and Technology Research, Project Name: Research and Application of Plant Image Recognition Method in Danjiang Basin Based on Deep Convolutional Neural Network Optimization; Henan Provincial Science and Technology Research Project, Project No.: 252102210043, Project Title: Research and Application of Real-Time Recognition of Wheat and Corn Leaf Diseases Based on Visual Transformer Optimization.
,
The author acknowledges that ChatGPT has been used to assist in the editing of language and grammar in this manuscript. The author reviewed and verified all content generated by the AI tools to ensure accuracy and integrity and accept full responsibility for the final version of the work.

2026

Emerald Publishing Limited

Licensed re-use rights only

Sensor Review 1–21.

https://doi.org/10.1108/SR-10-2025-0761

Purpose

This review critically examines the use of machine learning (ML) for calibrating low-cost air quality sensors (LCSs), which, despite their growing deployment for high-resolution monitoring, suffer from significant accuracy limitations. This paper aims to synthesize recent advances, evaluate methodological strengths and weaknesses and clarify ongoing debates regarding the reliability, transparency and generalizability of ML-based calibration strategies.

Design/methodology/approach

Drawing on more than 90 peer-reviewed studies published between 2013 and early 2024, identified through structured searches in Web of Science, Scopus, IEEE Xplore and Google Scholar using combinations of keywords such as “low-cost air quality sensor,” “machine learning calibration,” “drift correction” and “transferability,” this review surveys calibration approaches applied to major sensor types (optical, electrochemical, metal-oxide semiconductor and nondispersive infrared) across pollutants such as particulate matter, ozone and nitrogen dioxide. Both traditional regressions and advanced ML models are analyzed. The review highlights methodological practices, performance benchmarks and controversies regarding overfitting, model transferability and the role of ancillary variables.

Findings

Evidence demonstrates that ML calibration can reduce error metrics by more than 50% and raise correlation with reference monitors to R² values exceeding 0.8–0.9. For optical particulate matter sensors, cross-study evaluations commonly report postcalibration R² in the 0.8–0.95 range with slopes close to unity under long-term colocation, whereas calibrated electrochemical gas sensors for NO₂ and O₃ more typically achieve R² between about 0.6 and 0.9, with larger site-to-site variability. Case studies from diverse environments illustrate how neural networks and gradient boosting often outperform simpler models when sufficient training data are available, while regression approaches remain robust and comparatively stable under limited-data conditions. However, challenges such as sensor drift, lack of standardized protocols and limited generalizability across sites persist. Transparency concerns, particularly with black-box models, further complicate adoption in regulatory settings.

Originality/value

By synthesizing results across pollutants, algorithms and deployment contexts, this review offers a balanced appraisal of ML’s potential and limitations for LCS calibration. It identifies best practices, emphasizes the importance of training data and validation strategies, and underscores emerging hybrid methods that integrate sensor physics with data-driven models. The analysis provides guidance for researchers, practitioners and policymakers seeking to enhance the reliability and scalability of low-cost sensor networks for air quality management.

2026

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

A review of machine learning approaches for improving calibration and performance of low-cost air quality sensor networks

Email Alerts

Cited By

A review of machine learning approaches for improving calibration and performance of low-cost air quality sensor networks

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable