UniPercepNet-S: a lightweight dual-task framework with attention mechanisms for real-time object detection and instance segmentation

Ton, Quang Toai; Lưu, Khang Gia; Vu, Thanh Hien; Le, Tuong; Vu, Thanh Nguyen

doi:10.1108/DTA-08-2025-0745

Article navigation

Research Article| March 02 2026

UniPercepNet-S: a lightweight dual-task framework with attention mechanisms for real-time object detection and instance segmentation

Quang Toai Ton

0009-0000-7958-6195

;

Quang Toai Ton

Faculty of Information Technology, HUTECH University

, Ho Chi Minh City,

Vietnam

Faculty of Information Technology, Ho Chi Minh City University of Foreign Languages – Information Technology (HUFLIT)

, Ho Chi Minh City,

Vietnam

Search for other works by this author on:

This Site

PubMed

Google Scholar

Khang Gia Lưu

0009-0004-9101-499X

;

Khang Gia Lưu

Faculty of Information Technology, Ho Chi Minh City University of Foreign Languages – Information Technology (HUFLIT)

, Ho Chi Minh City,

Vietnam

Search for other works by this author on:

This Site

PubMed

Google Scholar

Thanh Hien Vu;

Thanh Hien Vu

Faculty of Information Technology, HUTECH University

, Ho Chi Minh City,

Vietnam

Search for other works by this author on:

This Site

PubMed

Google Scholar

Tuong Le

0000-0003-0909-4974

;

Tuong Le

Faculty of Information Technology, HUTECH University

, Ho Chi Minh City,

Vietnam

Search for other works by this author on:

This Site

PubMed

Google Scholar

Thanh Nguyen Vu

Faculty of Information Technology, Ho Chi Minh City University of Industry and Trade

, Ho Chi Minh City,

Vietnam

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Tuong Le can be contacted at: lc.tuong@hutech.edu.vn

Competing interests: The authors have no conflicts of interest to declare that are relevant to the content of this article.

Publisher: Emerald Publishing

Received: August 21 2025

Revision Received: January 14 2026

Accepted: February 03 2026

Online ISSN: 2514-9318

Print ISSN: 2514-9288

2026

Emerald Publishing Limited

Licensed re-use rights only

Data Technologies and Applications (2026) 60 (2): 367–391.

https://doi.org/10.1108/DTA-08-2025-0745

Purpose

Object detection and instance segmentation play an important role in autonomous driving, where vehicles must perceive their surroundings reliably. In practice, these tasks are commonly addressed using separate models, which increases both training complexity and deployment cost. To overcome this issue, we propose UniPercepNet-S, a lightweight dual-task framework inspired by YOLOF that brings detection and segmentation into a single unified network, aiming to support real-time perception in resource-constrained environments.

Design/methodology/approach

UniPercepNet-S follows a YOLOF-style one-level detection design and strengthens the backbone with a channel attention module to improve feature quality. To enable instance segmentation, we add a simple yet efficient mask prediction branch that operates directly on detected objects while keeping computation low. We evaluate the proposed framework on MS COCO and BDD100 K, covering both general object segmentation and autonomous-driving-oriented scenarios.

Findings

The proposed UniPercepNet-S achieves a mask AP of 38.0 on MS COCO, placing it among the top-performing entries in the COCO Detection Challenge for segmentation tasks. On BDD100 K, which reflects real-world driving conditions, the model reaches an AP of 20.3, showing that it generalizes well across different datasets. These results suggest that UniPercepNet-S can deliver accurate detection and segmentation while remaining suitable for real-time use.

Originality/value

This work contributes a unified and lightweight one-level framework that performs object detection and instance segmentation simultaneously, avoiding the need for heavy multi-scale architectures or separate task-specific models. By combining attention-enhanced representations with an efficient segmentation branch, UniPercepNet-S provides a practical solution for real-time perception. Its balance between simplicity, accuracy, and speed makes it especially valuable for autonomous driving and other embedded vision applications.

2026

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

UniPercepNet-S: a lightweight dual-task framework with attention mechanisms for real-time object detection and instance segmentation

New and popular articles

Email Alerts

Cited By

UniPercepNet-S: a lightweight dual-task framework with attention mechanisms for real-time object detection and instance segmentation

Sign in

Client Account

ICE Member Sign In

New and popular articles

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable