Current multi-source image fusion methods frequently overlook the issue of detailed features when employing deep learning technology, resulting in inadequate target feature information. In real-world mission scenarios, such as military information acquisition or medical image enhancement, the prominence of target feature information is of paramount importance. To address these challenges, this paper introduces a novel infrared-visible light fusion model.
Leveraging the foundational architecture of the traditional DenseFuse model, this paper optimizes the backbone network structure and incorporates a Unique Feature Encoder (UFE) to meticulously extract the distinctive features inherent in the two images. Furthermore, it integrates the Convolutional Block Attention Module (CBAM) and the Squeeze and Excitation Network (SE) to enhance and replace the original spatial and channel attention mechanisms.
Compared to other methods such as IFCNN, NestFuse, DenseFuse, etc., the values of entropy, standard deviation, and mutual information index of the method presented in this paper can reach 6.9985, 82.6652, and 13.6022, respectively, which are significantly improved compared with other methods.
This paper presents a UFEFusion framework that synergizes with the CBAM attention mechanism to markedly augment the extraction of detailed features relative to other methods. Moreover, the framework adeptly extracts and amplifies unique features from disparate images, thereby elevating the overall feature representation capability.
