Top 155 Pattern Recognition papers published in 2026

TL;DR: This paper proposes EWS, a weakly supervised binary semantic image segmentation framework that uses one-pixel annotations to achieve competitive results with low computational costs, eliminating the need for background annotations and hyperparameter tuning.

...read moreread less

Abstract: • Binary segmentation with sparse one-pixel annotations, even a single one per dataset. • Our method operates without requiring background annotations. • Novel contrastive loss using class-of-interest one-pixel annotations. • Dynamic contrastive loss hyperparameter computation based on image features. Despite recent advancements, Unsupervised Semantic Segmentation (USS) methods still exhibit a significant performance deficit compared to supervised approaches, particularly in binary semantic segmentation. This limitation arises because, without supervision, USS methods struggle to distinguish foreground from background image regions, particularly when the foreground contains small or uncommon objects. This issue is addressed by our proposed Extremely Weakly Supervised Binary Semantic Segmentation (EWS) framework. EWS expects minimal supervision, consisting only of a small set of one-pixel annotations explicitly belonging to the foreground class across the entire image dataset. Our approach leverages these one-pixel annotations and employs two contrastive losses to map visual transformer features into well-separated foreground and background feature clusters. Additionally, we propose a novel loss function to eliminate the need for hyperparameter tuning of the contrastive loss threshold, by dynamically computing it based on the similarity between the input image features. Even if we employ a single one-pixel annotation, EWS achieves competitive results in binary segmentation tasks while maintaining low computational costs, making it an efficient solution for critical segmentation applications. GitHub Repo: https://github.com/matJTzimas/EWS

...read moreread less

Journal Article•10.1016/j.patcog.2025.113037•

Interpretable Deep Learning Enables Reliable and Label-Efficient Fluorescence Imaging

[...]

Mingyang Chen, Luhong Jin, Xuwei Xuan, Defu Yang, Yun-Chien Cheng¹, Ju Zhang - Show less +2 more•Institutions (1)

National Chiao Tung University¹

01 Jan 2026-Pattern Recognition

Journal Article•10.1016/j.patcog.2025.112959•

BiPAZSL: A bidirectional progressive attention method for zero-shot learning domain shift mitigation

[...]

Chong Li, Jie Su¹, Jinsong Gao•Institutions (1)

University of Jinan¹

Pattern Recognition

Journal Article•10.1016/j.patcog.2026.113113•

OctMamba: Mamba-based octree context entropy model for point cloud geometry compression

[...]

Zhaoyi Jiang, Yi Xu, Frederick W. B. Li, Gary K.L. Tam, Chao Song, Bailin Yang - Show less +2 more

20 Jan 2026-Pattern Recognition

TL;DR: OctMamba proposes a unified framework for point cloud geometry compression, jointly modeling spatial, channel, and topological redundancies with linear complexity, outperforming baselines and achieving state-of-the-art performance on LiDAR and dynamic human point cloud benchmarks.

...read moreread less

Abstract: • Jointly models spatial, channel, and topological redundancies, moving beyond conventional spatial-only designs. • Embedding Mamba layers locally within specialized subcomponents instead of as a global backbone, enabling structured context modeling. • Achieves efficient long-range modeling with linear complexity, yielding a smaller model and faster decoding while outperforming baselines. Existing learned point cloud compression frameworks face two major limitations: (1) they focus almost exclusively on spatial redundancy and (2) rely on architectures built around local-global transformers or global Mamba blocks. Transformers incur quadratic complexity, while global Mamba lacks the granularity to capture structured correlations across multiple dimensions. We propose OctMamba, the first unified framework to jointly exploit spatial, channel, and topological redundancies, dimensions previously overlooked in point cloud geometry compression. Our approach introduces a new architectural principle: embedding Mamba modules within specialized subcomponents rather than applying them globally, challenging existing design paradigms. OctMamba combines two modules: Spatial-Channel Coupled Grouping Mamba (SCCGM) for spatial-channel fusion and Local Graph CNN-Mamba (LGCM) for topological encoding. This design enables efficient long-range modeling with linear complexity, delivering a smaller model and faster decoding while outperforming transformer-based and global Mamba baselines. On SemanticKITTI, OctMamba reduces bitrate by 60.2% over GPCC (D1 PSNR) and achieves state-of-the-art performance across LiDAR and dynamic human point cloud benchmarks with practical speed and scalability. By introducing multi-dimensional redundancy modeling, OctMamba has the potential to influence future research on efficient point cloud compression. Source code will be released.

...read moreread less

Journal Article•10.1016/j.patcog.2026.113041•

Noise-Robust tiny object localization with flows

[...]

Huixin Sun, Linlin Yang, Ronyu Chen, Kerui Gu, Baochang Zhang, Angela Yao, Xianbin Cao - Show less +3 more

09 Jan 2026-Pattern Recognition

Journal Article•10.1016/j.patcog.2026.113060•

Diffusion-based Laplacian frequency-aware network for low-light image enhancement

[...]

Li Zhou, Wenjie Li, Juncheng Li, G.F. Gao, Chia-Wen Lin - Show less +1 more

09 Jan 2026-Pattern Recognition

Journal Article•10.1016/j.patcog.2026.113098•

A Novel Approach for Fast Circlet Transform: Dynamic Analysis of Coefficients for Circular Shapes Quantification

[...]

Hossein Mir, Alireza Mehridehnavi

01 Jan 2026-Pattern Recognition

Journal Article•10.1016/j.patcog.2026.113083•

Insulator Shed Segmentation from 3D Point Cloud via Normal Reconstruction Based on Gaussian Mapping

[...]

You Tian, Minghui Li, Wanquan Liu¹•Institutions (1)

Sun Yat-sen University¹

01 Jan 2026-Pattern Recognition

Journal Article•10.1016/j.patcog.2026.113055•

From temporal thumbnail to semantics: Debiasing multi-view action recognition

[...]

Wei Feng, Zixian Zhu, Wenxuan Liu, Xu Wang, Bao Liu, Xiaohan Yu - Show less +2 more

08 Jan 2026-Pattern Recognition

Journal Article•10.1016/j.patcog.2026.113142•

VRDNet: Visual restoration dehazing network with triple color space feature fusion for clustered haze scenarios

[...]

Zhiyu Lyu¹•Institutions (1)

Dalian University of Technology¹

Pattern Recognition

Journal Article•10.1016/s0031-3203(26)00042-7•

Editorial Board

[...]

21 Jan 2026-Pattern Recognition

Journal Article•10.1016/j.patcog.2026.113087•

Is multimodal conversational emotion recognition satisfactory? Exploring the gaps in performance, generalization, and confidence

[...]

Geng Tu, Ran Jing, Xuan Luo, E. Cambria, Wenjie Li, Ruifeng Xu - Show less +2 more

17 Jan 2026-Pattern Recognition

Journal Article•10.1016/j.patcog.2026.113099•

Outlier-robust learning with continuously differentiable least trimmed squares

[...]

Lei Xing, Yufei Liu, Linhai Xu, Badong Chen

15 Jan 2026-Pattern Recognition

Showing papers in "Pattern Recognition in 2026"

Probabilistic modeling of disparity uncertainty for robust and efficient stereo matching

Infrared-assisted single-stage framework for joint restoration and fusion of visible and infrared images under hazy conditions

Entropy-increasing linear attention for multi-class unsupervised anomaly detection

StyleSeg V2: Towards robust single-label-supervised segmentation of brain tissue via optimization-free registration error perception

End-to-end susceptibility-induced distortion correction for diffusion MRI with unsupervised deep learning

Fine-grained evaluation for offensive speech detection on social media

MPFR: Memory Prompt Feature Reconstruction for Continual Anomaly Detection and Segmentation

3D temporal-spatial convolutional LSTM network for assessing drug addiction treatment

FreeStyle: Free lunch for text-guided style transfer using diffusion models

Quantifying knowledge during full-layer ANN-to-SNN knowledge distillation

Haze has many faces: Multi-domain haze style transfer for diverse haze removal

Masked Autoencoders for Spatio-Temporal Audio Representations: Theory and Optimization

Prompt-level contrastive learning for context-aware multi-modal image representation in medical diagnosis

Dual dynamic guidance image filtering

Multi -directional decision fusion for black-box source-free anomaly detection

SmokeAttack: Physically-based adversarial smoke for LiDAR point cloud detectors

RGD-SLAM: Robust Gaussian splatting SLAM for dynamic environments

Extreme weakly supervised binary semantic image segmentation via one-pixel supervision

Interpretable Deep Learning Enables Reliable and Label-Efficient Fluorescence Imaging

BiPAZSL: A bidirectional progressive attention method for zero-shot learning domain shift mitigation

OctMamba: Mamba-based octree context entropy model for point cloud geometry compression

Noise-Robust tiny object localization with flows

Diffusion-based Laplacian frequency-aware network for low-light image enhancement

A Novel Approach for Fast Circlet Transform: Dynamic Analysis of Coefficients for Circular Shapes Quantification

Insulator Shed Segmentation from 3D Point Cloud via Normal Reconstruction Based on Gaussian Mapping

From temporal thumbnail to semantics: Debiasing multi-view action recognition

VRDNet: Visual restoration dehazing network with triple color space feature fusion for clustered haze scenarios

Editorial Board

Is multimodal conversational emotion recognition satisfactory? Exploring the gaps in performance, generalization, and confidence

Outlier-robust learning with continuously differentiable least trimmed squares