Top 15 papers published in the topic of Pyramid (image processing) in 2022

Showing papers on "Pyramid (image processing) published in 2022"

Journal Article•10.1016/J.INFFUS.2021.09.002•

Laplacian pyramid networks: A new approach for multispectral pansharpening

[...]

Cheng Jin¹, Liang-Jian Deng¹, Ting-Zhu Huang¹, Gemine Vivone•Institutions (1)

University of Electronic Science and Technology of China¹

01 Feb 2022-Information Fusion

TL;DR: A Laplacian pyramid pansharpening network architecture for accurately fusing a high spatial resolution panchromatic image and a low spatial resolution multispectral image, which outperforms state-of-the-art panshARPening methods.

...read moreread less

86 citations

Journal Article•10.1016/J.CAGEO.2021.104969•

Semantic segmentation of high-resolution remote sensing images based on a class feature attention mechanism fused with Deeplabv3+

[...]

Zhimin Wang¹, Prof. Walace Rodrigues (PhD)², Yujun Li³, Jiasheng Wang¹, Kun Yang¹, Limeng Wang¹, Fanjie Su¹, Xinya Chen¹ - Show less +4 more•Institutions (3)

Yunnan Normal University¹, University of Electronic Science and Technology of China², Franklin College³

01 Jan 2022-Computers & Geosciences

TL;DR: Wang et al. as mentioned in this paper proposed a class feature attention mechanism fused with an improved Deeplabv3+ network called CFAMNet for semantic segmentation of common features in remote sensing images.

...read moreread less

78 citations

Journal Article•10.1016/J.BSPC.2021.103261•

Endoscope image mosaic based on pyramid ORB

[...]

Lirong Yin¹, Ziyan Zhang, Wenfeng Zheng², Lixiao Wang, Wenfeng Zheng², Lirong Yin¹, Hu Rongrong², Bo Yang² - Show less +4 more•Institutions (2)

Louisiana State University¹, University of Electronic Science and Technology of China²

01 Jan 2022-Biomedical Signal Processing and Control

TL;DR: In this article, the authors used the Gaussian pyramid to improve the simple ORB-oriented algorithm, which is more suitable for minimally invasive surgery endoscopic image mosaic through theoretical analysis and experimental verification.

...read moreread less

64 citations

Journal Article•10.1016/J.AUTCON.2021.104009•

Quantitative loosening detection of threaded fasteners using vision-based deep learning and geometric imaging theory

[...]

Petra B. Holden¹, Hao Gong², Xinjian Deng², Jianhua Liu², Jiayu Huang² - Show less +1 more•Institutions (2)

Nanjing Forestry University¹, Beijing Institute of Technology²

01 Jan 2022-Automation in Construction

TL;DR: A method to quantitatively calculate the length of the exposed bolt for detecting loosening using vision-based deep learning and geometric imaging theory is proposed, outperforming other measurement methods and the state-of-the-art networks of human pose estimation.

...read moreread less

37 citations

Journal Article•10.1016/J.BSPC.2021.103191•

Automated knee ligament injuries classification method based on exemplar pyramid local binary pattern feature extraction and hybrid iterative feature selection

[...]

Sukru Demir¹, Lara Tabet¹, Sefa Key², Mehmet Baygin³, Turker Tuncer¹, Sengul Dogan¹, Samir Brahim Belhaouari⁴, Ahmet Kursad Poyraz¹, Murat Gürger¹ - Show less +5 more•Institutions (4)

Fırat University¹, Turkish Ministry of Health², Ardahan University³, Khalifa University⁴

01 Jan 2022-Biomedical Signal Processing and Control

TL;DR: It was shown that an intelligent health assistant for knee injuries could be developed by using the proposed exemplar pyramid LBP method, and the general and high success of this method were demonstrated.

...read moreread less

12 citations

Journal Article•10.1016/J.DSP.2021.103289•

Attentive and context-aware deep network for saliency prediction on omni-directional images

[...]

Chunmei Qing¹, Huansheng Zhu¹, Xiaofen Xing¹, Dongwen Chen¹, Jianxiu Jin¹ - Show less +1 more•Institutions (1)

South China University of Technology¹

01 Jan 2022-Digital Signal Processing

TL;DR: Zhang et al. as discussed by the authors proposed a novel attentive and context-aware network for saliency prediction on omni-directional images, which is named as ACSalNet, and they further designed a Context-aware Feature Pyramid Module (CFPM) to reduce the semantic gap between features of different levels.

...read moreread less

10 citations

Journal Article•10.32604/CMC.2022.019328•

A position-aware transformer for image captioning

[...]

Zelin Deng¹, Bo Zhou¹, Pei He², Jianfeng Huang, Osama Alfarraj³, Amr Tolba⁴, Amr Tolba³ - Show less +3 more•Institutions (4)

Changsha University of Science and Technology¹, Guangzhou University², King Saud University³, Menoufia University⁴

01 Jan 2022-Cmc-computers Materials & Continua

TL;DR: Zhang et al. as mentioned in this paper proposed a Position-Aware Transformer model with image-feature attention and position-aware attention mechanisms for image captioning, which first extracts multi-level features by using Feature Pyramid Network (FPN), then utilizes the scaled-dot-product to fuse these features, which enables the model to detect objects of different scales in the image more effectively without increasing parameters.

...read moreread less

Abstract: Image captioning aims to generate a corresponding description of an image. In recent years, neural encoder-decoder models have been the dominant approaches, in which the Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) are used to translate an image into a natural language description. Among these approaches, the visual attention mechanisms are widely used to enable deeper image understanding through fine-grained analysis and even multiple steps of reasoning. However, most conventional visual attention mechanisms are based on high-level image features, ignoring the effects of other image features, and giving insufficient consideration to the relative positions between image features. In this work, we propose a Position-Aware Transformer model with image-feature attention and position-aware attention mechanisms for the above problems. The image-feature attention firstly extracts multi-level features by using Feature Pyramid Network (FPN), then utilizes the scaled-dot-product to fuse these features, which enables our model to detect objects of different scales in the image more effectively without increasing parameters. In the position-aware attention mechanism, the relative positions between image features are obtained at first, afterwards the relative positions are incorporated into the original image features to generate captions more accurately. Experiments are carried out on the MSCOCO dataset and our approach achieves competitive BLEU-4, METEOR, ROUGE-L, CIDEr scores compared with some state-of-the-art approaches, demonstrating the effectiveness of our approach.

...read moreread less

10 citations

Journal Article•10.1007/S00138-021-01245-Y•

Semantic convolutional features for face detection

[...]

The-Anh Pham¹•Institutions (1)

Hong Duc University¹

1 Jan 2022

TL;DR: Wang et al. as mentioned in this paper proposed a novel feature pyramid fashion to produce semantic features at all levels of the network for specially addressing the problem of face detection, where a Semantic Convolutional Box (SCBox) is presented by merging the features from different layers in a bottom-up fashion.

...read moreread less

Abstract: Convolutional neural networks have been extensively used as the key role to address many computer vision applications. Traditionally, learning convolutional features is performed in a hierarchical manner along the dimension of network depth to create multi-scale feature maps. As a result, strong semantic features are derived at the top-level layers only. This paper proposes a novel feature pyramid fashion to produce semantic features at all levels of the network for specially addressing the problem of face detection. Particularly, a Semantic Convolutional Box (SCBox) is presented by merging the features from different layers in a bottom-up fashion. The proposed lightweight detector is stacked of alternating SCBox and Inception residual modules to learn the visual features in both the dimensions of network depth and width. In addition, the newly introduced objective functions (e.g., focal and CIoU losses) are incorporated to effectively address the problem of unbalanced data, resulting in stable training. The proposed model has been validated on the standard benchmarks FDDB and WIDER FACES, in comparison with the state-of-the-art methods. Experiments showed promising results in terms of both processing time and detection accuracy. For instance, the proposed network achieves an average precision of $$96.8\%$$ on FDDB, $$82.4\%$$ on WIDER FACES, and gains an inference speed of 106 FPS on a moderate GPU configuration or 20 FPS on a CPU machine.

...read moreread less

3 citations

Book Chapter•10.1007/978-981-16-6372-7_10•

Global Context Guided Multi-scale Feature Network for Salient Object Detection

[...]

Zhenyu Zhao¹, Yachao Fang¹, Qing Zhang¹, Xiaowei Chen¹, Meng Dai¹, Jiajun Lin² - Show less +2 more•Institutions (2)

Shanghai Institute of Technology¹, East China University of Science and Technology²

1 Jan 2022

TL;DR: Zhang et al. as discussed by the authors proposed a salient object detection approach using global context and multi-scale feature representation to estimate saliency maps in a pixel-wise manner, which could help the network effectively locate salient objects and suppress background noises.

...read moreread less

Abstract: Currently, fully convolutional network based salient object detection approaches have some challenging problems. This paper proposes a novel salient object detection approach using global context and multi-scale feature representation to estimate saliency maps in a pixel-wise manner. Firstly, we explore and design a multi-scale feature enhancement module to improve the capability of feature representation and learning of multi-level side-output features. Moreover, we use global features to guide side-output multi-scale features to focus on the useful information, which could help the network effectively locate salient objects and suppress background noises. Finally, the feature pyramid network structure is utilized to refine the estimated results in a coarse-to-fine manner, and then obtain the final predicted results. The comparisons of our approach and 15 state-of-the-art methods demonstrate the effictiveness and robustness of the proposed approach on various scenarios.

...read moreread less

1 citations

Book Chapter•10.1007/978-981-16-6372-7_71•

A Traffic Video Completion Model Based on Generative Adversarial Networks

[...]

Lan Wu¹, Han Wang¹, Tian Gao¹, Binquan Li¹, Fanshi Kong - Show less +1 more•Institutions (1)

Henan University of Technology¹

1 Jan 2022

TL;DR: Wang et al. as discussed by the authors proposed a complementary model of GANs for missing traffic video frames, which uses the Feature Pyramid Network (FPN) to obtain feature maps of multiple scales on the input video frame.

...read moreread less

Abstract: Aiming at the problem of missing traffic video frames, this paper proposes a complementary model of generative adversarial networks. The model uses the Feature Pyramid Network (FPN) to obtain feature maps of multiple scales on the input video frame. By fusing feature maps of different scales, it can better integrate the semantic information on the frame. The local patch discriminator added to the discriminator model effectively ensures the accuracy and continuity of the completed frame. Experimental results on Caltech pedestrian dataset and KITTI dataset show the good performance of the proposed model.

...read moreread less

10.1007/978-981-15-8155-7_244•

An Efficient and Accurate Method for Detecting Objects in Aerial Images

[...]

Peng Sun¹, Yongbin Zheng¹, Zongtan Zhou¹•Institutions (1)

National University of Defense Technology¹

1 Jan 2022

TL;DR: Zhang et al. as mentioned in this paper proposed an efficient and accurate method for aerial image object detection in which oriented bounding boxes of objects are predicted by utilizing a simple network in the first stage, and the rotated bounding box predictions are then sent to non-maximum suppression (NMS) to produce final detection results.

...read moreread less

Abstract: Aerial image object detection in aerial images is a hot and challenging task in computer vision, due to the bird-view perspective, complex backgrounds, variant scales and appearance of objects and extremely dense objects distribution. It has previously been observed that existing methods cannot meet the application requirements of accuracy and speed at the same time. In this paper, we propose an efficient and accurate method for aerial image object detection. The pipeline of our method has only two stages. The oriented bounding boxes of objects are predicted by utilizing a simple network in the first stage, and the rotated bounding box predictions are then sent to non-maximum suppression (NMS) to produce final detection results. Besides, atrous spatial pyramid pooling (ASPP) network is added to the pipeline to extract multi-scale features, and Bi-directional long short term memory network (BiLSTM) is adopted to improve detection performance of long and slender instances. Experiments on the challenging DOTA dataset have shown the propose method outperforms existing methods in terms of detection rate and speed.

...read moreread less

10.1007/978-981-16-6554-7_13•

Image Semantic Segmentation Based on Joint Normalization

[...]

Jiexin Zheng¹, Taiwei Qiu¹, Lihong Chen², Shengyang Liang¹•Institutions (2)

Chang'an University¹, South China Agricultural University²

1 Jan 2022

TL;DR: Zhang et al. as mentioned in this paper proposed a depthwise separable convolution-joint feature pyramid (DSC-JFP) model, ASPP model and auxiliary network are removed to improve the real-time performance of semantic segmentation.

...read moreread less

Abstract: Image semantic segmentation is an important research direction in image processing, computer vision and deep learning. Semantic segmentation is to classify the image pixel by pixel, so that the original image is divided into semantic segmentation images with specific pixel marks, which is the most challenging in image processing. Based on DSC-JFP (depthwise separable convolution-joint feature pyramid) model, ASPP model and auxiliary network are removed to improve the real-time performance of semantic segmentation. Combined with batch normalization and instance normalization, parallel batch and instance normalization (PBIN) and cascaded batch and instance normalization (CBIN) methods are proposed to improve the effect of semantic segmentation. The experimental results also show that the proposed method improves the real-time performance of semantic segmentation while ensuring the effect of semantic segmentation.

...read moreread less

Journal Article•10.32604/CMC.2022.020820•

Kernel granulometric texture analysis and light res-aspp-unet classification for covid-19 detection

[...]

R. Gopi, P. Muthusamy, P. Suresh, C. G. Gabriel Santhosh Kumar, Irina V. Pustokhina, Denis A. Pustokhin, K. Shankar - Show less +3 more

01 Jan 2022-Cmc-computers Materials & Continua

TL;DR: In this article, the authors proposed an automatic frame work for detecting COVID-19 at the early stage using chest X-ray image and achieved 99.6% accuracy in detecting the virus at its early stage.

...read moreread less

Abstract: This research article proposes an automatic frame work for detecting COVID -19 at the early stage using chest X-ray image. It is an undeniable fact that coronovirus is a serious disease but the early detection of the virus present in human bodies can save lives. In recent times, there are somany research solutions that have been presented for early detection, but there is still a lack in need of right and even rich technology for its early detection. The proposed deep learning model analysis the pixels of every image and adjudges the presence of virus. The classifier is designed in such a way so that, it automatically detects the virus present in lungs using chest image. This approach uses an image texture analysis technique called granulometric mathematical model. Selected features are heuristically processed for optimization using novel multi scaling deep learning called light weight residual-atrous spatial pyramid pooling (LightRES-ASPP-Unet) Unet model. The proposed deep LightRES-ASPPUnet technique has a higher level of contracting solution by extracting major level of image features. Moreover, the corona virus has been detected using high resolution output. In the framework, atrous spatial pyramid pooling (ASPP) method is employed at its bottom level for incorporating the deep multi scale features in to the discriminative mode. The architectural working starts from the selecting the features from the image using granulometric mathematical model and the selected features are optimized using LightRESASPP- Unet. ASPP in the analysis of images has performed better than the existing Unet model. The proposed algorithm has achieved 99.6% of accuracy in detecting the virus at its early stage. © 2022 Tech Science Press. All rights reserved.

...read moreread less

Journal Article•10.1016/J.MEDIA.2021.102251•

High resolution histopathology image generation and segmentation through adversarial training.

[...]

Wenyuan Li¹, Jiayun Li¹, Jennifer S. Polson¹, Zichen Wang¹, William Speier¹, Corey W. Arnold - Show less +2 more•Institutions (1)

University of California, Los Angeles¹

01 Jan 2022-Medical Image Analysis

TL;DR: In this article, a multi-scale conditional GAN is proposed for high-resolution, large-scale histopathology image generation and segmentation, which consists of a pyramid of GAN structures, each responsible for generating and segmenting images at a different scale.

...read moreread less

Journal Article•10.1016/J.AEJ.2021.05.004•

Modified phase correlation algorithm for image registration based on pyramid

[...]

Yang Li¹, b4migpd985², Jianli Wang¹, b4migpd985³, Kainan Yao¹ - Show less +1 more•Institutions (3)

Chinese Academy of Sciences¹, Tongji Medical College², Huazhong University of Science and Technology³

01 Jan 2022-alexandria engineering journal

TL;DR: In this paper, a pyramid phase correlation algorithm (PCA) and normalized cross correlation-pyramid (NCCP) algorithm are combined for image registration in frequency domain and spatial domain, respectively.

...read moreread less

Abstract: Image registration is an important process for applications in various fields, such as remote sensing and medical imaging; thus, its accuracy significantly affects the efficacy as well as efficiency of those applications. Phase correlation algorithm (PCA) and normalized cross correlation-pyramid (NCCP) algorithm are the state-of-the-art frequency domain and spatial domain methods for image registration, respectively. However, these algorithms have some limitations. In particular, the registration speed of PCA needs to be improved, while the NCCP algorithm leads to errors if the image to be registered is partially occluded. Thus, to overcome these limitations, we propose a pyramid PCA that combines both algorithms. To verify the performance of our proposed algorithm, its results are compared with those obtained using the traditional PCA and NCCP algorithm. Our simulation results for partially occluded images indicate that the proposed algorithm outperforms the NCCP algorithm in terms of accuracy; in addition, it outperforms PCA in terms of speed. Furthermore, to test the feasibility of the proposed algorithm for real-time applications, a panoramic target detection system was set up, and the results obtained using the system proved that our method for image registration was both feasible and effective.

...read moreread less