Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Pyramid (image processing)
  4. 2022
  1. Home
  2. Topics
  3. Pyramid (image processing)
  4. 2022
Showing papers on "Pyramid (image processing) published in 2022"
Journal Article•10.1016/J.INFFUS.2021.09.002•
Laplacian pyramid networks: A new approach for multispectral pansharpening

[...]

Cheng Jin1, Liang-Jian Deng1, Ting-Zhu Huang1, Gemine Vivone•
University of Electronic Science and Technology of China1
01 Feb 2022-Information Fusion
TL;DR: A Laplacian pyramid pansharpening network architecture for accurately fusing a high spatial resolution panchromatic image and a low spatial resolution multispectral image, which outperforms state-of-the-art panshARPening methods.

86 citations

Journal Article•10.1016/J.CAGEO.2021.104969•
Semantic segmentation of high-resolution remote sensing images based on a class feature attention mechanism fused with Deeplabv3+

[...]

Zhimin Wang1, Prof. Walace Rodrigues (PhD)2, Yujun Li3, Jiasheng Wang1, Kun Yang1, Limeng Wang1, Fanjie Su1, Xinya Chen1 •
Yunnan Normal University1, University of Electronic Science and Technology of China2, Franklin College3
01 Jan 2022-Computers & Geosciences
TL;DR: Wang et al. as mentioned in this paper proposed a class feature attention mechanism fused with an improved Deeplabv3+ network called CFAMNet for semantic segmentation of common features in remote sensing images.

78 citations

Journal Article•10.1016/J.BSPC.2021.103261•
Endoscope image mosaic based on pyramid ORB

[...]

Lirong Yin1, Ziyan Zhang, Wenfeng Zheng2, Lixiao Wang, Wenfeng Zheng2, Lirong Yin1, Hu Rongrong2, Bo Yang2 •
Louisiana State University1, University of Electronic Science and Technology of China2
01 Jan 2022-Biomedical Signal Processing and Control
TL;DR: In this article, the authors used the Gaussian pyramid to improve the simple ORB-oriented algorithm, which is more suitable for minimally invasive surgery endoscopic image mosaic through theoretical analysis and experimental verification.

64 citations

Journal Article•10.1016/J.AUTCON.2021.104009•
Quantitative loosening detection of threaded fasteners using vision-based deep learning and geometric imaging theory

[...]

Petra B. Holden1, Hao Gong2, Xinjian Deng2, Jianhua Liu2, Jiayu Huang2 •
Nanjing Forestry University1, Beijing Institute of Technology2
01 Jan 2022-Automation in Construction
TL;DR: A method to quantitatively calculate the length of the exposed bolt for detecting loosening using vision-based deep learning and geometric imaging theory is proposed, outperforming other measurement methods and the state-of-the-art networks of human pose estimation.

37 citations

Journal Article•10.1016/J.BSPC.2021.103191•
Automated knee ligament injuries classification method based on exemplar pyramid local binary pattern feature extraction and hybrid iterative feature selection

[...]

Sukru Demir1, Lara Tabet1, Sefa Key2, Mehmet Baygin3, Turker Tuncer1, Sengul Dogan1, Samir Brahim Belhaouari4, Ahmet Kursad Poyraz1, Murat Gürger1 •
Fırat University1, Turkish Ministry of Health2, Ardahan University3, Khalifa University4
01 Jan 2022-Biomedical Signal Processing and Control
TL;DR: It was shown that an intelligent health assistant for knee injuries could be developed by using the proposed exemplar pyramid LBP method, and the general and high success of this method were demonstrated.

12 citations

Journal Article•10.1016/J.DSP.2021.103289•
Attentive and context-aware deep network for saliency prediction on omni-directional images

[...]

Chunmei Qing1, Huansheng Zhu1, Xiaofen Xing1, Dongwen Chen1, Jianxiu Jin1 •
South China University of Technology1
01 Jan 2022-Digital Signal Processing
TL;DR: Zhang et al. as discussed by the authors proposed a novel attentive and context-aware network for saliency prediction on omni-directional images, which is named as ACSalNet, and they further designed a Context-aware Feature Pyramid Module (CFPM) to reduce the semantic gap between features of different levels.

10 citations

Journal Article•10.32604/CMC.2022.019328•
A position-aware transformer for image captioning

[...]

Zelin Deng1, Bo Zhou1, Pei He2, Jianfeng Huang, Osama Alfarraj3, Amr Tolba4, Amr Tolba3 •
Changsha University of Science and Technology1, Guangzhou University2, King Saud University3, Menoufia University4
01 Jan 2022-Cmc-computers Materials & Continua
TL;DR: Zhang et al. as mentioned in this paper proposed a Position-Aware Transformer model with image-feature attention and position-aware attention mechanisms for image captioning, which first extracts multi-level features by using Feature Pyramid Network (FPN), then utilizes the scaled-dot-product to fuse these features, which enables the model to detect objects of different scales in the image more effectively without increasing parameters.
Abstract: Image captioning aims to generate a corresponding description of an image. In recent years, neural encoder-decoder models have been the dominant approaches, in which the Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) are used to translate an image into a natural language description. Among these approaches, the visual attention mechanisms are widely used to enable deeper image understanding through fine-grained analysis and even multiple steps of reasoning. However, most conventional visual attention mechanisms are based on high-level image features, ignoring the effects of other image features, and giving insufficient consideration to the relative positions between image features. In this work, we propose a Position-Aware Transformer model with image-feature attention and position-aware attention mechanisms for the above problems. The image-feature attention firstly extracts multi-level features by using Feature Pyramid Network (FPN), then utilizes the scaled-dot-product to fuse these features, which enables our model to detect objects of different scales in the image more effectively without increasing parameters. In the position-aware attention mechanism, the relative positions between image features are obtained at first, afterwards the relative positions are incorporated into the original image features to generate captions more accurately. Experiments are carried out on the MSCOCO dataset and our approach achieves competitive BLEU-4, METEOR, ROUGE-L, CIDEr scores compared with some state-of-the-art approaches, demonstrating the effectiveness of our approach.

10 citations

Journal Article•10.1007/S00138-021-01245-Y•
Semantic convolutional features for face detection

[...]

The-Anh Pham1•
Hong Duc University1
1 Jan 2022
TL;DR: Wang et al. as mentioned in this paper proposed a novel feature pyramid fashion to produce semantic features at all levels of the network for specially addressing the problem of face detection, where a Semantic Convolutional Box (SCBox) is presented by merging the features from different layers in a bottom-up fashion.
Abstract: Convolutional neural networks have been extensively used as the key role to address many computer vision applications. Traditionally, learning convolutional features is performed in a hierarchical manner along the dimension of network depth to create multi-scale feature maps. As a result, strong semantic features are derived at the top-level layers only. This paper proposes a novel feature pyramid fashion to produce semantic features at all levels of the network for specially addressing the problem of face detection. Particularly, a Semantic Convolutional Box (SCBox) is presented by merging the features from different layers in a bottom-up fashion. The proposed lightweight detector is stacked of alternating SCBox and Inception residual modules to learn the visual features in both the dimensions of network depth and width. In addition, the newly introduced objective functions (e.g., focal and CIoU losses) are incorporated to effectively address the problem of unbalanced data, resulting in stable training. The proposed model has been validated on the standard benchmarks FDDB and WIDER FACES, in comparison with the state-of-the-art methods. Experiments showed promising results in terms of both processing time and detection accuracy. For instance, the proposed network achieves an average precision of $$96.8\%$$ on FDDB, $$82.4\%$$ on WIDER FACES, and gains an inference speed of 106 FPS on a moderate GPU configuration or 20 FPS on a CPU machine.

3 citations

Book Chapter•10.1007/978-981-16-6372-7_10•
Global Context Guided Multi-scale Feature Network for Salient Object Detection

[...]

Zhenyu Zhao1, Yachao Fang1, Qing Zhang1, Xiaowei Chen1, Meng Dai1, Jiajun Lin2 •
Shanghai Institute of Technology1, East China University of Science and Technology2
1 Jan 2022
TL;DR: Zhang et al. as discussed by the authors proposed a salient object detection approach using global context and multi-scale feature representation to estimate saliency maps in a pixel-wise manner, which could help the network effectively locate salient objects and suppress background noises.
Abstract: Currently, fully convolutional network based salient object detection approaches have some challenging problems. This paper proposes a novel salient object detection approach using global context and multi-scale feature representation to estimate saliency maps in a pixel-wise manner. Firstly, we explore and design a multi-scale feature enhancement module to improve the capability of feature representation and learning of multi-level side-output features. Moreover, we use global features to guide side-output multi-scale features to focus on the useful information, which could help the network effectively locate salient objects and suppress background noises. Finally, the feature pyramid network structure is utilized to refine the estimated results in a coarse-to-fine manner, and then obtain the final predicted results. The comparisons of our approach and 15 state-of-the-art methods demonstrate the effictiveness and robustness of the proposed approach on various scenarios.

1 citations

Book Chapter•10.1007/978-981-16-6372-7_71•
A Traffic Video Completion Model Based on Generative Adversarial Networks

[...]

Lan Wu1, Han Wang1, Tian Gao1, Binquan Li1, Fanshi Kong •
Henan University of Technology1
1 Jan 2022
TL;DR: Wang et al. as discussed by the authors proposed a complementary model of GANs for missing traffic video frames, which uses the Feature Pyramid Network (FPN) to obtain feature maps of multiple scales on the input video frame.
Abstract: Aiming at the problem of missing traffic video frames, this paper proposes a complementary model of generative adversarial networks. The model uses the Feature Pyramid Network (FPN) to obtain feature maps of multiple scales on the input video frame. By fusing feature maps of different scales, it can better integrate the semantic information on the frame. The local patch discriminator added to the discriminator model effectively ensures the accuracy and continuity of the completed frame. Experimental results on Caltech pedestrian dataset and KITTI dataset show the good performance of the proposed model.
10.1007/978-981-15-8155-7_244•
An Efficient and Accurate Method for Detecting Objects in Aerial Images

[...]

Peng Sun1, Yongbin Zheng1, Zongtan Zhou1•
National University of Defense Technology1
1 Jan 2022
TL;DR: Zhang et al. as mentioned in this paper proposed an efficient and accurate method for aerial image object detection in which oriented bounding boxes of objects are predicted by utilizing a simple network in the first stage, and the rotated bounding box predictions are then sent to non-maximum suppression (NMS) to produce final detection results.
Abstract: Aerial image object detection in aerial images is a hot and challenging task in computer vision, due to the bird-view perspective, complex backgrounds, variant scales and appearance of objects and extremely dense objects distribution. It has previously been observed that existing methods cannot meet the application requirements of accuracy and speed at the same time. In this paper, we propose an efficient and accurate method for aerial image object detection. The pipeline of our method has only two stages. The oriented bounding boxes of objects are predicted by utilizing a simple network in the first stage, and the rotated bounding box predictions are then sent to non-maximum suppression (NMS) to produce final detection results. Besides, atrous spatial pyramid pooling (ASPP) network is added to the pipeline to extract multi-scale features, and Bi-directional long short term memory network (BiLSTM) is adopted to improve detection performance of long and slender instances. Experiments on the challenging DOTA dataset have shown the propose method outperforms existing methods in terms of detection rate and speed.
10.1007/978-981-16-6554-7_13•
Image Semantic Segmentation Based on Joint Normalization

[...]

Jiexin Zheng1, Taiwei Qiu1, Lihong Chen2, Shengyang Liang1•
Chang'an University1, South China Agricultural University2
1 Jan 2022
TL;DR: Zhang et al. as mentioned in this paper proposed a depthwise separable convolution-joint feature pyramid (DSC-JFP) model, ASPP model and auxiliary network are removed to improve the real-time performance of semantic segmentation.
Abstract: Image semantic segmentation is an important research direction in image processing, computer vision and deep learning. Semantic segmentation is to classify the image pixel by pixel, so that the original image is divided into semantic segmentation images with specific pixel marks, which is the most challenging in image processing. Based on DSC-JFP (depthwise separable convolution-joint feature pyramid) model, ASPP model and auxiliary network are removed to improve the real-time performance of semantic segmentation. Combined with batch normalization and instance normalization, parallel batch and instance normalization (PBIN) and cascaded batch and instance normalization (CBIN) methods are proposed to improve the effect of semantic segmentation. The experimental results also show that the proposed method improves the real-time performance of semantic segmentation while ensuring the effect of semantic segmentation.
Journal Article•10.32604/CMC.2022.020820•
Kernel granulometric texture analysis and light res-aspp-unet classification for covid-19 detection

[...]

R. Gopi, P. Muthusamy, P. Suresh, C. G. Gabriel Santhosh Kumar, Irina V. Pustokhina, Denis A. Pustokhin, K. Shankar 
01 Jan 2022-Cmc-computers Materials & Continua
TL;DR: In this article, the authors proposed an automatic frame work for detecting COVID-19 at the early stage using chest X-ray image and achieved 99.6% accuracy in detecting the virus at its early stage.
Abstract: This research article proposes an automatic frame work for detecting COVID -19 at the early stage using chest X-ray image. It is an undeniable fact that coronovirus is a serious disease but the early detection of the virus present in human bodies can save lives. In recent times, there are somany research solutions that have been presented for early detection, but there is still a lack in need of right and even rich technology for its early detection. The proposed deep learning model analysis the pixels of every image and adjudges the presence of virus. The classifier is designed in such a way so that, it automatically detects the virus present in lungs using chest image. This approach uses an image texture analysis technique called granulometric mathematical model. Selected features are heuristically processed for optimization using novel multi scaling deep learning called light weight residual-atrous spatial pyramid pooling (LightRES-ASPP-Unet) Unet model. The proposed deep LightRES-ASPPUnet technique has a higher level of contracting solution by extracting major level of image features. Moreover, the corona virus has been detected using high resolution output. In the framework, atrous spatial pyramid pooling (ASPP) method is employed at its bottom level for incorporating the deep multi scale features in to the discriminative mode. The architectural working starts from the selecting the features from the image using granulometric mathematical model and the selected features are optimized using LightRESASPP- Unet. ASPP in the analysis of images has performed better than the existing Unet model. The proposed algorithm has achieved 99.6% of accuracy in detecting the virus at its early stage. © 2022 Tech Science Press. All rights reserved.
Journal Article•10.1016/J.MEDIA.2021.102251•
High resolution histopathology image generation and segmentation through adversarial training.

[...]

Wenyuan Li1, Jiayun Li1, Jennifer S. Polson1, Zichen Wang1, William Speier1, Corey W. Arnold •
University of California, Los Angeles1
01 Jan 2022-Medical Image Analysis
TL;DR: In this article, a multi-scale conditional GAN is proposed for high-resolution, large-scale histopathology image generation and segmentation, which consists of a pyramid of GAN structures, each responsible for generating and segmenting images at a different scale.
Journal Article•10.1016/J.AEJ.2021.05.004•
Modified phase correlation algorithm for image registration based on pyramid

[...]

Yang Li1, b4migpd9852, Jianli Wang1, b4migpd9853, Kainan Yao1 •
Chinese Academy of Sciences1, Tongji Medical College2, Huazhong University of Science and Technology3
01 Jan 2022-alexandria engineering journal
TL;DR: In this paper, a pyramid phase correlation algorithm (PCA) and normalized cross correlation-pyramid (NCCP) algorithm are combined for image registration in frequency domain and spatial domain, respectively.
Abstract: Image registration is an important process for applications in various fields, such as remote sensing and medical imaging; thus, its accuracy significantly affects the efficacy as well as efficiency of those applications. Phase correlation algorithm (PCA) and normalized cross correlation-pyramid (NCCP) algorithm are the state-of-the-art frequency domain and spatial domain methods for image registration, respectively. However, these algorithms have some limitations. In particular, the registration speed of PCA needs to be improved, while the NCCP algorithm leads to errors if the image to be registered is partially occluded. Thus, to overcome these limitations, we propose a pyramid PCA that combines both algorithms. To verify the performance of our proposed algorithm, its results are compared with those obtained using the traditional PCA and NCCP algorithm. Our simulation results for partially occluded images indicate that the proposed algorithm outperforms the NCCP algorithm in terms of accuracy; in addition, it outperforms PCA in terms of speed. Furthermore, to test the feasibility of the proposed algorithm for real-time applications, a panoramic target detection system was set up, and the results obtained using the system proved that our method for image registration was both feasible and effective.

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve