Journal Article10.1109/LGRS.2022.3228689
A Framework of Maximum Feature Exploration Oriented Remote Sensing Object Detection
Xuelong Li,Yue Xing,Ziwei Wang,Teng-Jiao Xiao,QingZeng Song,Weiwei Li,Jianming Wang +6 more
- Vol. 20, pp 1-5
5
TL;DR: Wang et al. as mentioned in this paper proposed a general framework of accurate remote sensing object localization constructed with the core idea of maximum image feature exploration, which is mainly comprised of cascaded tiny patch correlation (CTPC) based feature digging, averaging local patch regression (ALPR) centered coarse location acquisition, and instance segmentation oriented further refinement (ISOFR).
read more
Abstract: Object detection from remote sensing images is an interesting research topic full of theoretical and application values. Compared with generic images, those of remote sensing contain much less distinguishing representation information due to far-away capturing position and limited camera lens resolution, the perpendicular to ground capturing view, almost unacquirable 3-D object structure and serious roof occlusion, and the inherent unimportance of top roof view. Focusing on this intrinsic disadvantage, in this letter, we propose a general framework of accurate remote sensing object localization constructed with the core idea of maximum image feature exploration. This integrated structure is mainly comprised of cascaded tiny patch correlation (CTPC) based feature digging, averaging local patch regression (ALPR) centered coarse location acquisition, and instance segmentation oriented further refinement (ISOFR). All those components are designed to maximize feature digging and usage. We believe this is the first to implement the idea of maximum feature exploration to each corner of object detection architecture. Extensive experiments on challenging aerial image datasets DOTA and NWPU VHR-10 show state-of-the-art performance.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Physical-Simulation-based Dynamic Template Matching Method for Remote Sensing Small Object Detection
Yaming Cao,Lei Guo,Fengguang Xiong,Liqun Kuang,Xie Han +4 more
TL;DR: This article proposes a template matching method based on physical simulation images (TMSI) and adds it to the modified Darknet-53 (named TMSI-Net) for small target detection and proposes DTMSI-Net with a dynamic template library (Dynamic TMSI-Net).
1
Learning Point Processes and Convolutional Neural Networks for Object Detection in Satellite Images
TL;DR: Learning point processes and CNNs for object detection in satellite images combines object detection with object interactions using a Markov Marked Point Process framework.
Class-Aware Prediction: A Solution to Center Point Collision for Anchor-Free Object Detection in Aerial Images
Yixing Yong,Jian Wang,Fan Li +2 more
Learning and Scoring Point Process Models for Object Detection in Satellite Images
Jules Mabon,Mathias Ortner,Josiane Zerubia +2 more
- 26 Aug 2024
TL;DR: This paper proposes a joint Point Process and CNN method for object detection in satellite images, leveraging interaction priors to complement limited visual information, with matching parameter estimation and result scoring procedures for improved accuracy and interpretability.
Learning point process models for vehicles detection using CNNs in satellite images
Jules Mabon,Mathias Ortner,Josiane Zerubia +2 more
- 08 Nov 2023
TL;DR: A novel point process model for vehicle detection in satellite images using CNNs, improving results over straight CNN outputs.
References
•Posted Content
Deep Residual Learning for Image Recognition
TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.
117.9K
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.
Microsoft COCO: Common Objects in Context
Tsung-Yi Lin,Michael Maire,Serge Belongie,James Hays,Pietro Perona,Deva Ramanan,Piotr Dollár,C. Lawrence Zitnick +7 more
- 06 Sep 2014
TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon,Santosh K. Divvala,Ross Girshick,Ali Farhadi +3 more
- 27 Jun 2016
TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.
•Posted Content
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy,Lucas Beyer,Alexander Kolesnikov,Dirk Weissenborn,Xiaohua Zhai,Thomas Unterthiner,Mostafa Dehghani,Matthias Minderer,Georg Heigold,Sylvain Gelly,Jakob Uszkoreit,Neil Houlsby +11 more
TL;DR: Vision Transformer (ViT) attains excellent results compared to state-of-the-art convolutional networks while requiring substantially fewer computational resources to train.