Qiran Kong
3 Papers
Qiran Kong is an academic researcher. The author has contributed to research in topics: Computer science & Pattern recognition (psychology). The author has an hindex of 1, co-authored 2 publications.
Chat about Author
Papers
CDT-CAD: Context-Aware Deformable Transformers for End-to-End Chest Abnormality Detection on X-Ray Images.
TL;DR: CDT-CAD as discussed by the authors constructs an iterative context-aware feature extractor, which not only enlarges receptive fields to encode multi-scale context information via dilated context encoding blocks, but also captures unique and scalable feature variation patterns in wavelet frequency domain via frequency pooling blocks.
30
CDText: Scene Text Detector based on Context-aware Deformable Transformer
Yiru Wu,Qiran Kong,Lai Yong,Fabio Narducci,Shaohua Wan +4 more
TL;DR: CDText as mentioned in this paper adopts different convolution kernel designs for feature extraction, which designs receptive fields with different size for multi-scale feature perception and fusion, and a multi-head self-attention mechanism is used to strengthen the reasoning ability of CDText in a global sense, thus enhancing feature maps with abundant context information by extracting implicit relationship between multiscale text features.
6
End-PolarT: Polar Representation for End-to-End Scene Text Detection
Yiru Wu,Qiran Kong,Cheng Qian,Michele Nappi,Shaohua Wan +4 more
TL;DR: End-PolarT network proposes an end-to-end, single-stage method for scene text detection using polar coordinates, reducing computation cost by regressing contour points instead of pixels, and achieving superior results on public datasets with balanced efficiency and effectiveness.
1