TL;DR: Cascade R-CNN as mentioned in this paper proposes a multi-stage object detection architecture, which consists of a sequence of detectors trained with increasing IoU thresholds, to be sequentially more selective against close false positives.
Abstract: In object detection, an intersection over union (IoU) threshold is required to define positives and negatives. An object detector, trained with low IoU threshold, e.g. 0.5, usually produces noisy detections. However, detection performance tends to degrade with increasing the IoU thresholds. Two main factors are responsible for this: 1) overfitting during training, due to exponentially vanishing positive samples, and 2) inference-time mismatch between the IoUs for which the detector is optimal and those of the input hypotheses. A multi-stage object detection architecture, the Cascade R-CNN, is proposed to address these problems. It consists of a sequence of detectors trained with increasing IoU thresholds, to be sequentially more selective against close false positives. The detectors are trained stage by stage, leveraging the observation that the output of a detector is a good distribution for training the next higher quality detector. The resampling of progressively improved hypotheses guarantees that all detectors have a positive set of examples of equivalent size, reducing the overfitting problem. The same cascade procedure is applied at inference, enabling a closer match between the hypotheses and the detector quality of each stage. A simple implementation of the Cascade R-CNN is shown to surpass all single-model object detectors on the challenging COCO dataset. Experiments also show that the Cascade R-CNN is widely applicable across detector architectures, achieving consistent gains independently of the baseline detector strength. The code is available at https://github.com/zhaoweicai/cascade-rcnn.
TL;DR: This work proposes a cascade architecture built on convolutional neural networks (CNNs) with very powerful discriminative capability, while maintaining high performance, and introduces a CNN-based calibration stage after each of the detection stages in the cascade.
Abstract: In real-world face detection, large visual variations, such as those due to pose, expression, and lighting, demand an advanced discriminative model to accurately differentiate faces from the backgrounds. Consequently, effective models for the problem tend to be computationally prohibitive. To address these two conflicting challenges, we propose a cascade architecture built on convolutional neural networks (CNNs) with very powerful discriminative capability, while maintaining high performance. The proposed CNN cascade operates at multiple resolutions, quickly rejects the background regions in the fast low resolution stages, and carefully evaluates a small number of challenging candidates in the last high resolution stage. To improve localization effectiveness, and reduce the number of candidates at later stages, we introduce a CNN-based calibration stage after each of the detection stages in the cascade. The output of each calibration stage is used to adjust the detection window position for input to the subsequent stage. The proposed method runs at 14 FPS on a single CPU core for VGA-resolution images and 100 FPS using a GPU, and achieves state-of-the-art detection performance on two public face detection benchmarks.
TL;DR: In this paper, a simple procedure is proposed for calculating the number of atomic displacements produced in a damage cascade by a primary knock-on atom of known energy, which is in close accord with recent computer simulations of radiation damage phenomena.
TL;DR: Chen et al. as discussed by the authors proposed a Hybrid Task Cascade (HTC) framework, which interweaves the two tasks for a joint multi-stage processing and adopted a fully convolutional branch to provide spatial context, which can help distinguishing hard foreground from cluttered background.
Abstract: Cascade is a classic yet powerful architecture that has boosted performance on various tasks. However, how to introduce cascade to instance segmentation remains an open question. A simple combination of Cascade R-CNN and Mask R-CNN only brings limited gain. In exploring a more effective approach, we find that the key to a successful instance segmentation cascade is to fully leverage the reciprocal relationship between detection and segmentation. In this work, we propose a new framework, Hybrid Task Cascade (HTC), which differs in two important aspects: (1) instead of performing cascaded refinement on these two tasks separately, it interweaves them for a joint multi-stage processing; (2) it adopts a fully convolutional branch to provide spatial context, which can help distinguishing hard foreground from cluttered background. Overall, this framework can learn more discriminative features progressively while integrating complementary features together in each stage. Without bells and whistles, a single HTC obtains 38.4% and 1.5% improvement over a strong Cascade Mask R-CNN baseline on MSCOCO dataset. Moreover, our overall system achieves 48.6 mask AP on the test-challenge split, ranking 1st in the COCO 2018 Challenge Object Detection Task. Code is available at https://github.com/open-mmlab/mmdetection.
TL;DR: In this paper, the authors studied the geophysical beta-effect of two-dimensional eddies in a homogeneous fluid at large Reynolds number, and showed that when the energy is intermittent in space, the cascade is halted simply by the spreading of energy about space, and then the end state of a zonal flow is probably not achieved.
Abstract: Two-dimensional eddies in a homogeneous fluid at large Reynolds number, if closely packed, are known to evolve towards larger scales. In the presence of a restoring force, the geophysical beta-effect, this cascade produces a field of waves without loss of energy, and the turbulent migration of the dominant scale nearly ceases at a wavenumber kβ = (β/2U)½ independent of the initial conditions other than U, the r.m.s. particle speed, and β, the northward gradient of the Coriolis frequency.The conversion of turbulence into waves yields, in addition, more narrowly peaked wavenumber spectra and less fine-structure in the spatial maps, while smoothly distributing the energy about physical space.The theory is discussed, using known integral constraints and similarity solutions, model equations, weak-interaction wave theory (which provides the terminus for the cascade) and other linearized instability theory. Computer experiments with both finite-difference and spectral codes are reported. The central quantity is the cascade rate, defined as
\[
T = 2\int_0^{\infty} kF(k)dk/U^3\langle k\rangle ,
\]
where F is the nonlinear transfer spectrum and 〈k〉 the mean wavenumber of the energy spectrum. (In unforced inviscid flow T is simply U−1d〈k〉−1/dt, or the rate at which the dominant scale expands in time t.) T is shown to have a mean value of 3·0 × 10−2 for pure two-dimensional turbulence, but this decreases by a factor of five at the transition to wave motion. We infer from weak-interaction theory even smaller values for k [Lt ] kβ.After passing through a state of propagating waves, the homogeneous cascade tends towards a flow of alternating zonal jets which, we suggest, are almost perfectly steady. When the energy is intermittent in space, however, model equations show that the cascade is halted simply by the spreading of energy about space, and then the end state of a zonal flow is probably not achieved.The geophysical application is that the cascade of pure turbulence to large scales is defeated by wave propagation, helping to explain why the energy-containing eddies in the ocean and atmosphere, though significantly nonlinear, fail to reach the size of their respective domains, and are much smaller. For typical ocean flows, . In addition the cascade generates, by itself, zonal flow (or more generally, flow along geostrophic contours).