TL;DR: A real-time algorithm for foreground-background segmentation that can handle scenes containing moving backgrounds or illumination variations, and it achieves robust detection for different types of videos is presented.
Abstract: We present a real-time algorithm for foreground-background segmentation. Sample background values at each pixel are quantized into codebooks which represent a compressed form of background model for a long image sequence. This allows us to capture structural background variation due to periodic-like motion over a long period of time under limited memory. The codebook representation is efficient in memory and speed compared with other background modeling techniques. Our method can handle scenes containing moving backgrounds or illumination variations, and it achieves robust detection for different types of videos. We compared our method with other multimode modeling techniques. In addition to the basic algorithm, two features improving the algorithm are presented-layered modeling/detection and adaptive codebook updating. For performance evaluation, we have applied perturbation detection rate analysis to four background subtraction algorithms and two videos of different types of scenes.
TL;DR: LOCUS (learning object classes with unsupervised segmentation) is introduced which uses a generative probabilistic model to combine bottom-up cues of color and edge with top-down cues of shape and pose, allowing for significant within-class variation.
Abstract: We address the problem of learning object class models and object segmentations from unannotated images. We introduce LOCUS (learning object classes with unsupervised segmentation) which uses a generative probabilistic model to combine bottom-up cues of color and edge with top-down cues of shape and pose. A key aspect of this model is that the object appearance is allowed to vary from image to image, allowing for significant within-class variation. By iteratively updating the belief in the object's position, size, segmentation and pose, LOCUS avoids making hard decisions about any of these quantities and so allows for each to be refined at any stage. We show that LOCUS successfully learns an object class model from unlabeled images, whilst also giving segmentation accuracies that rival existing supervised methods. Finally, we demonstrate simultaneous recognition and segmentation in novel images using the learned models for a number of object classes, as well as unsupervised object discovery and tracking in video.
TL;DR: This work devise a graph cut algorithm for interactive segmentation which incorporates shape priors, and positive results on both medical and natural images are demonstrated.
Abstract: Interactive or semi-automatic segmentation is a useful alternative to pure automatic segmentation in many applications. While automatic segmentation can be very challenging, a small amount of user input can often resolve ambiguous decisions on the part of the algorithm. In this work, we devise a graph cut algorithm for interactive segmentation which incorporates shape priors. While traditional graph cut approaches to interactive segmentation are often quite successful, they may fail in cases where there are diffuse edges, or multiple similar objects in close proximity to one another. Incorporation of shape priors within this framework mitigates these problems. Positive results on both medical and natural images are demonstrated.
TL;DR: A novel hierarchical mesh segmentation algorithm, which is based on new methods for prominent feature point and core extraction, which produces correct hierarchical segmentations of meshes, both in the coarse levels of the hierarchy and in the fine levels, where tiny segments are extracted.
Abstract: Mesh segmentation has be- come a necessary ingredient in many applications in computer graphics. This paper proposes a novel hierarch- ical mesh segmentation algorithm, which is based on new methods for prominent feature point and core extraction. The algorithm has several benefits. First, it is invariant both to the pose of the model and to different proportions between the model's components. Second, it produces correct hierarchical segmentations of meshes, both in the coarse levels of the hierarchy and in the fine levels, where tiny segments are extracted. Finally, the boundaries between the segments go along the natural seams of the models.
TL;DR: This work addresses the problem of segmenting 3D scan data into objects or object classes by using a recently proposed maximum-margin framework to discriminatively train the model from a set of labeled scans and automatically learn the relative importance of the features for the segmentation task.
Abstract: We address the problem of segmenting 3D scan data into objects or object classes. Our segmentation framework is based on a subclass of Markov random fields (MRFs) which support efficient graph-cut inference. The MRF models incorporate a large set of diverse features and enforce the preference that adjacent scan points have the same classification label. We use a recently proposed maximum-margin framework to discriminatively train the model from a set of labeled scans; as a result we automatically learn the relative importance of the features for the segmentation task. Performing graph-cut inference in the trained MRF can then be used to segment new scenes very efficiently. We test our approach on three large-scale datasets produced by different kinds of 3D sensors, showing its applicability to both outdoor and indoor environments containing diverse objects.
TL;DR: In this paper, a multiscale object-specific segmentation (MOSS) approach is presented for automatically delineating image-objects (i.e., segments) at multiple scales from a high-spatial resolution remotely sensed forest scene.
TL;DR: A new airway segmentation method based on fuzzy connectivity that works on various types of scans (low-dose and regular dose, normal subjects and diseased subjects) without the need for the user to manually adjust any parameters is presented.
Abstract: The segmentation of the human airway tree from volumetric computed tomography (CT) images builds an important step for many clinical applications and for physiological studies. Previously proposed algorithms suffer from one or several problems: leaking into the surrounding lung parenchyma, the need for the user to manually adjust parameters, excessive runtime. Low-dose CT scans are increasingly utilized in lung screening studies, but segmenting them with traditional airway segmentation algorithms often yields less than satisfying results. In this paper, a new airway segmentation method based on fuzzy connectivity is presented. Small adaptive regions of interest are used that follow the airway branches as they are segmented. This has several advantages. It makes it possible to detect leaks early and avoid them, the segmentation algorithm can automatically adapt to changing image parameters, and the computing time is kept within moderate values. The new method is robust in the sense that it works on various types of scans (low-dose and regular dose, normal subjects and diseased subjects) without the need for the user to manually adjust any parameters. Comparison with a commonly used region-grow segmentation algorithm shows that the newly proposed method retrieves a significantly higher count of airway branches. A method that conducts accurate cross-sectional airway measurements on airways is presented as an additional processing step. Measurements are conducted in the original gray-level volume. Validation on a phantom shows that subvoxel accuracy is achieved for all airway sizes and airway orientations.
TL;DR: The second international Chinese word segmentation bakeoff was held in the summer of 2005 and it was found that the technology has improved over the intervening two years, though the out-of-vocabulary problem is still or paramount importance.
Abstract: The second international Chinese word segmentation bakeoff was held in the summer of 2005 to evaluate the current state of the art in word segmentation. Twenty three groups submitted 130 result sets over two tracks and four different corpora. We found that the technology has improved over the intervening two years, though the out-of-vocabulary problem is still or paramount importance.
TL;DR: In this paper, four algorithms from the two main groups of segmentation algorithms (boundary-based and region-based) were evaluated and compared and an evaluation of each algorithm was carried out with empirical discrepancy evaluation methods.
Abstract: Since 1999, very high spatial resolution satellite data represent the surface of the Earth with more detail. However, information extraction by per pixel multispectral classification techniques proves to be very complex owing to the internal variability increase in land-cover units and to the weakness of spectral resolution. Image segmentation before classification was proposed as an alternative approach, but a large variety of segmentation algorithms were developed during the last 20 years, and a comparison of their implementation on very high spatial resolution images is necessary. In this study, four algorithms from the two main groups of segmentation algorithms (boundarybased and region-based) were evaluated and compared. In order to compare the algorithms, an evaluation of each algorithm was carried out with empirical discrepancy evaluation methods. This evaluation is carried out with a visual segmentation of Ikonos panchromatic images. The results show that the choice of parameters is very important and has a great influence on the segmentation results. The selected boundary-based algorithms are sensitive to the noise or texture. Better results are obtained with regionbased algorithms, but a problem with the transition zones between the contrasted objects can be present.
TL;DR: A new method based on color information is proposed that is robust in the presence of noise, shadows, pavement, and obstacles such like cars, motorcycles and pedestrians conditions and this method is applicable in complex environment.
Abstract: Lane boundary detection is the problem of estimating the geometric structure of the lane boundaries of a road on the images captured by a camera. To be an intelligent vehicle, lane boundary is necessary information, so the system and the algorithm should be as simple and fast as possible. In this paper, we propose a new method based on color information and this method is applicable in complex environment. In this system, we first choose a region of interest to find out a threshold using statistical method in a color image. The threshold then will be used to distinguish possible lane boundary from the road. We use color-based segmentation to find out the lane boundary and use a quadratic function to approach it. This system demands low computational power and memory requirements, and is robust in the presence of noise, shadows, pavement, and obstacles such like cars, motorcycles and pedestrians conditions. The result images can be used as pre-processed images for lane tracking, road following or obstacle detection.
TL;DR: The goal was to first develop a system for segmentation of the audio signal, and then classification into one of two main categories: speech or music, and results show that efficiency is exceptionally good, without sacrificing performance.
Abstract: Over the last several years, major efforts have been made to develop methods for extracting information from audiovisual media, in order that they may be stored and retrieved in databases automatically, based on their content. In this work we deal with the characterization of an audio signal, which may be part of a larger audiovisual system or may be autonomous, as for example in the case of an audio recording stored digitally on disk. Our goal was to first develop a system for segmentation of the audio signal, and then classification into one of two main categories: speech or music. Among the system's requirements are its processing speed and its ability to function in a real-time environment with a small responding delay. Because of the restriction to two classes, the characteristics that are extracted are considerably reduced and moreover the required computations are straightforward. Experimental results show that efficiency is exceptionally good, without sacrificing performance. Segmentation is based on mean signal amplitude distribution, whereas classification utilizes an additional characteristic related to the frequency. The classification algorithm may be used either in conjunction with the segmentation algorithm, in which case it verifies or refutes a music-speech or speech-music change, or autonomously, with given audio segments. The basic characteristics are computed in 20 ms intervals, resulting in the segments' limits being specified within an accuracy of 20 ms. The smallest segment length is one second. The segmentation and classification algorithms were benchmarked on a large data set, with correct segmentation about 97% of the time and correct classification about 95%.
TL;DR: It is shown that incorporation of a nonparametric probability density model allows for an extended random walkers algorithm that can locate disconnected objects and does not require user-specified labels.
Abstract: The recently introduced random walker segmentation algorithm by Grady and Funka-Lea (2004) has been shown to have desirable theoretical properties and to perform well on a wide variety of images in practice. However, this algorithm requires user-specified labels and produces a segmentation where each segment is connected to a labeled pixel. We show that incorporation of a nonparametric probability density model allows for an extended random walkers algorithm that can locate disconnected objects and does not require user-specified labels. Finally, we show that this formulation leads to a deep connection with the popular graph cuts method by Boykov et al. (2001) and Wu and Leahy (1993).
TL;DR: The experimental results show that the proposed image segmentation system has the desired ability for the segmentation of color image in a variety of vision tasks.
Abstract: An image segmentation system is proposed for the segmentation of color image based on neural networks. In order to measure the color difference properly, image colors are represented in a modified L/sup */u/sup */v/sup */ color space. The segmentation system comprises unsupervised segmentation and supervised segmentation. The unsupervised segmentation is achieved by a two-level approach, i.e., color reduction and color clustering. In color reduction, image colors are projected into a small set of prototypes using self-organizing map (SOM) learning. In color clustering, simulated annealing (SA) seeks the optimal clusters from SOM prototypes. This two-level approach takes the advantages of SOM and SA, which can achieve the near-optimal segmentation with a low computational cost. The supervised segmentation involves color learning and pixel classification. In color learning, color prototype is defined to represent a spherical region in color space. A procedure of hierarchical prototype learning (HPL) is used to generate the different sizes of color prototypes from the sample of object colors. These color prototypes provide a good estimate for object colors. The image pixels are classified by the matching of color prototypes. The experimental results show that the system has the desired ability for the segmentation of color image in a variety of vision tasks.
TL;DR: This work presents a novel texture and shape priors based method for kidney segmentation in ultrasound (US) images that is demonstrated through experimental results on both natural images and US data compared with other image segmentation methods and manual segmentation.
Abstract: This work presents a novel texture and shape priors based method for kidney segmentation in ultrasound (US) images. Texture features are extracted by applying a bank of Gabor filters on test images through a two-sided convolution strategy. The texture model is constructed via estimating the parameters of a set of mixtures of half-planed Gaussians using the expectation-maximization method. Through this texture model, the texture similarities of areas around the segmenting curve are measured in the inside and outside regions, respectively. We also present an iterative segmentation framework to combine the texture measures into the parametric shape model proposed by Leventon and Faugeras. Segmentation is implemented by calculating the parameters of the shape model to minimize a novel energy function. The goal of this energy function is to partition the test image into two regions, the inside one with high texture similarity and low texture variance, and the outside one with high texture variance. The effectiveness of this method is demonstrated through experimental results on both natural images and US data compared with other image segmentation methods and manual segmentation.
TL;DR: In this paper, a geometric model and a computational algorithm for segmentation of ultrasound images are presented, where a partial differential equation (PDE)-based flow is designed in order to achieve a maximum likelihood segmentation.
Abstract: This study presents a geometric model and a computational algorithm for segmentation of ultrasound images. A partial differential equation (PDE)-based flow is designed in order to achieve a maximum likelihood segmentation of the target in the scene. The flow is derived as the steepest descent of an energy functional taking into account the density probability distribution of the gray levels of the image as well as smoothness constraints. To model gray level behavior of ultrasound images, the classic Rayleigh probability distribution is considered. The steady state of the flow presents a maximum likelihood segmentation of the target. A finite difference approximation of the flow is derived, and numerical experiments are provided. Results are presented on ultrasound medical images as fetal echography arid echocardiography.
TL;DR: Two algorithms capable of real-time segmentation of foreground from background layers in stereo video sequences are described and found to have similar p performance, substantially better than stereo or colour/contrast alone.
Abstract: This paper demonstrates the high quality, real-time segmentation techniques. We achieve real-time segmentation of foreground from background layers in stereo video sequences. Automatic separation of layers from colour/contrast or from stereo alone is known to be error-prone. Here, colour, contrast and stereo matching information are fused to infer layers accurately and efficiently. The first algorithm, layered dynamic programming (LDP), solves stereo in an extended 6-state space that represents both foreground/background layers and occluded regions. The stereo-match likelihood is then fused with a contrast-sensitive colour model that is learned on the fly, and stereo disparities are obtained by dynamic programming. The second algorithm, layered graph cut (LGC), does not directly solve stereo. Instead the stereo match likelihood is marginalised over foreground and background hypotheses, and fused with a contrast-sensitive colour model like the one used in LDP. Segmentation is solved efficiently by ternary graph cut. Both algorithms are evaluated with respect to ground truth data and found to have similar performance, substantially better than stereo or colour/contrast alone. However, their characteristics with respect to computational efficiency are rather different. The algorithms are demonstrated in the application of background substitution and shown to give good quality composite video output.
TL;DR: In this research, images containing visually separable classes of either ice and water or multiple ice classes are segmented and a novel Bayesian segmentation approach is developed and applied.
Abstract: Environmental and sensor challenges pose difficulties for the development of computer-assisted algorithms to segment synthetic aperture radar (SAR) sea ice imagery. In this research, in support of operational activities at the Canadian Ice Service, images containing visually separable classes of either ice and water or multiple ice classes are segmented. This work uses image intensity to discriminate ice from water and uses texture features to identify distinct ice types. In order to seamlessly combine image spatial relationships with various image features, a novel Bayesian segmentation approach is developed and applied. This new approach uses a function-based parameter to weight the two components in a Markov random field (MRF) model. The devised model allows for automatic estimation of MRF model parameters to produce accurate unsupervised segmentation results. Experiments demonstrate that the proposed algorithm is able to successfully segment various SAR sea ice images and achieve improvement over existing published methods including the standard MRF-based method, finite Gamma mixture model, and K-means clustering.
TL;DR: In this article, a new approach for filtering is presented, which is a combination of both approaches, specifically exploiting their strengths, and it is demonstrated by examples in the field of aerial laser scanning.
Abstract: With airborne laser scanning points are measured on the terrain surface, and on other objects as buildings and vegetation. With socalled filtering methods a classification of the points into terrain and object points is performed. In the literature two approaches – i.e. a general strategies for solving the problem – for filtering can be identified. The first work directly on the measured points and geometric criteria are used for the decision, if a point is on the ground or an object point. The methods from the second approach first segment the data and then make a classification based on segments. In this paper we present a new approach for filtering. It is a combination of both approaches, specifically exploiting their strengths. A filter method following this new approach is developed and demonstrated by examples.
TL;DR: A generic framework for segmentation evaluation is introduced and a metric based on the distance between segmentation partitions is proposed to overcome some of the limitations of existing approaches.
Abstract: Image segmentation plays a major role in a broad range of applications. Evaluating the adequacy of a segmentation algorithm for a given application is a requisite both to allow the appropriate selection of segmentation algorithms as well as to tune their parameters for optimal performance. However, objective segmentation quality evaluation is far from being a solved problem. In this paper, a generic framework for segmentation evaluation is introduced after a brief review of previous work. A metric based on the distance between segmentation partitions is proposed to overcome some of the limitations of existing approaches. Symmetric and asymmetric distance metric alternatives are presented to meet the specificities of a wide class of applications. Experimental results confirm the potential of the proposed measures.
TL;DR: In this article, a digital segmentation method and apparatus determines foreground and background within at least one portion of a captured image, which can be implemented as part of a digital camera acquisition chain having effective computation complexity.
Abstract: A digital segmentation method and apparatus determines foreground and/or background within at least one portion of a captured image. The determining includes comparing a captured image to a pre-captured or post captured reference image of nominally the same scene. One of the images is taken with flash and the other without. The system can be implemented as part of a digital camera acquisition chain having effective computation complexity.
TL;DR: Qualitative and quantitative results obtained for benchmark image pairs show that the proposed algorithm outperforms most state-of-the-art matching algorithms currently listed on the Middlebury stereo evaluation website.
Abstract: This work describes a stereo algorithm that takes advantage of image segmentation, assuming that disparity varies smoothly inside a segment of homogeneous colour and depth discontinuities coincide with segment borders. Image segmentation allows our method to generate correct disparity estimates in large untextured regions and precisely localize depth boundaries. The disparity inside a segment is represented by a planar equation. To derive the plane model, an initial disparity map is generated. We use a window-based approach that exploits the results of segmentation. The size of the match window is chosen adaptively. A segment's planar model is then derived by robust least squared error fitting using the initial disparity map. In a layer extraction step, disparity segments that are found to be similar according to a plane dissimilarity measurement are combined to form a single robust layer. We apply a modified mean-shift algorithm to extract clusters of similar disparity segments. Segments of the same cluster build a layer, the plane parameters of which are computed from its spatial extent using the initial disparity map. We then optimize the assignment of segments to layers using a global cost function. The quality of the disparity map is measured by warping the reference image to the second view and comparing it with the real image. Z-buffering enforces visibility and allows the explicit detection of occlusions. The cost function measures the colour dissimilarity between the warped and real views, and penalizes occlusions and neighbouring segments that are assigned to different layers. Since the problem of finding the assignment of segments to layers that minimizes this cost function is N P -complete, an efficient greedy algorithm is applied to find a local minimum. Layer extraction and assignment are alternately applied. Qualitative and quantitative results obtained for benchmark image pairs show that the proposed algorithm outperforms most state-of-the-art matching algorithms currently listed on the Middlebury stereo evaluation website. The technique achieves particularly good results in areas with depth discontinuities and related occlusions, where missing stereo information is substituted from surrounding regions. Furthermore, we apply the algorithm to a self-recorded image set and show 3D visualizations of the derived results.
TL;DR: This paper investigates the design, development, and implementation of a robust color gradient vector flow (GVF) active contour model for performing segmentation, using a database of 1791 imaged cells and shows the results were superior to the other unsupervised approaches, and comparable with supervised segmentation.
Abstract: One of the most commonly used clinical tests performed today is the routine evaluation of peripheral blood smears. In this paper, we investigate the design, development, and implementation of a robust color gradient vector flow (GVF) active contour model for performing segmentation, using a database of 1791 imaged cells. The algorithms developed for this research operate in Luv color space, and introduce a color gradient and L/sub 2/E robust estimation into the traditional GVF snake. The accuracy of the new model was compared with the segmentation results using a mean-shift approach, the traditional color GVF snake, and several other commonly used segmentation strategies. The unsupervised robust color snake with L/sub 2/E robust estimation was shown to provide results which were superior to the other unsupervised approaches, and was comparable with supervised segmentation, as judged by a panel of human experts.
TL;DR: The proposed method retains spatial coherence on initial data characteristic of curve evolution techniques, as well as the balance between a pixel/voxel’s proximity to the curve and its intention to cross over the curve from the underlying energy.
Abstract: In this paper, we first draw a connection between a level set algorithm and k-Means plus nonlinear diffusion preprocessing. Then, we exploit this link to develop a new hybrid numerical technique for segmentation that draws on the speed and simplicity of k-Means procedures, and the robustness of level set algorithms. The proposed method retains spatial coherence on initial data characteristic of curve evolution techniques, as well as the balance between a pixel/voxel’s proximity to the curve and its intention to cross over the curve from the underlying energy. However, it is orders of magnitude faster than standard curve evolutions. Moreover, it does not suffer from the limitations of k-Means due to inaccurate local minima and allows for segmentation results ranging from k-Means clustering type partitioning to level set partitions.
TL;DR: This method is based on statistical modeling of an image pair using constraints on appearance and motion and is extended to video by chaining the pairwise models to produce a joint probability distribution to be maximized.
Abstract: In this paper, we propose a method for jointly computing optical flow and segmenting video while accounting for mixed pixels (matting). Our method is based on statistical modeling of an image pair using constraints on appearance and motion. Segments are viewed as overlapping regions with fractional (alpha) contributions. Bidirectional motion is estimated based on spatial coherence and similarity of segment colors. Our model is extended to video by chaining the pairwise models to produce a joint probability distribution to be maximized. To make the problem more tractable, we factorize the posterior distribution and iteratively minimize its parts. We demonstrate our method on frame interpolation
TL;DR: In this article, a system, method and apparatus for anatomical mapping utilizing optical coherence tomography was proposed, which can provide quantitative three-dimensional information about the spatial location and extent of macular edema and other pathologies.
Abstract: A system, method and apparatus for anatomical mapping utilizing optical coherence tomography. In the present invention, 3-dimensional fundus intensity imagery can be acquired from a scanning of light back-reflected from an eye. The scanning can include spectral domain scanning, as an example. A fundus intensity image can be acquired in real-time. The 3-dimensional data set can be reduced to generate an anatomical mapping, such as an edema mapping and a thickness mapping. Optionally, a partial fundus intensity image can be produced from the scanning of the eye to generate an en face view of the retinal structure of the eye without first requiring a full segmentation of the 3-D data set. Advantageously, the system, method and apparatus of the present invention can provide quantitative three-dimensional information about the spatial location and extent of macular edema and other pathologies. This three-dimensional information can be used to determine the need for treatment, monitor the effectiveness of treatment and identify the return of fluid that may signal the need for re-treatment.
TL;DR: The HIP user studies show that given correct segmentation, computers are much better at HIP character recognition than humans, and it is proposed that segmentation-based reading challenges are the future for building stronger human-friendly HIPs.
Abstract: Human interaction proofs (HIPs) have become common place on the internet due to their effectiveness in deterring automated abuse of online services intended for humans. However, there is a co-evolutionary arms race in progress and these proofs are becoming more difficult for genuine users while attackers are getting better at breaking existing HIPs. We studied various popular HIPs on the internet to understand their strength and human friendliness. To determine HIP strength, we adopted a direct approach of building computer attacks using image processing and machine learning techniques. To understand human-friendliness, a sequence of users studies were conducted to investigate HIP character recognition by humans under a variety of visual distortions and clutter commonly employed in reading-based HIPs. We found that many of the online HIPs are pure recognition tasks that can be easily broken using machine learning. The stronger HIPs tend to pose a combination of segmentation and recognition challenges. Further, the HIP user studies show that given correct segmentation, computers are much better at HIP character recognition than humans. In light of these results, we propose that segmentation-based reading challenges are the future for building stronger human-friendly HIPs. An example of such a segmentation-based HIP is presented with a preliminary assessment of its strength and human-friendliness.
TL;DR: This paper investigates the tumor segmentation performance of a recent variant of DRF models that takes advantage of the powerful Support Vector Machine (SVM) classification method, and indicates that the SVM-based DRFs offer a significant advantage over the other approaches.
Abstract: Markov Random Fields (MRFs) are a popular and well-motivated model for many medical image processing tasks such as segmentation. Discriminative Random Fields (DRFs), a discriminative alternative to the traditionally generative MRFs, allow tractable computation with less restrictive simplifying assumptions, and achieve better performance in many tasks. In this paper, we investigate the tumor segmentation performance of a recent variant of DRF models that takes advantage of the powerful Support Vector Machine (SVM) classification method. Combined with a powerful Magnetic Resonance (MR) preprocessing pipeline and a set of ‘alignment-based’ features, we evaluate the use of SVMs, MRFs, and two types of DRFs as classifiers for three segmentation tasks related to radiation therapy target planning for brain tumors, two of which do not rely on ‘contrast agent’ enhancement. Our results indicate that the SVM-based DRFs offer a significant advantage over the other approaches.
TL;DR: This paper introduces database-guided segmentation as a new data-driven paradigm that directly exploits expert annotation of interest structures in large medical databases and proposes a feature selection mechanism and the corresponding metric.
Abstract: The segmentation of anatomical structures has been traditionally formulated as a perceptual grouping task, and solved through clustering and variational approaches. However, such strategies require the a priori knowledge to be explicitly defined in the optimization criterion, e.g., "high-gradient border", "smoothness"', or "similar intensity or texture". This approach is limited by the validity of underlying assumptions and cannot capture complex structure appearance. This paper introduces database-guided segmentation as a new data-driven paradigm that directly exploits expert annotation of interest structures in large medical databases. Segmentation is formulated as a two-step learning problem. The first step is structure detection where we learn how to discriminate between the object of interest and background. The resulting classifier based on a boosted cascade of simple features also provides a global rigid transformation of the structure. The second step is shape inference where we use a sample-based representation of the joint distribution of appearance and shape annotations. To learn the association between the complex appearance and shape we propose a feature selection mechanism and the corresponding metric. We show that the selected features are better than using directly the appearance and illustrate the performance of the proposed method on a large set of ultrasound heart images.
TL;DR: A new approach to quantitative analysis of live cell image data using time as an extra dimension, level set methods are employed to determine cell trajectories from 2D + time data sets to help solving cell analysis problems of general importance including cell pedigree analysis and cell tracking.
Abstract: The Large Scale Digital Cell Analysis System (LSDCAS) developed at the University of Iowa provides capabilities for extended-time live cell image acquisition. This paper presents a new approach to quantitative analysis of live cell image data. By using time as an extra dimension, level set methods are employed to determine cell trajectories from 2D + time data sets. When identifying the cell trajectories, cell cluster separation and mitotic cell detection steps are performed. Each of the trajectories corresponds to the motion pattern of an individual cell in the data set. At each time frame, number of cells, cell locations, cell borders, cell areas, and cell states are determined and recorded. The proposed method can help solving cell analysis problems of general importance including cell pedigree analysis and cell tracking. The developed method was tested on cancer cell image sequences and its performance compared with manually-defined ground truth. The similarity Kappa Index is 0.84 for segmentation area and the signed border positioning segmentation error is 1.6 ± 2.1 μm.
TL;DR: This paper presents an evaluation of two popular segmentation algorithms, the mean shift-based segmentation algorithm and a graph- based segmentation scheme, and considers a hybrid method which combines the other two methods.
Abstract: Unsupervised image segmentation algorithms have matured to the point where they generate reasonable segmentations, and thus can begin to be incorporated into larger systems. A system designer now has an array of available algorithm choices, however, few objective numerical evaluations exist of these segmentation algorithms. As a first step towards filling this gap, this paper presents an evaluation of two popular segmentation algorithms, the mean shift-based segmentation algorithm and a graph-based segmentation scheme. We also consider a hybrid method which combines the other two methods. This quantitative evaluation is made possible by the recently proposed measure of segmentation correctness, the Normalized Probabilistic Rand (NPR) index, which allows a principled comparison between segmentations created by different algorithms, as well as segmentations on different images. For each algorithm, we consider its correctness as measured by the NPR index, as well as its stability with respect to changes in parameter settings and with respect to different images. An algorithm which produces correct segmentation results with a wide array of parameters on any one image, as well as correct segmentation results on multiple images with the same parameters, will be a useful, predictable and easily adjustable preprocessing step in a larger system. Our results are presented on the Berkeley image segmentation database, which contains 300 natural images along with several ground truth hand segmentations for each image. As opposed to previous results presented on this database, the algorithms we compare all use the same image features (position and colour) for segmentation, thereby making their outputs directly comparable.