TL;DR: Intelligent Scissors allow objects within digital images to be extracted quickly and accurately using simple gesture motions with a mouse, using live-wire boundary detection as an optimal path search in a weighted graph.
TL;DR: A new image representation is presented which provides a transformation from the raw pixel data to a small set of image regions which are coherent in color and texture space based on segmentation using the expectation-maximization algorithm on combined color andtexture features.
Abstract: Retrieving images from large and varied collections using image content as a key is a challenging and important problem. In this paper we present a new image representation which provides a transformation from the raw pixel data to a small set of image regions which are coherent in color and texture space. This so-called "blobworld" representation is based on segmentation using the expectation-maximization algorithm on combined color and texture features. The texture features we use for the segmentation arise from a new approach to texture description and scale selection. We describe a system that uses the blobworld representation to retrieve images. An important and unique aspect of the system is that, in the context of similarity-based querying, the user is allowed to view the internal representation of the submitted image and the query results. Similar systems do not offer the user this view into the workings of the system; consequently, the outcome of many queries on these systems can be quite inexplicable, despite the availability of knobs for adjusting the similarity metric.
TL;DR: A system that automatically segments and labels glioblastoma-multiforme tumors in magnetic resonance images (MRIs) of the human brain is presented and generally correspond well to ground truth, both on a per slice basis and more importantly in tracking total tumor volume during treatment over time.
Abstract: A system that automatically segments and labels glioblastoma-multiforme tumors in magnetic resonance images (MRIs) of the human brain is presented. The MRIs consist of T1-weighted, proton density, and T2-weighted feature images and are processed by a system which integrates knowledge-based (KB) techniques with multispectral analysis. Initial segmentation is performed by an unsupervised clustering algorithm. The segmented image, along with cluster centers for each class are provided to a rule-based expert system which extracts the intracranial region. Multispectral histogram analysis separates suspected tumor from the rest of the intracranial region, with region analysis used in performing the final tumor labeling. This system has been trained on three volume data sets and tested on thirteen unseen volume data sets acquired from a single MRI system. The KB tumor segmentation was compared with supervised, radiologist-labeled "ground truth" tumor volumes and supervised K-nearest neighbors tumor segmentations. The results of this system generally correspond well to ground truth, both on a per slice basis and more importantly in tracking total tumor volume during treatment over time.
TL;DR: An algorithm based on morphological watersheds has been implemented and tested on the segmentation of microscopic nuclei clusters and provides a tool that can be used for the implementation of both gradient- and domain-based algorithms, and, more importantly, for the Implementation of mixed (gradient- and shape-based) algorithms.
Abstract: Cluster division is a critical issue in fluorescence microscopy-based analytical cytology when preparation protocols do not provide appropriate separation of objects. Overlooking clustered nuclei and analyzing only isolated nuclei may dramatically increase analysis time or affect the statistical validation of the results. Automatic segmentation of clustered nuclei requires the implementation of specific image segmentation tools. Most algorithms are inspired by one of the two following strategies: 1) cluster division by the detection of internuclei gradients; or 2) division by definition of domains of influence (geometrical approach). Both strategies lead to completely different implementations, and usually algorithms based on a single view strategy fail to correctly segment most clustered nuclei, or perform well just for a specific type of sample. An algorithm based on morphological watersheds has been implemented and tested on the segmentation of microscopic nuclei clusters. This algorithm provides a tool that can be used for the implementation of both gradient- and domain-based algorithms, and, more importantly, for the implementation of mixed (gradient- and shape-based) algorithms. Using this algorithm, almost 90% of the test clusters were correctly segmented in peripheral blood and bone marrow preparations. The algorithm was valid for both types of samples, using the appropriate markers and transformations.
TL;DR: This paper considers the problem of the automatic evaluation of the results of color image segmentation, and identifies some limitations in this evaluation function, and proposes two enhanced functions that correspond more closely to visual judgment.
TL;DR: This work proposes a robust algorithm for the segmentation of three-dimensional (3-D) image data based on a novel combination of adaptive K-mean clustering and knowledge-based morphological operations that has been successfully applied to a sequence of cardiac CT volumetric images.
Abstract: Image segmentation remains one of the major challenges in image analysis. In medical applications, skilled operators are usually employed to extract the desired regions that may be anatomically separate but statistically indistinguishable. Such manual processing is subject to operator errors and biases, is extremely time consuming, and has poor reproducibility. We propose a robust algorithm for the segmentation of three-dimensional (3-D) image data based on a novel combination of adaptive K-mean clustering and knowledge-based morphological operations. The proposed adaptive K-mean clustering algorithm is capable of segmenting the regions of smoothly varying intensity distributions. Spatial constraints are incorporated in the clustering algorithm through the modeling of the regions by Gibbs random fields. Knowledge-based morphological operations are then applied to the segmented regions to identify the desired regions according to the a priori anatomical knowledge of the region-of-interest. This proposed technique has been successfully applied to a sequence of cardiac CT volumetric images to generate the volumes of left ventricle chambers at 16 consecutive temporal frames. Our final segmentation results compare favorably with the results obtained using manual outlining. Extensions of this approach to other applications can be readily made when a priori knowledge of a given object is available.
TL;DR: Simulation results demonstrate that the proposed technique can efficiently segment video sequences with fast moving, newly appearing, or disappearing objects in the scene.
Abstract: This paper presents a technique for unsupervised video segmentation. This technique consists of two phases: initial segmentation and temporal tracking, similar to a number of existing techniques. However, new algorithms for spatial segmentation, marker extraction, and modified watershed transformation are proposed for the present technique. The new algorithms make this technique differ from existing techniques by the following features: (1) it can effectively track fast moving objects, (2) it can detect the appearance of new objects as well as the disappearance of existing objects, and (3) it is computationally efficient because of the use of watershed transformations and a fast motion estimation algorithm. Simulation results demonstrate that the proposed technique can efficiently segment video sequences with fast moving, newly appearing, or disappearing objects in the scene.
TL;DR: A new automatic video sequence segmentation algorithm that extracts moving objects from the sequence using an object tracker that matches a two-dimensional binary model of the object against subsequent frames using the Hausdorff distance.
Abstract: The new video coding standard MPEG-4 is enabling content-based functionalities. It takes advantage of a prior decomposition of sequences into video object planes (VOPs) so that each VOP represents one moving object. A comprehensive review summarizes some of the most important motion segmentation and VOP generation techniques that have been proposed. Then, a new automatic video sequence segmentation algorithm that extracts moving objects is presented. The core of this algorithm is an object tracker that matches a two-dimensional (2-D) binary model of the object against subsequent frames using the Hausdorff distance. The best match found indicates the translation the object has undergone, and the model is updated every frame to accommodate for rotation and changes in shape. The initial model is derived automatically, and a new model update method based on the concept of moving connected components allows for comparatively large changes in shape. The proposed algorithm is improved by a filtering technique that removes stationary background. Finally, the binary model sequence guides the extraction objects of the VOPs from the sequence. Experimental results demonstrate the performance of our algorithm.
TL;DR: In this paper, the authors describe the importance of market segmentation as the bedrock of successful marketing, and propose a set of guidelines for success in the segmentation process, including fast tracking through the process of market mapping and determining the scope of a segmentation project.
Abstract: Foreword vii Preface and acknowledgements ix An important note to the reader from the authors xi List of figures xv List of tables xix 1 Market segmentation the bedrock of successful marketing 1 2 Preparing for segmentation additional guidelines for success 21 3 Fast tracking through the segmentation process 47 4 Determining the scope of a segmentation project 71 5 Portraying how a market works and identifying decision-makers 105 6 Developing a representative sample of different decision-makers 143 7 Accounting for the behaviour of decision-makers 213 8 Forming market segments out of like-minded decision-makers 255 9 Determining the attractiveness of market segments 303 10 Assessing company competitiveness and the portfolio matrix 329 11 Realizing the full potential of market mapping 349 12 Predicting channel transformation 369 13 Setting marketing objectives and strategies for identified segments 407 14 Organizational issues in market segmentation 449 15 Using segmentation to improve performance a case study 469 Index 481
TL;DR: A novel semantic video object extraction system using mathematical morphology and a perspective motion model to solve the semantic videoobject extraction problem in two separate steps: supervised I-frame segmentation, and unsupervised P-frame tracking.
Abstract: This paper introduces a novel semantic video object extraction system using mathematical morphology and a perspective motion model. Inspired by the results from the study of the human visual system, we intend to solve the semantic video object extraction problem in two separate steps: supervised I-frame segmentation, and unsupervised P-frame tracking. First, the precise semantic video object boundary can be found using a combination of human assistance and a morphological segmentation tool. Second, the semantic video objects in the remaining frames are obtained using global perspective motion estimation and compensation of the previous semantic video object plus boundary refinement as used for I frames.
TL;DR: A novel image segmentation algorithm was developed, in which an explicit anatomical template is used to moderate the segmentation obtained by statistical classification, which forms an adaptive, template moderated, spatially varying statistical classification (SVC).
Abstract: A novel image segmentation algorithm was developed to allow the automatic segmentation of both normal and abnormal anatomy. The new algorithm is a form of spatially varying classification (SVC), in which an explicit anatomical template is used to moderate the segmentation obtained by k Nearest Neighbour (k-NN) statistical classification. The new algorithm consists of an iterated sequence of spatially varying classification and nonlinear registration, which creates an adaptive, template moderated (ATM), spatially varying classification (SVC).
TL;DR: This task may be accomplished at the coder site to support the functionalities foreseen by new multimedia scenarios, and in particular the content-based functionalities focused by the MPEG-4 activity, allowing the user to access and decode single objects of a video sequence.
TL;DR: Experimental results on different types of scenes demonstrate the ability of the proposed technique for spatio-temporal segmentation to automatically partition the scene into its constituent objects.
Abstract: This paper proposes a technique for spatio-temporal segmentation to identify the objects present in the scene represented in a video sequence. This technique processes two consecutive frames at a time. A region-merging approach is used to identify the objects in the scene. Starting from an oversegmentation of the current frame, the objects are formed by iteratively merging regions together. Regions are merged based on their mutual spatio-temporal similarity. We propose a modified Kolmogorov-Smirnov test for estimating the temporal similarity. The region-merging process is based on a weighted, directed graph. Two complementary graph-based clustering rules are proposed, namely, the strong rule and the weak rule. These rules take advantage of the natural structures present in the graph. Experimental results on different types of scenes demonstrate the ability of the proposed technique to automatically partition the scene into its constituent objects.
TL;DR: This work derives and demonstrates a method for measuring probable scene boundaries, by calculating a short term memory-based model of shot-to-shot "coherence", and derive and demonstrate a one-pass on-the-fly shot clustering algorithm.
Abstract: In extended video sequences, individual frames are grouped into shots which are defined as a sequence taken by a single camera, and related shots are grouped into scenes which are defined as a single dramatic event taken by a small number of related cameras. This hierarchical structure is deliberately constructed, dictated by the limitations and preferences of the human visual and memory systems. We present three novel high-level segmentation results derived from these considerations, some of which are analogous to those involved in the perception of the structure of music. First and primarily, we derive and demonstrate a method for measuring probable scene boundaries, by calculating a short term memory-based model of shot-to-shot "coherence". The detection of local minima in this continuous measure permits robust and flexible segmentation of the video into scenes, without the necessity for first aggregating shots into clusters. Second, and independently of the first, we then derive and demonstrate a one-pass on-the-fly shot clustering algorithm. Third, we demonstrate partially successful results on the application of these two new methods to the next higher, "theme", level of video structure.
TL;DR: Core-related geometric properties and image object representations are laid out which, together with the aforementioned insensitivities, allow the core to be used effectively for a variety of image analysis objectives.
TL;DR: A way of incorporating curvilinear grouping into region-based image segmentation through normalized cut approach and results on a large variety of images are shown.
Abstract: Region-based image segmentation techniques make use of similarity in intensity, color and texture to determine the partitioning of an image. The powerful cue of contour continuity is not exploited at all. In this paper, we provide a way of incorporating curvilinear grouping into region-based image segmentation. Soft contour information is obtained through orientation energy. Weak contrast gaps and subjective contours are completed by contour propagation. The normalized cut approach proposed by Shi and Malik is used for the segmentation. Results on a large variety of images are shown.
TL;DR: For a convex smoothing penalty, the global optimal solution for an energy function that fits the data can be obtained in a polynomial time, by a novel use of the maximum-flow algorithm.
Abstract: We propose a method for segmenting gray-value images. By segmentation, we mean a map from the set of pixels to a small set of levels such that each connected component of the set of pixels with the same level forms a relatively large and "meaningful" region. The method finds a set of levels with associated gray values by first finding junctions in the image and then seeking a minimum set of threshold values that preserves the junctions. Then it finds a segmentation map that maps each pixel to the level with the closest gray value to the pixel data, within a smoothness constraint. For a convex smoothing penalty, we show the global optimal solution for an energy function that fits the data can be obtained in a polynomial time, by a novel use of the maximum-flow algorithm. Our approach is in contrast to a view in computer vision where segmentation is driven by intensity, gradient, usually not yielding closed boundaries.
TL;DR: In this paper, a mix of published evidence and case examples is used to explore three questions: Is segmentation a good idea? Why does it sometimes fail? What can be done to reduce the chance of failure? The authors conclude that if marketers are to overcome their segmentation implementation difficulties, they need practical guidance at three stages in the segmentation process.
Abstract: Despite the well‐documented benefits which segmentation offers, businesses continue to encounter implementation difficulties. This raises concerns about the cause of these problems and how they might be overcome. These concerns are addressed in this paper in the form of three questions: Is segmentation a good idea? If segmentation is such a good idea, why does it sometimes fail? What can be done to reduce the chance of failure? A mix of published evidence and case examples is used to explore these questions. The paper concludes by suggesting that if marketers are to overcome their segmentation implementation difficulties, they need practical guidance at three stages in the segmentation process. Before the project begins they must understand the role of success factors contributing to a successful result. During the segmentation project the qualities of the emerging segments must be clarified. After segmentation is complete the question of segment attractiveness must be considered. There is currently a gulf between the priorities of academics and practitioners carrying out segmentation. If this is to be bridged, further research is needed to provide guidance on segmentation success factors.
TL;DR: The improved image segmentation of the EASA technique allowed up to 32 times more plant cotyledons to be recognized under overcast lighting conditions when compared with a static segmentation technique trained under sunny conditions.
TL;DR: A scheme for interactive video segmentation that splitting relieves the computer of ill-posed semantic problems, and allows a higher level of flexibility of the method, is presented.
Abstract: We present a scheme for interactive video segmentation. A key feature of the system is the distinction between two levels of segmentation, namely, regions and object segmentation. Regions are homogeneous areas of the images, which are extracted automatically by the computer. Semantically meaningful objects are obtained through user interaction by grouping of regions according to the specific application. This splitting relieves the computer of ill-posed semantic problems, and allows a higher level of flexibility of the method. The extraction of regions is based on the multidimensional analysis of several image features by a spatially constrained fuzzy C-means algorithm. The local level of reliability of the different features is taken into account in order to adaptively weight the contribution of each feature to the segmentation process. Results on the extraction of regions as well as on the tracking of spatiotemporal objects are presented.
TL;DR: It is proposed that contextual influences serve pre-attentive visual segmentation by causing relatively higher neural responses to important or conspicuous image locations, making them more salient for perceptual pop-out.
Abstract: Stimuli outside classical receptive fields have been shown to exert significant influence over the activities of neurons in primary visual cortexWe propose that contextual influences are used for pre-attentive visual segmentation, in a new framework called {\it segmentation without classification}. This means that segmentation of an image into regions occurs without classification of features within a region or comparison of features between regions. This segmentation framework is simpler than previous computational approaches, making it implementable by V1 mechanisms, though higher leve l visual mechanisms are needed to refine its output. However, it easily handles a class of segmentation problems that are tricky in conventional methods. The cortex computes {\it global} region boundaries by detecting the breakdown of homogeneity or translation invariance in the input, using {\it local} intra-cortical interactions mediated by the horizontal connections. The difference between contextual influences near and far from region boundaries makes neural activities near region boundaries higher than elsewhere, making boundaries more salient for perceptual pop-out. This proposal is implemented in a biologically based model of V1, and demonstrated using examples of texture segmentation and figure-ground segregation. The model performs segmentation in exactly the same neural circuit that solves the dual problem of the enhancement of contours, as is suggested by experimental observations. Its behavior is compared with psychophysical and physiological data on segmentation, contour enhancement, and contextual influences. We discuss the implications of {\it segmentation without classification} and the predictions of our V1 model, and relate it to other phenomena such as asymmetry in visual search. .
TL;DR: A novel image segmentation technique using the robust, adaptive least kth order squares (ALKS) estimator which minimizes the kth Order statistics of the squares of residuals.
Abstract: We propose a novel image segmentation technique using the robust, adaptive least kth order squares (ALKS) estimator which minimizes the kth order statistics of the squares of residuals. The optimal value of k is determined from the data, and the procedure detects the homogeneous surface patch representing the relative majority of the pixels. The ALKS shows a better tolerance to structured outliers than other recently proposed similar techniques. The performance of the new, fully autonomous, range image segmentation algorithm is compared to several other methods.
TL;DR: In this article, a method for extracting environment features from 1D range data and their interpretation is presented, where the segmentation process is considered to include a subsequent matching step where segments which belong to the same landmark are merged while keeping track of those which originate from distinct features.
Abstract: A scheme for extracting environment features from 1D range data and their interpretation is presented. Segmentation is done by deciding on a measure of model fidelity which is applied to adjacent groups of measurements. The extraction process is considered to include a subsequent matching step where segments which belong to the same landmark are to be merged while keeping track of those which originate from distinct features. This is done by an agglomerative hierarchical clustering algorithm with a Mahalanobis distance matrix. The method is discussed with straight line segments which are found in a generalized least squares sense using polar coordinates including their first-order covariance estimates. As a consequence, extraction is no longer a real time problem on the level of single range readings, but must be treated on the level of whole scans. Experimental results with three commercially available laser scanners are presented. The implementation on a mobile robot which performs a mapbased localization demonstrate the accuracy and applicability of the method under real time conditions. The collection of line segments and associated covariance matrices obtained from the extraction process contains more information about the scene than is required for map-based localization. In a subsequent reasoning step this information is made explicit. By successive abstraction and consequent propagation of uncertainties, a compact scene model is finally obtained in the form of a weighted symbolic description preserving topology information and reflecting the main characteristics of a local observation.
TL;DR: A significant effort of the COST 211/sup ter/ group activities is dedicated toward image and video sequence analysis and segmentation-an important technological aspect for the success of emerging object-based MPEG-4 and MPEG-7 multimedia applications.
Abstract: Flexibility and efficiency of coding, content extraction, and content-based search are key research topics in the field of interactive multimedia. Ongoing ISO MPEG-4 and MPEG-7 activities are targeting standardization to facilitate such services. European COST Telecommunications activities provide a framework for research collaboration. At present a significant effort of the COST 211/sup ter/ group activities is dedicated toward image and video sequence analysis and segmentation-an important technological aspect for the success of emerging object-based MPEG-4 and MPEG-7 multimedia applications. The current work of COST 211 is centered around the test model, called the analysis model (AM). The essential feature of the AM is its ability to fuse information from different sources to achieve a high-quality object segmentation. The current information sources are the intermediate results from frame-based (still) color segmentation, motion vector based segmentation, and change-detection-based segmentation. Motion vectors, which form the basis for the motion vector based intermediate segmentation, are estimated from consecutive frames. A recursive shortest spanning tree (RSST) algorithm is used to obtain intermediate color and motion vector based segmentation results. A rule-based region processor fuses the intermediate results; a postprocessor further refines the final segmentation output. The results of the current AM are satisfactory.
TL;DR: A method of statistical background modeling for stereo sequences that improves the reliability and sensitivity of segmentation in the presence of object clutter is presented, and the dynamic version of the method, called gated background adaptation, can reliably learn background statistics in the absence of corrupting foreground motion.
Abstract: Stereo sequences promise to be a powerful method for segmenting images for applications such as tracking human figures. We present a method of statistical background modeling for stereo sequences that improves the reliability and sensitivity of segmentation in the presence of object clutter. The dynamic version of the method, called gated background adaptation, can reliably learn background statistics in the presence of corrupting foreground motion. The method has been used with a simple head discriminator to detect and track people using a stereo head mounted on a pan/tilt platform. It runs at video rates using standard PC hardware.
TL;DR: An efficient approach to identify tabular structures within either electronic or paper documents by taking word bounding box information as input, and outputs the corresponding logical text block units through the T-Recs system.
TL;DR: A novel genre of optimization problems, which are motivated in part by certain aspects of clustering and data mining, is studied, and a general greedy scheme is presented, which can be specialized to approximate any segmentation problem.
Abstract: We study a novel genre of optimization problems, which we call segmentation problems, motivated in part by certain aspects of clustering and data mining. For any classical optimization problem, the corresponding segmentation problem seeks to partition a set of cost vectors into several segments, so that the overall cost is optimized. We focus on two natural and interesting (but MAXSNP-complete) problems in this class, the hypercube segmentation problem and the catalog segmentation problem, and present approximation algorithms for them. We also present a general greedy scheme, which can be specialized to approximate any segmentation problem.
TL;DR: A high-performance shot boundary detection-based video segmentation algorithm that uses unsupervised clustering on a multiple feature input space, followed by a heuristic elimination process to detect, with almost perfect accuracy, shot boundaries in the video.
Abstract: A central step in content-based video retrieval is the temporal segmentation of video. An application independent approach to video segmentation is to detect temporally contiguous segments without significant content change between successive frames. Each such segment is termed a shot. A high-performance shot boundary detection-based video segmentation algorithm is proposed. The technique uses unsupervised clustering on a multiple feature input space, followed by a heuristic elimination process to detect, with almost perfect accuracy, shot boundaries in the video. With an extremely high accuracy coupled with a very small number of false positives, this algorithm outperforms most of the existing techniques.
TL;DR: One of the interests of the method is its ability to give the best solution, according to the resolution level required by the user, that is, to the prior distribution chosen.
Abstract: Segmentation of a nonstationary process consists in assuming piecewise stationarity and in detecting the instants of change. We consider the case where all the data is available at the same time and perform a global segmentation instead of a sequential procedure. We build a change process and define arbitrarily its prior distribution. This allows us to propose the MAP estimate as well as some minimum contrast estimate as a solution. One of the interests of the method is its ability to give the best solution, according to the resolution level required by the user, that is, to the prior distribution chosen. The method can address a wide class of parametric and nonparametric models. Simulations and applications to real data are proposed.