Image analysis using threshold reduction

doi:10.1117/12.49893

Open AccessProceedings Article10.1117/12.49893

Image analysis using threshold reduction

Dan S. Bloomberg

- 01 Jul 1991

- Proceedings of SPIE

- Vol. 1568, pp 38-52

12

TL;DR: A class of shift-variant reduction operations is introduced, that is useful for performing efficient and controllable shape and texture transformations between resolution levels, and some general properties of the cycle are derived.

Abstract: A class of shift-variant reduction operations is introduce d, that is useful for performing efficient and controllable shape and texture transformations between resolution levels. In their most general form, the operations proceed in three steps: (a) convolve a binary image with a kernel of arbitrary size; (b) threshold the result; (c) subsample to produce the reduced image. Taking a binary structuring element for the kernel, the threshold convolution on a binary image is equivalent to a rank order filter, and the full reduction operation is a threshold reduction. Threshold reductions that use convolution filters and subs ample tiles of equal size are optimized by combining the three operations, using only logical raster operations and producing threshold convolution values only at the sampling points. For 2x reduction, the four possible threshold values (1, 2, 3, and 4) refer to the minimum number of ON pixels within each 2x2 tile for which a pixel in the reduced image will be ON. Algorithms for boolean raster operations are given for 2x, 3x, and 4x threshold reduction, and lookup tables that effici ently implement column raster operations are provided. Threshold reduction rates of 2.5x pixel/second can be achieved with a Sun SparcStation2 . A mask-forming image analysis cycle of threshold reduction, augmented by morphology and followed by replicative expansion to full resolution, is described, an d some general properties of the cycle are derived. A simple application of threshold reduction to document image analysis, the extraction of halftone regions from scanned images that also contain text and line graphics, is illustrated. A sequence of 2x reductions with first low and then high thresholds is used to create a redu ced image consisting of a mask over the halftone regions. In this way, the extraction occurs as a nat ural consequence of the reductions.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Figure 4. (a) 4x reduction withm = 1 for each stage. Resolution: sampling (75/in), rendering (196/in). (b) Closing with 3x3 SE. Resolution: same as (a). (c) Further4x eduction withm = 4 for each stage. Resolution: sampling (19/in), rendering (49/in). (d) Opening with 3x3 SE. Resolution: same as (c).

Figure 3. Scanned image containing halftone image area(s). Sampling resolution is 300/in; rendering resolution is 375/in.

Figure 2. (a) initial image, (b) image after first cycle, (c) image in second cycle after closing with 3x3 SE in (e), (d) image after second cycle.

Figure 1. (a) Dilation filter for threshold 1; (b) erosion filter for threshold 4

Table 1. Implementation of 2x threshold reduction with boolean operations.

Citations

Proceedings Article•10.1117/12.205832

Measuring document image skew and orientation

Dan S. Bloomberg, +2 more

- 30 Mar 1995

TL;DR: This method does not indicate when text is upside-down, and it also requires sampling the function at 90 degrees of rotation to measure text skew in landscape mode, but such text orientation can be determined by noting that Roman characters in all languages have many more ascenders than descenders, and using morphological operations to identify such pixels.

...read moreread less

137

Multiresolution Morphological Approach to Document Image Analysis

Dan S. Bloomberg

- 01 Jan 1991

TL;DR: An image-based approach to document image analysis is presented, motivated by a merged view of shape and textural image properties at multiple scales, and the computational costs of the basic operations are given, so that algorithm efficiencies can be estimated.

...read moreread less

82

Patent

Methods and apparatus for selecting semantically significant images in a document image without decoding image content

M Margaret Withgott, +8 more

- 01 Sep 1992

TL;DR: In this paper, a method and apparatus for processing a document image, using a programmed general or special purpose computer, includes forming the image into image units, and at least one image unit classifier of each image unit is determined, without decoding the content of the image units.

...read moreread less

70

Proceedings Article•10.1117/12.131480

Multiresolution morphological analysis of document images

Dan S. Bloomberg

- 01 Nov 1992

TL;DR: An image-based approach to document image analysis is presented, that uses shape and textural properties interchangeably at multiple scales, and the importance of operating at the lowest feasable resolution is demonstrated.

...read moreread less

36

•Proceedings Article•10.1117/12.526615

Using mathematical morphology for document skew estimation

Laurent Najman

- 19 Dec 2003

TL;DR: This work proposes a concise definition of the skew angle of document, based on mathematical morphology, that has the advantages to be applicable both for binary and grey-scale images.

...read moreread less

36

References

•Book

Image Analysis and Mathematical Morphology

Jean Serra

- 11 Feb 1984

TL;DR: This invaluable reference helps readers assess and simplify problems and their essential requirements and complexities, giving them all the necessary data and methodology to master current theoretical developments and applications, as well as create new ones.

...read moreread less

10.1K

Journal Article•10.1109/TPAMI.1987.4767941

Image Analysis Using Mathematical Morphology

Robert M. Haralick, +2 more

- 01 Apr 1987

- IEEE Transactions on Pattern Analysis an...

TL;DR: The tutorial provided in this paper reviews both binary morphology and gray scale morphology, covering the operations of dilation, erosion, opening, and closing and their relations.

...read moreread less

2.9K

Journal Article•10.1109/TASSP.1987.1165254

Morphological filters--Part II: Their relations to median, order-statistic, and stack filters

Petros Maragos, +1 more

- 24 Mar 1987

- IEEE Transactions on Acoustics, Speech, ...

TL;DR: This paper extends the theory of median, order-statistic (OS), and stack filters by using mathematical morphology to analyze them and by relating them to those morphological erosions, dilations, openings, closings, and open-closings that commute with thresholding.

...read moreread less

582

Book Chapter•10.1007/978-3-642-51590-3_2

The Pyramid as a Structure for Efficient Computation

P. J. Burt

- 01 Jan 1984

TL;DR: Here the pyramid will be viewed primarily as a computational tool, however, interesting similarities will be noted between pyramid processing and processing within the human visual system.

...read moreread less

484

Journal Article•10.1109/34.41371

Segmentation of document images

Torfinn Taxt, +2 more

- 01 Dec 1989

- IEEE Transactions on Pattern Analysis an...

TL;DR: Several methods for segmentation of document images (maps, drawings, etc.) are explored and a noncontextual Bayes classifier performed best, and automatic updating improved the results for both classifiers.

...read moreread less

165