Segmentation of heterogeneous document images : an ... - Tel

More documents

Recommendations

Info

tel-00912566, version 1 - 2 Dec 2013 Figure 2.12: Left image shows segmented lines from a document in [7] and Right image displays projection profiles of vertical strips. segmentation contests. Hough-based methods can also be considered as projection based methods. They are based on Hough transform but instead of projecting pixel values into x or y axis, they project intensities onto parameter’s space. In general, the purpose of Hough transform is to find imperfect instances of objects within a certain class of shapes by a voting procedure. The classical Hough transform is concerned with the identification of lines in an image, and text lines in the case of document image. However, it is not limited to lines, and any kind of shape can be found inside an image. In document image analysis, Hough transform is used in a variety of situations. Some methods use it to detect text lines or the skew angle of the text lines, or to drive different characteristics such as the direction of a connected handwritten word, or even to detect table lines. Here we note two methods that use Hough transform specifically for detecting text lines. In [58], G. Louloudis et al. have proposed a Hough transform based handwritten segmentation method. Their earliest results have been published in [56, 57]. The method is adapted to deal with challenges of handwritten documents, including arbitrary slanted and skewed text lines, accent marks above or below the text lines and touching lines. After dividing the document image into equally sized blocks, the aim of Hough transform becomes to find dominant direction of connected components in each block. The algorithm also keeps track of the dominant direction of all components inside the document. To form text lines, for each connected component a decision is held based on rules that compare the direction of each component in a block with its adjacent blocks. An additional constraint is applied upon which, a text line is valid only if the corresponding skew angle of the line deviates from the dominant direction by 30
(a) Five zones 1-5 (b) Projection profile of zone 3 (c) first derivative (d) initial, refined and final regions Figure 2.13: Steps for locating text line separators in part of document image. [75] tel-00912566, version 1 - 2 Dec 2013 less than 2 ◦ . This method is successful provided that the free parameters are set correctly. Furthermore, because the algorithm keeps track of the dominant direction, the whole document must have text lines with roughly the same direction. The method published in [61] is one more method based on Hough transform. In the first step, authors apply a Hough transform to each connected component namely the handwritten words to find the direction of a component. Then the algorithm searches for the nearest neighbors of each component in four principal directions. Once the neighbors are found, a weighted directed graph is built by connecting each component to its neighbors with a weighted edge proportional to the geometric distance between components. Finally, to form text lines, the algorithm removes top to bottom edges based on thresholding the length of edges. Texture based methods Any method that is based on some kind of filtering, shall it be Gabor, Wavelet, Gaussian or just the averaging operator can fit into this category. The first method that we review is for text line segmentation from freestyle script-independent handwritten or printed documents. Y. Li et al. first have published their preliminary results for this method in [52] and later in [53]. For this method it is assumed that text lines have a horizontally elongated shape, but still a variation of ±10 ◦ is allowed. The method estimates a probability density function based by convolving the image with a non-parametric anisotropic Gaussian kernel. The initial estimates of the text line boundaries are computed by thresholding this density function map and then a level set method evolves from the initial estimations to obtain the final text line boundaries. Another method is proposed in [29]. Du et al. propose a script-independent method for segmentation of handwritten text lines based on a piecewise ap- 31
Page 1 and 2: tel-00912566, version 1 - 2 Dec 201
Page 3 and 4: Resumé La segmentation de page est
Page 5 and 6: Acknowledgements This work would no
Page 7 and 8: 4.3.2 Text components . . . . . . .
Page 9 and 10: 3.6 Two documents that have obtaine
Page 11 and 12: 6.1 PARAGRAPH DETECTION SUCCESS RAT
Page 15 and 16: detection and we conclude that the
Page 21 and 22: Figure 1.8: A screen shot that show
Page 23 and 24: Chapter 2 Related work tel-00912566
Page 27 and 28: them. In such circumstances, it wou
Page 31 and 32: [21] is another texture-based metho
Page 33 and 34: Figure 2.4: Part of a document in o
Page 35 and 36: • Degraded quality due to ageing
Page 37 and 38: 2.3.2 Handwritten text line detecti
Page 39: (a) Divided strips and their projec
Page 43 and 44: would be difficult to draw a conclu
Page 45 and 46: The proposed methods by Xiao [102],
Page 49 and 50: is assigning a label to a region of
Page 51 and 52: fixed range. When the elongation ap
Page 55 and 56: The second method calculates the co
Page 57 and 58: 3. Repeat for m = 1, 2, ..., M •
Page 61 and 62: Chapter 4 Region detection tel-0091
Page 63 and 64: The next advantage of using CRFs is
Page 65 and 66: weights that are assigned to edge a
Page 67 and 68: { 1 if ys = text and y f 1 (y s , y
Page 69 and 70: (a) Document (b) Filled text compon
Page 75 and 76: f = [y c = 0] × [y tl = 0] f = [y
Page 77 and 78: (a) Ground-truth (b) y c = 0 tel-00
Page 79 and 80: ∂l λ = ∑ ( ∑y∈Y f k (y s ,
Page 81 and 82: incorrect [100]. Several sufficient
Page 91 and 92:
Table 4.3: TION COUNT WEIGHTED SUCC
Page 93 and 94:
tel-00912566, version 1 - 2 Dec 201
Page 95 and 96:
tel-00912566, version 1 - 2 Dec 201
Page 97 and 98:
Chapter 5 Text line detection tel-0
Page 99 and 100:
tel-00912566, version 1 - 2 Dec 201
Page 101 and 102:
tel-00912566, version 1 - 2 Dec 201
Page 103 and 104:
Having specified the model, a verti
Page 105 and 106:
• The fifth step is to remove ext
Page 107 and 108:
tel-00912566, version 1 - 2 Dec 201
Page 109 and 110:
text lines can be divided into two
Page 111 and 112:
the two children. The root node rep
Page 113 and 114:
leaves of the tree which contain on
Page 115 and 116:
tel-00912566, version 1 - 2 Dec 201
Page 117 and 118:
tel-00912566, version 1 - 2 Dec 201
Page 119 and 120:
tel-00912566, version 1 - 2 Dec 201
Page 121 and 122:
currently working on some of these
Page 123 and 124:
• fn (false negative) is the numb
Page 125 and 126:
2 ∗ RA ∗ DR F − Measure = RA
Page 127 and 128:
• ”-tn”: This option uses the
Page 129 and 130:
[12] T. M. Breuel. Two geometric al
Page 131 and 132:
[39] B. Gatos, A. Antonacopoulos, a
Page 133 and 134:
[64] K. P. Murphy, Y. Weiss, and M.
Page 135 and 136:
[91] M. Stamp. A revealing introduc
Page 137 and 138:
Index tel-00912566, version 1 - 2 D
show all

Segmentation of heterogeneous document images : an ... - Tel

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?