Segmentation of heterogeneous document images : an ... - Tel

14.01.2014 Views
[78] L. R. Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. IEEE, 77(2):257–286, 1989. [79] Z. Razak, K. Zulkiflee, M. Yamani, I. Idris, E. M. Tamil, M. Noorzaily, M. Noor, R. Salleh, M. Yaacob, and Z. M. Yusof. Offline handwriting text line segmentation: a review. International Journal of Computer Science and Network Security, 8(7):12–20, 2008. [80] M. D. Riley. Time-frequency representations for speech signals. Technical report, 1987. [81] F. Rosenblatt. The perceptron: a probabilistic model for information storage and organization in the brain. Psychological Review, 65(6):386– 408, 1958. [82] J. Sauvola and M. Pietikainen. Adaptive document image binarization. Pattern Recognition, 33(2):225–236, 2000. tel-00912566, version 1 - 2 Dec 2013 [83] F. Shafait and T. M. Breuel. Document image dewarping contest. International Workshop on Camera-Based Document Analysis and Recognition, 2007. [84] F. Shafait, D. Keysers, and T. M. Breuel. Performance evaluation and benchmarking of six page segmentation algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(6):941–954, 2008. [85] F. Shafait, J. van Beusekom, D. Keysers, and T. M. Breuel. Background variability modeling for statistical layout analysis. 2008 19th International Conference on Pattern Recognition, pages 1–4, December 2008. [86] F. Shafait, J. Van Beusekom, D. Keysers, and T. M. Breuel. Structural mixtures for statistical layout analysis. In 2008 The Eighth IAPR International Workshop on Document Analysis Systems, pages 415–422. IEEE, September 2008. [87] Z. Shi, S. Setlur, and V. Govindaraju. Text extraction from gray scale historical document images using adaptive local connectivity map. 8th International Conference on Document Analysis and Recognition (ICDAR ’05), pages 794–798, 2005. [88] Z. Shi, S. Setlur, and V. Govindaraju. A steerable directional local profile technique for extraction of handwritten arabic text lines. 10th International Conference on Document Analysis and Recognition (ICDAR ’09), 2009. [89] R. Smith. An overview of the Tesseract OCR engine. 9th International Conference on Document Analysis and Recognition (ICDAR ’07), 2:629– 633, 2007. [90] T. Stafylakis, V. Papavassiliou, V. Katsouros, and G. Carayannis. Robust text-line and word segmentation for handwritten documents images. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 3393–3396, March 2008. 124

[91] M. Stamp. A revealing introduction to hidden Markov models. Technical report, Department of Computer Science San Jose State University, 2004. [92] T. Su, T. Zhang, and D. Guan. Corpus-based HIT-MW database for offline recognition of general-purpose Chinese handwritten text. 9th International Conference on Document Analysis and Recognition (ICDAR ’07), 10(1):27–38, March 2007. [93] H. M. Sun. Page segmentation for Manhattan and non-Manhattan layout documents via selective CRLA. 8th International Conference on Document Analysis and Recognition (ICDAR ’05), pages 116–120, 2005. [94] C. Sutton and A. McCallum. An introduction to conditional random fields for relational learning. In Lise Getoor and Ben Taskar, editors, Introduction to Statistical Relational Learning, volume 7 of Adaptive Computation and Machine Learning, chapter 4, page 93. The MIT Press, 2006. tel-00912566, version 1 - 2 Dec 2013 [95] M. S. Taylor, F. S. Brundick, and A. E. Brodeen. A statistical approach to the generation of a database for evaluating OCR software. International Journal on Document Analysis and Recognition, 4:170–176, 2002. [96] K. Tombre, S. Tabbone, and L. Pélissier. Text/graphics separation revisited. DAS ’02 Proceedings of the 5th International Workshop on Document Analysis Systems V, pages 200–211, August 2002. [97] A Viterbi. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory, 13(2):260–269, 1967. [98] F. M. Wahl, K. Y. Wong, and R. G. Casey. Block segmentation and text extraction in mixed text/image documents. Computer Graphics and Image Processing, 20(4):375–390, December 1982. [99] B. Waked. Page segmentation and identification for document image analysis. PhD thesis, Concordia University, Montreal, Canada, 2001. [100] Y. Weiss. Correctness of local probability propagation in graphical models with loops. Neural Computation, 12(1):1–41, 2000. [101] K. Y. Wong, R. G. Casey, and F. M. Wahl. Document analysis system. IBM Journal of Research and Development, 26(6):647–656, 1982. [102] Y. Xiao and H. Yan. Text region extraction in a document image based on the Delaunay tessellation. Pattern Recognition, 36(3):799–809, March 2003. [103] F. Yin and C. Liu. Handwritten Chinese text line segmentation by clustering with distance metric learning. Pattern Recognition, 42(12):3146–3157, 2009. [104] L. A. Zadeh. A simple view of the Dempster-Shafer theory of evidence and its implication for the rule of combination. The AI Magazine, 7(2):85–90, July 1986. 125

Page 1 and 2: tel-00912566, version 1 - 2 Dec 201

Page 3 and 4: Resumé La segmentation de page est

Page 5 and 6: Acknowledgements This work would no

Page 7 and 8: 4.3.2 Text components . . . . . . .

Page 9 and 10: 3.6 Two documents that have obtaine

Page 11 and 12: 6.1 PARAGRAPH DETECTION SUCCESS RAT


Page 15 and 16: detection and we conclude that the



Page 21 and 22: Figure 1.8: A screen shot that show

Page 23 and 24: Chapter 2 Related work tel-00912566


Page 27 and 28: them. In such circumstances, it wou


Page 31 and 32: [21] is another texture-based metho

Page 33 and 34: Figure 2.4: Part of a document in o

Page 35 and 36: • Degraded quality due to ageing

Page 37 and 38: 2.3.2 Handwritten text line detecti

Page 39 and 40: (a) Divided strips and their projec

Page 41 and 42: (a) Five zones 1-5 (b) Projection p

Page 43 and 44: would be difficult to draw a conclu

Page 45 and 46: The proposed methods by Xiao [102],


Page 49 and 50: is assigning a label to a region of

Page 51 and 52: fixed range. When the elongation ap


Page 55 and 56: The second method calculates the co

Page 57 and 58: 3. Repeat for m = 1, 2, ..., M •


Page 61 and 62: Chapter 4 Region detection tel-0091

Page 63 and 64: The next advantage of using CRFs is

Page 65 and 66: weights that are assigned to edge a

Page 67 and 68: { 1 if ys = text and y f 1 (y s , y

Page 69 and 70: (a) Document (b) Filled text compon



Page 75 and 76: f = [y c = 0] × [y tl = 0] f = [y

Page 77 and 78: (a) Ground-truth (b) y c = 0 tel-00

Page 79 and 80: ∂l λ = ∑ ( ∑y∈Y f k (y s ,

Page 81 and 82: incorrect [100]. Several sufficient





Page 91 and 92: Table 4.3: TION COUNT WEIGHTED SUCC



Page 97 and 98: Chapter 5 Text line detection tel-0



Page 103 and 104: Having specified the model, a verti

Page 105 and 106: • The fifth step is to remove ext


Page 109 and 110: text lines can be divided into two

Page 111 and 112: the two children. The root node rep

Page 113 and 114: leaves of the tree which contain on




Page 121 and 122: currently working on some of these

Page 123 and 124: • fn (false negative) is the numb

Page 125 and 126: 2 ∗ RA ∗ DR F − Measure = RA

Page 127 and 128: • ”-tn”: This option uses the

Page 129 and 130: [12] T. M. Breuel. Two geometric al

Page 131 and 132: [39] B. Gatos, A. Antonacopoulos, a

Page 133: [64] K. P. Murphy, Y. Weiss, and M.

Page 137 and 138: Index tel-00912566, version 1 - 2 D

method

methods

components

detection

segmentation

documents

feature

region

analysis

regions

heterogeneous

images

tel.archives-ouvertes.fr

Segmentation of heterogeneous document images : an ... - Tel

Segmentation of heterogeneous document images : an ... - Tel ... View more Segmentation of heterogeneous document images : an ... - Tel

Delete template?

Save as template ?

Segmentation of heterogeneous document images : an ... - Tel Segmentation of heterogeneous document images : an ... - Tel