Segmentation of heterogeneous document images : an ... - Tel
Segmentation of heterogeneous document images : an ... - Tel Segmentation of heterogeneous document images : an ... - Tel
[78] L. R. Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. IEEE, 77(2):257–286, 1989. [79] Z. Razak, K. Zulkiflee, M. Yamani, I. Idris, E. M. Tamil, M. Noorzaily, M. Noor, R. Salleh, M. Yaacob, and Z. M. Yusof. Offline handwriting text line segmentation: a review. International Journal of Computer Science and Network Security, 8(7):12–20, 2008. [80] M. D. Riley. Time-frequency representations for speech signals. Technical report, 1987. [81] F. Rosenblatt. The perceptron: a probabilistic model for information storage and organization in the brain. Psychological Review, 65(6):386– 408, 1958. [82] J. Sauvola and M. Pietikainen. Adaptive document image binarization. Pattern Recognition, 33(2):225–236, 2000. tel-00912566, version 1 - 2 Dec 2013 [83] F. Shafait and T. M. Breuel. Document image dewarping contest. International Workshop on Camera-Based Document Analysis and Recognition, 2007. [84] F. Shafait, D. Keysers, and T. M. Breuel. Performance evaluation and benchmarking of six page segmentation algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(6):941–954, 2008. [85] F. Shafait, J. van Beusekom, D. Keysers, and T. M. Breuel. Background variability modeling for statistical layout analysis. 2008 19th International Conference on Pattern Recognition, pages 1–4, December 2008. [86] F. Shafait, J. Van Beusekom, D. Keysers, and T. M. Breuel. Structural mixtures for statistical layout analysis. In 2008 The Eighth IAPR International Workshop on Document Analysis Systems, pages 415–422. IEEE, September 2008. [87] Z. Shi, S. Setlur, and V. Govindaraju. Text extraction from gray scale historical document images using adaptive local connectivity map. 8th International Conference on Document Analysis and Recognition (ICDAR ’05), pages 794–798, 2005. [88] Z. Shi, S. Setlur, and V. Govindaraju. A steerable directional local profile technique for extraction of handwritten arabic text lines. 10th International Conference on Document Analysis and Recognition (ICDAR ’09), 2009. [89] R. Smith. An overview of the Tesseract OCR engine. 9th International Conference on Document Analysis and Recognition (ICDAR ’07), 2:629– 633, 2007. [90] T. Stafylakis, V. Papavassiliou, V. Katsouros, and G. Carayannis. Robust text-line and word segmentation for handwritten documents images. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 3393–3396, March 2008. 124
[91] M. Stamp. A revealing introduction to hidden Markov models. Technical report, Department of Computer Science San Jose State University, 2004. [92] T. Su, T. Zhang, and D. Guan. Corpus-based HIT-MW database for offline recognition of general-purpose Chinese handwritten text. 9th International Conference on Document Analysis and Recognition (ICDAR ’07), 10(1):27–38, March 2007. [93] H. M. Sun. Page segmentation for Manhattan and non-Manhattan layout documents via selective CRLA. 8th International Conference on Document Analysis and Recognition (ICDAR ’05), pages 116–120, 2005. [94] C. Sutton and A. McCallum. An introduction to conditional random fields for relational learning. In Lise Getoor and Ben Taskar, editors, Introduction to Statistical Relational Learning, volume 7 of Adaptive Computation and Machine Learning, chapter 4, page 93. The MIT Press, 2006. tel-00912566, version 1 - 2 Dec 2013 [95] M. S. Taylor, F. S. Brundick, and A. E. Brodeen. A statistical approach to the generation of a database for evaluating OCR software. International Journal on Document Analysis and Recognition, 4:170–176, 2002. [96] K. Tombre, S. Tabbone, and L. Pélissier. Text/graphics separation revisited. DAS ’02 Proceedings of the 5th International Workshop on Document Analysis Systems V, pages 200–211, August 2002. [97] A Viterbi. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory, 13(2):260–269, 1967. [98] F. M. Wahl, K. Y. Wong, and R. G. Casey. Block segmentation and text extraction in mixed text/image documents. Computer Graphics and Image Processing, 20(4):375–390, December 1982. [99] B. Waked. Page segmentation and identification for document image analysis. PhD thesis, Concordia University, Montreal, Canada, 2001. [100] Y. Weiss. Correctness of local probability propagation in graphical models with loops. Neural Computation, 12(1):1–41, 2000. [101] K. Y. Wong, R. G. Casey, and F. M. Wahl. Document analysis system. IBM Journal of Research and Development, 26(6):647–656, 1982. [102] Y. Xiao and H. Yan. Text region extraction in a document image based on the Delaunay tessellation. Pattern Recognition, 36(3):799–809, March 2003. [103] F. Yin and C. Liu. Handwritten Chinese text line segmentation by clustering with distance metric learning. Pattern Recognition, 42(12):3146–3157, 2009. [104] L. A. Zadeh. A simple view of the Dempster-Shafer theory of evidence and its implication for the rule of combination. The AI Magazine, 7(2):85–90, July 1986. 125
- Page 83 and 84: tel-00912566, version 1 - 2 Dec 201
- Page 85 and 86: tel-00912566, version 1 - 2 Dec 201
- Page 87 and 88: tel-00912566, version 1 - 2 Dec 201
- Page 89 and 90: tel-00912566, version 1 - 2 Dec 201
- Page 91 and 92: Table 4.3: TION COUNT WEIGHTED SUCC
- Page 93 and 94: tel-00912566, version 1 - 2 Dec 201
- Page 95 and 96: tel-00912566, version 1 - 2 Dec 201
- Page 97 and 98: Chapter 5 Text line detection tel-0
- Page 99 and 100: tel-00912566, version 1 - 2 Dec 201
- Page 101 and 102: tel-00912566, version 1 - 2 Dec 201
- Page 103 and 104: Having specified the model, a verti
- Page 105 and 106: • The fifth step is to remove ext
- Page 107 and 108: tel-00912566, version 1 - 2 Dec 201
- Page 109 and 110: text lines can be divided into two
- Page 111 and 112: the two children. The root node rep
- Page 113 and 114: leaves of the tree which contain on
- Page 115 and 116: tel-00912566, version 1 - 2 Dec 201
- Page 117 and 118: tel-00912566, version 1 - 2 Dec 201
- Page 119 and 120: tel-00912566, version 1 - 2 Dec 201
- Page 121 and 122: currently working on some of these
- Page 123 and 124: • fn (false negative) is the numb
- Page 125 and 126: 2 ∗ RA ∗ DR F − Measure = RA
- Page 127 and 128: • ”-tn”: This option uses the
- Page 129 and 130: [12] T. M. Breuel. Two geometric al
- Page 131 and 132: [39] B. Gatos, A. Antonacopoulos, a
- Page 133: [64] K. P. Murphy, Y. Weiss, and M.
- Page 137 and 138: Index tel-00912566, version 1 - 2 D
[91] M. Stamp. A revealing introduction to hidden Markov models. Technical<br />
report, Department <strong>of</strong> Computer Science S<strong>an</strong> Jose State University, 2004.<br />
[92] T. Su, T. Zh<strong>an</strong>g, <strong>an</strong>d D. Gu<strong>an</strong>. Corpus-based HIT-MW database for<br />
<strong>of</strong>fline recognition <strong>of</strong> general-purpose Chinese h<strong>an</strong>dwritten text. 9th International<br />
Conference on Document Analysis <strong>an</strong>d Recognition (ICDAR<br />
’07), 10(1):27–38, March 2007.<br />
[93] H. M. Sun. Page segmentation for M<strong>an</strong>hatt<strong>an</strong> <strong>an</strong>d non-M<strong>an</strong>hatt<strong>an</strong> layout<br />
<strong>document</strong>s via selective CRLA. 8th International Conference on Document<br />
Analysis <strong>an</strong>d Recognition (ICDAR ’05), pages 116–120, 2005.<br />
[94] C. Sutton <strong>an</strong>d A. McCallum. An introduction to conditional r<strong>an</strong>dom fields<br />
for relational learning. In Lise Getoor <strong>an</strong>d Ben Taskar, editors, Introduction<br />
to Statistical Relational Learning, volume 7 <strong>of</strong> Adaptive Computation<br />
<strong>an</strong>d Machine Learning, chapter 4, page 93. The MIT Press, 2006.<br />
tel-00912566, version 1 - 2 Dec 2013<br />
[95] M. S. Taylor, F. S. Brundick, <strong>an</strong>d A. E. Brodeen. A statistical approach to<br />
the generation <strong>of</strong> a database for evaluating OCR s<strong>of</strong>tware. International<br />
Journal on Document Analysis <strong>an</strong>d Recognition, 4:170–176, 2002.<br />
[96] K. Tombre, S. Tabbone, <strong>an</strong>d L. Pélissier. Text/graphics separation revisited.<br />
DAS ’02 Proceedings <strong>of</strong> the 5th International Workshop on Document<br />
Analysis Systems V, pages 200–211, August 2002.<br />
[97] A Viterbi. Error bounds for convolutional codes <strong>an</strong>d <strong>an</strong> asymptotically<br />
optimum decoding algorithm. IEEE Tr<strong>an</strong>sactions on Information Theory,<br />
13(2):260–269, 1967.<br />
[98] F. M. Wahl, K. Y. Wong, <strong>an</strong>d R. G. Casey. Block segmentation <strong>an</strong>d<br />
text extraction in mixed text/image <strong>document</strong>s. Computer Graphics <strong>an</strong>d<br />
Image Processing, 20(4):375–390, December 1982.<br />
[99] B. Waked. Page segmentation <strong>an</strong>d identification for <strong>document</strong> image <strong>an</strong>alysis.<br />
PhD thesis, Concordia University, Montreal, C<strong>an</strong>ada, 2001.<br />
[100] Y. Weiss. Correctness <strong>of</strong> local probability propagation in graphical models<br />
with loops. Neural Computation, 12(1):1–41, 2000.<br />
[101] K. Y. Wong, R. G. Casey, <strong>an</strong>d F. M. Wahl. Document <strong>an</strong>alysis system.<br />
IBM Journal <strong>of</strong> Research <strong>an</strong>d Development, 26(6):647–656, 1982.<br />
[102] Y. Xiao <strong>an</strong>d H. Y<strong>an</strong>. Text region extraction in a <strong>document</strong> image based<br />
on the Delaunay tessellation. Pattern Recognition, 36(3):799–809, March<br />
2003.<br />
[103] F. Yin <strong>an</strong>d C. Liu. H<strong>an</strong>dwritten Chinese text line segmentation by clustering<br />
with dist<strong>an</strong>ce metric learning. Pattern Recognition, 42(12):3146–3157,<br />
2009.<br />
[104] L. A. Zadeh. A simple view <strong>of</strong> the Dempster-Shafer theory <strong>of</strong> evidence <strong>an</strong>d<br />
its implication for the rule <strong>of</strong> combination. The AI Magazine, 7(2):85–90,<br />
July 1986.<br />
125