Segmentation of heterogeneous document images : an ... - Tel

Segmentation of heterogeneous document images : an ... - Tel Segmentation of heterogeneous document images : an ... - Tel

tel.archives.ouvertes.fr
from tel.archives.ouvertes.fr More from this publisher
14.01.2014 Views

[78] L. R. Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. IEEE, 77(2):257–286, 1989. [79] Z. Razak, K. Zulkiflee, M. Yamani, I. Idris, E. M. Tamil, M. Noorzaily, M. Noor, R. Salleh, M. Yaacob, and Z. M. Yusof. Offline handwriting text line segmentation: a review. International Journal of Computer Science and Network Security, 8(7):12–20, 2008. [80] M. D. Riley. Time-frequency representations for speech signals. Technical report, 1987. [81] F. Rosenblatt. The perceptron: a probabilistic model for information storage and organization in the brain. Psychological Review, 65(6):386– 408, 1958. [82] J. Sauvola and M. Pietikainen. Adaptive document image binarization. Pattern Recognition, 33(2):225–236, 2000. tel-00912566, version 1 - 2 Dec 2013 [83] F. Shafait and T. M. Breuel. Document image dewarping contest. International Workshop on Camera-Based Document Analysis and Recognition, 2007. [84] F. Shafait, D. Keysers, and T. M. Breuel. Performance evaluation and benchmarking of six page segmentation algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(6):941–954, 2008. [85] F. Shafait, J. van Beusekom, D. Keysers, and T. M. Breuel. Background variability modeling for statistical layout analysis. 2008 19th International Conference on Pattern Recognition, pages 1–4, December 2008. [86] F. Shafait, J. Van Beusekom, D. Keysers, and T. M. Breuel. Structural mixtures for statistical layout analysis. In 2008 The Eighth IAPR International Workshop on Document Analysis Systems, pages 415–422. IEEE, September 2008. [87] Z. Shi, S. Setlur, and V. Govindaraju. Text extraction from gray scale historical document images using adaptive local connectivity map. 8th International Conference on Document Analysis and Recognition (ICDAR ’05), pages 794–798, 2005. [88] Z. Shi, S. Setlur, and V. Govindaraju. A steerable directional local profile technique for extraction of handwritten arabic text lines. 10th International Conference on Document Analysis and Recognition (ICDAR ’09), 2009. [89] R. Smith. An overview of the Tesseract OCR engine. 9th International Conference on Document Analysis and Recognition (ICDAR ’07), 2:629– 633, 2007. [90] T. Stafylakis, V. Papavassiliou, V. Katsouros, and G. Carayannis. Robust text-line and word segmentation for handwritten documents images. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 3393–3396, March 2008. 124

[91] M. Stamp. A revealing introduction to hidden Markov models. Technical report, Department of Computer Science San Jose State University, 2004. [92] T. Su, T. Zhang, and D. Guan. Corpus-based HIT-MW database for offline recognition of general-purpose Chinese handwritten text. 9th International Conference on Document Analysis and Recognition (ICDAR ’07), 10(1):27–38, March 2007. [93] H. M. Sun. Page segmentation for Manhattan and non-Manhattan layout documents via selective CRLA. 8th International Conference on Document Analysis and Recognition (ICDAR ’05), pages 116–120, 2005. [94] C. Sutton and A. McCallum. An introduction to conditional random fields for relational learning. In Lise Getoor and Ben Taskar, editors, Introduction to Statistical Relational Learning, volume 7 of Adaptive Computation and Machine Learning, chapter 4, page 93. The MIT Press, 2006. tel-00912566, version 1 - 2 Dec 2013 [95] M. S. Taylor, F. S. Brundick, and A. E. Brodeen. A statistical approach to the generation of a database for evaluating OCR software. International Journal on Document Analysis and Recognition, 4:170–176, 2002. [96] K. Tombre, S. Tabbone, and L. Pélissier. Text/graphics separation revisited. DAS ’02 Proceedings of the 5th International Workshop on Document Analysis Systems V, pages 200–211, August 2002. [97] A Viterbi. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory, 13(2):260–269, 1967. [98] F. M. Wahl, K. Y. Wong, and R. G. Casey. Block segmentation and text extraction in mixed text/image documents. Computer Graphics and Image Processing, 20(4):375–390, December 1982. [99] B. Waked. Page segmentation and identification for document image analysis. PhD thesis, Concordia University, Montreal, Canada, 2001. [100] Y. Weiss. Correctness of local probability propagation in graphical models with loops. Neural Computation, 12(1):1–41, 2000. [101] K. Y. Wong, R. G. Casey, and F. M. Wahl. Document analysis system. IBM Journal of Research and Development, 26(6):647–656, 1982. [102] Y. Xiao and H. Yan. Text region extraction in a document image based on the Delaunay tessellation. Pattern Recognition, 36(3):799–809, March 2003. [103] F. Yin and C. Liu. Handwritten Chinese text line segmentation by clustering with distance metric learning. Pattern Recognition, 42(12):3146–3157, 2009. [104] L. A. Zadeh. A simple view of the Dempster-Shafer theory of evidence and its implication for the rule of combination. The AI Magazine, 7(2):85–90, July 1986. 125

[91] M. Stamp. A revealing introduction to hidden Markov models. Technical<br />

report, Department <strong>of</strong> Computer Science S<strong>an</strong> Jose State University, 2004.<br />

[92] T. Su, T. Zh<strong>an</strong>g, <strong>an</strong>d D. Gu<strong>an</strong>. Corpus-based HIT-MW database for<br />

<strong>of</strong>fline recognition <strong>of</strong> general-purpose Chinese h<strong>an</strong>dwritten text. 9th International<br />

Conference on Document Analysis <strong>an</strong>d Recognition (ICDAR<br />

’07), 10(1):27–38, March 2007.<br />

[93] H. M. Sun. Page segmentation for M<strong>an</strong>hatt<strong>an</strong> <strong>an</strong>d non-M<strong>an</strong>hatt<strong>an</strong> layout<br />

<strong>document</strong>s via selective CRLA. 8th International Conference on Document<br />

Analysis <strong>an</strong>d Recognition (ICDAR ’05), pages 116–120, 2005.<br />

[94] C. Sutton <strong>an</strong>d A. McCallum. An introduction to conditional r<strong>an</strong>dom fields<br />

for relational learning. In Lise Getoor <strong>an</strong>d Ben Taskar, editors, Introduction<br />

to Statistical Relational Learning, volume 7 <strong>of</strong> Adaptive Computation<br />

<strong>an</strong>d Machine Learning, chapter 4, page 93. The MIT Press, 2006.<br />

tel-00912566, version 1 - 2 Dec 2013<br />

[95] M. S. Taylor, F. S. Brundick, <strong>an</strong>d A. E. Brodeen. A statistical approach to<br />

the generation <strong>of</strong> a database for evaluating OCR s<strong>of</strong>tware. International<br />

Journal on Document Analysis <strong>an</strong>d Recognition, 4:170–176, 2002.<br />

[96] K. Tombre, S. Tabbone, <strong>an</strong>d L. Pélissier. Text/graphics separation revisited.<br />

DAS ’02 Proceedings <strong>of</strong> the 5th International Workshop on Document<br />

Analysis Systems V, pages 200–211, August 2002.<br />

[97] A Viterbi. Error bounds for convolutional codes <strong>an</strong>d <strong>an</strong> asymptotically<br />

optimum decoding algorithm. IEEE Tr<strong>an</strong>sactions on Information Theory,<br />

13(2):260–269, 1967.<br />

[98] F. M. Wahl, K. Y. Wong, <strong>an</strong>d R. G. Casey. Block segmentation <strong>an</strong>d<br />

text extraction in mixed text/image <strong>document</strong>s. Computer Graphics <strong>an</strong>d<br />

Image Processing, 20(4):375–390, December 1982.<br />

[99] B. Waked. Page segmentation <strong>an</strong>d identification for <strong>document</strong> image <strong>an</strong>alysis.<br />

PhD thesis, Concordia University, Montreal, C<strong>an</strong>ada, 2001.<br />

[100] Y. Weiss. Correctness <strong>of</strong> local probability propagation in graphical models<br />

with loops. Neural Computation, 12(1):1–41, 2000.<br />

[101] K. Y. Wong, R. G. Casey, <strong>an</strong>d F. M. Wahl. Document <strong>an</strong>alysis system.<br />

IBM Journal <strong>of</strong> Research <strong>an</strong>d Development, 26(6):647–656, 1982.<br />

[102] Y. Xiao <strong>an</strong>d H. Y<strong>an</strong>. Text region extraction in a <strong>document</strong> image based<br />

on the Delaunay tessellation. Pattern Recognition, 36(3):799–809, March<br />

2003.<br />

[103] F. Yin <strong>an</strong>d C. Liu. H<strong>an</strong>dwritten Chinese text line segmentation by clustering<br />

with dist<strong>an</strong>ce metric learning. Pattern Recognition, 42(12):3146–3157,<br />

2009.<br />

[104] L. A. Zadeh. A simple view <strong>of</strong> the Dempster-Shafer theory <strong>of</strong> evidence <strong>an</strong>d<br />

its implication for the rule <strong>of</strong> combination. The AI Magazine, 7(2):85–90,<br />

July 1986.<br />

125

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!