14.01.2014 Views

Segmentation of heterogeneous document images : an ... - Tel

Segmentation of heterogeneous document images : an ... - Tel

Segmentation of heterogeneous document images : an ... - Tel

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.6 Two <strong>document</strong>s that have obtained the lowest accuracy rate for<br />

text/graphics separation. . . . . . . . . . . . . . . . . . . . . . . 48<br />

3.7 Misclassified components, gathered from our own dataset . . . . 49<br />

tel-00912566, version 1 - 2 Dec 2013<br />

4.1 Long-dist<strong>an</strong>ce communication between image sites in CRFs. . . . 52<br />

4.2 Our two-dimensional conditionl r<strong>an</strong>dom fields model . . . . . . . 54<br />

4.3 Height <strong>an</strong>d width maps . . . . . . . . . . . . . . . . . . . . . . . 58<br />

4.4 A <strong>document</strong> image, its filled text <strong>an</strong>d graphical components, separated<br />

using our text/graphics separation method. . . . . . . . . 59<br />

4.5 Horizontal, vertical <strong>an</strong>d marginal run-length maps as features . . 61<br />

4.6 Results <strong>of</strong> applying different Gabor filters to a <strong>document</strong> image. . 63<br />

4.7 Comparison between Gabor filters with different kernel sizes . . . 64<br />

4.8 A toy example that shows why a normalized observation is better<br />

to appear in two feature functions instead <strong>of</strong> just one. . . . . . . 67<br />

4.9 Two sample pages from our training dataset along with their<br />

ground-truth <strong>images</strong>. . . . . . . . . . . . . . . . . . . . . . . . . . 73<br />

4.10 Number <strong>of</strong> misclassified sites per iteration using voted perceptron<br />

training . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74<br />

4.10 Number <strong>of</strong> misclassified sites per iteration using voted perceptron<br />

training . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75<br />

4.11 Several obtained results for text region detection without postprocessing.<br />

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77<br />

4.11 Several obtained results for text region detection without postprocessing.<br />

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78<br />

4.12 Results <strong>of</strong> text region detection after post-processing. . . . . . . . 79<br />

4.13 Non-textual holes <strong>an</strong>d penetrations . . . . . . . . . . . . . . . . . 82<br />

4.14 Title breakdown . . . . . . . . . . . . . . . . . . . . . . . . . . . 82<br />

4.15 Sl<strong>an</strong>ted text . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83<br />

4.16 Learning rate parameter . . . . . . . . . . . . . . . . . . . . . . . 84<br />

4.17 Overlapping ratio parameter . . . . . . . . . . . . . . . . . . . . . 85<br />

4.18 Maximum number <strong>of</strong> ICM cycles . . . . . . . . . . . . . . . . . . 86<br />

5.1 First steps in line detection to obtain initial lines <strong>an</strong>d gaps for a<br />

single <strong>document</strong> image. . . . . . . . . . . . . . . . . . . . . . . . 89<br />

5.2 First steps in line detection to obtain initial lines <strong>an</strong>d gaps for a<br />

single <strong>document</strong> image. . . . . . . . . . . . . . . . . . . . . . . . 90<br />

5.3 Initial text <strong>an</strong>d gap regions . . . . . . . . . . . . . . . . . . . . . 91<br />

5.4 Refined text <strong>an</strong>d gap regions . . . . . . . . . . . . . . . . . . . . 94<br />

5.5 Some steps to get from separators to text lines . . . . . . . . . . 96<br />

5.5 Some steps to get from separators to text lines . . . . . . . . . . 97<br />

6.1 A fictional text region including its text lines . . . . . . . . . . . 102<br />

6.2 Binary partition tree generated from couple <strong>of</strong> text lines. . . . . . 102<br />

6.3 Updated results for ICDAR2011 competition . . . . . . . . . . . 107<br />

6.4 Paragrah detection results (1) . . . . . . . . . . . . . . . . . . . . 108<br />

6.5 Paragrah detection results (2) . . . . . . . . . . . . . . . . . . . . 109<br />

v

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!