Segmentation of heterogeneous document images : an ... - Tel
Segmentation of heterogeneous document images : an ... - Tel
Segmentation of heterogeneous document images : an ... - Tel
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
3.6 Two <strong>document</strong>s that have obtained the lowest accuracy rate for<br />
text/graphics separation. . . . . . . . . . . . . . . . . . . . . . . 48<br />
3.7 Misclassified components, gathered from our own dataset . . . . 49<br />
tel-00912566, version 1 - 2 Dec 2013<br />
4.1 Long-dist<strong>an</strong>ce communication between image sites in CRFs. . . . 52<br />
4.2 Our two-dimensional conditionl r<strong>an</strong>dom fields model . . . . . . . 54<br />
4.3 Height <strong>an</strong>d width maps . . . . . . . . . . . . . . . . . . . . . . . 58<br />
4.4 A <strong>document</strong> image, its filled text <strong>an</strong>d graphical components, separated<br />
using our text/graphics separation method. . . . . . . . . 59<br />
4.5 Horizontal, vertical <strong>an</strong>d marginal run-length maps as features . . 61<br />
4.6 Results <strong>of</strong> applying different Gabor filters to a <strong>document</strong> image. . 63<br />
4.7 Comparison between Gabor filters with different kernel sizes . . . 64<br />
4.8 A toy example that shows why a normalized observation is better<br />
to appear in two feature functions instead <strong>of</strong> just one. . . . . . . 67<br />
4.9 Two sample pages from our training dataset along with their<br />
ground-truth <strong>images</strong>. . . . . . . . . . . . . . . . . . . . . . . . . . 73<br />
4.10 Number <strong>of</strong> misclassified sites per iteration using voted perceptron<br />
training . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74<br />
4.10 Number <strong>of</strong> misclassified sites per iteration using voted perceptron<br />
training . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75<br />
4.11 Several obtained results for text region detection without postprocessing.<br />
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77<br />
4.11 Several obtained results for text region detection without postprocessing.<br />
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78<br />
4.12 Results <strong>of</strong> text region detection after post-processing. . . . . . . . 79<br />
4.13 Non-textual holes <strong>an</strong>d penetrations . . . . . . . . . . . . . . . . . 82<br />
4.14 Title breakdown . . . . . . . . . . . . . . . . . . . . . . . . . . . 82<br />
4.15 Sl<strong>an</strong>ted text . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83<br />
4.16 Learning rate parameter . . . . . . . . . . . . . . . . . . . . . . . 84<br />
4.17 Overlapping ratio parameter . . . . . . . . . . . . . . . . . . . . . 85<br />
4.18 Maximum number <strong>of</strong> ICM cycles . . . . . . . . . . . . . . . . . . 86<br />
5.1 First steps in line detection to obtain initial lines <strong>an</strong>d gaps for a<br />
single <strong>document</strong> image. . . . . . . . . . . . . . . . . . . . . . . . 89<br />
5.2 First steps in line detection to obtain initial lines <strong>an</strong>d gaps for a<br />
single <strong>document</strong> image. . . . . . . . . . . . . . . . . . . . . . . . 90<br />
5.3 Initial text <strong>an</strong>d gap regions . . . . . . . . . . . . . . . . . . . . . 91<br />
5.4 Refined text <strong>an</strong>d gap regions . . . . . . . . . . . . . . . . . . . . 94<br />
5.5 Some steps to get from separators to text lines . . . . . . . . . . 96<br />
5.5 Some steps to get from separators to text lines . . . . . . . . . . 97<br />
6.1 A fictional text region including its text lines . . . . . . . . . . . 102<br />
6.2 Binary partition tree generated from couple <strong>of</strong> text lines. . . . . . 102<br />
6.3 Updated results for ICDAR2011 competition . . . . . . . . . . . 107<br />
6.4 Paragrah detection results (1) . . . . . . . . . . . . . . . . . . . . 108<br />
6.5 Paragrah detection results (2) . . . . . . . . . . . . . . . . . . . . 109<br />
v