Segmentation of heterogeneous document images : an ... - Tel
Segmentation of heterogeneous document images : an ... - Tel
Segmentation of heterogeneous document images : an ... - Tel
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Table 4.3:<br />
TION<br />
COUNT WEIGHTED SUCCESS RATES FOR REGION SEGMENTA-<br />
tel-00912566, version 1 - 2 Dec 2013<br />
Image Merge % Split % Miss % Partial miss % False detection % Overall success%<br />
9782738000064 0110 13.33 23.53 0 13.33 9.09 87.85<br />
9782738000064 0112 0 23.33 0 20.69 16.67 86.36<br />
9782738000064 0267 0 34.43 0 23.08 0 83.84<br />
9782738002082 0145 0 12.5 0 22.22 0 91.31<br />
9782738003041 0337 0 0 0 33.33 0 88.41<br />
9782738005151 0042 28.57 21.05 0 28.57 0 81.64<br />
37502016547384 00275 0 86.02 0 17.14 9.09 56.85<br />
37502043571332 00074 0 11.11 0 0 9.09 95.65<br />
37502043571332 00129 0 0 0 40 0 85<br />
37511000967896 00044 0 0 0 0 0 100<br />
37531011459550 00011 0 0 0 28.57 16.67 87.9<br />
37531011459550 00023 27.27 0 0 20 9.09 86.65<br />
37531011465763 00009 33.33 11.11 0 33.33 0 80.27<br />
37531011465979 00019 42.86 0 0 50 0 70.9<br />
37531017210387 00004 0 55.56 0 66.67 67.74 48.64<br />
37531021715306 00002 0 44.44 0 28.57 9.09 77.65<br />
37531022215512 00002 15.38 0 0 26.67 0 88.96<br />
37531022215512 00006 45.45 0 0 66.67 16.67 62.36<br />
37531022331848 00165 0 0 0 0 0 100<br />
37531022333950 00254 0 0 0 0 0 100<br />
37531023943955 00010 0 0 40 40 0 76<br />
37531023943955 00043 50 33.33 0 50 0 65.35<br />
37531027315911 00007 0 0 0 0 0 100<br />
37531032730088 00703 0 22.22 0 46.15 0 78.81<br />
37531032730096 00692 0 22.58 14.29 40 0 80.44<br />
37531032886898 00116 0 0 0 40 16.67 82.96<br />
CIDELOT 00002 000112 0 47.22 0 42.42 9.09 72.17<br />
SEPTLOT 00001 000547 12.9 6.9 0 12.9 0 93.22<br />
A slightly more serious problem is when the same problem occurs near the<br />
border <strong>of</strong> text regions, which leads to the second type <strong>of</strong> error, namely nontextual<br />
penetrations. Figure 4.13 (B) displays penetration <strong>of</strong> non-textual sites<br />
in text regions. This problem is also fixable in post-processing by isolating the<br />
text region <strong>an</strong>d applying morphological opening on the region while respecting<br />
the borders. Morphological opening should only be applied after isolating a text<br />
region, otherwise two text regions may merge incorrectly.<br />
The third problem is <strong>of</strong> broken titles. Figure 4.14 shows this problem in<br />
part <strong>of</strong> a <strong>document</strong> image. The title on this page is divided into two parts when<br />
there are enough empty space around. This problem is a serious problem <strong>an</strong>d<br />
we do not have a fix for that.<br />
The fourth problem is when sides notes are very close to main text <strong>an</strong>d the<br />
main text is slightly sl<strong>an</strong>ted to one side. Figure 4.15 illustrates this problem. Its<br />
clear from the <strong>images</strong> that Gabor filters are not able to capture gaps between<br />
side notes.<br />
4.8.1 Discussion on parameters<br />
The final matter that should be discussed is the choice <strong>of</strong> parameters <strong>an</strong>d their<br />
effects on the results <strong>of</strong> the region detector. For this purpose we train a CRF<br />
on a selected subset <strong>of</strong> the dataset containing 16 <strong>document</strong>s. The goal is to<br />
determine the effects <strong>of</strong> ch<strong>an</strong>ging parameters such as learning rate, overlapping<br />
ratio <strong>an</strong>d the maximum number <strong>of</strong> cycles <strong>of</strong> ICM on the training error. For<br />
this study, we consider the first 100 iterations <strong>of</strong> the training algorithm <strong>an</strong>d the<br />
81