Segmentation of heterogeneous document images : an ... - Tel
Segmentation of heterogeneous document images : an ... - Tel
Segmentation of heterogeneous document images : an ... - Tel
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
2 ∗ RA ∗ DR<br />
F − Measure =<br />
RA + DR<br />
where N is the number <strong>of</strong> lines in ground-truth <strong>an</strong>d M is the number <strong>of</strong> recognized<br />
lines. w 1 ,w 2 ,w 3 ,w 4 ,w 5 ,w 6 are predetermined weights that are set<br />
to 1, 0.25, 0.25, 1, 0.25, 0.25 respectively in [75] <strong>an</strong>d to 1, 0.75, 0.75, 1, 0.75, 0.75<br />
in [5]. The former is more generous towards errors in results.<br />
A.3 Scenario driven region correspondence<br />
tel-00912566, version 1 - 2 Dec 2013<br />
Scenario driven region correspondence [25] is <strong>an</strong>other perform<strong>an</strong>ce evaluation<br />
method implemented in Prima Layout Evaluation Tool. It is used as part <strong>of</strong> the<br />
perform<strong>an</strong>ce evaluation methods in [4] <strong>an</strong>d [6]. The essence <strong>of</strong> this method from<br />
a segmentation point <strong>of</strong> view is very similar to match counting. The method still<br />
computes merge, split, miss/partial miss <strong>an</strong>d false detection errors. However it<br />
also takes into account the reading order <strong>of</strong> the <strong>document</strong> <strong>an</strong>d assign different<br />
weights to errors in each entity depending on the situation in which <strong>an</strong> error<br />
occurs. For example, a merge between two text regions that belong to different<br />
text columns is more signific<strong>an</strong>t th<strong>an</strong> a merge between two regions that follow<br />
each other in a single text column. The weights are set due to the scenario<br />
that a evaluation is carried out. All perform<strong>an</strong>ce evaluations for paragraph detection<br />
are done using Prima Evaluation Tool under pure segmentation scenario.<br />
115