14.01.2014 Views

Segmentation of heterogeneous document images : an ... - Tel

Segmentation of heterogeneous document images : an ... - Tel

Segmentation of heterogeneous document images : an ... - Tel

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

2 ∗ RA ∗ DR<br />

F − Measure =<br />

RA + DR<br />

where N is the number <strong>of</strong> lines in ground-truth <strong>an</strong>d M is the number <strong>of</strong> recognized<br />

lines. w 1 ,w 2 ,w 3 ,w 4 ,w 5 ,w 6 are predetermined weights that are set<br />

to 1, 0.25, 0.25, 1, 0.25, 0.25 respectively in [75] <strong>an</strong>d to 1, 0.75, 0.75, 1, 0.75, 0.75<br />

in [5]. The former is more generous towards errors in results.<br />

A.3 Scenario driven region correspondence<br />

tel-00912566, version 1 - 2 Dec 2013<br />

Scenario driven region correspondence [25] is <strong>an</strong>other perform<strong>an</strong>ce evaluation<br />

method implemented in Prima Layout Evaluation Tool. It is used as part <strong>of</strong> the<br />

perform<strong>an</strong>ce evaluation methods in [4] <strong>an</strong>d [6]. The essence <strong>of</strong> this method from<br />

a segmentation point <strong>of</strong> view is very similar to match counting. The method still<br />

computes merge, split, miss/partial miss <strong>an</strong>d false detection errors. However it<br />

also takes into account the reading order <strong>of</strong> the <strong>document</strong> <strong>an</strong>d assign different<br />

weights to errors in each entity depending on the situation in which <strong>an</strong> error<br />

occurs. For example, a merge between two text regions that belong to different<br />

text columns is more signific<strong>an</strong>t th<strong>an</strong> a merge between two regions that follow<br />

each other in a single text column. The weights are set due to the scenario<br />

that a evaluation is carried out. All perform<strong>an</strong>ce evaluations for paragraph detection<br />

are done using Prima Evaluation Tool under pure segmentation scenario.<br />

115

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!