14.01.2014 Views

Segmentation of heterogeneous document images : an ... - Tel

Segmentation of heterogeneous document images : an ... - Tel

Segmentation of heterogeneous document images : an ... - Tel

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Appendix A<br />

Perform<strong>an</strong>ce evaluation<br />

methods<br />

tel-00912566, version 1 - 2 Dec 2013<br />

M<strong>an</strong>y segmentation algorithms have been proposed in the literature. However<br />

without a common perform<strong>an</strong>ce evaluation method, it is hard to compare the<br />

perform<strong>an</strong>ce <strong>of</strong> these methods. Care should be taken in using a perform<strong>an</strong>ce<br />

metric to predict how well a segmentation method will perform on a particular<br />

task.<br />

Three popular perform<strong>an</strong>ce evaluation methods are:<br />

1. Precision <strong>an</strong>d recall<br />

2. Match counting<br />

3. Scenario driven region correspondence<br />

A.1 Precision <strong>an</strong>d recall<br />

Precision <strong>an</strong>d recall are two measures that evaluate the results <strong>of</strong> a classification<br />

task. The precision for a class is the number <strong>of</strong> true positive 1 divided by<br />

the total number <strong>of</strong> objects that are labeled as belonging to the positive class.<br />

Recall is defined as the number <strong>of</strong> true positives divided by the total number <strong>of</strong><br />

objects that actually belong to the positive class. In simple terms, high recall<br />

me<strong>an</strong>s that a the classifier returned most <strong>of</strong> the relev<strong>an</strong>t results. High precision<br />

me<strong>an</strong>s that a classifier returned more relev<strong>an</strong>t results th<strong>an</strong> irrelev<strong>an</strong>t.<br />

In the context <strong>of</strong> text <strong>an</strong>d graphics separation, a positive class is text <strong>an</strong>d a<br />

negative class is graphics. We define four counts:<br />

• tp (true positive) is the number <strong>of</strong> correctly classified objects as text.<br />

• fp (false positive) is the number <strong>of</strong> unexpected text objects.<br />

• tn (true negative) is the number <strong>of</strong> correctly objects as graphics.<br />

1 the number <strong>of</strong> objects that are labeled correctly as belonging to the positive class<br />

112

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!