Segmentation of heterogeneous document images : an ... - Tel
Segmentation of heterogeneous document images : an ... - Tel
Segmentation of heterogeneous document images : an ... - Tel
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Appendix A<br />
Perform<strong>an</strong>ce evaluation<br />
methods<br />
tel-00912566, version 1 - 2 Dec 2013<br />
M<strong>an</strong>y segmentation algorithms have been proposed in the literature. However<br />
without a common perform<strong>an</strong>ce evaluation method, it is hard to compare the<br />
perform<strong>an</strong>ce <strong>of</strong> these methods. Care should be taken in using a perform<strong>an</strong>ce<br />
metric to predict how well a segmentation method will perform on a particular<br />
task.<br />
Three popular perform<strong>an</strong>ce evaluation methods are:<br />
1. Precision <strong>an</strong>d recall<br />
2. Match counting<br />
3. Scenario driven region correspondence<br />
A.1 Precision <strong>an</strong>d recall<br />
Precision <strong>an</strong>d recall are two measures that evaluate the results <strong>of</strong> a classification<br />
task. The precision for a class is the number <strong>of</strong> true positive 1 divided by<br />
the total number <strong>of</strong> objects that are labeled as belonging to the positive class.<br />
Recall is defined as the number <strong>of</strong> true positives divided by the total number <strong>of</strong><br />
objects that actually belong to the positive class. In simple terms, high recall<br />
me<strong>an</strong>s that a the classifier returned most <strong>of</strong> the relev<strong>an</strong>t results. High precision<br />
me<strong>an</strong>s that a classifier returned more relev<strong>an</strong>t results th<strong>an</strong> irrelev<strong>an</strong>t.<br />
In the context <strong>of</strong> text <strong>an</strong>d graphics separation, a positive class is text <strong>an</strong>d a<br />
negative class is graphics. We define four counts:<br />
• tp (true positive) is the number <strong>of</strong> correctly classified objects as text.<br />
• fp (false positive) is the number <strong>of</strong> unexpected text objects.<br />
• tn (true negative) is the number <strong>of</strong> correctly objects as graphics.<br />
1 the number <strong>of</strong> objects that are labeled correctly as belonging to the positive class<br />
112