Rotationally Invariant Descriptors using Intensity Order ... - IEEE Xplore

Rotationally Invariant Descriptors using Intensity Order ... - IEEE Xplore Rotationally Invariant Descriptors using Intensity Order ... - IEEE Xplore

vision.ia.ac.cn
from vision.ia.ac.cn More from this publisher
21.08.2013 Views

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTION ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 22 recall recall recall recall recall 1 0.9 0.8 0.7 0.6 0.5 graf 1 V 2 0.4 0.3 DAISY(136) SIFT(128) 0.2 MROGH(192) 0.1 MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 0.4 1−precision 0.5 1 0.8 0.6 wall 1 V 2 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 0.4 1−precision 0.5 0.6 1 0.8 0.6 boat 1 V 2 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 0.4 1−precision 0.5 0.6 1 0.9 0.8 0.7 0.6 0.5 bikes 1 V 2 0.4 0.3 DAISY(136) SIFT(128) 0.2 MROGH(192) 0.1 MRRID(256) RIFT(32) 0 0 0.05 0.1 0.15 0.2 0.25 1−precision 0.3 0.35 0.4 0.94 DAISY(136) SIFT(128) 0.92 MROGH(192) MRRID(256) RIFT(32) 0.9 0 0.02 0.04 0.06 1−precision 0.08 0.1 recall 1 0.98 0.96 1 0.8 0.6 ubc 1 V 2 leuven 1 V 2 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 1−precision 0.4 recall recall recall recall recall 1 0.9 0.8 0.7 0.6 0.5 graf 1 V 3 0.4 0.3 DAISY(136) SIFT(128) 0.2 MROGH(192) 0.1 MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 0.4 0.5 1−precision 0.6 0.7 0.8 1 0.8 0.6 wall 1 V 3 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 0.4 1−precision 0.5 0.6 1 0.8 0.6 boat 1 V 3 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 0.4 1−precision 0.5 0.6 1 0.9 0.8 0.7 0.6 0.5 recall 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 DAISY(136) SIFT(128) MROGH(192) MRRID(256) RIFT(32) (a) viewpoint changes (graf) recall graf 1 V 4 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1−precision 1 0.8 0.6 (b) viewpoint changes (wall) recall wall 1 V 4 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.2 0.4 1−precision 0.6 0.8 1 0.8 0.6 boat 1 V 4 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.2 0.4 1−precision 0.6 (c) rotation and scale changes (boat) bikes 1 V 3 0.4 0.3 DAISY(136) SIFT(128) 0.2 MROGH(192) 0.1 MRRID(256) RIFT(32) 0 0 0.05 0.1 0.15 0.2 0.25 1−precision 0.3 0.35 0.4 0.9 DAISY(136) SIFT(128) MROGH(192) MRRID(256) RIFT(32) 0.85 0 0.05 0.1 1−precision 0.15 0.2 recall 1 0.95 1 0.8 0.6 ubc 1 V 3 leuven 1 V 3 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 1−precision 0.4 0.5 recall 1 0.9 0.8 0.7 0.6 0.5 (d) image blur (bikes) recall bikes 1 V 4 0.4 0.3 DAISY(136) SIFT(128) 0.2 MROGH(192) 0.1 MRRID(256) RIFT(32) 0 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 1−precision 1 0.95 0.9 (e) JPEG compression (ubc) ubc 1 V 4 0.85 DAISY(136) SIFT(128) MROGH(192) MRRID(256) RIFT(32) 0.8 0 0.05 0.1 1−precision 0.15 0.2 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 0.4 1−precision 0.5 0.6 (f) illumination changes (leuven) recall 1 0.8 0.6 leuven 1 V 4 recall recall recall recall recall recall 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 DAISY(136) SIFT(128) MROGH(192) MRRID(256) RIFT(32) graf 1 V 5 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1−precision 1 0.8 0.6 0.4 0.2 DAISY(136) SIFT(128) MROGH(192) MRRID(256) RIFT(32) wall 1 V 5 0 0 0.2 0.4 0.6 0.8 1 1−precision 1 0.8 0.6 boat 1 V 5 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.2 0.4 0.6 1−precision 0.8 1 0.9 0.8 0.7 0.6 0.5 bikes 1 V 5 0.4 0.3 DAISY(136) SIFT(128) 0.2 MROGH(192) 0.1 MRRID(256) RIFT(32) 0 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 1−precision 1 0.9 0.8 0.7 ubc 1 V 5 0.6 DAISY(136) SIFT(128) 0.5 MROGH(192) MRRID(256) RIFT(32) 0.4 0 0.1 0.2 1−precision 0.3 0.4 1 0.8 0.6 leuven 1 V 5 0.4 DAISY(136) SIFT(128) 0.2 MROGH(192) MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 0.4 1−precision 0.5 0.6 Fig. 11. Experimental results under various image transformations in the Oxford dataset for Hessian-Affine Region. November 26, 2011 DRAFT

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTION ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 23 properties. They not only use multiple support regions to improve discriminative ability, but also combine local features (histogram of gradient orientation in MROGH and CS-LBP in MRRID) with ordinal information. Moreover, they are rotation invariant without relying on a reference orientation, further improving their robustness. Since MROGH uses a similar local feature to the one used in RIFT, the significant performance improvement of MROGH over RIFT demonstrates the effectiveness and advantage of our proposed feature pooling scheme, i.e. pooling by intensity orders is more informative than by rings. In most cases, MROGH performs better than MRRID. When images exhibit blur or illumination changes, MRRID is better, especially when encountering large illumination changes. 2) Performance on Images with Large Illumination Changes: In order to investigate the robustness of the proposed descriptors to monotonic intensity changes, we conducted image matching on three additional image pairs with large illumination changes as shown in the top of Fig. 12. They were captured by changing the exposure time of camera, the matching results are shown below the tested images. The Hessian-Affine detector was used for interest region detection. It can be seen from Fig. 12 that MRRID performs the best in such cases and achieves much higher performance than the other evaluated descriptors due to its invariance to monotonic intensity changes. The more severe the brightness change is, the larger the performance gap between MRRID and the other evaluated descriptors is. recall 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 DAISY(136) 0.2 SIFT(128) MROGH(192) 0.1 MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 1−precision 0.4 0.5 recall 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 DAISY(136) 0.2 SIFT(128) MROGH(192) 0.1 MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 0.4 0.5 0.6 1−precision recall 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 DAISY(136) 0.2 SIFT(128) MROGH(192) 0.1 MRRID(256) RIFT(32) 0 0 0.1 0.2 0.3 1−precision 0.4 0.5 0.6 Fig. 12. Matching results of images with large illumination changes, which were captured by changing the exposure time of camera. The more severe the brightness change is, the larger the performance gap between MRRID and the other evaluated descriptors is. November 26, 2011 DRAFT

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.<br />

<strong>IEEE</strong> TRANSACTION ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 23<br />

properties. They not only use multiple support regions to improve discriminative ability, but<br />

also combine local features (histogram of gradient orientation in MROGH and CS-LBP in<br />

MRRID) with ordinal information. Moreover, they are rotation invariant without relying on<br />

a reference orientation, further improving their robustness. Since MROGH uses a similar local<br />

feature to the one used in RIFT, the significant performance improvement of MROGH over RIFT<br />

demonstrates the effectiveness and advantage of our proposed feature pooling scheme, i.e. pooling<br />

by intensity orders is more informative than by rings. In most cases, MROGH performs better<br />

than MRRID. When images exhibit blur or illumination changes, MRRID is better, especially<br />

when encountering large illumination changes.<br />

2) Performance on Images with Large Illumination Changes: In order to investigate the<br />

robustness of the proposed descriptors to monotonic intensity changes, we conducted image<br />

matching on three additional image pairs with large illumination changes as shown in the top<br />

of Fig. 12. They were captured by changing the exposure time of camera, the matching results<br />

are shown below the tested images. The Hessian-Affine detector was used for interest region<br />

detection. It can be seen from Fig. 12 that MRRID performs the best in such cases and achieves<br />

much higher performance than the other evaluated descriptors due to its invariance to monotonic<br />

intensity changes. The more severe the brightness change is, the larger the performance gap<br />

between MRRID and the other evaluated descriptors is.<br />

recall<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

DAISY(136)<br />

0.2<br />

SIFT(128)<br />

MROGH(192)<br />

0.1<br />

MRRID(256)<br />

RIFT(32)<br />

0<br />

0 0.1 0.2 0.3<br />

1−precision<br />

0.4 0.5<br />

recall<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

DAISY(136)<br />

0.2<br />

SIFT(128)<br />

MROGH(192)<br />

0.1<br />

MRRID(256)<br />

RIFT(32)<br />

0<br />

0 0.1 0.2 0.3 0.4 0.5 0.6<br />

1−precision<br />

recall<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

DAISY(136)<br />

0.2<br />

SIFT(128)<br />

MROGH(192)<br />

0.1<br />

MRRID(256)<br />

RIFT(32)<br />

0<br />

0 0.1 0.2 0.3<br />

1−precision<br />

0.4 0.5 0.6<br />

Fig. 12. Matching results of images with large illumination changes, which were captured by changing the exposure time of<br />

camera. The more severe the brightness change is, the larger the performance gap between MRRID and the other evaluated<br />

descriptors is.<br />

November 26, 2011 DRAFT

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!