22.03.2013 Views

Curvature Scale Space Based Trade Mark Recognition

Curvature Scale Space Based Trade Mark Recognition

Curvature Scale Space Based Trade Mark Recognition

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Curvature</strong> <strong>Scale</strong> <strong>Space</strong> <strong>Based</strong> <strong>Trade</strong> <strong>Mark</strong> <strong>Recognition</strong><br />

1 S.China Venkateswarlu, 2 G.Nagendra, 3 S.RamaSubba Reddy, 4 G.Soma Sekhar<br />

1 Professor & HOD-CSE/ECE, HITS College of Engg.,, Hyderabad<br />

2 Assistant Professor –ECE-VJIT, Hyderabad<br />

3 Assistant Professor –ECE-VJIT, Hyderabad<br />

4 Research Scholar –Acharya Nagarju University,Guntur<br />

Abstract— Retrieval efficiency and accuracy are two important issues in designing a content-based database retrieval<br />

system. The efficient retrieval of the data base image is dependent on various factors such as the environment considered<br />

for query image, the reading source and the channel for communication. These factors affect the recognition of any given<br />

query image. To overcome the effect of environmental noises and rotation effect a new image recognition algorithm based<br />

on curvature scale space (CSS) is proposed. Database retrieval method based on shape information using curvature feature<br />

is proposed. The method evaluates the closed counter and process on the curvature smoothening using different level of<br />

Gaussian values. The proposed CSS based recognition algorithm found to be efficient in recognition for invariant features<br />

such as rotation and noise level. The proposed task is to be implemented on Matlab tool using the image processing toolbox<br />

for it’s realization..<br />

Keywords— Audio, <strong>Curvature</strong> <strong>Scale</strong> <strong>Space</strong>, Database, Image Retrieval, Speech, Processors<br />

I. INTRODUCTION<br />

Information is inherently multimodal. Humans can<br />

efficiently and effectively process information<br />

simultaneously in multiple dimensions. These multiple<br />

media, that aid effective communication, can be<br />

characterized into speech, audio, image, video, and<br />

textual data. Advances in computing and networking are<br />

generating a significant amount of interest in multimedia<br />

services and applications. Powerful processors, highspeed<br />

networking, high-capacity storage devices,<br />

improvements in compression algorithms, and advances<br />

in processing of audio, speech, image, and video signals<br />

are making multimedia systems technically and<br />

economically feasible.<br />

Multimedia systems suggest a wide variety of<br />

potential applications such as interactive entertainment,<br />

video news distribution, video rental services, and digital<br />

multimedia libraries. These services aim to provide the<br />

user with on-demand multimedia services. Any system<br />

providing these services will have to address the issue of<br />

representation, indexing, retrieval, and manipulation of<br />

the multimedia data. The multimedia objects are<br />

archived in a database. A feature extraction module<br />

extracts various features from the database, and<br />

represents the objects in terms of these features. The user<br />

interface provides the user with the capabilities to query,<br />

extract features, and insert multimedia objects.<br />

Image and video are an integral part of multimedia<br />

data. There are a number of applications where images<br />

need to be automatically retrieved by their Connell. This<br />

necessitates the need for powerful image processing and<br />

understanding tools. Various applications in digital<br />

libraries and image databases have been described in the<br />

literature.<br />

II. IMAGE RETRIEVAL SYSTEM<br />

Digital images are a convenient media for<br />

describing and storing spatial, temporal, spectral, and<br />

physical components of information contained in a<br />

variety of domains (e.g.. aerial/satellite images in remote<br />

sensing, medical images in telemedicine. fingerprints in<br />

forensics, museum collections in art history, and<br />

registration of trademarks and logos). These databases<br />

typically consist of thousands of images, taking up gigabytes<br />

of memory space.<br />

While advances in image compression algorithms<br />

have alleviated the storage requirement to some extent,<br />

the large volume of these images makes it difficult for a<br />

user to browse through the entire database. Therefore, an<br />

efficient and automatic procedure is required for<br />

indexing and retrieving images from databases.<br />

Figure 1: Traditional image retrieval model<br />

The proposed hierarchical content-based image<br />

retrieval algorithm is applied to a trademark image 1<br />

database. 1 with 1,100 images (See Appendix A for<br />

representative images belonging to this database).<br />

In the fast pruning stage, simple and easily<br />

calculated shape' features, including edge direction<br />

histograms and invariant moments are used to quickly<br />

browse' the- database for a small set of plausible<br />

matches. The retrieved images are then screened using a<br />

more elaborate, but costly deformable template matching<br />

process to remove false retrievals.<br />

III. METHODS USING A SINGLE CUE<br />

Traditional image retrieval systems use a single cue<br />

such as shape, texture, or color to represent the image<br />

and retrieval is based on the features that represent the<br />

chosen cue. Although color seems to be a highly reliable<br />

attribute for image retrieval, situations where color<br />

Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 225


information is not present in the images require the use<br />

of shape and/or texture attributes for image retrieval.<br />

Retrieval based on a single image attribute might<br />

lack sufficient discriminatory information and might not<br />

be able to accommodate large scale and orientation<br />

changes. For example, color-based approaches cannot<br />

distinguish between a red apple and a red Ferrari.<br />

Additional shape information can very easily distinguish<br />

the two.<br />

IV. TRADE MARK RECOGNITION<br />

<strong>Trade</strong>marks represent a gamut of pictorial data.<br />

There are over a million registered trademarks in the<br />

U.S. alone, and they represent a number of goods and<br />

products, which are sold by different manufacturers and<br />

service and other organizations. Most of the trademarks<br />

are an abstract representation of a concept in the world,<br />

like an abstract drawing of an animal, or a natural object<br />

(Sun. Moon, etc.). It is extremely challenging and<br />

instructive to study and address the issue of image<br />

database retrieval on this huge source of pictorial data. A<br />

trademark is either a word, phrase, symbol or design, or<br />

combination of words, phrases, symbols or designs,<br />

which identifies and distinguishes the source of goods or<br />

services of one party from those of others. A service<br />

mark is the same as a trademark except this it identifies<br />

and distinguishes the source of a service rather than a<br />

product. Thus, while a trademark appears on the product<br />

or it‘s packaging, the service mark appears in advertising<br />

for the services.<br />

V. TRADEMARK REGISTRATION<br />

<strong>Trade</strong>marks in the U.S. are registered with the<br />

USPTO (U.S. Patent and <strong>Trade</strong>mark Office). An<br />

applicant may apply for federal registration in three<br />

principle ways. An applicant can process a use<br />

application if the applicant has already commenced the<br />

use of the mark in products or services. This application<br />

is filed on the basis of the use of the mark. If the<br />

applicant has not yet used the mark but intends to use it<br />

in the future, the applicant can file an intent-to-use<br />

application on the basis of a bona fide intention to use<br />

the mark. The third option allows international<br />

applicants to apply for a mark that has already been<br />

applied or registered in another country. Thus, an<br />

applicant from outside the United States may file within<br />

the United States.<br />

VI. SEARCHES FOR CONFLICTING MARKS<br />

Before a mark is registered with the USPTO, an<br />

examining attorney conducts a search for conflicting<br />

marks. Usually, it is not necessary for an applicant to<br />

conduct a search for conflicting marks prior to filing an<br />

application. The application fee covers processing and<br />

search costs, and is not refunded in case a conflict is<br />

found and the mark cannot be registered.<br />

VII INVARIENT-MOMENT BASED METHOD<br />

Retrieval speed and accuracy are two main issues in<br />

designing image databases. System accuracy can be<br />

defined in terms of precision and recall rates. A<br />

precision rate can be defined as the percent of retrieved<br />

images similar to the query among the total number of<br />

retrieved images. A recall rate is defined as the percent<br />

of retrieved images which are similar to the query<br />

among the total number of images similar to the query in<br />

the database. It can be easily seen that both precision and<br />

recall rates are a function of the total number of retrieved<br />

images. In order to have a high accuracy, the system<br />

needs to have both a high precision and a high recall<br />

rate. Although, simple image features can be easily<br />

extracted, they lack sufficient expressiveness and<br />

discriminatory information to determine if two images<br />

have a similar content. Thus, there exists a trade-off<br />

between speed and accuracy. In order to build a system<br />

with both high speed and accuracy, we use a hierarchical<br />

two-level feature extraction and matching structure for<br />

image retrieval. Our system uses multiple shape features<br />

for the initial pruning stage. Retrievals based on these<br />

features are integrated for better accuracy and higher<br />

system recall rate. The second stage uses deformable<br />

template matching to eliminate the false retrievals<br />

present at the output of the first stage, thereby improving<br />

the precision rate of the system.<br />

VIII. CURVATURE SCALE SPACE (CSS) METHOD<br />

A useful general – purpose shape representation<br />

method in computational vision should make accurate<br />

and reliable recognition of an object possible. Therefore,<br />

such a representation should necessarily satisfy a<br />

number of criteria. The following is a list of such<br />

criteria. Note that when two planar curves are described<br />

as having the same shape, there exists a transformation<br />

consisting of uniform scaling, rotation, and translation,<br />

which will cause one of those curves to overlap the<br />

other.<br />

A. Invariance: If two curves have the same shape, they<br />

should also have the same representation. Uniqueness:<br />

If two curves have the same shape, they should also have<br />

the different representation.<br />

B .Stability: If two curves have a small difference, their<br />

representations should also have a small difference, and<br />

if two representations have small difference, the curves<br />

they represent should also have small shape difference.<br />

The importance of the invariance criterion is that it<br />

guarantees that all curves with the same shape will have<br />

the same representation. It will therefore be possible to<br />

conclude that two curves have different shapes by<br />

observing that they have different representations.<br />

Without the invariance criterion, two curves with the<br />

same shape may have different representations. The<br />

uniqueness criterion is important since it guarantees<br />

C. Efficiency: The representation should be efficient to<br />

compute and store. This is important since it may<br />

necessary for an object recognition system to perform<br />

real-time recognition. By efficient, we mean that the<br />

computational complexity should be a low-order<br />

polynomial in time and space (and in the number of<br />

processors if a parallel computing architecture is used)<br />

as a function of the size of the input curve.<br />

D. Ease of implementation: If two or more competing<br />

representations exist, it is advantageous to choose one of<br />

Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 226


those representations such that the implementation of the<br />

computer program that computes that representation<br />

requires the least time spent on programming and<br />

debugging.<br />

E. Computation of shape properties: It may be useful to<br />

be able to determine properties of the shape of a curve<br />

using its representation. For example if a curve has a<br />

symmetric shape, it may be desirable to be able to<br />

determine that fact from its representation (the symmetry<br />

criterion). Furthermore, if the shape of a whole curve or<br />

part of a curve is the same as the shape of part of another<br />

curve, it may be useful to be able to determine that<br />

relationship using their representations (the part/whole<br />

criterion).<br />

IX. THE CURVATURE SCALE SPACE IMAGE<br />

A planar curve is a set of points whose position<br />

vectors are the values of a continuous, vector-valued<br />

function. It can be represented by the parametric vector<br />

equation<br />

r (u) = (x (u), y(u)) (1)<br />

The function r (u) is a parametric representation of the<br />

curve. Planar curve has an infinite number of distinct<br />

parametric representations. A parametric representation<br />

in which the parameter is the arc length s is called a<br />

natural parameterization of the curve. A natural<br />

parameterization can be computed from an arbitrary<br />

parameterization using the following equation:<br />

(2)<br />

Where ŕ represents the derivative.i.e, ŕ = dr/dv. For any<br />

parameterization<br />

(3)<br />

Where t(u) and n(u) are the tangent and normal vectors<br />

at u, respectively. For any planar curve, the vectors t(u)<br />

and n(u) must satisfy the simplified serret-Frenet vector<br />

equations:<br />

Where k(s) is the curvature of the curve at s and is<br />

defined as<br />

Where Ф is the angle between t(s) and t (s+h). Now,<br />

observe that<br />

t(s) =dt/ds =(dt/du) (du/ds)<br />

Therefore<br />

dt/du = (ds / du ) kn =│ ŕ │kn<br />

Hence<br />

Differentiating the expression for t (u), we obtain<br />

It now follows that<br />

Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 227<br />

(4)<br />

X. CORNER DETECTION<br />

The algorithm of edge and boundary interpretation<br />

is based on operating with object's vertices, where each<br />

vertex is a corner with the line segments that are<br />

connected to it. Each vertex of the object according to<br />

the definition is located in corner. This is the reason we<br />

first needed to detect all the corners of the object. Corner<br />

detection is an important task in various computer vision<br />

and image-understanding systems [1]. Corner detection<br />

should satisfy a number of important criteria:<br />

_ All the true corners should be detected<br />

_ No false corners should be detected<br />

_ Corner points should be well localized<br />

_ Corner detector should be robust with respect to noise<br />

_ Corner detector should be efficient<br />

There are different methods for corner detection in<br />

literature. Each one is based on different principle. Each<br />

method addresses different problem in corner detection.<br />

For example, [2] uses a generalized Hough transform for<br />

edge detection, where the transform is needed to detect<br />

the edge lines (boundaries) of the object and the<br />

generalization is performed to cope with the corners that<br />

are not sharp. On the other hand analyses the curvature<br />

scale of object's contour and extracts the points with the<br />

maxima of absolute curvature. The method is based on<br />

CSS (The <strong>Curvature</strong> <strong>Scale</strong>-<strong>Space</strong> Technique) method.<br />

XI. CURVATURE SCALE SPACE (CSS)<br />

The CSS technique is suitable fro recovering<br />

invariant geometric features (curvature zero crossing<br />

points and/or extrema) of a planar curve at multiple<br />

scales. To compute it, the curve G is first parameterized<br />

by the arc length parameter u:<br />

(5)<br />

An evolved version Gs of G can then be computed.<br />

Where<br />

(6)<br />

(7)<br />

Where is the convolution operator and<br />

denotes a Gaussian of width . In order to find<br />

curvature zero – crossings or extrema from evolved<br />

versions of the input curve, one needs to compute<br />

curvature:<br />

Where,<br />

(8)<br />

(9)


The corners are defined as the local maxima of the<br />

absolute value of curvature. At a very fine scale, there<br />

exist many such maxima due to noise and the digital<br />

contour. As the scale is increased, the noise is smoothed<br />

away and only the maxima corresponding to the real<br />

corners remain. The CSS corner-detection method finds<br />

the corners at these local maxima<br />

The process of CSS image corner detection is as follows:<br />

Utilize the canny edge detector to extract edges<br />

from the original image.<br />

Extract the edge contours from the image:<br />

― Fill the gaps in the edge contours.<br />

― Find the T – junctions and mark them as T –<br />

corners.<br />

Compute the curvature at highest scale σhigh<br />

and determine the corner candidates by<br />

comparing the neighboring minima.<br />

Track the corners to the lowest scale to improve<br />

localization.<br />

Compare the T – corners to the corners found<br />

using the curvature procedure and remove<br />

corners which are very close.<br />

The following is an explanation of each stage of<br />

the CSS corner detector.<br />

Steps description:<br />

Here canny edge detection was used, but it may be<br />

replaced by any other edge detector<br />

The canny edge detector can cause gaps at T-junctions<br />

and the corners may not be found with the CSS method.<br />

― If the endpoint is nearly connected to another<br />

endpoint, fill the gap and continue the<br />

extraction<br />

―If the endpoint is nearly connected to an edge contour,<br />

but not to another endpoint,<br />

mark this point as a T-junction corner.<br />

Figure 2: Two cases of gaps in the edge contours<br />

.<br />

Figure 3: case where one corner is marked twice<br />

XII. DATABASE<br />

The image database used in this implementation was<br />

created by the collection of large number of trademarks<br />

consisting of 1001 images collected from different<br />

sources. A design mark is registered in the binary form<br />

at the USPTO. The trademarks are converted from gray<br />

level images to binary images.<br />

Figure 4: Image retrieval model<br />

The trademarks in our database were selected so<br />

that many of them have similar perceptual meaning to<br />

make the retrieval problem more challenging. These<br />

trademarks encompass a wide variety of objects; some<br />

are based on the alphabets of the English language,<br />

while others represent the Sun, Earth, humans, eyes,<br />

animals, etc.<br />

XIII. RESULT ANALYSIS<br />

Figure 5:Input interface for query reading<br />

Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 228


Figure 6: Menu option for selection of operation<br />

XIV. CSS BASED METHOD:<br />

Case-I<br />

Image Taken:<br />

Orientation: ‗0 Degree‘<br />

Noise level: 0<br />

Figure 7: Original Query image taken<br />

Figure 8: Taken image for testing the Implementation<br />

Figure 9: Extracted counter for given Query Image<br />

Figure 10: <strong>Curvature</strong> <strong>Scale</strong> space (CSS) plot for the given query image<br />

Figure 11: Classified Images for Given Query Image<br />

Figure 12: Recognized image based on CSS method<br />

CASE-II<br />

Image Taken:<br />

Orientation: ‗30 Degree‘<br />

Noise level: 0<br />

Figure 13: Original Query image taken<br />

Figure 14: Taken image at 30 degree for testing the Implementation<br />

Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 229


Figure 15: Extracted Counter for given Query Image<br />

Figure 16: <strong>Curvature</strong> <strong>Scale</strong> space (CSS) plot for the given query image<br />

Figure 17: Classified Images for Given Query Image<br />

Figure 18: Recognized image based on CSS method<br />

CASE-IV<br />

Image Taken:<br />

Orientation: ‗0 Degree‘<br />

Noise Level: mean = 0.1, Variance = 0.2.<br />

Noise Type: Gaussian<br />

Figure 19: Original Query image taken<br />

Figure 20: Taken noisy image for testing the Implementation<br />

Figure 21: Extracted Counter for given Query Image<br />

Figure 22: <strong>Curvature</strong> <strong>Scale</strong> space (CSS) plot for the given query image<br />

Figure 23: Classified Images for Given Query Image<br />

Figure 24: Recognized image based on CSS method<br />

Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 230


XV. CONCLUSION<br />

An efficient shape-based retrieval algorithm has<br />

been developed to retrieve trademark images. Efficiency<br />

and accuracy of retrievals are achieved by designing a<br />

two stage hierarchical retrieval system. The proposed<br />

CSS based method shows promising technique for shape<br />

–based image database retrieval. The technique is robust<br />

under rotated, scaled, and noisy versions of the database<br />

images. The CSS based method is compared with the<br />

invariant based estimation algorithm for the evaluation<br />

of accuracy in estimation and classification. It is<br />

observed that the CSS based estimation outperforms the<br />

estimation with less classified symbols with accurate<br />

retreveation of trademark image as compared to the<br />

Invariant based method.<br />

The proposed pruning strategy is robust under noisy<br />

(hand drawn trademark images and faded master-marks),<br />

ar-bitrarily oriented (master-mark images) and scaled<br />

images (multiple trademarks in database with different<br />

scales as well as scale difference in master-marks in<br />

database with respect to real marks extracted from the<br />

bottom of the tankards).<br />

REFERENCES<br />

[1] H. Asada and M. Brady, ―The curvature primal sketch,‖ IEEE<br />

Trans, Pat, Anal. Machine Intell, vol, PAMI-8 pp. 2-14, 1986.<br />

[2] D.H. Ballard, ―Generating the Hough Transform to detect arbitary<br />

shapes‖ Patt Recogn., vol. 13 pp. 111-122, 1981.<br />

[3] D.H. Ballard and C.M. Brown, Computer Vision. Engle Cliffs, NI:<br />

Prentice –Hall, 1982.<br />

[4] H. Bluem, ―Biological shape and visual science(part I),‖ J.<br />

Therotical Biol., vol. 38, pp. 205-287, 1973.<br />

[5] P. E. Danielsson, ―A new shape factor,‖ CGIP, vol. 7, pp. 292-299,<br />

1978.<br />

[6] L. S. Davis, ―Understanding shape: Angles and sides,‖ IEEE Trans.<br />

Comput., vol. C-26, pp. 236-242, 1977.<br />

[7] R. O. Duda and P.E. Hart, ―Use of the Hough transformation to<br />

detect lines and curves in pictures,‖ Comm.ACM, vol.15, pp. 11-15,<br />

1972.<br />

[8] H. Freeman, ―Computer processing of line-drawing images,‖<br />

Comput Surveys, vol, 6, 1974.<br />

[9] M. Gage and R.S. Hamilton, ―The heat equation shrinking<br />

convex phanc curves,‖ J. Differential Geometry, vol. 23, pp.69-96,<br />

1986.<br />

[10] A. Goetz, Introduction to Differential Geometry. Reading,<br />

MA:Addison-Wesley, 1970.<br />

[11] R. M. Haralick, A. K. Mackworth, and S. L. Tanimoto,<br />

―Computer Vision Ipdate,‖ Tech.Rep, 89-12, Dept. Comput. Sci. Univ.<br />

British Columbia, Vancouver , 1989.<br />

[12] J. Hong and X. Tan, ―Recognize the similarity between shapes<br />

under affine transformation,‖ in Proc. ICCV(Tampa, FL), 1988, pp.<br />

489-493.<br />

[13] B. K. P. Horn and E. J. Weldon, ―Filtering closed curves,‖ IEEE<br />

Trans Patt. Anal. Machine Intell., vol. PAMI-8, pp. 665-668, 1986.<br />

[14] P. V. C. Hough, ―Method and means for reconginzing complex<br />

patterns,‖ U.S. patent 3 069 654, 1962.<br />

[15] R. A. Hummel, B. Kissia, and S. W. Zucker, ―Deburring<br />

Gaussian blur,‖CVGIP, vol. 38, pp. 66-80, 1987.<br />

[16] W. Kecs, The Convolution Product and Some Applications.<br />

Boston, MA: D. Reidel, 1982.<br />

[17] B. Kimia, A. Tannenbaum, and S.W. Zucker, ―Toward a<br />

computational theory of shape: An overview,‖ TR-CIM-89013, McGill<br />

Univ., 1989<br />

Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 231

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!