Curvature Scale Space Based Trade Mark Recognition
Curvature Scale Space Based Trade Mark Recognition
Curvature Scale Space Based Trade Mark Recognition
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
<strong>Curvature</strong> <strong>Scale</strong> <strong>Space</strong> <strong>Based</strong> <strong>Trade</strong> <strong>Mark</strong> <strong>Recognition</strong><br />
1 S.China Venkateswarlu, 2 G.Nagendra, 3 S.RamaSubba Reddy, 4 G.Soma Sekhar<br />
1 Professor & HOD-CSE/ECE, HITS College of Engg.,, Hyderabad<br />
2 Assistant Professor –ECE-VJIT, Hyderabad<br />
3 Assistant Professor –ECE-VJIT, Hyderabad<br />
4 Research Scholar –Acharya Nagarju University,Guntur<br />
Abstract— Retrieval efficiency and accuracy are two important issues in designing a content-based database retrieval<br />
system. The efficient retrieval of the data base image is dependent on various factors such as the environment considered<br />
for query image, the reading source and the channel for communication. These factors affect the recognition of any given<br />
query image. To overcome the effect of environmental noises and rotation effect a new image recognition algorithm based<br />
on curvature scale space (CSS) is proposed. Database retrieval method based on shape information using curvature feature<br />
is proposed. The method evaluates the closed counter and process on the curvature smoothening using different level of<br />
Gaussian values. The proposed CSS based recognition algorithm found to be efficient in recognition for invariant features<br />
such as rotation and noise level. The proposed task is to be implemented on Matlab tool using the image processing toolbox<br />
for it’s realization..<br />
Keywords— Audio, <strong>Curvature</strong> <strong>Scale</strong> <strong>Space</strong>, Database, Image Retrieval, Speech, Processors<br />
I. INTRODUCTION<br />
Information is inherently multimodal. Humans can<br />
efficiently and effectively process information<br />
simultaneously in multiple dimensions. These multiple<br />
media, that aid effective communication, can be<br />
characterized into speech, audio, image, video, and<br />
textual data. Advances in computing and networking are<br />
generating a significant amount of interest in multimedia<br />
services and applications. Powerful processors, highspeed<br />
networking, high-capacity storage devices,<br />
improvements in compression algorithms, and advances<br />
in processing of audio, speech, image, and video signals<br />
are making multimedia systems technically and<br />
economically feasible.<br />
Multimedia systems suggest a wide variety of<br />
potential applications such as interactive entertainment,<br />
video news distribution, video rental services, and digital<br />
multimedia libraries. These services aim to provide the<br />
user with on-demand multimedia services. Any system<br />
providing these services will have to address the issue of<br />
representation, indexing, retrieval, and manipulation of<br />
the multimedia data. The multimedia objects are<br />
archived in a database. A feature extraction module<br />
extracts various features from the database, and<br />
represents the objects in terms of these features. The user<br />
interface provides the user with the capabilities to query,<br />
extract features, and insert multimedia objects.<br />
Image and video are an integral part of multimedia<br />
data. There are a number of applications where images<br />
need to be automatically retrieved by their Connell. This<br />
necessitates the need for powerful image processing and<br />
understanding tools. Various applications in digital<br />
libraries and image databases have been described in the<br />
literature.<br />
II. IMAGE RETRIEVAL SYSTEM<br />
Digital images are a convenient media for<br />
describing and storing spatial, temporal, spectral, and<br />
physical components of information contained in a<br />
variety of domains (e.g.. aerial/satellite images in remote<br />
sensing, medical images in telemedicine. fingerprints in<br />
forensics, museum collections in art history, and<br />
registration of trademarks and logos). These databases<br />
typically consist of thousands of images, taking up gigabytes<br />
of memory space.<br />
While advances in image compression algorithms<br />
have alleviated the storage requirement to some extent,<br />
the large volume of these images makes it difficult for a<br />
user to browse through the entire database. Therefore, an<br />
efficient and automatic procedure is required for<br />
indexing and retrieving images from databases.<br />
Figure 1: Traditional image retrieval model<br />
The proposed hierarchical content-based image<br />
retrieval algorithm is applied to a trademark image 1<br />
database. 1 with 1,100 images (See Appendix A for<br />
representative images belonging to this database).<br />
In the fast pruning stage, simple and easily<br />
calculated shape' features, including edge direction<br />
histograms and invariant moments are used to quickly<br />
browse' the- database for a small set of plausible<br />
matches. The retrieved images are then screened using a<br />
more elaborate, but costly deformable template matching<br />
process to remove false retrievals.<br />
III. METHODS USING A SINGLE CUE<br />
Traditional image retrieval systems use a single cue<br />
such as shape, texture, or color to represent the image<br />
and retrieval is based on the features that represent the<br />
chosen cue. Although color seems to be a highly reliable<br />
attribute for image retrieval, situations where color<br />
Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 225
information is not present in the images require the use<br />
of shape and/or texture attributes for image retrieval.<br />
Retrieval based on a single image attribute might<br />
lack sufficient discriminatory information and might not<br />
be able to accommodate large scale and orientation<br />
changes. For example, color-based approaches cannot<br />
distinguish between a red apple and a red Ferrari.<br />
Additional shape information can very easily distinguish<br />
the two.<br />
IV. TRADE MARK RECOGNITION<br />
<strong>Trade</strong>marks represent a gamut of pictorial data.<br />
There are over a million registered trademarks in the<br />
U.S. alone, and they represent a number of goods and<br />
products, which are sold by different manufacturers and<br />
service and other organizations. Most of the trademarks<br />
are an abstract representation of a concept in the world,<br />
like an abstract drawing of an animal, or a natural object<br />
(Sun. Moon, etc.). It is extremely challenging and<br />
instructive to study and address the issue of image<br />
database retrieval on this huge source of pictorial data. A<br />
trademark is either a word, phrase, symbol or design, or<br />
combination of words, phrases, symbols or designs,<br />
which identifies and distinguishes the source of goods or<br />
services of one party from those of others. A service<br />
mark is the same as a trademark except this it identifies<br />
and distinguishes the source of a service rather than a<br />
product. Thus, while a trademark appears on the product<br />
or it‘s packaging, the service mark appears in advertising<br />
for the services.<br />
V. TRADEMARK REGISTRATION<br />
<strong>Trade</strong>marks in the U.S. are registered with the<br />
USPTO (U.S. Patent and <strong>Trade</strong>mark Office). An<br />
applicant may apply for federal registration in three<br />
principle ways. An applicant can process a use<br />
application if the applicant has already commenced the<br />
use of the mark in products or services. This application<br />
is filed on the basis of the use of the mark. If the<br />
applicant has not yet used the mark but intends to use it<br />
in the future, the applicant can file an intent-to-use<br />
application on the basis of a bona fide intention to use<br />
the mark. The third option allows international<br />
applicants to apply for a mark that has already been<br />
applied or registered in another country. Thus, an<br />
applicant from outside the United States may file within<br />
the United States.<br />
VI. SEARCHES FOR CONFLICTING MARKS<br />
Before a mark is registered with the USPTO, an<br />
examining attorney conducts a search for conflicting<br />
marks. Usually, it is not necessary for an applicant to<br />
conduct a search for conflicting marks prior to filing an<br />
application. The application fee covers processing and<br />
search costs, and is not refunded in case a conflict is<br />
found and the mark cannot be registered.<br />
VII INVARIENT-MOMENT BASED METHOD<br />
Retrieval speed and accuracy are two main issues in<br />
designing image databases. System accuracy can be<br />
defined in terms of precision and recall rates. A<br />
precision rate can be defined as the percent of retrieved<br />
images similar to the query among the total number of<br />
retrieved images. A recall rate is defined as the percent<br />
of retrieved images which are similar to the query<br />
among the total number of images similar to the query in<br />
the database. It can be easily seen that both precision and<br />
recall rates are a function of the total number of retrieved<br />
images. In order to have a high accuracy, the system<br />
needs to have both a high precision and a high recall<br />
rate. Although, simple image features can be easily<br />
extracted, they lack sufficient expressiveness and<br />
discriminatory information to determine if two images<br />
have a similar content. Thus, there exists a trade-off<br />
between speed and accuracy. In order to build a system<br />
with both high speed and accuracy, we use a hierarchical<br />
two-level feature extraction and matching structure for<br />
image retrieval. Our system uses multiple shape features<br />
for the initial pruning stage. Retrievals based on these<br />
features are integrated for better accuracy and higher<br />
system recall rate. The second stage uses deformable<br />
template matching to eliminate the false retrievals<br />
present at the output of the first stage, thereby improving<br />
the precision rate of the system.<br />
VIII. CURVATURE SCALE SPACE (CSS) METHOD<br />
A useful general – purpose shape representation<br />
method in computational vision should make accurate<br />
and reliable recognition of an object possible. Therefore,<br />
such a representation should necessarily satisfy a<br />
number of criteria. The following is a list of such<br />
criteria. Note that when two planar curves are described<br />
as having the same shape, there exists a transformation<br />
consisting of uniform scaling, rotation, and translation,<br />
which will cause one of those curves to overlap the<br />
other.<br />
A. Invariance: If two curves have the same shape, they<br />
should also have the same representation. Uniqueness:<br />
If two curves have the same shape, they should also have<br />
the different representation.<br />
B .Stability: If two curves have a small difference, their<br />
representations should also have a small difference, and<br />
if two representations have small difference, the curves<br />
they represent should also have small shape difference.<br />
The importance of the invariance criterion is that it<br />
guarantees that all curves with the same shape will have<br />
the same representation. It will therefore be possible to<br />
conclude that two curves have different shapes by<br />
observing that they have different representations.<br />
Without the invariance criterion, two curves with the<br />
same shape may have different representations. The<br />
uniqueness criterion is important since it guarantees<br />
C. Efficiency: The representation should be efficient to<br />
compute and store. This is important since it may<br />
necessary for an object recognition system to perform<br />
real-time recognition. By efficient, we mean that the<br />
computational complexity should be a low-order<br />
polynomial in time and space (and in the number of<br />
processors if a parallel computing architecture is used)<br />
as a function of the size of the input curve.<br />
D. Ease of implementation: If two or more competing<br />
representations exist, it is advantageous to choose one of<br />
Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 226
those representations such that the implementation of the<br />
computer program that computes that representation<br />
requires the least time spent on programming and<br />
debugging.<br />
E. Computation of shape properties: It may be useful to<br />
be able to determine properties of the shape of a curve<br />
using its representation. For example if a curve has a<br />
symmetric shape, it may be desirable to be able to<br />
determine that fact from its representation (the symmetry<br />
criterion). Furthermore, if the shape of a whole curve or<br />
part of a curve is the same as the shape of part of another<br />
curve, it may be useful to be able to determine that<br />
relationship using their representations (the part/whole<br />
criterion).<br />
IX. THE CURVATURE SCALE SPACE IMAGE<br />
A planar curve is a set of points whose position<br />
vectors are the values of a continuous, vector-valued<br />
function. It can be represented by the parametric vector<br />
equation<br />
r (u) = (x (u), y(u)) (1)<br />
The function r (u) is a parametric representation of the<br />
curve. Planar curve has an infinite number of distinct<br />
parametric representations. A parametric representation<br />
in which the parameter is the arc length s is called a<br />
natural parameterization of the curve. A natural<br />
parameterization can be computed from an arbitrary<br />
parameterization using the following equation:<br />
(2)<br />
Where ŕ represents the derivative.i.e, ŕ = dr/dv. For any<br />
parameterization<br />
(3)<br />
Where t(u) and n(u) are the tangent and normal vectors<br />
at u, respectively. For any planar curve, the vectors t(u)<br />
and n(u) must satisfy the simplified serret-Frenet vector<br />
equations:<br />
Where k(s) is the curvature of the curve at s and is<br />
defined as<br />
Where Ф is the angle between t(s) and t (s+h). Now,<br />
observe that<br />
t(s) =dt/ds =(dt/du) (du/ds)<br />
Therefore<br />
dt/du = (ds / du ) kn =│ ŕ │kn<br />
Hence<br />
Differentiating the expression for t (u), we obtain<br />
It now follows that<br />
Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 227<br />
(4)<br />
X. CORNER DETECTION<br />
The algorithm of edge and boundary interpretation<br />
is based on operating with object's vertices, where each<br />
vertex is a corner with the line segments that are<br />
connected to it. Each vertex of the object according to<br />
the definition is located in corner. This is the reason we<br />
first needed to detect all the corners of the object. Corner<br />
detection is an important task in various computer vision<br />
and image-understanding systems [1]. Corner detection<br />
should satisfy a number of important criteria:<br />
_ All the true corners should be detected<br />
_ No false corners should be detected<br />
_ Corner points should be well localized<br />
_ Corner detector should be robust with respect to noise<br />
_ Corner detector should be efficient<br />
There are different methods for corner detection in<br />
literature. Each one is based on different principle. Each<br />
method addresses different problem in corner detection.<br />
For example, [2] uses a generalized Hough transform for<br />
edge detection, where the transform is needed to detect<br />
the edge lines (boundaries) of the object and the<br />
generalization is performed to cope with the corners that<br />
are not sharp. On the other hand analyses the curvature<br />
scale of object's contour and extracts the points with the<br />
maxima of absolute curvature. The method is based on<br />
CSS (The <strong>Curvature</strong> <strong>Scale</strong>-<strong>Space</strong> Technique) method.<br />
XI. CURVATURE SCALE SPACE (CSS)<br />
The CSS technique is suitable fro recovering<br />
invariant geometric features (curvature zero crossing<br />
points and/or extrema) of a planar curve at multiple<br />
scales. To compute it, the curve G is first parameterized<br />
by the arc length parameter u:<br />
(5)<br />
An evolved version Gs of G can then be computed.<br />
Where<br />
(6)<br />
(7)<br />
Where is the convolution operator and<br />
denotes a Gaussian of width . In order to find<br />
curvature zero – crossings or extrema from evolved<br />
versions of the input curve, one needs to compute<br />
curvature:<br />
Where,<br />
(8)<br />
(9)
The corners are defined as the local maxima of the<br />
absolute value of curvature. At a very fine scale, there<br />
exist many such maxima due to noise and the digital<br />
contour. As the scale is increased, the noise is smoothed<br />
away and only the maxima corresponding to the real<br />
corners remain. The CSS corner-detection method finds<br />
the corners at these local maxima<br />
The process of CSS image corner detection is as follows:<br />
Utilize the canny edge detector to extract edges<br />
from the original image.<br />
Extract the edge contours from the image:<br />
― Fill the gaps in the edge contours.<br />
― Find the T – junctions and mark them as T –<br />
corners.<br />
Compute the curvature at highest scale σhigh<br />
and determine the corner candidates by<br />
comparing the neighboring minima.<br />
Track the corners to the lowest scale to improve<br />
localization.<br />
Compare the T – corners to the corners found<br />
using the curvature procedure and remove<br />
corners which are very close.<br />
The following is an explanation of each stage of<br />
the CSS corner detector.<br />
Steps description:<br />
Here canny edge detection was used, but it may be<br />
replaced by any other edge detector<br />
The canny edge detector can cause gaps at T-junctions<br />
and the corners may not be found with the CSS method.<br />
― If the endpoint is nearly connected to another<br />
endpoint, fill the gap and continue the<br />
extraction<br />
―If the endpoint is nearly connected to an edge contour,<br />
but not to another endpoint,<br />
mark this point as a T-junction corner.<br />
Figure 2: Two cases of gaps in the edge contours<br />
.<br />
Figure 3: case where one corner is marked twice<br />
XII. DATABASE<br />
The image database used in this implementation was<br />
created by the collection of large number of trademarks<br />
consisting of 1001 images collected from different<br />
sources. A design mark is registered in the binary form<br />
at the USPTO. The trademarks are converted from gray<br />
level images to binary images.<br />
Figure 4: Image retrieval model<br />
The trademarks in our database were selected so<br />
that many of them have similar perceptual meaning to<br />
make the retrieval problem more challenging. These<br />
trademarks encompass a wide variety of objects; some<br />
are based on the alphabets of the English language,<br />
while others represent the Sun, Earth, humans, eyes,<br />
animals, etc.<br />
XIII. RESULT ANALYSIS<br />
Figure 5:Input interface for query reading<br />
Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 228
Figure 6: Menu option for selection of operation<br />
XIV. CSS BASED METHOD:<br />
Case-I<br />
Image Taken:<br />
Orientation: ‗0 Degree‘<br />
Noise level: 0<br />
Figure 7: Original Query image taken<br />
Figure 8: Taken image for testing the Implementation<br />
Figure 9: Extracted counter for given Query Image<br />
Figure 10: <strong>Curvature</strong> <strong>Scale</strong> space (CSS) plot for the given query image<br />
Figure 11: Classified Images for Given Query Image<br />
Figure 12: Recognized image based on CSS method<br />
CASE-II<br />
Image Taken:<br />
Orientation: ‗30 Degree‘<br />
Noise level: 0<br />
Figure 13: Original Query image taken<br />
Figure 14: Taken image at 30 degree for testing the Implementation<br />
Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 229
Figure 15: Extracted Counter for given Query Image<br />
Figure 16: <strong>Curvature</strong> <strong>Scale</strong> space (CSS) plot for the given query image<br />
Figure 17: Classified Images for Given Query Image<br />
Figure 18: Recognized image based on CSS method<br />
CASE-IV<br />
Image Taken:<br />
Orientation: ‗0 Degree‘<br />
Noise Level: mean = 0.1, Variance = 0.2.<br />
Noise Type: Gaussian<br />
Figure 19: Original Query image taken<br />
Figure 20: Taken noisy image for testing the Implementation<br />
Figure 21: Extracted Counter for given Query Image<br />
Figure 22: <strong>Curvature</strong> <strong>Scale</strong> space (CSS) plot for the given query image<br />
Figure 23: Classified Images for Given Query Image<br />
Figure 24: Recognized image based on CSS method<br />
Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 230
XV. CONCLUSION<br />
An efficient shape-based retrieval algorithm has<br />
been developed to retrieve trademark images. Efficiency<br />
and accuracy of retrievals are achieved by designing a<br />
two stage hierarchical retrieval system. The proposed<br />
CSS based method shows promising technique for shape<br />
–based image database retrieval. The technique is robust<br />
under rotated, scaled, and noisy versions of the database<br />
images. The CSS based method is compared with the<br />
invariant based estimation algorithm for the evaluation<br />
of accuracy in estimation and classification. It is<br />
observed that the CSS based estimation outperforms the<br />
estimation with less classified symbols with accurate<br />
retreveation of trademark image as compared to the<br />
Invariant based method.<br />
The proposed pruning strategy is robust under noisy<br />
(hand drawn trademark images and faded master-marks),<br />
ar-bitrarily oriented (master-mark images) and scaled<br />
images (multiple trademarks in database with different<br />
scales as well as scale difference in master-marks in<br />
database with respect to real marks extracted from the<br />
bottom of the tankards).<br />
REFERENCES<br />
[1] H. Asada and M. Brady, ―The curvature primal sketch,‖ IEEE<br />
Trans, Pat, Anal. Machine Intell, vol, PAMI-8 pp. 2-14, 1986.<br />
[2] D.H. Ballard, ―Generating the Hough Transform to detect arbitary<br />
shapes‖ Patt Recogn., vol. 13 pp. 111-122, 1981.<br />
[3] D.H. Ballard and C.M. Brown, Computer Vision. Engle Cliffs, NI:<br />
Prentice –Hall, 1982.<br />
[4] H. Bluem, ―Biological shape and visual science(part I),‖ J.<br />
Therotical Biol., vol. 38, pp. 205-287, 1973.<br />
[5] P. E. Danielsson, ―A new shape factor,‖ CGIP, vol. 7, pp. 292-299,<br />
1978.<br />
[6] L. S. Davis, ―Understanding shape: Angles and sides,‖ IEEE Trans.<br />
Comput., vol. C-26, pp. 236-242, 1977.<br />
[7] R. O. Duda and P.E. Hart, ―Use of the Hough transformation to<br />
detect lines and curves in pictures,‖ Comm.ACM, vol.15, pp. 11-15,<br />
1972.<br />
[8] H. Freeman, ―Computer processing of line-drawing images,‖<br />
Comput Surveys, vol, 6, 1974.<br />
[9] M. Gage and R.S. Hamilton, ―The heat equation shrinking<br />
convex phanc curves,‖ J. Differential Geometry, vol. 23, pp.69-96,<br />
1986.<br />
[10] A. Goetz, Introduction to Differential Geometry. Reading,<br />
MA:Addison-Wesley, 1970.<br />
[11] R. M. Haralick, A. K. Mackworth, and S. L. Tanimoto,<br />
―Computer Vision Ipdate,‖ Tech.Rep, 89-12, Dept. Comput. Sci. Univ.<br />
British Columbia, Vancouver , 1989.<br />
[12] J. Hong and X. Tan, ―Recognize the similarity between shapes<br />
under affine transformation,‖ in Proc. ICCV(Tampa, FL), 1988, pp.<br />
489-493.<br />
[13] B. K. P. Horn and E. J. Weldon, ―Filtering closed curves,‖ IEEE<br />
Trans Patt. Anal. Machine Intell., vol. PAMI-8, pp. 665-668, 1986.<br />
[14] P. V. C. Hough, ―Method and means for reconginzing complex<br />
patterns,‖ U.S. patent 3 069 654, 1962.<br />
[15] R. A. Hummel, B. Kissia, and S. W. Zucker, ―Deburring<br />
Gaussian blur,‖CVGIP, vol. 38, pp. 66-80, 1987.<br />
[16] W. Kecs, The Convolution Product and Some Applications.<br />
Boston, MA: D. Reidel, 1982.<br />
[17] B. Kimia, A. Tannenbaum, and S.W. Zucker, ―Toward a<br />
computational theory of shape: An overview,‖ TR-CIM-89013, McGill<br />
Univ., 1989<br />
Published in International Journal of Advanced Engineering & Application, Jan 2011 Issue 231