12.07.2015 Views

Dictionaries and Tolerant Retrieval

Dictionaries and Tolerant Retrieval

Dictionaries and Tolerant Retrieval

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Introduction to Information <strong>Retrieval</strong>Sec. 3.4Soundex – typical algorithm• Turn every token to be indexed into a 4‐characterreduced form• Do the same with query terms• Build <strong>and</strong> search an index on the reduced forms• (when the query calls for a soundex match)• http://www.creativyst.com/Doc/Articles/SoundEx1/SoundEx1.htm#Top42

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!