Dictionaries and Tolerant Retrieval
Dictionaries and Tolerant Retrieval
Dictionaries and Tolerant Retrieval
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Introduction to Information <strong>Retrieval</strong>Sec. 3.4Soundex – typical algorithm• Turn every token to be indexed into a 4‐characterreduced form• Do the same with query terms• Build <strong>and</strong> search an index on the reduced forms• (when the query calls for a soundex match)• http://www.creativyst.com/Doc/Articles/SoundEx1/SoundEx1.htm#Top42