Introduction to Information Retrieval
Introduction to Information Retrieval
Introduction to Information Retrieval
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
<strong>Introduction</strong> <strong>to</strong> <strong>Information</strong> <strong>Retrieval</strong><br />
Document frequency<br />
• We want high weights for rare terms like ARACHNOCENTRIC.<br />
• We want low (positive) weights for frequent words like<br />
GOOD, INCREASE and LINE.<br />
• We will use document frequency <strong>to</strong> fac<strong>to</strong>r this in<strong>to</strong><br />
computing the matching score.<br />
• The document frequency is the number of documents in<br />
the collection that the term occurs in.<br />
22