23.06.2015 Views

Introduction to Information Retrieval

Introduction to Information Retrieval

Introduction to Information Retrieval

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Introduction</strong> <strong>to</strong> <strong>Information</strong> <strong>Retrieval</strong><br />

Document frequency<br />

• We want high weights for rare terms like ARACHNOCENTRIC.<br />

• We want low (positive) weights for frequent words like<br />

GOOD, INCREASE and LINE.<br />

• We will use document frequency <strong>to</strong> fac<strong>to</strong>r this in<strong>to</strong><br />

computing the matching score.<br />

• The document frequency is the number of documents in<br />

the collection that the term occurs in.<br />

22

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!