23.06.2015 Views

Introduction to Information Retrieval

Introduction to Information Retrieval

Introduction to Information Retrieval

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Introduction</strong> <strong>to</strong> <strong>Information</strong> <strong>Retrieval</strong><br />

Instead of raw frequency: Log frequency<br />

weighting<br />

• The log frequency weight of term t in d is defined as follows<br />

• tf t,d → w t,d :<br />

0 → 0, 1 → 1, 2 → 1.3, 10 → 2, 1000 → 4, etc.<br />

• Score for a document-query pair: sum over terms t in both q<br />

and d:<br />

tf-matching-score(q, d) = t∈q∩d (1 + log tf t,d )<br />

<br />

• The score is 0 if none of the query terms is present in the<br />

document.<br />

17

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!