Introduction to Information Retrieval
Introduction to Information Retrieval Introduction to Information Retrieval
Introduction to Information Retrieval Query-document matching scores • How do we compute the score of a query-document pair? • Let’s start with a one-term query. • If the query term does not occur in the document: score should be 0. • The more frequent the query term in the document, the higher the score • We will look at a number of alternatives for doing this. 8
Introduction to Information Retrieval Take 1: Jaccard coefficient • A commonly used measure of overlap of two sets • Let A and B be two sets • Jaccard coefficient: • JACCARD (A, A) = 1 • JACCARD (A, B) = 0 if A ∩ B = 0 • A and B don’t have to be the same size. • Always assigns a number between 0 and 1. 9
- Page 1 and 2: Introduction to Information Retriev
- Page 3 and 4: Introduction to Information Retriev
- Page 5 and 6: Introduction to Information Retriev
- Page 7: Introduction to Information Retriev
- Page 11 and 12: Introduction to Information Retriev
- Page 13 and 14: Introduction to Information Retriev
- Page 15 and 16: Introduction to Information Retriev
- Page 17 and 18: Introduction to Information Retriev
- Page 19 and 20: Introduction to Information Retriev
- Page 21 and 22: Introduction to Information Retriev
- Page 23 and 24: Introduction to Information Retriev
- Page 25 and 26: Introduction to Information Retriev
- Page 27 and 28: Introduction to Information Retriev
- Page 29 and 30: Introduction to Information Retriev
- Page 31 and 32: Introduction to Information Retriev
- Page 33 and 34: Introduction to Information Retriev
- Page 35 and 36: Introduction to Information Retriev
- Page 37 and 38: Introduction to Information Retriev
- Page 39 and 40: Introduction to Information Retriev
- Page 41 and 42: Introduction to Information Retriev
- Page 43 and 44: Introduction to Information Retriev
- Page 45 and 46: Introduction to Information Retriev
- Page 47 and 48: Introduction to Information Retriev
- Page 49 and 50: Introduction to Information Retriev
- Page 51 and 52: Introduction to Information Retriev
- Page 53: Introduction to Information Retriev
<strong>Introduction</strong> <strong>to</strong> <strong>Information</strong> <strong>Retrieval</strong><br />
Query-document matching scores<br />
• How do we compute the score of a query-document pair?<br />
• Let’s start with a one-term query.<br />
• If the query term does not occur in the document: score<br />
should be 0.<br />
• The more frequent the query term in the document, the<br />
higher the score<br />
• We will look at a number of alternatives for doing this.<br />
8