Introduction to Information Retrieval
Introduction to Information Retrieval Introduction to Information Retrieval
Introduction to Information Retrieval Use angle instead of distance • Rank documents according to angle with query • Thought experiment: take a document d and append it to itself. Call this document d′. d′ is twice as long as d. • “Semantically” d and d′ have the same content. • The angle between the two documents is 0, corresponding to maximal similarity . . . • . . . even though the Euclidean distance between the two documents can be quite large. 38
Introduction to Information Retrieval From angles to cosines • The following two notions are equivalent. • Rank documents according to the angle between query and document in decreasing order • Rank documents according to cosine(query,document) in increasing order • Cosine is a monotonically decreasing function of the angle for the interval [0 ◦ , 180 ◦ ] 39
- Page 1 and 2: Introduction to Information Retriev
- Page 3 and 4: Introduction to Information Retriev
- Page 5 and 6: Introduction to Information Retriev
- Page 7 and 8: Introduction to Information Retriev
- Page 9 and 10: Introduction to Information Retriev
- Page 11 and 12: Introduction to Information Retriev
- Page 13 and 14: Introduction to Information Retriev
- Page 15 and 16: Introduction to Information Retriev
- Page 17 and 18: Introduction to Information Retriev
- Page 19 and 20: Introduction to Information Retriev
- Page 21 and 22: Introduction to Information Retriev
- Page 23 and 24: Introduction to Information Retriev
- Page 25 and 26: Introduction to Information Retriev
- Page 27 and 28: Introduction to Information Retriev
- Page 29 and 30: Introduction to Information Retriev
- Page 31 and 32: Introduction to Information Retriev
- Page 33 and 34: Introduction to Information Retriev
- Page 35 and 36: Introduction to Information Retriev
- Page 37: Introduction to Information Retriev
- Page 41 and 42: Introduction to Information Retriev
- Page 43 and 44: Introduction to Information Retriev
- Page 45 and 46: Introduction to Information Retriev
- Page 47 and 48: Introduction to Information Retriev
- Page 49 and 50: Introduction to Information Retriev
- Page 51 and 52: Introduction to Information Retriev
- Page 53: Introduction to Information Retriev
<strong>Introduction</strong> <strong>to</strong> <strong>Information</strong> <strong>Retrieval</strong><br />
From angles <strong>to</strong> cosines<br />
• The following two notions are equivalent.<br />
• Rank documents according <strong>to</strong> the angle between query and<br />
document in decreasing order<br />
• Rank documents according <strong>to</strong> cosine(query,document) in<br />
increasing order<br />
• Cosine is a mono<strong>to</strong>nically decreasing function of the angle<br />
for the interval [0 ◦ , 180 ◦ ]<br />
39