03.01.2015 Views

Combining Information from Multiple Internet Sources

Combining Information from Multiple Internet Sources

Combining Information from Multiple Internet Sources

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

4. Tests of the three approaches<br />

This chapter presents the tests of the three approaches: Game theory, Auction and<br />

Consensus. There were three queries issued for the testing purposes: consensus decision<br />

making, consensus decision making for conflict solving and is<br />

consensus decision making for conflict solving good enough or<br />

maybe Game theory or auction is better. The idea was to take three queries which<br />

relate to the same topic; however first was to be simple, second more complex and third was to be<br />

very complex, while retaining coherence.<br />

There were 5 search engines queried. Four of them were English-language-based: Google,<br />

Ask.com, Live, Yahoo! and one of Polish origin – Interia, which in fact is a Google based engine;<br />

however very often it produces results which differ <strong>from</strong> its parent engine. Search engines were set<br />

up to return 20 results for each query. This means that as input to tested algorithms there were 5<br />

result sets provided; each comprising of 20 URLs. This allowed for fast algorithm processing.<br />

The first phase of result evaluation was to compare the result sets of each of the three tested<br />

approaches against result sets produced by each search engine individually. There are two measures<br />

of comparison: Set Coverage and URL to URL coverage. Set Coverage measures how many URLs<br />

<strong>from</strong> the result of the algorithm is contained in the result set returned by the search engine<br />

regardless of the position of the URL. URL to URL measures how many URLs were at the same<br />

position in both results – of the algorithm and that of the search engine. Those measures however;<br />

were taken only for the 10 top results returned by each search engine. This means that in the<br />

algorithms result sets there may be answers which are not shown in the result set of any search<br />

engine. Afterwards, algorithms, for each query, were compared with each other and then with<br />

MySpiders system [11].<br />

42

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!