28.06.2013 Views

Papers in PDF format

Papers in PDF format

Papers in PDF format

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

MENTOR: Internet Search Advisor and In<strong>format</strong>ion Retrieval System<br />

Introduction<br />

A. A. Ts<strong>in</strong>akos and K. G. Margaritis<br />

Department of In<strong>format</strong>ics<br />

University of Macedonia<br />

54006 Thessaloniki<br />

GREECE<br />

Tel : +30-31- 891 891<br />

E-mail: ts<strong>in</strong>akos/kmarg@kirki.it.uom.gr<br />

Internet provides access to hundreds of gigabytes of <strong>in</strong><strong>format</strong>ion through a variety of wide-area fil<strong>in</strong>g,<br />

<strong>in</strong><strong>format</strong>ion retrieval, publish<strong>in</strong>g and library access systems. This rapidly grow<strong>in</strong>g data volume and diversity <strong>in</strong><br />

Internet has created significant problems related to the efficiency and accuracy of the <strong>in</strong><strong>format</strong>ion retrieval. To<br />

make effective use of this wealth of <strong>in</strong><strong>format</strong>ion, user needs means to locate <strong>in</strong><strong>format</strong>ion. Additionally,<br />

<strong>in</strong><strong>format</strong>ion <strong>in</strong> exist<strong>in</strong>g Internet repositories is heterogeneous, <strong>in</strong>consistent and sometimes <strong>in</strong>complete<br />

[Bowman and Danzig and Manber and Schwartz 1994]. This fact <strong>in</strong>creases the difficulty of the above<br />

mentioned problem. In the past few years, a number of such resource discovery tools have been created such<br />

as: 1) Internet Brows<strong>in</strong>g and Explor<strong>in</strong>g systems, such as Gopher, Hytelnet, Global Network Navigator, 2)<br />

Subject - Oriented Search systems, such as WWW, Virtual Library, Yahoo, USENET Frequently Asked<br />

Questions Archive, 3) Word - Oriented Search systems, such as Lycos, Web Crawler, Knowbot, Archie, WAIS<br />

and have ga<strong>in</strong>ed wide popular acceptance <strong>in</strong> the Internet.<br />

Further models orig<strong>in</strong>ally developed for Artificial Intelligence research, have been applied to In<strong>format</strong>ion<br />

Retrieval lead<strong>in</strong>g to the development and evaluation of <strong>in</strong>telligent retrieval models for text documents, such as<br />

those found <strong>in</strong> bibliographic databases. These retrieval models specify strategies for evaluat<strong>in</strong>g documents with<br />

respect to a given query, typically result<strong>in</strong>g <strong>in</strong> a ranked output. Hypertext researchers, on the other hand have<br />

emphasized flexible organizations of multimedia "nodes" through connection made with user-specified l<strong>in</strong>ks<br />

and <strong>in</strong>terfaces that facilitate brows<strong>in</strong>g <strong>in</strong> this network of l<strong>in</strong>ks. A number of approaches to the <strong>in</strong>tegration of<br />

query-based retrieval strategies and brows<strong>in</strong>g <strong>in</strong> hypertext networks have been proposed. The I3R system<br />

[Croft and Thompson 1987] and the medical handbook system described by Frisse, for example, use query<br />

based retrieval strategies to form a ranked list of candidate "start<strong>in</strong>g po<strong>in</strong>ts" for hypertext brows<strong>in</strong>g.<br />

F<strong>in</strong>ally a number of probabilistic retrieval models for hypertext have been proposed [Frisse and Cous<strong>in</strong>s 1989]<br />

[Savoy and Desbois, 1991]. These models view hypertext l<strong>in</strong>ks as specify<strong>in</strong>g important dependencies between<br />

hypertext nodes. The aim of the retrieval strategies based on these models are to improve the effectiveness of<br />

retrieval and to provide better start<strong>in</strong>g po<strong>in</strong>ts for brows<strong>in</strong>g [Croft and Turtle, 1993].<br />

In this paper we present a propose-and-revise system which automates the construction of a search strategy<br />

(<strong>in</strong> a specific doma<strong>in</strong>) for Internet based <strong>in</strong><strong>format</strong>ion retrieval, <strong>in</strong> order to help novice or non novice Internet<br />

users to access and retrieve <strong>in</strong><strong>format</strong>ion us<strong>in</strong>g a variety of Internet search eng<strong>in</strong>es and <strong>in</strong><strong>format</strong>ion resources.<br />

Overview description of MENTOR system<br />

MENTOR system can be analysed <strong>in</strong> five levels 1) User level: The system can be accessed by one or multiple<br />

users and this is the start<strong>in</strong>g po<strong>in</strong>t of user - system <strong>in</strong>teraction. 2) Data Input-Output level: The front - end<br />

<strong>in</strong>terface of the system, implemented <strong>in</strong> HTML, <strong>in</strong> which the user is allowed as first step, to <strong>in</strong>sert his<br />

selections <strong>in</strong> a dialogue box and additionally to receive the results both of the proposed search strategy and the<br />

Internet search. 3) MENTOR's Advisor level: At this level the system implements and comb<strong>in</strong>es user <strong>in</strong>puts<br />

with the expert suggestions (us<strong>in</strong>g the pre-stored Librarian, Internet and Doma<strong>in</strong> expert knowledge) <strong>in</strong> order to<br />

report the preferable search strategy, that is suggested to be followed, back to the user. A detailed description<br />

of this level is given <strong>in</strong> the follow<strong>in</strong>g section. 4) Query Trans<strong>format</strong>ion level: This level is responsible for the

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!