28.06.2013 Views

Papers in PDF format

Papers in PDF format

Papers in PDF format

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Introduction<br />

Web Conceptual Space<br />

Robert E. Kent<br />

School of EECS<br />

Wash<strong>in</strong>gton State University<br />

USA<br />

rekent@eecs.wsu.edu<br />

Christian Neuss<br />

Fachgebiet Praktische In<strong>format</strong>ik<br />

Technische Hochschule Darmstadt<br />

Germany<br />

neuss@isa.<strong>in</strong><strong>format</strong>ik.th-darmstadt.de<br />

Abstract: In this paper various tools and techniques from the mathematical theory of conceptual<br />

knowledge process<strong>in</strong>g [Wille 1982] [Wille 1992] are applied to the conceptual universe of the<br />

World Wide Web. For the efficient and economical management of conceptual knowledge, the new<br />

idea of conceptual space is <strong>in</strong>troduced. For the conceptual empowerment of the Web user, a new<br />

technique called conceptual brows<strong>in</strong>g [Kent & Neuss 1996a] is advocated. Conceptual brows<strong>in</strong>g,<br />

which browses over conceptual space along conceptual l<strong>in</strong>ks, is dual mode (extentional versus<br />

<strong>in</strong>tentional) and dual scope (global versus local).<br />

Discovery of relevant resources is still one of the biggest problem <strong>in</strong> the cont<strong>in</strong>uously grow<strong>in</strong>g <strong>in</strong>frastructure of the<br />

World Wide Web. In order to allow the Web to become a true electronic library, it needs a service that identifies and<br />

locates documents belong<strong>in</strong>g to a given category, based on descriptive document attributes such as author, title, or<br />

subject. In traditional library science, resources are identified through catalog <strong>in</strong><strong>format</strong>ion, which is organized as a<br />

hierarchy of restricted subject categories. Searches are conducted upon this explicitly encoded descriptive <strong>in</strong><strong>format</strong>ion,<br />

and can easily be supported with relatively simple retrieval programs. Although traditional library science ma<strong>in</strong>ta<strong>in</strong>s<br />

<strong>in</strong>dexes manually, the dynamic nature of electronic documents requires a certa<strong>in</strong> automation of this task. Not only<br />

does the pure vastness of the document space forbid manual <strong>in</strong>dex<strong>in</strong>g, but many of the electronic documents are also<br />

subject to frequent change. Currently, the popular searchable <strong>in</strong>dexes of World Wide Web resource <strong>in</strong><strong>format</strong>ion are<br />

based upon Web robots: programs that connect to a remote site and recursively retrieve documents over the Network,<br />

usually <strong>in</strong> order to build an <strong>in</strong>dex. S<strong>in</strong>ce the HTML markup language provides relatively little support for encod<strong>in</strong>g<br />

document structure and semantic <strong>in</strong><strong>format</strong>ion [Nicol 1995], automatic tools have to search the document text and<br />

employ heuristic methods <strong>in</strong> order to judge document relevance.<br />

In order to address this problem, recent research aims at provid<strong>in</strong>g self-identify<strong>in</strong>g document structure [Barnard et al.<br />

1995], standards for the representation of metadata [Weibel et al. 1995] [Bowman & Schwartz 1996], and scaleable,<br />

distributed directory services for locat<strong>in</strong>g <strong>in</strong><strong>format</strong>ion resources such as Harvest [Bowman et al. 1994a] or the<br />

Synopsis File System [Bowman 1996]. Provided that sufficient descriptive <strong>in</strong><strong>format</strong>ion about documents is available<br />

for resource discovery tools, the question arises how the <strong>in</strong><strong>format</strong>ion acquired can best be digested and manipulated.<br />

One approach is to employ a graphical metaphor to represent <strong>in</strong><strong>format</strong>ion structures, and provide users with a maplike<br />

view of the underly<strong>in</strong>g <strong>in</strong><strong>format</strong>ion space [Robertson 1993]. These <strong>in</strong><strong>format</strong>ion visualization techniques can<br />

successfully be applied to the World Wide Web [Mukherja & Foley 1995]. However, creation of an effective view is<br />

difficult, as the available descriptive <strong>in</strong><strong>format</strong>ion lacks many useful semantic attributes [ibid.]. In general, visualization<br />

<strong>in</strong>volves a two- or three-dimensional projection from a multidimensional attribute space, and thus requires an<br />

<strong>in</strong>terpretation of the raw attribute data.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!