28.06.2013 Views

Papers in PDF format

Papers in PDF format

Papers in PDF format

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

mantic space comes from explor<strong>in</strong>g relations which are relatively weak. These relations are captured <strong>in</strong> the<br />

more dense network. The user typically identifies a s<strong>in</strong>gle document or document region by brows<strong>in</strong>g and<br />

travers<strong>in</strong>g l<strong>in</strong>ks from a known document <strong>in</strong> the sparse network, and then selects to have the weaker l<strong>in</strong>ks of a<br />

node or node set become visible. These are identified by a differ<strong>in</strong>g color and the po<strong>in</strong>t at which the selection<br />

is made is identified by a marker or signpost which is relatively large and can be easily zoomed back to dur<strong>in</strong>g<br />

the course of exploration. Most simply, the user can select a node which represents a s<strong>in</strong>gle document or term<br />

and expand or collapse connected nodes. Aga<strong>in</strong>, color is used to mark the course of brows<strong>in</strong>g and network exploration.<br />

Several navigational tools are available to the user which provide mechanisms for return<strong>in</strong>g to previous<br />

view<strong>in</strong>g po<strong>in</strong>ts, or supply <strong>in</strong><strong>format</strong>ion about the context of the current position. Visual bookmarks can be set<br />

by the user at any po<strong>in</strong>t while mov<strong>in</strong>g through the network. Simply select<strong>in</strong>g the bookmark returns the user to<br />

the viewpo<strong>in</strong>t at that time the bookmark was set. As mentioned above, visual anchors can be set which rema<strong>in</strong><br />

visible from long view<strong>in</strong>g distances and can be returned to at later times. Other display mechanisms <strong>in</strong>clude<br />

l<strong>in</strong>k and node color<strong>in</strong>gs <strong>in</strong>dicat<strong>in</strong>g parts of the networks of high similarity to the user’s query and user positioned<br />

navigational signposts po<strong>in</strong>t<strong>in</strong>g to the closest overview nodes or bookmarks. All navigation ma<strong>in</strong>ta<strong>in</strong>s<br />

fluid movement <strong>in</strong> the space, always zoom<strong>in</strong>g, rather than jump<strong>in</strong>g, to new vie wpo<strong>in</strong>ts.<br />

3. Query Formulation<br />

Although the system’s most novel features center on its visual representations to support brows<strong>in</strong>g and<br />

structure perception, it is useful for <strong>in</strong><strong>format</strong>ion retrieval systems to supply multiple paths of access to <strong>in</strong><strong>format</strong>ion<br />

items. For the type of m<strong>in</strong>imum cost networks used <strong>in</strong> this system, there is a close relation to various<br />

cluster<strong>in</strong>g algorithms. Us<strong>in</strong>g the visual document space representation to browse documents can be characterized<br />

as a form of user directed, cluster based search. The system also supplies conventional vector space retrieval<br />

as an adjunct, supported by direct manipulation techniques for form<strong>in</strong>g queries.<br />

In a typical use of the system the user enters a natural language statement of <strong>in</strong><strong>format</strong>ion need to beg<strong>in</strong><br />

the brows<strong>in</strong>g and retrieval process. The system converts the natural language to a weighted vector representation<br />

and uses conventional weighted vector search to form a sequence of documents match<strong>in</strong>g a vector representation<br />

of the query. The user can then select a document from this list to serve as the entry po<strong>in</strong>t <strong>in</strong> the<br />

network of documents, i.e., the viewpo<strong>in</strong>t is positioned near that document and the document is dist<strong>in</strong>guished<br />

by color.<br />

The availability of an association map of keyword relations, a term space, assists the user <strong>in</strong> formulat<strong>in</strong>g<br />

queries. The relations among terms as portrayed <strong>in</strong> an association map are typically quite different than found<br />

<strong>in</strong> a conventional thesaurus and serve as an alternative order<strong>in</strong>g of term relations. To support query formulation<br />

the system provides direct manipulation facilities for construct<strong>in</strong>g queries. A visual, manipulable representation<br />

of the user’s query is displayed together with the natural language from which it was orig<strong>in</strong>ally derived.<br />

This visual representation can be manipulated by the user by mov<strong>in</strong>g terms from the term network <strong>in</strong>to<br />

the query w<strong>in</strong>dow and connect<strong>in</strong>g them to the query graph. The vector space retrieval is based on the query<br />

graph and is performed by convert<strong>in</strong>g it to a weighted vector representation. This functionality could be provided<br />

by other <strong>in</strong>teraction mechanisms, yet the visual graph manipulation technique encourages active <strong>in</strong>teraction<br />

generally, and allows the of use network representations throughout the system.<br />

4. Acknowledgments<br />

Work on this project has been supported by NASA grants NAG9-551 and NAG9-842.<br />

5. Conclusion<br />

The Document Explorer supplies visualization, brows<strong>in</strong>g, and query formulation mechanisms based on<br />

the semantic content of WWW documents. Users can view and <strong>in</strong>teract with a visually displayed network of<br />

documents based on content similarity <strong>in</strong> a WWW <strong>in</strong><strong>format</strong>ion space that is an alternative to l<strong>in</strong>k based representations.<br />

Relationships among <strong>in</strong>dividual keywords <strong>in</strong> the documents are also displayed visually to support<br />

query formulation by direct manipulation and convey <strong>in</strong><strong>format</strong>ion about the keyword set. Navigation and orientation<br />

tools facilitate <strong>in</strong>teraction and enhance perception of the document set and term collection structures

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!