28.06.2013 Views

Papers in PDF format

Papers in PDF format

Papers in PDF format

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>in</strong> our system is quite large, and provides sufficient, language <strong>in</strong>dependent <strong>in</strong><strong>format</strong>ion to potential buyers and<br />

sellers of products that can be traded with<strong>in</strong> the system.<br />

We currently have no scientific criteria to show that our selection of topics and vocabularies is <strong>in</strong>deed<br />

necessary and sufficient to provide mean<strong>in</strong>gful communication <strong>in</strong> specific section. Our approach here has been to<br />

make empirical <strong>in</strong>vestigations of classified ads <strong>in</strong> newspapers as well as on-l<strong>in</strong>e services, and <strong>in</strong>clude the most<br />

commonly found topics and their related vocabularies. We are consider<strong>in</strong>g us<strong>in</strong>g computerized <strong>in</strong><strong>format</strong>ion<br />

extraction from on-l<strong>in</strong>e services, such as <strong>in</strong> the FRUMP system, [DeJong 79] and [DeJong 82].<br />

To cont<strong>in</strong>ue the above example, the section Automobile may have the follow<strong>in</strong>g def<strong>in</strong>itions:<br />

-> Topics : (Vehicle Type, Manufacturer, Price)<br />

-> Vocabularies : ((Sedan, Hard Top/Coupe, Open Convertible, Sports Car ... etc.) ( Acura, Alfa, Audi, Aust<strong>in</strong>,<br />

BMW ... etc.) ( 1 .. 20,000))<br />

The user can now be more specific <strong>in</strong> his formulation: "I would like to search the sell ad list<strong>in</strong>gs for an<br />

automobile which is a sedan or sports car made by Audi, cost<strong>in</strong>g no more than $ 10,000."<br />

The mathematical foundation for the approach <strong>in</strong> 3.1 and 3.2 above is a simple application of comb<strong>in</strong>ation theory.<br />

The system is a comb<strong>in</strong>ation of well-formed sentences which can be gotten out of the formula:<br />

Actions * Intentions * Sections<br />

where each Section has the follow<strong>in</strong>g number of possible comb<strong>in</strong>ations:<br />

Vocabulary-Topic1 * Vocabulary-Topic2 * ... * Vocabulary-TopicN<br />

Dur<strong>in</strong>g the design of the system we have come to talk about our enumerated language as a "pickup-language".<br />

This is because we provide a graphical, ma<strong>in</strong>ly menu-based <strong>in</strong>terface for submitt<strong>in</strong>g and trad<strong>in</strong>g of goods <strong>in</strong> each<br />

section, as opposed to newspapers which require more human process<strong>in</strong>g, especially for ad submissions. For<br />

example, if someone wants to sell a car, he/she <strong>in</strong>puts <strong>in</strong><strong>format</strong>ion by pick<strong>in</strong>g up menu choices for maker, brand,<br />

model etc. Display of <strong>in</strong><strong>format</strong>ion to potential buyers is then based on menu selections, so that unsuitable<br />

<strong>in</strong><strong>format</strong>ion neither can be <strong>in</strong>- or output from the system.<br />

3.3 Mapp<strong>in</strong>g Between Equal Vocabularies -- to Handle Multiple Languages<br />

So far, we have mostly talked about <strong>in</strong>put of <strong>in</strong><strong>format</strong>ion <strong>in</strong>to the system, us<strong>in</strong>g our "pickup-language". However,<br />

for communication to occur, there must also be output. All <strong>in</strong>put to the system is stored as English "pickup-words".<br />

When someone browses the system, this <strong>in</strong><strong>format</strong>ion is then presented <strong>in</strong> tabular form, as shown <strong>in</strong> Figure 1. This<br />

tabular presentation is easily readable and has the advantage that it is highly language neutral and does not<br />

depend on other syntactic rules, such as word order, plural forms, etc. A mapp<strong>in</strong>g for each word is provided so<br />

that the user can browse the system <strong>in</strong> his/her language, regardless of what the <strong>in</strong>put language orig<strong>in</strong>ally was<br />

[Bjorn and Kim 96].<br />

Figure 1: Tabular representation of the "pickup-language"<br />

The same is also true for <strong>in</strong>put: For each menu choice, a mapp<strong>in</strong>g from a base language to each supported<br />

language is provided, so that the user can <strong>in</strong>teract with the system <strong>in</strong> his/her preferred language but still make the<br />

<strong>in</strong><strong>format</strong>ion available to speakers of other languages. A mapp<strong>in</strong>g (from the topic level down) for the above

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!