28.06.2013 Views

Papers in PDF format

Papers in PDF format

Papers in PDF format

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

the worst case, we can make a rough judgement on the value of resource transparently for users based on how<br />

often it has been retrieved by users.<br />

We feel it would be a good idea to also <strong>in</strong>clude at least “Could not get” check box and “Submit” button <strong>in</strong>to<br />

result forms of other search eng<strong>in</strong>es. This way would enable the search eng<strong>in</strong>es to clean their <strong>in</strong>dexes.<br />

How It Works<br />

Each broker consists of a database and software that performs follow<strong>in</strong>g functions:<br />

1. Ma<strong>in</strong>ta<strong>in</strong>, <strong>in</strong>dex and search the local cache of the abstracts..<br />

2. Process <strong>in</strong>com<strong>in</strong>g queries locally and send results back to clients.<br />

3. If a query can not be processed locally or the user is not satisfied with the results of local process<strong>in</strong>g, make<br />

decisions on rout<strong>in</strong>g the query to other sources of meta-<strong>in</strong><strong>format</strong>ion.<br />

4. Make decisions on cach<strong>in</strong>g (replication) of remote query results (which are sets of abstracts) locally.<br />

5. Publish (make available for queries) the abstracts of local and/or authorised users.<br />

6. Periodically recommend the most popular abstracts to other brokers that might be <strong>in</strong>terested <strong>in</strong> gett<strong>in</strong>g a copy<br />

of the abstract for replication.<br />

7. Make decisions on cach<strong>in</strong>g locally abstracts recommended by other brokers.<br />

Data<br />

The way we build the broker’s cache is similar to what happens <strong>in</strong> the process of human learn<strong>in</strong>g. We (human<br />

beens) collect relatively small bits of knowledge about every possible topic we face <strong>in</strong> our life, from road traffic<br />

rules to rumours about crimes. The more it is <strong>in</strong>terest<strong>in</strong>g for us, the more often we use this knowledge, the<br />

longer we remember it. We also have knowledge (often vague) about possible sources where we can get more<br />

<strong>in</strong><strong>format</strong>ion on a given topic, if necessary. This part of memory is our knowledge cache and knowledge map.<br />

Most of people are experts <strong>in</strong> a narrow area. If they are recognised as the experts, their expertise is used by<br />

other people directly or <strong>in</strong>directly. If one is a known expert <strong>in</strong> some area, we ask them for <strong>in</strong><strong>format</strong>ion on this<br />

area and sometime we suggest a new <strong>in</strong><strong>format</strong>ion for their judgement or if we want this <strong>in</strong><strong>format</strong>ion to be<br />

known to other people through them or just to contribute to their expertise.<br />

The broker’s database conta<strong>in</strong>s three groups of abstracts.<br />

- Abstracts which are popular among the local users and form a corporate user profile. We call this group the<br />

“wide view group”. It is knowledge cache and knowledge map of the broker.<br />

- A specialised group of abstracts on some topic, prevail<strong>in</strong>g <strong>in</strong> the wide view group. We call this group the<br />

“expertise” group.<br />

- Abstracts that are published by local users. We call this group the “local group”.<br />

Additional <strong>in</strong><strong>format</strong>ion is stored for each abstract. This <strong>in</strong>cludes expiry date, date last used (for resource<br />

retrieval or query rout<strong>in</strong>g), source broker, popularity rat<strong>in</strong>g, cost of retrieval, user profiles.<br />

It makes sense to store abstracts of the wide view group because the <strong>in</strong><strong>format</strong>ion that is <strong>in</strong>terest<strong>in</strong>g for one user<br />

with<strong>in</strong> the organisation, is very likely to be <strong>in</strong>terest<strong>in</strong>g for the others. Second, this group of abstracts forms a<br />

knowledge map of the outside world of <strong>in</strong><strong>format</strong>ion sources. It conta<strong>in</strong>s a dynamic set of abstracts cover<strong>in</strong>g a<br />

wide range of topics. This <strong>in</strong><strong>format</strong>ion is used to choose the best sources for given search topic.<br />

Storage of the expertise group makes sense because we need to concentrate knowledge of a topic as much as<br />

possible to reduce need for query distribution. In other words, we have to achieve specialisation of a broker <strong>in</strong><br />

some topic. If we do this, only a few brokers need to be searched for a particular topic, or a comb<strong>in</strong>ation of<br />

topics, <strong>in</strong>stead of the millions of brokers as <strong>in</strong> the case of dispersed <strong>in</strong><strong>format</strong>ion.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!