21.11.2013 Views

YEARS OF EUROPEAN ONLINE ANNÉES DE EN LIGNE ...

YEARS OF EUROPEAN ONLINE ANNÉES DE EN LIGNE ...

YEARS OF EUROPEAN ONLINE ANNÉES DE EN LIGNE ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

MEETING <strong>OF</strong> THE COUNCIL WORKING PARTY<br />

ON LEGAL DATA PROCESSING<br />

Both tools aim at supporting the user and both tackle certain language issues<br />

related to the database.<br />

2.1. The Eurovoc thesaurus<br />

the irst edition of the Eurovoc thesaurus dates back to 1984. today this<br />

controlled vocabulary is available in version 4.2 in 18 oficial languages (exceptions<br />

are maltese and Estonian) and comes with approximately 6 400 descriptors,<br />

an almost equivalent number of reciprocal hierarchical relationships (broader–<br />

narrower term) and around 3 500 reciprocal associative relations (related term).<br />

the added value that this tool provides for EUR-Lex users cannot immediately<br />

be derived from these purely statistical igures. But queries based on natural<br />

language search terms already come with certain weaknesses when applied<br />

to a monolingual database. Cases of homonymy and synonymy can be referred<br />

to as blurring the results of that kind of query. Nevertheless, quite a few users<br />

tend to perform this kind of search irst — and sometimes only this kind.<br />

the natural language search term ‘electricity’ was<br />

found in the title of 573 documents (<strong>EN</strong> advanced search<br />

query [tI=electricity]). for comparison, in the Italian language<br />

version of the database, the search in the title for<br />

the synonyms ‘corrente elletrica’ AND ‘elletricità’, combined<br />

with a Boolean OR, produced only 278 hits (It<br />

advanced search query [tI=corrente elletrica OR<br />

tI=elletricità]) and applying ‘Strom’ OR ‘Elektrizität’ in<br />

the German language (<strong>DE</strong> advanced search query<br />

[tI=Strom OR tI=Elektrizität]) gathers only 149 references. One reason for the<br />

comparably low number of hits in the German example is the availability of<br />

numerous composites in the German language which are not retrieved by that<br />

example of a query.<br />

Consulting the Eurovoc thesaurus results, for all ive terms that were used<br />

for the above queries, in only one common descriptor, which is ‘electrical energy’<br />

in English, ‘energia elettrica’ in Italian and the corresponding ‘elektrische<br />

Energie’ in German. the terms previously used for the search in the title all<br />

serve as non-descriptors in the controlled vocabulary and provide a reference<br />

to the descriptor.<br />

Applying the descriptor for a search exploiting the Eurovoc ield in the<br />

EUR-Lex database (e.g. [DC=electrical energy]) results, for all the three language<br />

versions mentioned, in the same number of hits (which was 1 123).<br />

62 | 63<br />

01_2007_5222_txt_ML.indd 63 6-12-2007 15:13:42

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!