06.06.2013 Views

Patentee Name Harmonisation - ecoom.be

Patentee Name Harmonisation - ecoom.be

Patentee Name Harmonisation - ecoom.be

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Impact<br />

From 438,069 unique names to 438,061 unique names, an additional reduction of 8 names, or a<br />

total reduction of 5,661 names (1.3%).<br />

1.2.4 Remove non-alphanumerical characters at the <strong>be</strong>ginning of a name<br />

Description<br />

A name is expected to <strong>be</strong>gin with a letter, a digit, or some relevant character but not with a<br />

character such as “.” or “,”.<br />

Non-alphanumerical characters at the <strong>be</strong>ginning of a name that are not relevant should <strong>be</strong><br />

removed.<br />

Analysis<br />

<strong>Name</strong>s that <strong>be</strong>gin with an irrelevant non-alphanumerical character are identified by querying the<br />

data for names where the first character does not <strong>be</strong>long to the following set of letters, digits<br />

and other relevant characters: A-Z; 0-9; “””; “@”; “(“; “’”; “#”; “!”; “*”; “/”.<br />

23 names were found to contain an irregular first character although only 18 actually <strong>be</strong>gin with<br />

an irrelevant character.<br />

The following characters were identified for removal if they appear at the <strong>be</strong>ginning of a name:<br />

“.”; “-“; “?”; “:”; “_”.<br />

Implementation<br />

All occurrences of “.”; “-“; “?”; “!”; “*”; “:”; “_” are removed from the <strong>be</strong>ginning of a name by<br />

executing an update query on the data.<br />

As the removal of irrelevant characters at the <strong>be</strong>ginning of a name can lead to leading spaces,<br />

names have to <strong>be</strong> checked for and trimmed of leading spaces after the removal of irrelevant<br />

characters at the <strong>be</strong>ginning of a name.<br />

The removal of characters at the <strong>be</strong>ginning of a name can also lead to a new irregular <strong>be</strong>ginning<br />

of a name, so this step has to <strong>be</strong> executed several times until no further irregularities are found.<br />

Result<br />

Irrelevant non-alphanumerical characters at the <strong>be</strong>ginning of a name have <strong>be</strong>en removed in 18<br />

names.<br />

Impact<br />

From 438,061 unique names to 438,052 unique names, an additional reduction of 9 names, or a<br />

total reduction of 5,670 names (1.3%).<br />

1.2.5 Remove non-alphanumerical characters at the end of a name<br />

Description<br />

A name is expected to end with a letter, a digit or some relevant character, but not with a<br />

character like “:” or “;”.<br />

Non-alphanumerical characters at the end of a name that are not relevant should <strong>be</strong> removed.<br />

Analysis<br />

<strong>Name</strong>s that end with an irrelevant non-alphanumerical character are identified by querying the<br />

data for names where the last character does not <strong>be</strong>long to the following set of letters, digits<br />

and other relevant characters: A-Z; 0-9; “.”; “’”; “””; “)”.<br />

1,528 names were found containing an irregular end.<br />

668 end with “ DITE:”, “ DITE,” or “ DITE :”. Normally, this should <strong>be</strong> followed by an acronym<br />

or nickname. Since this is not the case here, the 668 occurrences of “DITE:”, “ DITE,” and “DITE<br />

:” can <strong>be</strong> removed.<br />

24

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!