01.02.2015 Views

Free/Open-Source Machine Translation: a three-day tutorial ... - CNGL

Free/Open-Source Machine Translation: a three-day tutorial ... - CNGL

Free/Open-Source Machine Translation: a three-day tutorial ... - CNGL

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

(a) Description<br />

• Background (interNOSTRUM, Universia)<br />

• Rationale<br />

• Apertium as a shallow-transfer machine translation platform:<br />

engine, data, tools (Armentano et al. 2006).<br />

• Language-pair data<br />

• Funding<br />

• The Apertium community as an example of FOSS development<br />

(repositories: the trunk, the incubator; roles).<br />

• Apertium tools: apertium-dixtools, apertium-transfertools,<br />

apertium-tagger-training-tools.<br />

• Apertium-based applications (Tinylex, Wordpress plugin,<br />

Pidgin plugin, <strong>Open</strong>Office.org plugins, etc.)<br />

• Apertium as a research platform (Sánchez-Martínez and<br />

Forcada 2007, Sánchez-Martínez et al. 2008).<br />

(b) Laboratory: installing and modifying Apertium<br />

• Installing Apertium from the latest sources on a virtual<br />

Linux machine.<br />

• Changing the data for a language pair: vocabularies and,<br />

optionally, transfer rules.<br />

4. Corpus-based FOSMT: Moses (*)<br />

(a) Statistical machine translation<br />

• Statistical MT (SMT)<br />

• The data: sentence-aligned corpora.<br />

• Training: the statistical models<br />

• The SMT engine or “decoder”<br />

(b) Description of Moses and related software:<br />

• Training: Giza++<br />

• Language models: irstlm<br />

• Tuning: MERT<br />

• Decoding: Moses<br />

(c) Laboratory: installing and running Moses.<br />

4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!