22.01.2015 Views

Military Communications and Information Technology: A Trusted ...

Military Communications and Information Technology: A Trusted ...

Military Communications and Information Technology: A Trusted ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

270 <strong>Military</strong> <strong>Communications</strong> <strong>and</strong> <strong>Information</strong> <strong>Technology</strong>...<br />

There are various NLP technologies that can support intelligence with the processing<br />

of text. A selection of those is presented below:<br />

• <strong>Information</strong> retrieval (IR): <strong>Information</strong> retrieval is the technology underlying<br />

modern search engines, i.e., it is concerned with efficiently <strong>and</strong> effectively<br />

searching for documents. By the application of professional search methods<br />

we are able to find specific texts from document collections like databases<br />

or the World Wide Web.<br />

• Document classification: Document classification (or text categorization)<br />

is an NLP method to assign documents to one or more classes or categories.<br />

This may also mean the classification of documents relevance. As document<br />

classification is usually used to enhance information retrieval, it is<br />

sometimes regarded as a sub-field of IR.<br />

• <strong>Information</strong> extraction (IE): <strong>Information</strong> extraction describes the extraction<br />

of specifically defined information from text. For this, different text<br />

processing is performed. Critical information is identified <strong>and</strong> extracted<br />

<strong>and</strong> may also be structured <strong>and</strong> combined.<br />

• Text mining: Text mining refers to the process of deriving text content. The aim<br />

is to discover previously unknown information that is relevant for a particular<br />

purpose. In the course of text mining different NLP technology may be applied<br />

such as, for example, the related task of IE to extract specific items.<br />

• Opinion mining: Opinion mining, which is also called sentiment analysis,<br />

involves the analysis of subjective information from text. It is used to try to<br />

determine the attitude of a writer with respect to some topic or the overall<br />

contextual polarity of a document.<br />

• Machine translation (MT): The automatic translation of text is done by<br />

machine translation. There are different approaches to generate a target<br />

language translation from a source language text. Which approach is best<br />

depends on the application <strong>and</strong> various circumstances. Most modern research<br />

is performed in the area of statistical machine translation (SMT).<br />

In section 4, SMT is described in more detail.<br />

The technologies presented can be applied to assist the human intelligence<br />

analyst. Through automatic document processing, the intelligence process can be<br />

improved in terms of efficiency as well as effectiveness. The computational processing<br />

of text is much faster than human processing. This means that time-critical information<br />

will reach the mission’s comm<strong>and</strong> <strong>and</strong> control in a much shorter time. Many<br />

intelligence tasks cannot possibly be performed by humans (such as the manual<br />

search for information in the web). Only by the use of technology are we able to<br />

fully exploit the information that we have access to nowadays. In the following, we<br />

illuminate one possible approach of supporting the intelligence process by the application<br />

of NLP technology (Figure 2).<br />

a) Collection: Intelligence involves the collection of data that are relevant for<br />

the mission. There are different sources of relevant textual data. For example,

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!