28.01.2013 Views

SAP HANA Developer Guide - Get a Free Blog

SAP HANA Developer Guide - Get a Free Blog

SAP HANA Developer Guide - Get a Free Blog

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Column ID Key Description Data Type<br />

given processing type (hence the previous<br />

attribute as key).<br />

TA_TOKEN - Term or entity - depending on processing<br />

type.<br />

TA_LANGUAGE - Usually, the document's language is stated in<br />

the source table. In rare cases where this is<br />

not true, the language code is stored here. As<br />

there is no support for multi-language<br />

documents, the language code is identical for<br />

all result records of a document.<br />

TA_TYPE - The token type contains the linguistic or<br />

semantic type of the token; for instance<br />

"noun" (if option = LINGANALYSIS_*) or<br />

"company" (if option = EXTRACTION_*).<br />

TA_NORMALIZED - Stores a normalized representation of the<br />

token. This becomes relevant e.g. for German<br />

with umlauts, or ß/ss. Normalization with<br />

regards to capitalization would not be as<br />

important as to justify this column.<br />

TA_STEM - Stores the linguistic stemming information,<br />

e.g. the singular nominative for nouns, or the<br />

indicative for verbs. If text analysis yields<br />

several stems, only the first stem will be<br />

stored, assuming this to be the best match.<br />

TA_PARAGRAPH - Stores the number of all paragraphs in the<br />

document.<br />

TA_SENTENCE - Stores the number of all sentences in a<br />

document.<br />

TA_CREATED_AT - Stores the creation time. Used for mere<br />

administrative information; e.g. for<br />

reorganizing purposes.<br />

The $TA table can be modified like any other table, but it cannot be partitioned.<br />

You can add additional columns to this table, e.g. with a statement like ALTER TABLE<br />

NVARCHAR(250)<br />

NVARCHAR(2)<br />

NVARCHAR(100)<br />

NVARCHAR(250)<br />

NVARCHAR(300)<br />

INTEGER<br />

INTEGER<br />

TIMESTAMP<br />

"TA"."$TA_SOURCE_TEXT_TA" ADD(C INTEGER GENERATED ALWAYS AS 1). In this example, the value 1 will be<br />

written to only existing rows in the new column C. The new column will not be filled automatically in delta updates.<br />

<strong>SAP</strong> <strong>HANA</strong> <strong>Developer</strong> <strong>Guide</strong><br />

Enabling Search<br />

P U B L I C<br />

© 2012 <strong>SAP</strong> AG. All rights reserved. 245

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!