25.10.2016 Views

SAP HANA Predictive Analysis Library (PAL)

sap_hana_predictive_analysis_library_pal_en

sap_hana_predictive_analysis_library_pal_en

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

BPNN performs three steps as follows:<br />

1. Build the network structure and initialize the network.<br />

2. Feed the training data repeatedly to train the network.<br />

3. Make prediction with the trained network.<br />

Training<br />

During the training, BPNN adjusts the connection weights by comparing its output and expected target to<br />

make the output more accurate. The following training styles are available:<br />

●<br />

●<br />

Batch training: weights updating is based on the error of the entire package of training patterns. Thus, in<br />

one round the weights are updated once.<br />

Stochastic training: weights updating is based on the error of a single training pattern. Thus, in one round<br />

the weights are updated for each pattern.<br />

Support for Categorical Attributes<br />

If an attribute is of category type, it will be converted to a binary vector and then be used as numerical<br />

attributes. For example, in the below table, "Gender" is of category type.<br />

Table 87:<br />

Customer ID Age Income Gender<br />

T1 31 10,000 Female<br />

T2 27 8,000 Male<br />

Because "Gender" has two distinct values, it will be converted into a binary vector with two dimensions:<br />

Table 88:<br />

Customer ID Age Income Gender_1 Gender_2<br />

T1 31 10,000 0 1<br />

T2 27 8,000 1 0<br />

Prerequisites<br />

For training data:<br />

●<br />

●<br />

●<br />

The data are of integer, varchar, nvarchar, or double data type and do not contain null value. Otherwise the<br />

algorithm will issue errors.<br />

If it is for classification, then the last column is considered as the label column and is of integer, varchar, or<br />

nvarchar type.<br />

If it is for regression, then you should specify how many last columns are considered as target values, and<br />

they are of integer or double type.<br />

For predicted data:<br />

●<br />

●<br />

●<br />

The data are of integer, varchar, nvarchar, or double data type and does not contain null value. Otherwise<br />

the algorithm will issue errors.<br />

The first column is ID column and should be of integer type.<br />

The column order and column number of the predicted data are the same as the order and number used in<br />

model training.<br />

126 P U B L I C<br />

<strong>SAP</strong> <strong>HANA</strong> <strong>Predictive</strong> <strong>Analysis</strong> <strong>Library</strong> (<strong>PAL</strong>)<br />

<strong>PAL</strong> Functions

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!