25.10.2016 Views

SAP HANA Predictive Analysis Library (PAL)

sap_hana_predictive_analysis_library_pal_en

sap_hana_predictive_analysis_library_pal_en

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.2.13 Random Forest<br />

The ransom forests algorithm is an ensemble learning method for classification or regression. It grows many<br />

CART decision trees and outputs the class (classification) that is voted by majority of individual trees or mean<br />

prediction (regression) of the individual trees.<br />

The algorithm uses both bagging and random feature selection techniques. Each new training set is drawn<br />

with replacement from the original training set, and then a tree is grown on the new training set using random<br />

feature selection.<br />

The random forest algorithm generates an internal unbiased estimate (out-of-bag error) of the generalization<br />

error as the forest building processes. It gives estimates of what variables are important from nodes’ splitting<br />

process. It also has an effective method for estimating missing data. If the m th variable is numerical, the<br />

method computes the median of all values of this variable in class j or computes the most frequent nonmissing<br />

value in class j, and then it uses this value to replace all missing values of the m th variable in class j.<br />

Prerequisite<br />

The target column of the training data must not have null values, and other columns should have at least one<br />

valid value (not null).<br />

RANDOMFORESTTRAIN<br />

This function is used for classification or regression.<br />

Procedure Generation<br />

CALL SYS.AFLLANG_WRAPPER_PROCEDURE_CREATE (‘AFL<strong>PAL</strong>’, ‘RANDOMFORESTTRAIN’,<br />

‘’, '', );<br />

The signature table should contain the following records:<br />

Table 159:<br />

Position Schema Name Table Type Name Parameter Type<br />

1 <br />

IN<br />

2 IN<br />

3 <br />

OUT<br />

4 <br />

OUT<br />

<strong>SAP</strong> <strong>HANA</strong> <strong>Predictive</strong> <strong>Analysis</strong> <strong>Library</strong> (<strong>PAL</strong>)<br />

<strong>PAL</strong> Functions P U B L I C 219

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!