25.10.2016 Views

SAP HANA Predictive Analysis Library (PAL)

sap_hana_predictive_analysis_library_pal_en

sap_hana_predictive_analysis_library_pal_en

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>PAL</strong>_CLUSTER_ASSIGNED_TBL:<br />

Related Information<br />

K-Means [page 63]<br />

DBSCAN [page 47]<br />

Self-Organizing Maps [page 103]<br />

3.1.5 DBSCAN<br />

DBSCAN (Density-Based Spatial Clustering of Applications with Noise) is a density-based data clustering<br />

algorithm. It finds a number of clusters starting from the estimated density distribution of corresponding<br />

nodes.<br />

DBSCAN requires two parameters: scan radius (eps) and the minimum number of points required to form a<br />

cluster (minPts). The algorithm starts with an arbitrary starting point that has not been visited. This point's<br />

eps-neighborhood is retrieved, and if the number of points it contains is equal to or greater than minPts, a<br />

cluster is started. Otherwise, the point is labeled as noise. These two parameters are very important and are<br />

usually determined by user.<br />

<strong>PAL</strong> provides a method to automatically determine these two parameters. You can choose to specify the<br />

parameters by yourself or let the system determine them for you.<br />

Prerequisites<br />

●<br />

●<br />

No missing or null data in the inputs.<br />

The data is numeric, not categorical.<br />

<strong>SAP</strong> <strong>HANA</strong> <strong>Predictive</strong> <strong>Analysis</strong> <strong>Library</strong> (<strong>PAL</strong>)<br />

<strong>PAL</strong> Functions P U B L I C 47

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!