13.08.2022 Views

advanced-algorithmic-trading

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 18

Tree-Based Methods

In this chapter a group of algorithms that make use of an underlying technique called a Decision

Tree will be considered. Decision Trees (DTs) are a supervised learning technique that predict

values of responses by learning decision rules derived from features. They can be used in both

a regression and a classification context. For this reason they are sometimes also referred to as

Classification And Regression Trees (CART).

DT/CART models are an example of a more general area of machine learning known as

adaptive basis function models. These models learn the features directly from the data,

rather than being prespecified, as in some other basis expansions. Unlike linear regression these

models are not linear in the parameters. This means that it is only possible to compute a locally

optimal maximum likelihood estimate (MLE) for the parameters[71], rather than a guaranteed

global optimum.

DT/CART models work by partitioning the feature space into a number of simple rectangular

regions divided up by axis parallel splits. In order to obtain a prediction for a particular

observation the mean or mode of the rectangle in feature space to which the observation belongs

is used.

One of the primary benefits of using an individual DT/CART is that by construction it

produces interpretable if-then-else decision rulesets, which are akin to graphical flowcharts. The

main disadvantage of a decision tree lies in the fact that it is often uncompetitive with other

supervised techniques such as support vector machines or deep neural networks in terms of

prediction accuracy.

However they become extremely competitive when used in an ensemble setting such as with

bootstrap aggregation ("bagging"), Random Forests or boosting.

In quantitative finance ensembles of DT/CART models are used in forecasting both future

asset prices and liquidity of certain instruments.

18.1 Decision Trees - Mathematical Overview

Under a probabilistic adaptive basis function specification the model f(x) is given by[71]:

M∑

f(x) = E(y | x) = w m φ(x; v m ) (18.1)

m=1

243

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!