advanced-algorithmic-trading

Recommendations

Info

172.2 Applying Bayes’ Rule for Bayesian InferenceAs we stated at the start of this chapter the basic idea of Bayesian inference is to continuallyupdate our prior beliefs about events as new evidence is presented. This is a very natural wayto think about probabilistic events. As more and more evidence is accumulated our prior beliefsare steadily "washed out" by any new data.Consider a (rather nonsensical) prior belief that the Moon is going to collide with the Earth.For every night that passes, the application of Bayesian inference will tend to correct our priorbelief to a posterior belief that the Moon is less and less likely to collide with the Earth, since itremains in orbit.In order to demonstrate a concrete numerical example of Bayesian inference it is necessaryto introduce some new notation.Firstly, we need to consider the concept of parameters and models. A parameter could bethe weighting of an unfair coin, which we could label as θ. Thus θ = P (H) would describe theprobability distribution of our beliefs that the coin will come up as heads when flipped. Themodel is the actual means of encoding this flip mathematically. In this instance, the coin flipcan be modelled as a Bernoulli trial.Bernoulli TrialA Bernoulli trial is a random experiment with only two outcomes, usually labelled as "success"or "failure", in which the probability of the success is exactly the same every time the trial iscarried out. The probability of the success is given by θ, which is a number between 0 and 1.Thus θ ∈ [0, 1].Over the course of carrying out some coin flip experiments (repeated Bernoulli trials) we willgenerate some data, D, about heads or tails.A natural example question to ask is "What is the probability of seeing 3 heads in 8 flips (8Bernoulli trials), given a fair coin (θ = 0.5)?".A model helps us to ascertain the probability of seeing this data, D, given a value of theparameter θ. The probability of seeing data D under a particular value of θ is given by thefollowing notation: P (D|θ).However, if you consider it for a moment, we are actually interested in the alternative question- "What is the probability that the coin is fair (or unfair), given that I have seen a particularsequence of heads and tails?".Thus we are interested in the probability distribution which reflects our belief about differentpossible values of θ, given that we have observed some data D. This is denoted by P (θ|D).Notice that this is the converse of P (D|θ). So how do we get between these two probabilities?It turns out that Bayes’ rule is the link that allows us to go between the two situations.Bayes’ Rule for Bayesian InferenceP (θ|D) = P (D|θ) P (θ) / P (D) (2.10)Where:• P (θ) is the prior. This is the strength in our belief of θ without considering the evidenceD. Our prior view on the probability of how fair the coin is.
18• P (θ|D) is the posterior. This is the (refined) strength of our belief of θ once the evidenceD has been taken into account. After seeing 4 heads out of 8 flips, say, this is our updatedview on the fairness of the coin.• P (D|θ) is the likelihood. This is the probability of seeing the data D as generated by amodel with parameter θ. If we knew the coin was fair, this tells us the probability of seeinga number of heads in a particular number of flips.• P (D) is the evidence. This is the probability of the data as determined by summing (orintegrating) across all possible values of θ, weighted by how strongly we believe in thoseparticular values of θ. If we had multiple views of what the fairness of the coin is (butdidn’t know for sure), then this tells us the probability of seeing a certain sequence of flipsfor all possibilities of our belief in the coin’s fairness.The entire goal of Bayesian inference is to provide us with a rational and mathematicallysound procedure for incorporating our prior beliefs, with any evidence at hand, in order toproduce an updated posterior belief. What makes it such a valuable technique is that posteriorbeliefs can themselves be used as prior beliefs under the generation of new data. Hence Bayesianinference allows us to continually adjust our beliefs under new data by repeatedly applying Bayes’rule.There was a lot of theory to take in within the previous two sections, so I’m now going toprovide a concrete example using the age-old tool of statisticians: the coin-flip.2.3 Coin-Flipping ExampleIn this example we are going to consider multiple coin-flips of a coin with unknown fairness. Wewill use Bayesian inference to update our beliefs on the fairness of the coin as more data (i.e.more coin flips) becomes available. The coin will actually be fair, but we won’t learn this untilthe trials are carried out. At the start we have no prior belief on the fairness of the coin, thatis, we can say that any level of fairness is equally likely.In statistical language we are going to perform N repeated Bernoulli trials with θ = 0.5. Wewill use a uniform probability distribution as a means of characterising our prior belief that weare unsure about the fairness. This states that we consider each level of fairness (or each valueof θ) to be equally likely.We are going to use a Bayesian updating procedure to go from our prior beliefs to posteriorbeliefs as we observe new coin flips. This is carried out using a particularly mathematicallysuccinct procedure known as conjugate priors. We won’t go into any detail on conjugate priorswithin this chapter, as it will form the basis of the next chapter on Bayesian inference. It willhowever provide us with the means of explaining how the coin flip example is carried out inpractice.The uniform distribution is actually a more specific case of another probability distribution,known as a Beta distribution. Conveniently, under the binomial model, if we use a Beta distributionfor our prior beliefs it leads to a Beta distribution for our posterior beliefs. This is anextremely useful mathematical result, as Beta distributions are quite flexible in modelling beliefs.However, I don’t want to dwell on the details of this too much here, since we will discuss it in
Page 2 and 3: ContentsI Introduction 11 Introduct
Page 4 and 5: 38.2 Correlation . . . . . . . . .
Page 6 and 7: 513.2 The Kalman Filter . . . . . .
Page 8 and 9: 719 Support Vector Machines . . . .
Page 10 and 11: 928 Kalman Filter-Based Pairs Tradi
Page 12 and 13: Limit of Liability/Disclaimer ofWar
Page 14: Part IIntroduction1
Page 17 and 18: 4The aim of this book is to provide
Page 19 and 20: 61.3 How Is The Book Laid Out?The b
Page 21 and 22: 81.5 How Does This Book Differ From
Page 23 and 24: 101.7.1 AlternativesThere are many
Page 26 and 27: Chapter 2Introduction to Bayesian S
Page 28 and 29: 15Table 2.1: Comparison of Frequent
Page 32 and 33: 19the next chapter. At this stage,
Page 34 and 35: 21# Create and plot a Beta distribu
Page 36 and 37: Chapter 3Bayesian Inference of a Bi
Page 38 and 39: 25• The fairness of the coin does
Page 40 and 41: 273.4.3 Multiple Flips of the CoinN
Page 42 and 43: 29x = np.linspace(0, 1, 100)params
Page 44 and 45: 31Hence, all we need to do is re-ar
Page 46 and 47: 33Figure 3.3: The prior and posteri
Page 48 and 49: Chapter 4Markov Chain Monte CarloIn
Page 50 and 51: 374.2.1 Markov Chain Monte Carlo Al
Page 52 and 53: 39We discussed the fact that we cou
Page 54 and 55: 41Finally we specify the Metropolis
Page 56 and 57: 43we do not need to compute 100,000
Page 58 and 59: 45x, stats.beta.pdf(x, alpha, beta)
Page 60 and 61: Chapter 5Bayesian Linear Regression
Page 62 and 63: 49non-informative priors.• Poster
Page 64 and 65: 51# Use a linear model (y ~ beta_0
Page 66 and 67: 53# Use the No-U-Turn Samplerstep =
Page 68 and 69: 55plt.show()We can see the sampled
Page 70 and 71: 57"""Calculates the Markov Chain Mo
Page 72 and 73: Chapter 6Bayesian Stochastic Volati
Page 74 and 75: 61plt.show()Figure 6.1: Different r
Page 76 and 77: 63( )yilog ∼ t(ν, 0, exp(−2s i
Page 78 and 79: 656.3.2 Model Specification in PyMC
Page 80 and 81:
67Figure 6.5: Overlay of samples fr
Page 82 and 83:
69plt.xlabel(’Time’)plt.ylabel(
Page 84:
Part IIITime Series Analysishttps:/
Page 87 and 88:
74• Seasonal Variation - Many tim
Page 89 and 90:
767.5 How Does This Relate to Other
Page 91 and 92:
78Definition 8.1.1. Expectation. Th
Page 93 and 94:
808.2 CorrelationCorrelation is a d
Page 95 and 96:
82Definition 8.3.4. Stationary in t
Page 97 and 98:
84Figure 8.2: Correlogram plotted i
Page 99 and 100:
868.6 Next StepsNow that we’ve di
Page 101 and 102:
88• Use our knowledge of time ser
Page 103 and 104:
909.3.2 CorrelogramWe can also plot
Page 105 and 106:
929.4.2 CorrelogramThe autocorrelat
Page 107 and 108:
94created the difference series, we
Page 109 and 110:
96there are a few that are marginal
Page 111 and 112:
9810.1 How Will We Proceed?In this
Page 113 and 114:
10010.4.1 RationaleIn the previous
Page 115 and 116:
10210.4.4 Simulations and Correlogr
Page 117 and 118:
104Figure 10.2: Realisation of AR(1
Page 119 and 120:
10610.4.5 Financial DataAmazon Inc.
Page 121 and 122:
108Figure 10.6: Correlogram of Firs
Page 123 and 124:
110> plot(gspcrt)Figure 10.8: First
Page 125 and 126:
112model, as well as the conditiona
Page 127 and 128:
114Figure 10.10: Realisation of MA(
Page 129 and 130:
116Coefficients:ma1 intercept-0.729
Page 131 and 132:
118> require(quantmod)> getSymbols(
Page 133 and 134:
120> amznrt.maCall:arima(x = amznrt
Page 135 and 136:
122Figure 10.16: Residuals of MA(1)
Page 137 and 138:
124Figure 10.18: Residuals of MA(3)
Page 139 and 140:
12610.6.3 RationaleTo date we have
Page 141 and 142:
128Figure 10.20: Correlogram of an
Page 143 and 144:
130Figure 10.22: Correlogram of an
Page 145 and 146:
132Figure 10.23: Correlogram of the
Page 147 and 148:
134Notice that there are some signi
Page 149 and 150:
136I would like to thank you for be
Page 151 and 152:
138Figure 11.1: Plot of simulated A
Page 153 and 154:
140> amzn = diff(log(Cl(AMZN)))As i
Page 155 and 156:
142We fit an ARIMA model by looping
Page 157 and 158:
144being able to create strategy in
Page 159 and 160:
14611.5.2 Why Does This Model Volat
Page 161 and 162:
148ɛ t = σ t w t (11.17)σ 2 t =
Page 163 and 164:
1502.5 % 97.5 %a0 0.1786255 0.21726
Page 165 and 166:
152Figure 11.10: Residuals of an AR
Page 167 and 168:
Figure 11.13: Squared residuals of
Page 169 and 170:
156mean, but due to its stationarit
Page 171 and 172:
158x t = pz t + w x,t (12.1)y t = q
Page 173 and 174:
160Figure 12.3: Plot of x t and y t
Page 175 and 176:
162> layout(1:2)> plot(badcomb, typ
Page 177 and 178:
164We then created two subsequent s
Page 179 and 180:
166Figure 12.6: Backward-adjusted c
Page 181 and 182:
168Coefficients:(Intercept) ewaAdj3
Page 183 and 184:
170> plot(rdsaAdj, rdsbAdj,xlab="RD
Page 185 and 186:
172set.seed(123)## SIMULATED DATA##
Page 187 and 188:
174plot(comb1$residuals, type="l",x
Page 189 and 190:
176> q <- 0.6*z + rnorm(10000)> r <
Page 191 and 192:
178Figure 12.12: Plot of s t , the
Page 193 and 194:
180EWA.Adjusted.l2 EWC.Adjusted.l2
Page 195 and 196:
182library("quantmod")library("tser
Page 197 and 198:
184
Page 199 and 200:
186• Smoothing - Estimating the p
Page 201 and 202:
18813.2.1 A Bayesian ApproachRecall
Page 203 and 204:
190E[y t+1 |D t ] = E[Ft+1θ T t +
Page 205 and 206:
192of our linear regression. The ne
Page 207 and 208:
194Figure 13.1: Scatterplot of the
Page 209 and 210:
196Figure 13.2: Time-varying slope
Page 211 and 212:
198delta = 1e-5trans_cov = delta /
Page 213 and 214:
200
Page 215 and 216:
202"risk manager". They will be use
Page 217 and 218:
204Figure 14.1: Two-state Markov Ch
Page 219 and 220:
206Figure 14.2: Hidden Markov Model
Page 221 and 222:
208> bull_var <- 0.1> bear_mean <-
Page 223 and 224:
210historical financial data.14.3.3
Page 225 and 226:
212The same process will now be car
Page 227 and 228:
214install.packages(’quantmod’)
Page 229 and 230:
216
Page 232 and 233:
Chapter 15Introduction to Machine L
Page 234 and 235:
22115.3.1 Linear RegressionAn eleme
Page 236 and 237:
223While seemingly not a traditiona
Page 238 and 239:
Chapter 16Supervised LearningSuperv
Page 240 and 241:
227ŷ = ˆf(x) = argmax z∈R p(y =
Page 242 and 243:
Chapter 17Linear RegressionIn this
Page 244 and 245:
231if __name__ == "__main__":# Set
Page 246 and 247:
23317.3 Maximum Likelihood Estimati
Page 248 and 249:
235To simplify the notation the lat
Page 250 and 251:
237Once the full dataset is created
Page 252 and 253:
239if __name__ == "__main__":# Set
Page 254 and 255:
241))lr_model.coef_[0]# Create a sc
Page 256 and 257:
Chapter 18Tree-Based MethodsIn this
Page 258 and 259:
245Figure 18.2: The resulting parti
Page 260 and 261:
24718.3 Decision Trees for Classifi
Page 262 and 263:
249In quantitative finance applicat
Page 264 and 265:
251(a) Grow a tree ˆf b with k spl
Page 266 and 267:
253tslag["Lag%s" % str(i+1)] = ts["
Page 268 and 269:
255The same approach is carried out
Page 270 and 271:
257seen whether it is feasible to p
Page 272 and 273:
259amzn = create_lagged_series("AMZ
Page 274 and 275:
Chapter 19Support Vector MachinesIn
Page 276 and 277:
263• Non-Probabilistic - Since th
Page 278 and 279:
265The key point here is that it is
Page 280 and 281:
267classification. Overfitting mean
Page 282 and 283:
269Figure 19.6: Addition of a singl
Page 284 and 285:
27119.8 Support Vector MachinesThe
Page 286 and 287:
273p∑K(x i , x k ) = (1 + x ij x
Page 288 and 289:
Chapter 20Model Selection andCross-
Page 290 and 291:
277This motivates the concept of a
Page 292 and 293:
279Figure 20.1: Various estimates o
Page 294 and 295:
281error for a particular model est
Page 296 and 297:
283Figure 20.3: Amazon, Inc. - Pric
Page 298 and 299:
28520.2.5 Python ImplementationWe a
Page 300 and 301:
287At this stage we have the necess
Page 302 and 303:
289..import pylab as plt....def plo
Page 304 and 305:
291fold = 0# Loop over the k-foldsf
Page 306 and 307:
293Figure 20.5: Test MSE curves for
Page 308 and 309:
295return tsretdef validation_set_p
Page 310 and 311:
297model = Pipeline([("polynomial_f
Page 312 and 313:
299X = lags[["Lag1", "Lag2", "Lag3"
Page 314 and 315:
Chapter 21Unsupervised LearningThe
Page 316 and 317:
303parameters of the model, θ.Conv
Page 318 and 319:
Chapter 22Clustering MethodsIn this
Page 320 and 321:
3072. Iterate the following until c
Page 322 and 323:
309]# Generate a list of the 2D clu
Page 324 and 325:
311clustered using K-Means and then
Page 326 and 327:
313ax.set_axis_bgcolor((1,1,0.9))ax
Page 328 and 329:
315ClusterTomorrow to contain tomor
Page 330 and 331:
317Figure 22.4: 3D scatterplot of n
Page 332 and 333:
319# Apply the K-Means Algorithm fo
Page 334 and 335:
321# up days and red for down daysc
Page 336 and 337:
323sp500["ClusterMatrix"] = list(zi
Page 338 and 339:
Chapter 23Natural Language Processi
Page 340 and 341:
327The Reuters 21578 dataset can be
Page 342 and 343:
329for export as shippers are now e
Page 344 and 345:
331tungtung-oilveg-oilwheatwoolwpiy
Page 346 and 347:
333in order to minimise memory usag
Page 348 and 349:
335’The Tokyo Grain Exchange said
Page 350 and 351:
337for t in d[0]:if t in topics:d_t
Page 352 and 353:
339This would mean that we are givi
Page 354 and 355:
341partitions into the training and
Page 356 and 357:
3430.660194174757[[21 0 0 0 2 3 0 0
Page 358 and 359:
345self.encoding = encodingdef _res
Page 360 and 361:
347"""topics = open("data/all-topic
Page 362:
Part VQuantitative Trading Techniqu
Page 365 and 366:
352While vectorised backtests are s
Page 367 and 368:
354• Backtest - Encapsulates the
Page 369 and 370:
356The second strategy provides a 3
Page 371 and 372:
35825.4.1 MonthlyLiquidateRebalance
Page 373 and 374:
360"""Size the order to reflect the
Page 375 and 376:
362Figure 25.1: US Equities/Bonds 6
Page 377 and 378:
364five years relative to US equiti
Page 379 and 380:
366# Use the monthly liquidate and
Page 381 and 382:
368import datetimefrom qstrader imp
Page 383 and 384:
37026.2 Strategy ImplementationTo i
Page 385 and 386:
372final.order[1],final.order[3]),
Page 387 and 388:
374> spIntersect = merge( spArimaGa
Page 389 and 390:
376enters what looks to be more a s
Page 391 and 392:
378# Output the CSV file to "foreca
Page 393 and 394:
380
Page 395 and 396:
382natural gas.The hypothesis prese
Page 397 and 398:
384## Carry out linear regression o
Page 399 and 400:
38627.5 Python QSTrader Implementat
Page 401 and 402:
388that arrive out of order in the
Page 403 and 404:
390if self.invested is not None:if
Page 405 and 406:
392take into account commission dif
Page 407 and 408:
394adf.test(comb$residuals, k=1)# c
Page 409 and 410:
396def go_short_units(self):"""Go s
Page 411 and 412:
398from qstrader.trading_session.ba
Page 413 and 414:
400default=’’,help=’Pickle (.
Page 415 and 416:
402Hence we can "long the spread" i
Page 417 and 418:
404class KalmanPairsTradingStrategy
Page 419 and 420:
406if all(self.latest_prices > -1.0
Page 421 and 422:
408It also changes the FixedPositio
Page 423 and 424:
410’--testing/--no-testing’, de
Page 425 and 426:
412• Asset Selection - Choosing a
Page 427 and 428:
414if self.R is not None:self.R = s
Page 429 and 430:
416)self.events_queue.put(SignalEve
Page 431 and 432:
418@click.option(’--testing/--no-
Page 433 and 434:
420• Short-Term Trend - Predict m
Page 435 and 436:
422It takes the CSV file path as an
Page 437 and 438:
424produce a final truth column whe
Page 439 and 440:
426bias-variance trade-off of the m
Page 441 and 442:
428used by the machine learning mod
Page 443 and 444:
430IQFeedIntradayCsvBarPriceHandler
Page 445 and 446:
432’--filename’,default=’’,
Page 447 and 448:
Figure 29.2: Tearsheet for Random F
Page 449 and 450:
436):lookforward_minutes=5,up_down_
Page 451 and 452:
438#ts["UpDown"] = down_tot & up_to
Page 453 and 454:
440events_queue - A handle to the s
Page 455 and 456:
442from intraday_ml_strategy import
Page 457 and 458:
444if __name__ == "__main__":main()
Page 459 and 460:
446sentiment.In recent years there
Page 461 and 462:
448Ticker Name Period LinkMSFT Micr
Page 463 and 464:
450self.statistics.update(event.tim
Page 465 and 466:
452if self.end_date is not None:sen
Page 467 and 468:
454a sent_sell corresponding exit t
Page 469 and 470:
456sentiment_handler = SentdexSenti
Page 471 and 472:
458Figure 30.2:volatile, posting mo
Page 473 and 474:
460fication by adding many more sto
Page 475 and 476:
462events_queue = queue.Queue()csv_
Page 477 and 478:
464def main(config, testing, ticker
Page 479 and 480:
466In quantitative trading this pro
Page 481 and 482:
468self.tickers[ticker]["timestamp"
Page 483 and 484:
470df["Adj Close"][mask],".", lines
Page 485 and 486:
472from qstrader.event import (Sign
Page 487 and 488:
474class RegimeHMMRiskManager(Abstr
Page 489 and 490:
476# If in the undesirable regime,
Page 491 and 492:
47831.5 Strategy Results31.5.1 Tran
Page 493 and 494:
480Figure 31.3: Trend Following Reg
Page 495 and 496:
482ax.grid(True)plt.show()if __name
Page 497 and 498:
484elif short_sma < long_sma and se
Page 499 and 500:
486elif action == "SLD":if self.inv
Page 501 and 502:
488backtest = Backtest(price_handle
Page 503 and 504:
490holding an instrument representi
Page 505 and 506:
492rolling_sharpe_s = np.sqrt(self.
Page 507 and 508:
494In order to use the annualised r
Page 509 and 510:
496Figure 32.2: Aluminum Smelting S
Page 511 and 512:
498
Page 513 and 514:
500[18] Wikipedia: Viterbi algorith
Page 515 and 516:
502[56] Investor, S. Regime detecti
Page 517:
504[92] Sinclair, E. Volatility Tra
show all

advanced-algorithmic-trading

Create successful ePaper yourself

Delete template?

Save as template?