100% found this document useful (1 vote)

99 views31 pages

Finance With Python and MPT

The document discusses using machine learning techniques for modern portfolio theory and efficient frontiers. It generates random portfolio weights and calculates returns and volatility to plot the efficient frontier. Features and targets are created from the portfolio data to train a random forest regressor model to predict optimal weights. Hypothetical backtesting shows the model outperforms buying and holding QQQ.

Uploaded by

ravinyse

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

100% found this document useful (1 vote)

99 views31 pages

Finance With Python and MPT

Uploaded by

ravinyse

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

You are on page 1/ 31

DataCamp Machine Learning for Finance in Python

MACHINE LEARNING FOR FINANCE IN PYTHON

Modern portfolio theory

(MPT); efficient frontiers

Nathan George
Data Science Professor
DataCamp Machine Learning for Finance in Python
DataCamp Machine Learning for Finance in Python

Joining data
stocks = ['AMD', 'CHK', 'QQQ']
full_df = pd.concat([amd_df, chk_df, qqq_df], axis=1).dropna()
full_df.head()

AMD CHK QQQ

Date
1999-03-10 8.690 0.904417 45.479603
1999-03-11 8.500 0.951617 45.702324
1999-03-12 8.250 0.951617 44.588720
1999-03-15 8.155 0.951617 45.880501
1999-03-16 8.500 0.951617 46.281398
DataCamp Machine Learning for Finance in Python

Calculating returns
# calculate daily returns of stocks
returns_daily = full_df.pct_change()

# resample the full dataframe to monthly timeframe

monthly_df = full_df.resample('BMS').first()

# calculate monthly returns of the stocks

returns_monthly = monthly_df.pct_change().dropna()

print(returns_monthly.tail())

AMD CHK QQQ

Date
2018-01-01 0.023299 0.002445 0.028022
2018-02-01 0.206740 -0.156098 0.059751
2018-03-01 -0.101887 -0.190751 -0.020719
2018-04-02 -0.199160 0.060714 -0.052971
2018-05-01 0.167891 0.003367 0.046749
DataCamp Machine Learning for Finance in Python

Covariances
# daily covariance of stocks (for each monthly period)
covariances = {}
for i in returns_monthly.index:
rtd_idx = returns_daily.index
# mask daily returns for each month (and year) and calculate covariance
mask = (rtd_idx.month == i.month) & (rtd_idx.year == i.year)
covariances[i] = returns_daily[mask].cov()

print(covariances[i])

AMD CHK QQQ

AMD 0.000257 0.000177 0.000068
CHK 0.000177 0.002057 0.000108
QQQ 0.000068 0.000108 0.000051
DataCamp Machine Learning for Finance in Python

Generating portfolio weights

for date in covariances.keys():
cov = covariances[date]
for single_portfolio in range(5000):
weights = np.random.random(3)
weights /= np.sum(weights)
DataCamp Machine Learning for Finance in Python

Calculating returns and volatility

portfolio_returns, portfolio_volatility, portfolio_weights = {}, {}, {}

# get portfolio performances at each month

for date in covariances.keys():
cov = covariances[date]
for single_portfolio in range(5000):
weights = np.random.random(3)
weights /= np.sum(weights)

returns = np.dot(weights, returns_monthly.loc[date])

volatility = np.sqrt(np.dot(weights.T, np.dot(cov, weights)))

portfolio_returns.setdefault(date, []).append(returns)
portfolio_volatility.setdefault(date, []).append(volatility)
portfolio_weights.setdefault(date, []).append(weights)
DataCamp Machine Learning for Finance in Python

Plotting the efficient frontier

date = sorted(covariances.keys())[-1]

# plot efficient frontier

plt.scatter(x=portfolio_volatility[date],
y=portfolio_returns[date],
alpha=0.5)
plt.xlabel('Volatility')
plt.ylabel('Returns')
plt.show()
DataCamp Machine Learning for Finance in Python

MACHINE LEARNING FOR FINANCE IN PYTHON

Calculate MPT portfolios!

DataCamp Machine Learning for Finance in Python

MACHINE LEARNING FOR FINANCE IN PYTHON

Sharpe ratios; features and

targets

Nathan George
Data Science Professor
DataCamp Machine Learning for Finance in Python
DataCamp Machine Learning for Finance in Python
DataCamp Machine Learning for Finance in Python
DataCamp Machine Learning for Finance in Python

Getting our Sharpe ratios

# empty dictionaries for sharpe ratios and best sharpe indexes by date
sharpe_ratio, max_sharpe_idxs = {}, {}

# loop through dates and get sharpe ratio for each portfolio
for date in portfolio_returns.keys():
for i, ret in enumerate(portfolio_returns[date]):
volatility = portfolio_volatility[date][i]
sharpe_ratio.setdefault(date, []).append(ret / volatility)

# get the index of the best sharpe ratio for each date
max_sharpe_idxs[date] = np.argmax(sharpe_ratio[date])
DataCamp Machine Learning for Finance in Python

Create features
# calculate exponentially-weighted moving average of daily returns
ewma_daily = returns_daily.ewm(span=30).mean()

# resample daily returns to first business day of the month

ewma_monthly = ewma_daily.resample('BMS').first()

# shift ewma 1 month forward

ewma_monthly = ewma_monthly.shift(1).dropna()
DataCamp Machine Learning for Finance in Python

Calculate features and targets

targets, features = [], []

# create features from price history and targets as ideal portfolio

for date, ewma in ewma_monthly.iterrows():
# get the index of the best sharpe ratio
best_idx = max_sharpe_idxs[date]
targets.append(portfolio_weights[date][best_idx])
features.append(ewma)

targets = np.array(targets)
features = np.array(features)
DataCamp Machine Learning for Finance in Python

Re-plot efficient frontier

# latest date
date = sorted(covariances.keys())[-1]

cur_returns = portfolio_returns[date]
cur_volatility = portfolio_volatility[date]

plt.scatter(x=cur_volatility,
y=cur_returns,
alpha=0.1,
color='blue')

best_idx = max_sharpe_idxs[date]

plt.scatter(cur_volatility[best_idx],
cur_returns[best_idx],
marker='x',
color='orange')

plt.xlabel('Volatility')
plt.ylabel('Returns')
plt.show()
DataCamp Machine Learning for Finance in Python
DataCamp Machine Learning for Finance in Python

MACHINE LEARNING FOR FINANCE IN PYTHON

Get Sharpe!
DataCamp Machine Learning for Finance in Python

MACHINE LEARNING FOR FINANCE IN PYTHON

Machine learning for MPT

Nathan George
Data Science Professor
DataCamp Machine Learning for Finance in Python

Make train and test sets

# make train and test features
train_size = int(0.8 * features.shape[0])
train_features = features[:train_size]
train_targets = targets[:train_size]

test_features = features[train_size:]
test_targets = targets[train_size:]

print(features.shape)

(230, 3)
DataCamp Machine Learning for Finance in Python

Fit the model

from sklearn.ensemble import RandomForestRegressor

# fit the model and check scores on train and test

rfr = RandomForestRegressor(n_estimators=300, random_state=42)
rfr.fit(train_features, train_targets)

print(rfr.score(train_features, train_targets))
print(rfr.score(test_features, test_targets))

0.8382262317599827
0.09504859048985377
DataCamp Machine Learning for Finance in Python

Evaluate the model's performance

# get predictions from model on train and test
test_predictions = rfr.predict(test_features)

# calculate and plot returns from our RF predictions and the QQQ returns
test_returns = np.sum(returns_monthly.iloc[train_size:] * test_predictions,
axis=1)

plt.plot(test_returns, label='algo')
plt.plot(returns_monthly['QQQ'].iloc[train_size:], label='QQQ')
plt.legend()
plt.show()
DataCamp Machine Learning for Finance in Python
DataCamp Machine Learning for Finance in Python

Calculate hypothetical portfolio

cash = 1000
algo_cash = [cash]

for r in test_returns:
cash *= 1 + r
algo_cash.append(cash)

# calculate performance for QQQ

cash = 1000 # reset cash amount
qqq_cash = [cash]
for r in returns_monthly['QQQ'].iloc[train_size:]:
cash *= 1 + r
qqq_cash.append(cash)

print('algo returns:', (algo_cash[-1] - algo_cash[0]) / algo_cash[0])

print('QQQ returns:', (qqq_cash[-1] - qqq_cash[0]) / qqq_cash[0])

algo returns: 0.5009443507049591

QQQ returns: 0.5186775933696601
DataCamp Machine Learning for Finance in Python

Plot the results

plt.plot(algo_cash, label='algo')
plt.plot(qqq_cash, label='QQQ')
plt.ylabel('$')
plt.legend() # show the legend
plt.show()
DataCamp Machine Learning for Finance in Python

MACHINE LEARNING FOR FINANCE IN PYTHON

Train your model!

DataCamp Machine Learning for Finance in Python

MACHINE LEARNING FOR FINANCE IN PYTHON

Final thoughts

Nathan George
Data Science Professor
DataCamp Machine Learning for Finance in Python

Toy examples

Tools for bigger data:

Python 3 multiprocessing
Dask
Spark
AWS or other cloud solutions
DataCamp Machine Learning for Finance in Python

Get more and better data

Data in this course:

From Quandl.com/EOD (free subset available)

Alternative and other data:

satellite images
sentiment analysis (e.g. PsychSignal)
analyst predictions
fundamentals data
DataCamp Machine Learning for Finance in Python

MACHINE LEARNING FOR FINANCE IN PYTHON

Be careful, and Godspeed!

Machine Learning and Data Mining For Sports Analytics: Ulf Brefeld Jesse Davis Jan Van Haaren Albrecht Zimmermann
No ratings yet
Machine Learning and Data Mining For Sports Analytics: Ulf Brefeld Jesse Davis Jan Van Haaren Albrecht Zimmermann
206 pages
Entegra Integrated Flight Display System Installation Manual
No ratings yet
Entegra Integrated Flight Display System Installation Manual
142 pages
Stock Market Analysis Project
No ratings yet
Stock Market Analysis Project
23 pages
Time Series Analysis With R
No ratings yet
Time Series Analysis With R
6 pages
Read & Download (PDF Kindle)
No ratings yet
Read & Download (PDF Kindle)
5 pages
Eran15.0 Lte TDD Clock Synchronization Detection: Huawei Technologies Co., LTD
No ratings yet
Eran15.0 Lte TDD Clock Synchronization Detection: Huawei Technologies Co., LTD
33 pages
C Piscine: Abstract: This Document Is The Subject For Day03 of The C Piscine at 42
No ratings yet
C Piscine: Abstract: This Document Is The Subject For Day03 of The C Piscine at 42
15 pages
Hands-On AI: Building ML Models with Python
From Everand
Hands-On AI: Building ML Models with Python
Anand Vemula
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Pattern Recognition and Machine Learning Errata and Additional Comments
0% (1)
Pattern Recognition and Machine Learning Errata and Additional Comments
7 pages
ECON2125/8013 Maths Notes: John Stachurski March 4, 2015
No ratings yet
ECON2125/8013 Maths Notes: John Stachurski March 4, 2015
162 pages
Unsupervised Machine Learning in Python
100% (1)
Unsupervised Machine Learning in Python
89 pages
The Matplotlib User's Guide
No ratings yet
The Matplotlib User's Guide
868 pages
Slide - Python - Statistical Simulation in Python
No ratings yet
Slide - Python - Statistical Simulation in Python
107 pages
Adaptive Filtering
No ratings yet
Adaptive Filtering
10 pages
How To Calculate Precision, Recall, and F-Measure For Imbalanced Classification
No ratings yet
How To Calculate Precision, Recall, and F-Measure For Imbalanced Classification
19 pages
Vector Autoregression (VAR) - Comprehensive Guide With Examples in Python - ML
0% (1)
Vector Autoregression (VAR) - Comprehensive Guide With Examples in Python - ML
41 pages
Complete Guide To Create A Time Series Forecast (With Codes in Python) PDF
100% (4)
Complete Guide To Create A Time Series Forecast (With Codes in Python) PDF
18 pages
Machine Learning-Algorithmic Trading-Python
No ratings yet
Machine Learning-Algorithmic Trading-Python
6 pages
Python For Finance
No ratings yet
Python For Finance
289 pages
Mathematical Tools For Data Science
No ratings yet
Mathematical Tools For Data Science
9 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
H2o Training Day
No ratings yet
H2o Training Day
180 pages
Getting Started C++ API
No ratings yet
Getting Started C++ API
142 pages
Numpy-User-1 10 1
No ratings yet
Numpy-User-1 10 1
107 pages
IJERT Data Analysis Using Python
No ratings yet
IJERT Data Analysis Using Python
6 pages
XG Boost
100% (1)
XG Boost
4 pages
Practical Linear Algebra
No ratings yet
Practical Linear Algebra
253 pages
Czekanowski Index-Based Similarity As Alternative Correlation Measure in N-Asset Portfolio Analysis
No ratings yet
Czekanowski Index-Based Similarity As Alternative Correlation Measure in N-Asset Portfolio Analysis
1 page
Flask Restplus
No ratings yet
Flask Restplus
86 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
Test Driven Development Simplified in 5 Steps: Pete Heard
100% (1)
Test Driven Development Simplified in 5 Steps: Pete Heard
24 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
2 pages
Data Mining Slides
No ratings yet
Data Mining Slides
43 pages
Getting Started - TensorFlow
0% (1)
Getting Started - TensorFlow
14 pages
How To Take All Math Classes You Need
100% (1)
How To Take All Math Classes You Need
5 pages
Python Machine Learning Tutorial With Scikit-Learn
No ratings yet
Python Machine Learning Tutorial With Scikit-Learn
16 pages
StockMarket Forecasting Using Hidden Markov Model A New Approach
No ratings yet
StockMarket Forecasting Using Hidden Markov Model A New Approach
5 pages
7 Time Series Datasets For Machine Learning
No ratings yet
7 Time Series Datasets For Machine Learning
8 pages
CSCI933 Machine Learning Algotithms and Applications
No ratings yet
CSCI933 Machine Learning Algotithms and Applications
19 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
A Brief Introduction To Mathematica: The Very Basics
No ratings yet
A Brief Introduction To Mathematica: The Very Basics
27 pages
Complete Guide To Parameter Tuning in XGBoost (With Codes in Python) PDF
No ratings yet
Complete Guide To Parameter Tuning in XGBoost (With Codes in Python) PDF
20 pages
Building Python Real-Time Applications With Storm - Sample Chapter
No ratings yet
Building Python Real-Time Applications With Storm - Sample Chapter
18 pages
Natural Language Processing With Java - Sample Chapter
100% (1)
Natural Language Processing With Java - Sample Chapter
33 pages
Quantitative Analysis and Modelling
No ratings yet
Quantitative Analysis and Modelling
3 pages
A Practical ImplementationOfHJM
No ratings yet
A Practical ImplementationOfHJM
336 pages
Scikit Learn
No ratings yet
Scikit Learn
25 pages
[Ebooks PDF] download Python Projects for Kids 1st Edition Ingrassellino full chapters
100% (3)
[Ebooks PDF] download Python Projects for Kids 1st Edition Ingrassellino full chapters
65 pages
Portfolio Optimization Using Particle Swarm Optimization
No ratings yet
Portfolio Optimization Using Particle Swarm Optimization
6 pages
Stavely Python Ebook PDF
No ratings yet
Stavely Python Ebook PDF
260 pages
Prologue: 0.1 Books and Algorithms
No ratings yet
Prologue: 0.1 Books and Algorithms
9 pages
Stochastic Modelling & Its Applications
No ratings yet
Stochastic Modelling & Its Applications
19 pages
Regression Analysis - ISYE 6414
No ratings yet
Regression Analysis - ISYE 6414
3 pages
Study Guide For STA3701
No ratings yet
Study Guide For STA3701
325 pages
Machine Learning Resource Guide
No ratings yet
Machine Learning Resource Guide
11 pages
Role of Machine Learning in The Field of Fiber Reinforced Polymer
No ratings yet
Role of Machine Learning in The Field of Fiber Reinforced Polymer
6 pages
DeepThought FinML
No ratings yet
DeepThought FinML
124 pages
Rsa - TCR PDF
No ratings yet
Rsa - TCR PDF
89 pages
OceanofPDF - Com Python Machine Learning The Beginners Gu - Lilly Trinity
No ratings yet
OceanofPDF - Com Python Machine Learning The Beginners Gu - Lilly Trinity
115 pages
Maths in Daily Life
No ratings yet
Maths in Daily Life
5 pages
Fast Sequential Monte Carlo Methods for Counting and Optimization
From Everand
Fast Sequential Monte Carlo Methods for Counting and Optimization
Reuven Y. Rubinstein
No ratings yet
Python Unleashed: Mastering the Art of Efficient Coding
From Everand
Python Unleashed: Mastering the Art of Efficient Coding
James Livingston
No ratings yet
Relational Databases
No ratings yet
Relational Databases
88 pages
Global Superstore 2016
No ratings yet
Global Superstore 2016
6,865 pages
Bag of Words
No ratings yet
Bag of Words
32 pages
Gradient Descent Algorithm
No ratings yet
Gradient Descent Algorithm
5 pages
Air Quality UCI
No ratings yet
Air Quality UCI
540 pages
Assign
100% (1)
Assign
11 pages
Value Weight Required Rate of Return
No ratings yet
Value Weight Required Rate of Return
3 pages
Testing Class
No ratings yet
Testing Class
10 pages
Banna Leisure 111
No ratings yet
Banna Leisure 111
2 pages
Programmes Offered by Ksou: A Under Graduate Programmes - (05) Sl. No. Proogrammes Duration of The Programme
No ratings yet
Programmes Offered by Ksou: A Under Graduate Programmes - (05) Sl. No. Proogrammes Duration of The Programme
3 pages
Weighted Average Cost of Capital (WACC) - 2017 Value Weight Required Rate of Return
No ratings yet
Weighted Average Cost of Capital (WACC) - 2017 Value Weight Required Rate of Return
4 pages
Plagiarism - Report
No ratings yet
Plagiarism - Report
49 pages
Internatiional Financial Management: Unit I
No ratings yet
Internatiional Financial Management: Unit I
51 pages
All About Stock Market - Read It
No ratings yet
All About Stock Market - Read It
53 pages
TECHNOLOGY
No ratings yet
TECHNOLOGY
3 pages
1.3.1 (Regular Languages and Regular Expressions)
No ratings yet
1.3.1 (Regular Languages and Regular Expressions)
21 pages
A Comparative Study of Cybercrime Law Between The United States and The Philippines
No ratings yet
A Comparative Study of Cybercrime Law Between The United States and The Philippines
5 pages
Daily QA Report Somatom Go All
No ratings yet
Daily QA Report Somatom Go All
17 pages
(SIMPLE) My $10K - Day TikTok Ads Strategy in 2023 (Dropshipping & Ecom)
No ratings yet
(SIMPLE) My $10K - Day TikTok Ads Strategy in 2023 (Dropshipping & Ecom)
12 pages
Mitre Attack 4
No ratings yet
Mitre Attack 4
16 pages
Technogym Plus - Faq
No ratings yet
Technogym Plus - Faq
3 pages
Teseo Liv3fl
No ratings yet
Teseo Liv3fl
34 pages
CSCI5273 PS2 KiranJojare
No ratings yet
CSCI5273 PS2 KiranJojare
14 pages
My Resume
No ratings yet
My Resume
3 pages
S7-1500 ET200MP-21UKEX0008X-Iss0 NN Frei
No ratings yet
S7-1500 ET200MP-21UKEX0008X-Iss0 NN Frei
7 pages
L3 Seo
No ratings yet
L3 Seo
19 pages
Digital Business Strategy
No ratings yet
Digital Business Strategy
85 pages
Data Analyst (1) (11168)
No ratings yet
Data Analyst (1) (11168)
399 pages
NetflixOSS - A Cloud Native Architecture - Slides PDF
No ratings yet
NetflixOSS - A Cloud Native Architecture - Slides PDF
86 pages
2021 Test 2 Industrial Engineering
No ratings yet
2021 Test 2 Industrial Engineering
3 pages
Top 10 Uses of Computer in Our Daily Life
100% (7)
Top 10 Uses of Computer in Our Daily Life
5 pages
21-2M-120Ohm Cable
No ratings yet
21-2M-120Ohm Cable
2 pages
Bahasa Arab Online Exercise For 7 - Live Worksheets
No ratings yet
Bahasa Arab Online Exercise For 7 - Live Worksheets
3 pages
How To Understand A Renko Chart
No ratings yet
How To Understand A Renko Chart
18 pages
Course Information: Instructors
No ratings yet
Course Information: Instructors
7 pages
SmartRay ECCO With Matrox Design Assistant
No ratings yet
SmartRay ECCO With Matrox Design Assistant
7 pages
ML Engineer Learning Resources
No ratings yet
ML Engineer Learning Resources
9 pages
PAL & BOP SPRO Configuration
No ratings yet
PAL & BOP SPRO Configuration
9 pages
Are You Solving The Right Problems?
No ratings yet
Are You Solving The Right Problems?
25 pages
Complete Guide on Google Analytics 4
No ratings yet
Complete Guide on Google Analytics 4
24 pages
Design of Aders
No ratings yet
Design of Aders
20 pages