There are a lot of interesting applications of convex optimization; in this post I’ll explore an application of convex optimization in finance. I’ll walk through using convex optimization to allocate a stock portfolio so that it maximizes return for a given risk level. We’ll use real data for a mock portfolio, and solve the problem using Python. All of the code can be found on GitHub – the code shown here is from portfolio_opt.py and uses code in stocks.py, which pulls stock data from Yahoo Finance.
Motivation
Let’s say you want to invest some money in the stock market. You choose a set of stocks and have a sum of money to invest. How should you distribute the money into the different stocks? There is a general tradeoff between risk and return; with higher potential return we often face higher risk. If we have a goal return in mind, then we should choose the portfolio allocation that minimizes the risk for that return. How can we do this?
Borrowing ideas from modern portfolio theory, we can view the return of each stock as a random variable, and estimate the variable’s parameters – namely the mean return and covariance – with past data. Then we solve an optimization problem to find the combination of stocks that maximizes expected return for a given risk level.
We’ll choose different assets, viewing the portfolio as a vector Each will represent the percentage of our budget invested in asset . As a running example, we’ll have different stocks, identified by their ticker symbols:
symbols = ['GOOG', 'AIMC', 'GS', 'BH', 'TM', 'F', 'HLS', 'DIS', 'LUV', 'MSFT']
So for instance, would mean that we invested 14% of our budget in Google. A naive allocation could be investing equal amounts (10%) into each stock, so that . Our goal is to do better, and choose the best possible .
Looking to the Past
First we need an estimate of the expected returns and covariance for the portfolio, which will be used in our optimization. A simple way of estimating a stock’s expected return is to look to its past performance; we’ll use average yearly return from the past four years. Four is somewhat arbitrary, but the emphasis here is illustrating the optimization approach rather than the estimation of a stock’s return. The average historical return will be an dimensional vector , where is the average return of asset i.
Using avg_return()
from stocks.py, we have:
start = '1/1/2010' end = '1/1/2014' # average yearly return for each stock r_avg = map(lambda s: stocks.avg_return(s, start, end, 'y'), symbols)
Similarly, we can find the portfolio’s covariance using past data; the covariance of asset returns is . Using cov_matrix()
from stocks.py:
# covariance of asset returns sigma = numpy.array(stocks.cov_matrix(symbols, start, end, 'y'))
The last parameter is our goal return threshold, :
# minimum expected return threshold r_min = 0.10
With these quantities in mind, we can now formulate a convex optimization problem to find the optimal portfolio allocation; that is, the portfolio that achieves the lowest amount of risk while meeting our return goal .
The problem
We can use the quantity as a measure of risk for a given portfolio allocation with covariance . Our objective is to minimize . This objective function is a convex function, meaning that we’re able to formulate a convex optimization problem, specifically a quadratic program (QP), to find its minimum. To start out, we have the problem:
minimize
We can build in our goal return as a constraint . Since we want each to be a percentage (of the budget), we can also add the constraints and . These are simple linear constraints, maintaining convexity of the new problem:
minimize
subject to
Solving with Python
Now it’s time to translate the math into code. In order to setup and solve the problem in Python, we’ll use the CVXOPT library. CVXOPT allows us to solve a convex optimization problem as long as we can put it into the proper form. First, we convert the covariance and average return arrays into CVXOPT matrices:
r_avg = matrix(r_avg) sigma = matrix(sigma) # that was easy
Since the portfolio allocation problem is a quadratic program, we need to put our problem into the form:
minimize
subject to
In our case, P = sigma, and q = 0:
P = sigma q = matrix(numpy.zeros((n, 1)))
The inequality constraints and are captured using :
# inequality constraints Gx <= h # captures the constraints (avg_ret'x >= r_min) and (x >= 0) G = matrix(numpy.concatenate(( -numpy.transpose(numpy.array(avg_ret)), -numpy.identity(n)), 0)) h = matrix(numpy.concatenate(( -numpy.ones((1,1))*r_min, numpy.zeros((n,1))), 0))
And the equality constraint is captured using :
# equality constraint Ax = b; captures the constraint sum(x) == 1 A = matrix(1.0, (1,n)) b = matrix(1.0)
We’re ready to solve! Throw it into the solver and brace yourself for optimality:
sol = solvers.qp(P, q, G, h, A, b)
The Results and the Expansions
Here’s the result:
Optimal solution found. [0.0, 0.0, 0.0, 0.7, 0.16, 0.0, 0.0, 0.0, 0.0, 0.13]
Surprisingly, we should only invest in the 3 of the stocks! Keep in mind that the model that we’ve used here contains many simplifying assumptions; the emphasis here is outlining the approach to casting the problem as a convex optimization problem using real stock data. The great thing is that we can easily change the estimation of expected return, use a different objective function, or introduce new constraints that better reflect our goals, and more generally, the real-world.
Credits
This post was originally inspired by content from Stephen Boyd’s great book Convex Optimization. Boyd is also teaching an ongoing online-course called CVX 101 if you are interested in learning more about convex optimization.