The Capital Asset Pricing Model
In finance, the capital asset pricing model (CAPM) was the first theory to measure systematic risk. The CAPM argues that there is a single type of risk, market risk. I derive the CAPM from the mean–variance framework of modern portfolio theory.
When I first learned about the capital asset pricing model (CAPM) (Sharpe, 1964), I thought the main result was non-obvious. To explain my initial confusion, let me first state this result. Let be the return of the market. For example, this might be approximated by the return of the S&P 500 index. Let $R_i$ be the return of asset . And let be the risk-free rate, meaning a rate of return that is achievable without any risk. No such risk-free asset truly exists, but the canonical example is a United States Treasury bill. Under some assumptions which we will discuss, we can prove the following equation:
where is the variance of the market and is the cross-covariance between the market and asset . The coefficient in Equation is often called “beta” and denoted , giving us
This is because in a simple linear regression, the linear coefficient is the ratio of the cross-covariance to the variance. (See Equation in this post for details on this claim.) I assume that in Equation gets its name and notation from the standard notation for least squares regression. An attentive reader might notice that the regression’s intercept, often denoted , is zero in Equation . We will discuss what this means later.
Now let me explain what is non-obvious to me. What Equation say is that there is a linear relationship between the expected excess rate of return of asset (excess w.r.t. the risk free rate) and the expected excess rate of return of the market. This means that if we had a bunch of historical data with which to estimate and , we could simply fit a linear regression to Equation to estimate , which would tell us whether asset is equivalent to the market (), better than the market (), or worse than the market ().
In my mind, if the above were true, it would be surprising. However, we can prove it is true with some assumptions (albeit simplistic ones). The goal of this post is to re-derive Equation from first principles, to better understand which assumptions induce this surprising result.
Mean–variance analysis
To understand the CAPM, one must understand modern portfolio theory and the efficient frontier. I will briefly review these topics here, but see those two posts for details.
Recall that modern portfolio theory posits that reward can be defined as the expected return of a portfolio, and its associated risk can be defined as the standard deviation on that return. Let be the return of portfolio , and let denote the portfolio weight for the -th asset. Then one can show that
which implies that the relationship between risk and reward is a quadratic. This quadratic relationship is sometimes called the Markowitz bullet (named after the founder of modern portfolio theory, Harry Markowitz), and it is typically diagrammed with on the -axis and on the -axis (Figure , dashed blue line). This kind of plot is sometimes called the risk-reward spectrum. Each point on the plane represents a portfolio (possibly a single-asset portfolio) with a given risk and associated reward.
Modern portfolio theory assumes that the rational investor should seek higher reward for the same risk. Since for a given portfolio variance, there are two portfolios with different expected returns, the rational investor should prefer portfolios on the top half of the Markowitz bullet (Figure , solid blue line). This region is called the efficient frontier, because any portfolio that is inside of or on the lower half of the Markowitz bullet is inefficient in that an investor could have a higher expected return for the same risk.
Finally, modern portfolio theory proves that if we can construct a portfolio with risky assets (each is a random variable) and a single risk-free asset (the non-random risk-free rate ), the efficient frontier is no longer a quadratic curve but a straight line (Figure , red line). Furthermore, any portfolio along this line can be represented as a convex combination of a portfolio containing just the risk free rate (Figure , triangle) and the tangency portfolio, which is the portfolio that sits at the intersection of the linear and quadratic efficient frontiers (Figure , circle).
In my mind, understanding the linear efficient frontier is critical to understanding the CAPM. This is so important that if it does not make sense now, I strongly suggest you read this section on the efficient frontier with a risk-free asset before proceeding.
Capital market line
The reason understanding the linear efficient frontier is so important for understanding the CAPM is because, at least in my mind, the CAPM is primarily an interpretation of this frontier. First, the CAPM argues that the tangency portfolio is really the market portfolio, or a portfolio containing all the assets in the market. (Market weight might be chosen, for example, based on volume trades of asset .) Why? Recall that the tangency portfolio is the portfolio of risky assets that maximizes the Sharpe ratio, defined as
See this section for details. The geometric interpretation of the Sharpe ratio is that it is the reward per unit of risk on the risk-return spectrum, and so the tangency portfolio is the portfolio that is maximally efficient in the sense that it you cannot get more reward for the same amount of risk. The CAPM argues that the market must be efficient, because if it were not, investors would arbitrage any mispricings and thereby make the market efficient. So all rational investors must hold this maximally efficient portfolio, which is also the tangency portfolio.
So the “tangency portfolio” is now the “market portfolio”. Second, the CAPM interprets the linear efficient frontier as the “capital market line”. I am not sure why, but I would guess that the name is meant to suggest that the capital markets “live” on that linear efficient frontier, since all investors, being rational, want a portfolio that is a combination of the maximally efficient portfolio (tangency or market portfolio) and the risk-free rate. Where investors want to be on this convex combination depends on their risk preferences. So rather than Figure , the CAPM uses Figure .
Deriving the CAPM
We are now ready to derive Equation . Consider a portfolio that holds a combination of the market portfolio and a single asset . Let be the market weight and be the single asset weight. Clearly, the expected return of this portfolio is
where is the rate of return of the market. This is just an accounting identity. The variance is a bit more tedious, but it’s just
You can verify that when , the mean and variance are just the mean and variance for asset , and when , the mean and variance are just the mean and variance for the market.
Notice that these two equations also imply a quadratic relationship. Since a portfolio with a single risky asset must be either on the efficient frontier or inside of it, and since any portfolio with just the market portfolio must be at the tangency portfolio, then Equations and imply a quadratic equation that is inside the Markowitz bullet and yet touches the tangency portfolio (Figure , black line). We can think of this curve as asset ’s idiosyncratic frontier. (This is not standard terminology, but it is how I think about it.)
Now what is the slope (Sharpe ratio) of this curve at the tangency or market portfolio? We know it must be the Sharpe ratio of the tangency portfolio. But we could also derive the slope of the idiosyncratic frontier at the tangency portfolio by representing its slope as a function of , taking its derivative, and then plugging in . Let’s do that.
Let be the function that defines the black curve on the risk-reward spectrum. Then the slope of curve is simply the first derivative, . We would like to compute this derivative but in terms of the weight . This way we can plug in to get to the slope of the black curve at the tangency or market portfolio. We can do that by expressing the mean and standard deviations as functions of . Define such functions and as
These are functions of because of Equations and . Then the function can be written as
This is the function that traces the black curve on the risk-reward spectrum, as a function of the weight . Then derivative of as a function of is
So we simply need to compute the ratio of the derivatives of and at to compute the desired slope. The derivative of is
So is not a function of . The derivative of is more tedious because of the square root, but it is straightforward. It is
What happens when we plug into ? Most of the terms cancel, and we get
So the derivative of the function that traces out the black curve, at the point where the curve touches the market portfolio, is
But we also know that the slope of linear efficient frontier (the capital market line) is the Sharpe ratio of the tangency portfolio. So it must be the case that
And this is just Equation :
And we’re done. This is why I said at the beginning that the linear relationship between individual asset’s return and the market’s return is really an artifact of the linear efficient frontier. Equations and simply fall out of this fact.
Security market line
The main graphical representation of the CAPM is the security market line (Figure ). To understand this visualization, let’s think about what happens when . When , the return of asset equals the return of the market, or . And when , the return of asset equals the risk-free rate . If we plot on the -axis and expected return on the -axis, we can draw a straight line between the two points defined above. This line is the security market line. Then we can estimate and for any asset in the market and plot the value against the security market line. An overvalued asset should be above the security market line, and an undervalued asset should be below it.
Alpha
As I mentioned in the introduction, we could estimate the expectations in Equation from historical data, and then fit a linear regression to estimate the correlation between the asset and the market, the term
Notice that the linear regression should have an intercept and an error term, i.e.:
In Equation , the intercept is and the slope is . The standard ordinary least squares assumption is that the error terms are zero mean, or . However, given the mathematical fact of Equation —and it is a mathematical fact if the assumptions are true, then it must be true that
This is another way of saying that the market is efficient. In expectation, there is no vertical offset for risky assets on the risk–reward spectrum, no risk-free reward in the market. The only way to a bigger reward is to take on a bigger risk. And captures how much reward you get per unit of risk, for a given asset .
Conclusion
In finance, the CAPM was the first theory to measure systematic risk (Grinold & Kahn, 2000). The CAPM argues that there is a single type of risk, market risk. The risk of asset is captured by the parameter , as defined by Equation , and the CAPM argues that, in expectation, there is no , no idiosyncratic reward that requires no risk. To earn a higher reward, you must expose yourself to the risks of the market. The framework is an extension of modern portfolio theory with some additional assumptions about market efficiency.
- Grinold, R. C., & Kahn, R. N. (2000). Active portfolio management.