The Greeks
In finance, the "Greeks" refer to the partial derivatives of an option pricing model with respect to its inputs. They are important for understanding how an option's price may change. I discuss the Black–Scholes Greeks in detail.
An option pricing model is a function that takes the price of the underlying asset, or spot price, and other market inputs and then outputs an option’s fair value :
An important way to understand a particular option-pricing model is to think about the output price’s sensitivity to the model’s inputs. For example, what happens to the option price when a particular parameter changes? To do this, we can take partial derivatives of with respect to the inputs. For example, arguably the most important sensitivity to understand is how the option price changes as a function of spot price or
We can visualize this sensitivity as the slope at a point along a price curve, defined by a particular option pricing model . In an instant in time, when spot is valued at , this partial derivative evaluated at is a line representing the rate of change of the option price as a function of the spot price (Figure ). Clearly, a steeper slope means the option is more sensitive to the underlying’s price.
In the options-pricing literature, these partial derivatives are called the Greeks, so-named because they are denoted with Greek letters. For example, the sensitivity in Equation is called delta ().
However, without assuming a particular option pricing model, we cannot specify delta or the other Greeks further. We could have a high-level discussion about the Greeks without defining , but any practical discussion requires that we do so. The most common choice for is the Black–Scholes equation (Black & Scholes, 1973), which gives the fair price of a European-style option under assumptions such as the stock price following an Itô process and that the stock does not pay dividends. Inputs to Black–Scholes are the spot price , the option’s strike price , the option’s time to expiry , the stock’s volatility , and the risk-free interest-rate . Thus, throughout this post, I will refer to various Greeks with the understanding that these are the Black–Scholes Greeks. However, it is important to keep in mind that the basic idea is more general: an option’s price is sensitive to the inputs of its pricing model, and this model is not necessarily Black–Scholes. One could, for example, use the Bachelier model (Bachelier, 1900) or the Black model (Black, 1976) or the Cox-Ross-Rubinstein model (Cox et al., 1979; Rendleman, 1979) to price options.
We will stick with Black–Scholes. Now let denote the price of a call option. The Black–Scholes price of a European-style call is
where is the cumulative distribution function (CDF) of the standard normal distribution:
Since is the CDF of the standard normal distribution, its derivative is the probability density function (PDF) of the standard normal distribution. I’ll denote this with or
As a preliminary, see A1 for computing this derivative when the input to is itself a function of , i.e. for computing for some function . This fact will be used repeatedly when deriving the Greeks.
The intuition for Equation is that the Black–Scholes formula is the weighted difference between the spot and strike (Figure , left). The weights account for the time-value of money and the probability of the option ending in-the-money under the risk-neutral measure. See my post on the binomial options-pricing model for a detailed yet intuitive explanation of Black–Scholes using a simpler yet analogous model. Note that we assume the stock does not pay dividends, and therefore there is no adjustment to . In a separate paper, (Merton, 1973) derived Black–Scholes with stochastic interest rates, dividend-paying stocks, and American options; and thus Black–Scholes is sometimes called Black–Scholes–Merton to give credit to Merton.
The Black–Scholes formula for a European-style put with price is
Here, we can see that the payoff is roughly flipped (Figure , right). We’ll dig into some more subtle differences later.
However, for deriving the Greeks, we will find it useful to represent a call in terms of a put via put-call parity:
Intuitively, this equation says: if you have the right to buy a stock plus the money to do so, you’re in the same financial position as if you had the right to sell the stock short plus you own the stock to sell. If you’re unconvinced, simply write down two portfolios, one for the left-hand side containing a call and a zero-coupon bond paying at time and one for the right-hand side with a put and a stock; then convince yourselves that these have the same payout when or .
Throughout this post, I’ll always derive the Black–Scholes Greek for a call option and then use put-call parity to more easily derive the Greek for a put. This approach is especially nice in light of the linearity of differentiation. Why does this work? Put-call parity is a principle of efficient markets, and thus an option pricing formula that admits no arbitrage should adhere to put-call parity. And we can convince ourselves that Black–Scholes adheres to put-call parity by deriving the Greeks for puts using both Equations and and observing that we get the same result either way. I will do this in appendices.
Now let’s dig into five of the most important Greeks: delta (), gamma (), rho (), theta (), and vega (). See (Hull, 1993) and (Cox & Rubinstein, 1985) for textbook discussions of this topic.
Delta
As we saw above, delta () captures the sensitivity, or more precisely the instantaneous rate of change, of the option price to the spot price. The Black–Scholes delta for a call option is
where and are defined in Equations and . See A2 for a key lemma that will be used throughout the derivations of the Greeks and see A3 for a derivation of .
To understand delta, let’s start with an example. Suppose we have a stock with price , a call option with a price of , and a delta between the two of . (The prices do not actually matter here, but they will in later examples.) Now imagine that we sell calls for shares of this stock, and that we want to hedge our resulting short position. We can use delta to compute the appropriate size of our hedge! We should buy shares of the underlying stock. Why? This is just a direct application of Equation . First, let’s replace the differentials (e.g. ) with small moves in the assets (e.g. ). This is a fine thing to do because Equation says that is the linear approximation of how the Black–Scholes price of our calls will change. We can write this as
Now let denote the number of shares of the stock associated with the calls we sold (here ). Then by Equation , we have
Our proposed hedge above is , and this works because if the stock price changes by , then the call moves by , and we will make on our hedge and lose on our calls. In practice, this calculation is made slightly more tedious because options are typically sold with a contract multiplier indicating the number of shares per contract, but this is just a matter of bookkeeping. Thus, while the options we sold have a delta of , our actual portfolio position has a delta of zero. We say that our portfolio is delta neutral, and this means that if spot changes a little, the value of our options position does not change significantly.
Dollar delta
In the previous example, the raw values of the stock and call did not matter, and we had no sense of our total notional exposure. Thus, in practice, mathematical delta (Equation ) is often converted to dollar delta, sometimes called notional delta, because this quantity tells us how much notional exposure we have to the underlying. Dollar delta is just delta times spot or
For example, our notional exposure in the previous example was the number of shares times dollar delta or
To hedge, we should buy of stock, which is obviously the same as saying we should buy shares of stock at .
One benefit of thinking in a notional amount is that now we can multiply dollar delta by a percent change in the stock. In contrast, since delta represents a dollar change in stock move per dollar change in option move, we cannot directly multiply mathematical delta by a relative move such as a return or percent return. To see this, consider the returns in our running example. Let denote the stock price after move . Then the return on is
while the return on to a new price is
And clearly
So it is nonsensical to multiply mathematical delta by a relative move. We must operate on it in a way that makes sense given Equation .
However, we can multiply dollar delta by a relative move since
So dollar delta is nice because it tells us both our notional exposure to the underlying and because we can multiply it by a relative move in the underlying to get a dollar change in the option.
Portfolio delta
Imagine we had a portfolio of options. We can represent the value of our portfolio, , as a weighted sum of these assets,
where the portfolio weights represent the quantity of each option, and where is the value of the -th call or put. By the linearity of differentiation, we can easily compute the delta of this portfolio:
Thus, the delta of a portfolio can be computed by adding up the deltas of the individual options.
For example, recall our short position in call options from the example above. Now imagine that we additionally have a long position in call options worth of shares of the same underlying, where the delta of each option is . Then the delta of our portfolio in shares of stock is
Thus, our portfolio has a dollar delta of , which we can make delta-neutral by buying of stock. This makes sense. Before we had to notional hedge of . Now we have a notional hedge that is roughly half that, given we have a long position in calls that behave roughly like half a stock.
Delta hedging
In our leading example, we hedged our delta from our option position, a short position in calls. Now that we understand dollar delta and portfolio delta, let’s think a little bit about a more general way to formulate the problem. Imagine we have a portfolio with a delta of . And imagine that another asset has a delta of . Then to be delta neutral, we want to buy of this asset such that
Solving for , we get
But what asset should we use here? Well, we could find an option with a delta of , but a simpler thing to do would be to buy the underlying asset, which has a delta of one (). So our hedge in shares or dollars should be proportional to , which is precisely what we see in Equation . We can easily think about this in terms of shares or dollars by multiplying both sides of Equation as desired.
Functional form
Now that we have some sense of how delta is used, let’s dig into the precise definition of the Black–Scholes delta (Equation ). This is a wonderful result, but it may not be immediately obvious what it means. I’ll discuss a common interpretation in a moment. But first, note that since is the CDF of a symmetric distribution, we can immediately guess that shape of delta is roughly a sigmoid function, which it is (Figure )! This also makes sense if we visualize the slope of the tangent line changing as we move along the Black–Scholes price curve in Figure . Clearly, delta for a call must initially be nearly zero when , must increase as approaches and passes , and finally must flatten off to nearly one when .
Notice that an at-the-money (ATM) option has a delta of slightly greater than . This is because while is symmetric with respect to its input , the variable is a logarithmic function of . Confusingly, some people will claim that the delta of an ATM option is . For example, Investopedia says
An ATM option has a delta of , positive if it is a call, negative for a put.
But this is demonstrably false. Just consider the full definition of delta for a call when :
Since is the CDF of the standard normal distribution, we can see that only when , assuming interest rates and volatility are nonzero. But in my mind, it does not make much sense to talk about the delta of an ATM option at expiry. I’ll discuss this a bit more when discussing delta’s sensitivities to other parameters. Anyway, in most realistic scenarios (positive interest rates, nonzero volatility, and nonzero time to expiry), the term is greater than zero when , and thus the Black–Scholes delta of an ATM option is slightly greater than .
A deep in-the-money (ITM) option has a delta close to one, meaning that the option behaves like spot. And a deep out-of-the-money (OTM) option has a delta close to zero, meaning that the option is insensitive to the underlying’s price changes. I think it’s worth observing that many of these points should hold true for any option pricing model. It does not make sense to have an option pricing model whose delta is zero for an ITM option, for example. So the coherency of Black–Scholes Greeks with reality is one justification for using this particular model.
As a practical matter, we often refer to an option’s delta in percentage terms, so an option with a delta of is sometimes called a “-delta option”. And of course, since a -delta option is often considered ATM, this phrase is often used to describe ATM options.
It makes sense that the delta for a call option is in the range , since it is the slope of the relationship between the spot and option (Figure , left). Using put-call parity, we can immediately derive the delta for a put:
Since ranges in , clearly is in the range . And this makes sense given the Black–Scholes price curve for a put (Figure , right). So the delta for a call and put have the exact same shape for all input parameters, but are just shifted by one. See A3 for a derivation.
Moneyness
At this point, we have enough of an understanding of delta to investigate a common interpretation of delta, namely that it captures the moneyness of an option, how likely it is that the option ends ITM.
To understand this, we need to understand and from the Black–Scholes equation. has a clean interpretation. It is the probability that the option ends ITM in a risk-neutral world. If this is not clear to you, then I would recommend interpreting by mapping it onto the appropriate term in the binomial options-pricing formula, which is discrete-time analog of Black–Scholes (see the right term in brackets in Equation in this post).
But since the moneyness interpretation is true of , it is only true of to the extent that . How are these terms different? They differ by the volatility of spot:
What this reasoning suggests is that the moneyness interpretation of delta is approximately true when volatility or time to expiry are low but is less true when volatility or time to expiry are high. When we compare and for varying volatilities, this is precisely the relationship we see (Figure ). For example, the delta and moneyness of a option are closer in value than the delta and moneyness of a option.
Delta sensitivities
As we have seen already, the precise shape of delta changes based on the other Black–Scholes inputs. Just as we can understand Black–Scholes by fixing all parameters but one, we can understand a single Greek such as delta by fixing all other parameters but one.
First, let’s look at how delta changes while holding all other parameters fixed as the time to expiry decreases (Figure ). I like to plot this with decreasing since this is means the graph can be read left-to-right as physical time moves forward. We can see that all options, regardless of moneyness, have more similar deltas when the expiration date is far enough out. Then as expiration approaches, ITM, ATM, and OTM options converge to deltas of , , and respectively for calls and , , and respectively for puts. In Figure , I have only plotted these curves for , since the graph for is identical except for a shift in the -axis.
Another way to see this is to visualize how the Black–Scholes price curve (for calls) changes as a function of time. In the left subplot of Figure , I plotted the Black–Scholes price curve for a call as the time to expiry decreases to zero. As we can see, when is large (here, years), the option’s delta is roughly over a large range of stock values. However, as the time to expiry approaches, the price curve becomes more hockey-stick shaped until the moment before expiry, at which point it is essentially a piecewise linear function with a discontinuity at the strike price. What does this tell us? Long before expiry, OTM and ITM calls have a similar delta, but these two deltas will converge to zero and one by expiry (all other values being fixed). And what happens to the delta of an ATM call? Well, its delta becomes increasingly sensitive to the precise value of the stock.
We can also think about how delta changes with respect to volatility (Figure ). When volatility decreases, OTM calls become less sensitive to the underlying, while ITM calls become more sensitive to the underlying.
This makes sense because in the absence of volatility, the world is predictable, and the delta function becomes a step function. Either the stock is worth more than the strike, in which case the option moves like a stock, or the stock is worth less than the strike, in which case the option is insensitive to the stock price. And in a world without volatility, it does not make sense to talk about the delta of ATM options, and we can see from Equation that this value is undefined. The only reason the discontinuity is not at is because of a nonzero interest rate. As with Figure , the graph for puts is identical except for a shift in the -axis.
Changing delta
Finally, it is important to remember that delta is a derivative and is therefore the slope of the tangent line to the Black–Scholes price curve at a particular spot price! When spot changes in value from to , our original estimate of delta is no longer correct. It should be a new value , representing the slope of the line tangent to the Black–Scholes price curve at (Figure ).
Thus, hedging an options position by trading the underlying requires continuous rebalancing of the hedged position, due to this continuously accumulating hedging error. One way to visualize this ever-changing error is by visualizing the gap between the linear equations and , where and are the -intercepts their respective deltas (Figure , pink region). What would increase this gap between the two points? More curvature to the Black–Scholes price curve! Thus, in order to account for this hedging error, we could add a term that accounts for the curvature of the Black–Schole’s model a particular point. To do this, we use a second-order Greek called gamma.
Gamma
The Greek gamma () is an option’s sensitivity to delta. In other words, delta is a first-order Greek, while gamma is a second-order Greek, since gamma is a second partial derivative:
The Black–Scholes gamma for a call option is
where is defined in Equation . See A4 for a derivation.
Using our running example so far, the gamma of a call with inputs , , , , and is roughly . This means that delta changes by for every change in spot. As with delta, we can replace the differentials above with moves in spot and delta to get our linear approximation of curvature:
For example, in Figure spot changed by . So our estimate of the change in delta using gamma estimated at is . We can see that the actual change in delta was . We overestimated the change in delta because the curvature at point is greater than the curvature at point .
Dollar gamma
Much like delta, gamma is often converted to a notional value, called dollar gamma. While dollar delta is easy—it is just delta times spot or Equation —dollar gamma is made complicated by the fact that there are many different definitions of it. See this Stack Exchange for an example of this confusion. I am going to use this definition:
I like Equation because, as mentioned in that Stack Exchange post, it matches the definition of dollar gamma used by Peter Carr—see around Equation in (Carr & Madan, 2001)—and because it is analogous to dollar delta. An alternative definition of dollar gamma is to halve Equation , so
In my mind, this is also a reasonable definition.
To see why these definitions make sense, let’s consider an analogy with physics. If we drive a car miles per hour for hours, we have gone miles. But what if we start at exactly miles per hour while accelerating at a constant mile per hours squared? This means that our velocity is increasing by miles per hour per hour. So after two hours, our total distance traveled is not miles but rather miles. Our change in position (distance) from baseline velocity is still just miles, but we also changed position due to acceleration. And acceleration times hours gives us our instantaneous velocity due to acceleration at each time point. If we multiply this by time again, we get the distance traveled due to these instantenous velocities. Of course, this relationship is captured in a classic equation of motion that most people learned in high school physics:
where is position, is time, is velocity, and is acceleration. My argument here is that is analogous to the dollar change in our options position, is analogous to dollar delta ( and ), and finally is analogous to dollar gamma. The former captures how much the notional value of our position has changed due to a first-order change in the underlying, while the latter captures how much the notional value of our position has changed due a second-order change in the underlying:
And if we substitute our definitions of dollar delta and dollar gamma into Equation , we get
And we can visualize this nicely. In Figure , dollar delta is the blue region representing how much our portfolio has changed due to some notional change in spot price, while dollar gamma is the pink region representing how much our portfolio has changed due to the same notional change in spot price. If we add these two regions together, we get the total notional change in our portfolio.
You might be asking: aren’t we just doing a second-order Taylor approximation? Yes! To see this formally, let denote the Black–Scholes pricing function. If we abuse notation a little bit and let and denote the first and second partial derivatives of evaluated at , the first-order Taylor approximation of around the point is
I visualize this in Figure . To me this, really justifies Carr’s definition of as dollar gamma. The is only from the Taylor expansion. If this section doesn’t make sense, see this excellent 3Blue1Brown video on the Taylor series, particularly around the mark.
Finally, yet another alternative definition of dollar gamma is to divide Equation by , so
(Note that yet another definition of dollar gamma I have seen is dividing it by ; see here.) There is a motivation for this definition, and I think it’s worth understanding. Consider dollar delta divided by :
Since dollar delta has units of dollars per return in , the units of Equation are dollars per percent. For example, if we have a notional long position in -delta calls, then we have a dollar delta of and a dollar-delta-per-percent of per . What this represents is a change in dollar delta per a change in stock. Put differently, for every change in the underlying stock price, we earn in our option position. For example, if the stock changed by , then our dollar delta would change by . Particularly if we scale these values by the number of contracts to get our portfolio’s notional position, we can see that Equation is very useful for quickly estimating our notional risk from a percent change in the underlying.
The gamma analog to Equation is thus
This tells us how much our mathematical delta (Equation ) changes a percent change in the underlying. But if we’re interested in dollar delta, then we would multiply Equation by , giving us the definition of dollar gamma in Equation . In other words, Equation captures how dollar delta changes per a percent change in the underlying. In my mind, this definition is useful from a trading perspective but is less mathematically coherent. So I prefer to refer to Equation as dollar gamma, although Equation is useful in its own way.
Gamma hedging
As with delta, we may want to hedge our gamma exposure or make our portfolio gamma neutral. This means our position would be insensitive to large price shocks. However, it may not be immediately obvious how to do this, since the underlying asset has zero gamma:
So unlike delta, we cannot simply buy some amount of the underlying to make our portfolio gamma neutral.
What we need instead is to gamma hedge with options. Imagine that our portfolio is delta neutral but has a gamma exposure of , and that an option has a gamma of . We would like to buy of the option such that
This means we want to buy
of the option. After we do this, however, our delta hedge will be wrong. That’s because we just bought of an option, which means we have new delta exposure to the underlying. Since deltas add (see Equation ), the total unhedged delta of our portfolio is
We have already hedged , and can be delta neutral if we hedge . This means we buy of the underlying (Equation ). Since we hedge our delta with spot and spot has no gamma exposure, we’re done.
Functional form
Now that we have some sense of how gamma can be used, let’s look at its Black–Scholes definition in detail.
Based on Equation , we can immediately infer that gamma take values in , should be roughly symmetric with respect to , and should decrease as , or increase. Again, I say “roughly” because while the normal distribution is symmetric with respect to its input, here the input to the PDF is not the stock price but , which is logarithmic with respect to . So much like delta, which is a slightly asymmetric sigmoid function, gamma is slightly asymmetric normal distribution (Figure ).
We can derive the gamma for a put option using put-call parity:
which matches gamma for a put if we derive it directly from Black–Scholes (see A4). So gamma is the same for puts and calls!
I think there’s a nice way to visualize this equivalence. Consider Figure . I have plotted the Black–Scholes price curve (solid black line) for calls (left) and puts (right) as well as the delta approximation at a given point (blue line). (I chose an off-center point to underscore the point being made here.) We can visualize the curvature of by thinking about the gap between the blue tangent line and . We can see that for calls and puts, this gap is the same, suggesting that the curvature is the same. It is not a proof, and the gap itself is not gamma, but might give some sense for why gamma might be the same for calls and puts.
Finally, based on the lemma in A2, we could also write Equation as
This is useful to know since sometimes different authors write gamma in different ways.
Gamma sensitivities
As with delta, it is useful to visualize gamma changing as we vary one other parameter. As we already mentioned, it should be clear that gamma is small when an option is ITM or OTM, and it is bigger when the option is ATM. We can observe the first fact in Figure (gamma decreases as increases). This makes intuitive sense, as the option will become less sensitive to moves in the underlying the more ITM the option is. This is a different way of saying that delta “levels off” as increases, as it does in Figure .
And as with delta, we can see how gamma changes as the time to expiry decreases (Figure ). We can see that the gammas for ITM and OTM options ultimately trend to zero. This makes sense, as these options are increasingly less sensitive to changes in delta. But ATM options become very sensitive to changes in delta as they near expiry. This also makes sense, as the Black–Scholes price curve becomes a hockey stick with a kink at (Figure ), and gamma is measuring this increasing curvature. In fact, gamma becomes so high close to expiry that I stopped plotting the ATM curve in Figure well before . Otherwise, the plot would be unreadable.
Lastly, we can see how gamma changes as changes (Figure ). In my mind, this result is fairly intuitive. If volatility increases, then for the same stock price, the option’s sensitivity to changes in the stock price is lower.
Rho
At this point, we have a good understanding of how an option’s price changes with respect to spot. Now let’s change tack and look at first-order Greeks with respect to other parameters. In the remainder of this post, we’ll look at theta (), rho (), and vega (). I’ll start with two easier ones, rho and theta, and leave vega—and option’s sensitivity to volatility—for last.
The Greek rho () measures an option’s sensitivity to interest rates. The Black–Scholes rhos for calls and puts are
See A5 for derivations. I placed the two definitions of above side-by-side because it is clear that they have opposite signs, and I think it’s useful to think about why this might be the case. Recall our justification for put-call parity. A call option plus a zero-coupon bond paying at time —and is thus equal to today—is equivalent to a put option plus the underlying stock. The key thing to think about here is how the bond is subject to interest rates. As interest rates increase, the present value of the strike (bond) decreases—we need less money today in order for it to be worth the same amount at time —and thus becomes relatively more expensive to . Thus, with zero interest rates, ATM puts and calls with the same inputs are worth the amount (Figure , left). With zero interest rates and with , the difference between calls and puts is equal to (Figure , middle and right). In my mind, this is why calls and puts have sensitivities to rates that are the opposite sign. As interest rates increase, the fair price of a call increases, while the fair price of a put decreases.
Now that we have some understanding for how calls and puts change with respect to interest rates, let’s look at an example of using the Greek rho. Let’s use our rolling example of an ATM call with , time to expiry of years, annualized volatility of , and an annualized interest rate of . Using these values, the price of a Black–Scholes call option is roughly (you can sanity check this number with Figure ). If we plug these values into Equation above, we get
That’s a pretty big number, but what does it mean? Well, since Black–Scholes interest rates are annualized, then this means that a move in annualized interest rates will result in a move in our call option. So rather than the fair price being , it will now be . And if this example was for a put option, the logic would be the same but the sign of rho would flip: rho for a put with the same inputs would be . This tracks with what we see in Figure .
We can sanity check these changes in option values with an example. Imagine we bought shares of the underlying stock at its current price . So we need to pay today. However, alternatively, we could buy calls for shares of stock, meaning we get exposure to shares of stock for . Thus, as a buyer of calls, we have the same potential upside as the buyer of stock but we have surplus dollars that we can put into zero-coupon bonds. At annualized interest, we would earn in one year. If interest rates increased by , then we would earn more dollars. So to a first approximation, the value of the calls going up seems reasonable.
Functional form
Finally, let’s look at the functional form of rho with respect to spot price (Figure ). We can see that rho for a call looks a lot like delta (Figure ), which makes sense since rho for a call is the CDF of standard normal , scaled by . And the graph for a put would be similar but shifted along the -axis.
Given this similarity to delta, it seems straightforward to quickly approximate rho for an ATM option. For example, imagine that an option expires in one week or , that and that . Then , which means that —think about the first-order Taylor approximation if this does not make sense—and so . So should be close to unity. The true value is approximately . This value is significantly smaller than our previous example (Equation ) due a much smaller time to expiry.
Rho hedging
To my knowledge, rates hedging is less widely discussed in the derivatives-pricing literature. I suspect this is because are interest rates are relatively constant and because changes in rates are predictable in the sense that they happen after central bank announcements. Thus, an options dealer simply needs to re-hedge their rates exposure around these events. Furthermore, one can hedge rates without options via fixed-income assets such as bonds.
As with gamma, we cannot hedge our rates exposure with the underlying asset, which has zero rho. But we need to find an asset with rates exposure such that
We could hedge our rates exposure with an option, but another idea would be to use a fixed-income asset. This underscores an important idea, which is that we can think of other types of assets such as futures or bonds as having Greeks as well. Here, we can think of the rho of a bond as a linear approximation of how the bond price changes per a change in rates.
Theta
The Greek theta () captures the sensitivity of the option price to the passage of time. The Black–Scholes theta for a call option is
See A6 for a derivation. This Greek is often referred to as time decay because an option’s value typically (but not always!) decreases over time if all other variables are held constant. Why? My intuition is this: imagine that an option is fairly priced today but then time passes without any change to the interest rate, the underlying, and so forth. Then the option is worth less because the optionality, which was priced in, has decreased. For example, a call option is a bet on the stock going up, and now the stock has less time for the stock to go up.
One easy way to verify this is to simply plot the Black–Scholes price of an option as approaches (Figure ). We can see that the value of calls is decreasing for both OTM and ITM options, but it decays to zero for OTM options and to nonzero values for ITM options. To be clear, theta is measuring how fast each option’s value is decaying, so options that start at a higher value but still end in zero must have a faster time decay.
As before, let’s start with an example. Imagine an ATM call option has a spot and strike price of , has an implied volatility of , and expires in a quarter of the year (). And imagine that the annual interest rate is . Then the option’s theta is
How do we interpret this number? As with the other Greeks, we can multiply it by a change in time to get a linear approximation of the change in our option’s value. For example, to approximate how our option has changed in value after one week has passed, we multiply theta by a change in time (annualized):
So all else being constant, this call option’s value will decay by per week. Note that we can convert theta to a daily value by simply dividing by the number of days. So here, the daily theta is
Now we can simply multiply this quantity by a number of days to get how our option’s value decays in notional terms over that period.
Negative sign
Before looking at the functional form, let’s ask a more basic question. Why does theta have a negative sign in Equation ? In this post so far, we have denoted the time to expiry as , and we imagine that is decreasing. But we typically think of time as increasing. So we need a negative sign to capture this inversion between the derivative and our model of time. If you’re unconvinced, you could write the time to expiry as
where is a fixed constant, and is physical time, which is now increasing. So before, was the time to expiry but decreased, and now is the time to expiry and decreases as increases. To compute theta here, we take partial derivatives with respect to , and the chain rule gives us the negative sign, e.g.
So the negative sign is a convention due to how we think about time, and it is coherent across different ways of modeling the problem.
Functional form
Now let’s dig into the Black–Scholes functional form of theta.
Broadly, we might expect theta to be often negative, but at least for me, Equation is a bit more complicated to intuitive than the other Greeks. So let’s just plot theta as a function of the stock price and see what we get (Figure ). What we find is that theta for a call is non-positive for OTM, ATM, and ITM options, with a small negative peak around the strike. And I think this makes sense. A deep OTM option is worthless and must decay to zero at expiry; therefore, its time decay is close to zero. An ITM call has a nonzero value at expiry, and thus will have a smaller time decay. Finally, an ATM call is worth somewhere between an OTM and ITM call today, but its value at expiry is zero. Thus, an ATM call has the fastest time decay. As with the other Greeks, the peak is not precisely when . We can also see this in Figure .
As before, we can use put-call parity to derive theta for a put:
Again, see A6 for a derivation. At a high level, the key idea is that the partial derivative with respect to results in a nonzero interest-rate term:
As we can see, theta for a put is equivalent to theta for a call, plus some non-negative offset assuming non-negative interest rates. In fact, this observation is why I put this section on theta after the section on rho. Equation does not make intuitive sense unless we first appreciate how calls and puts behave as interest rates change. As interest rates go up, calls become more expensive, and puts become cheaper. And if an option is worth more (less) today but has the same value at expiry, then it must have a faster (slower) time decay.
For calls with non-negative interest rates, theta is non-positive across OTM, ATM, and ITM options (Figure ). However, for puts with non-negative interest rates, theta is negative when the option is ATM or OTM, but it is positive when the option is ITM (Figure ). This means that for an ITM put, it’s value is increasing with time! This may be surprising but it is true (Figure , right). Why? One way to think about it is that present value of the eventual payoff for a put is
So as the time to expiry decreases, the term increases. So an ITM put is increasing in value with the passage of time. We see the opposite effect with calls. This effect only occurs when . When , then a deep ITM put is simply worth across time (Figure , left).
Theta hedging
While is a random variable—and the other Black–Scholes parameters are also often modeled as random variables rather than market-specified constants—time to expiry is non-random. Therefore, while it’s important to understand theta conceptually, it is not a Greek that is typically hedged. We could hedge our time exposure, of course, by finding an asset with time decay such that
However, this does not offer any utility, as our position is not subject to unpredictable “time shocks”.
Vega
Finally, the Greek vega () measures an option’s sensitivity to the standard deviation of the underlying asset, or the underlying’s volatility:
We can easily see via put-call parity that vega will be the same for calls and puts. See A7 for a derivation. As an aside, “vega” is not the name of a Greek letter, and I’m unsure of the etymology. See this Stack Exchange for a discussion.
As a caveat, Black–Scholes assumes that the volatility of the underlying stock is constant. So it is a bit conceptually inconsistent to use the Black–Scholes vega since the entire derivation of Black–Scholes assumes constant volatility. However, in practice, this is done as a useful first approximation. Since the volatility of the stock is unobservable, is often referred to as the implied volatility, since we can observe only the market-implied volatility from the price of options.
Let’s start with an example. Again, we’ll use values , , years, and . With these values, the Black–Scholes vega of either a call or a put is
So if the annualized implied volatility increases from to , vega approximates the option will change by . The actual price change using Black–Scholes is .
Dollar vega
We multiplied delta by spot to get our notional exposure to spot. And we multiplied gamma by spot squared since it is the second-partial derivative with respect to spot. Using similar logic, we can compute a dollar vega by multiplying our vega by an implied volatility :
Here, is he market-implied volatility using Black–Scholes. Alternatively, we could think about a percentage vega or
This represents our notional vega per a move in annualized implied volatility. This is a more practical number than raw vega. For example, in our example above, the percent vega would be . This is interpretable since it says how much our position changes in dollar terms per a change in implied volatility. And since it is in notional terms, we can multiply it by shares to the percent vega in terms of our portfolio.
Vega hedging
As with gamma, rho, and theta, a position in spot has zero vega exposure, since
What this means is that if our portfolio position has vega , we cannot neutralize the risk by taking a position in the underlying. Again, we need to find an asset with vega and achieve vega neutrality by taking a position in the asset such that
Again, we could use a options, or any other asset with appropriately matched vega, to construct a portfolio which is vega neutral. My understanding is that an options market maker will keep their book vega neutral by simply selling vega to other clients via selling particular options.
As with gamma hedging, however, vega hedging might re-introduce exposures that will need to be re-hedged. This should make intuitive sense. First, we need to find a linear combination of options such that our portfolio’s gamma and vega exposures are both zero. Then we can zero out our delta, since doing so does not effect the other Greeks. See section on vega in (Hull, 1993) for more details.
Functional form
The functional form of vega is, like gamma, a scaled PDF of the normal distribution (Figure ). Intuitively, an option’s sensitivity to changes in implied volatility should decrease as either the spot price or time to expiry decrease. It also makes sense that vega should peak around the ATM point, when . Why? Imagine an option is deep OTM. Even if the stock has a very high realized volatility—imagine a meme stock that whose price is whipping up and down while still remaining far below the strike—the fair price of the option does not need to change. Thus, vega should be small for deep OTM or deep ITM options.
As with gamma, the lemma in A2 allows us to write vega in terms of the strike price today, , rather than in terms of the spot price :
This equation is nice because it has a clean interpretation. Recall that is the probability that an option ends ITM under the risk-neutral measure. And is the present value of the strike price (at time ) or perhaps the fair price of a zero-coupon bond. So we can see that vega is like an expectation! The option either ends ITM and has a fair price of with probability or ends OTM and is worthless with probability , or
I have not read anyone write about vega this way, but in my mind, it seems like one view of vega is a kind expected value of the option.
Finally, note that given Equations and , we can easily express gamma in terms of vega or vice versa:
So if we are an option’s trader and know our gamma risk and volatility, you can do some quick math to estimate vega.
Vega sensitivities
We can see from Equation that vega increases as either the spot price or time to expiry increase. This is pretty intuitive. For example, if an option expires in one year and the implied volatility increases, there is a larger range in final outcomes for the option than if it expired in one day (Figure , left).
When implied volatility decreases, vega for a particular spot price will decrease (Figure , right). This effect is a bit subtle, and I’ve drawn the Figure slightly differently to highlight this. And of course, it should be clear that vega is highest when an option is roughly ATM. As with other Greeks, this makes intuitive sense, since this is when the option’s value could go either way. In this case, the option is particularly sensitive to its inputs.
Unifying view
As we have seen, the Greeks are complex topic. It can be tricky to reason about how a high-dimensional, nonlinear price surface changes as its various inputs change. So in conclusion, let me try to unify this into a coherent story.
First, a mathematical story. Recall that Taylor’s theorem for univariate functions can be generalized to higher-dimensional functions. If we view our derivatives-pricing function as one such multidimensional function—rather than as a univariate function of just spot—, then the Greeks are the partial derivatives in the multidimensional Taylor expansion! Using the Greeks in this post, we could write the Taylor expansion as
As we add higher-order derivatives for a particular input, our approximation along that dimension will get better provided higher-order derivatives are nonzero. And as the various inputs change, this multidimensional function representing the fair price of the option changes. If we want to hedge a particular sensitivity or Greek, then we are constantly rebalancing our portfolio to re-neutralize our position. So this is one coherent view of the Greeks.
Second, a financial story. One view of Equation is that is really the profit and loss (P&L) from our option position. This means that each term in the approximation is a separate P&L: we have a delta P&L, a rho P&L, a vega P&L, and so forth. And since we can hedge individual or combinations of Greeks, we could use a portfolio of options to trade one of the inputs to Black–Scholes! We would hedge out all the Greeks except one, and when that input moved, our position in options would change in value, capturing the relevant P&L.
But why would you do this? First, most of the inputs to Black–Scholes can either be traded directly, such as delta via stocks or interest rates via fixed-income products like bonds, or they cannot be traded, such as time. However, there is one input to Black–Scholes that is unobservable, difficult to trade, and impacts our total P&L: implied volatility! And vega P&L is the key. We can construct a portfolio that has a vega exposure, allowing us to convert changes in implied volatility into P&L in our options. This is why volatility trading is so intimately tied to options trading (Carr & Madan, 2001).
But there’s a second problem with this approach. Imagine that we try to trade volatility by buying a basket of options that has limited exposure to all Greeks but vega. And now imagine that the stock price is extremely volatile. Do we make money? The answer, as we can see in Figure , is that it depends. It depends because vega is only nonzero for options within a certain moneyness range. Far OTM and far ITM options have effectively zero vega, and therefore even though the stock price might be volatile, we do not necessarily capture a meaningful change in option prices if vega is negligible. So options via the Greeks allow us to trade the inputs to our derivative-pricing model, but they do not necessarily let us trade those inputs cleanly.
A natural evolution in this line of thinking is to construct derivatives that have constant exposure to vega. This is the essential idea of a variance swap, which is a financial contract designed to have exposure to pure volatility (Demeterfi et al., 1999). And if we want to trade variance swaps, it makes sense to want an index that tracks the fair price of such a swap. This is the idea behind the VIX (Whaley, 1993; Whaley, 2009), which is essentially the fair price of a volatility swap.
And so we can trace a thread from an old idea to a new one. Trading options is at least several hundred years old (Haug & Taleb, 2011), while trading volatility is a relatively modern idea, one which arguably started with the observation that options imply a volatility (Latane & Rendleman, 1976). And at the center of these two seemingly distinct ideas is the Greeks.
Appendix
A1. Deriving
Let denote the CDF of a random variable with PDF . We can write in terms of :
Using the fundamental theorem of calculus, it is easy to see that the CDF must be the anti-derivative of the PDF:
But I was curious: what is the derivative of when is a function of another function ? Consider this case:
To compute its derivative, we can decompose the integral into two parts:
for some constant . The derivative is thus
Note that the left integral is zero because the term is constant with respect to . So we have
Let be a constant. We can then invoke Leibniz integral rule to compute
But the middle and right terms are both zero because their respective derivatives are zero, since each function is constant with respect to . This gives us
Mapping this onto Black–Scholes, we can see that we can take partial derivatives such as
The reasoning above holds in general, i.e. for other partial derivatives.
A2. Risk neutrality
Under the risk-neutral measure, the expected following equality must hold:
Intuitively, this equation states that the stock price is equal to the strike price, while adjusting for factors such as volatility, interest rates, and the probability of the option ending in-the-money. We want to prove that this equation holds under Black–Scholes, as we will use this result repeatedly.
Observe that by the definition of , the PDF of the standard normal distribution, we have
And by the definition of , note that
Putting this together, we get
or that
as desired.
Intuitively, this means that the Black–Scholes model adheres to the notional of risk-neutrality or that it satisfies the no-arbitrage condition.
As a final note, observe that since is the PDF of symmetric distribution, we know that . So we can write this lemma as do
If you would like, you can convince yourself that this is true by plugging in rather than in Equation . You’ll see this symmetry is enforced by the square.
A3. Deriving delta
From A1, we know we need to compute the partial derivative of with respect to . This is
We can then use the definition to compute
Now let us complete the derivation:
Now applying the lemma in A2 to Equation , we get:
as desired.
The derivation for delta for a put is essentially the same:
Applying Equation and the lemma in A2, we get
Since the normal distribution is symmetric, we can write this as
as desired.
A4. Deriving gamma
After deriving delta, deriving gamma is easy. It is
where we have used the fact that
from the derivation of delta. We can see from the definition of delta for a put (Equation ) that gamma for a put should be the same as gamma for a call:
Alternatively, we could use put-call parity to derive Equation .
A5. Deriving rho
Using the chain and product rules, we have
Terms cancel in step because of the lemma in A2 and since
The derivation for a put is nearly identical:
Alternatively, we could use put-call parity:
In the last step, we have used the symmetry of the normal distribution:
A6. Deriving theta
The derivation of theta is a bit tedious since we will need to use the product rule. So let us break it into parts. First, observe that:
And for brevity, let’s define a function as
which implies that
Finally, observe
We can now put these observations together to derive :
In step , we use the lemma in A2. Negate the result above to get Equation .
Now let’s derive theta for a put from the Equation . As before, let’s address the term with the product rule separately. We will see that it is:
Putting this all together, we get
Step is the subtle one. On this step, we both apply the lemma in A2 and use the fact that the standard normal distribution is symmetric and thus .
If we negate this and align terms with , we get
We can verify that we get the same result if we derive theta for a put using put-call parity:
Since theta for a put is negative as well, we have
In the last step, we used the symmetry of the normal distribution (Equation )
A7. Deriving vega
Observe
Much like the delta derivation, we can use the lemma in A2 to ge
The derivation for a put is essentially the same. We just use the fact that since the normal distribution is symmetric:
So . And we can easily see that we would get the same result using put-call parity.
- Black, F., & Scholes, M. (1973). The pricing of options and corporate liabilities. Journal of Political Economy, 81(3), 637–654.
- Bachelier, L. (1900). Théorie de la spéculation. Annales Scientifiques De l’École Normale Supérieure, 17, 21–86.
- Black, F. (1976). The pricing of commodity contracts. Journal of Financial Economics, 3(1-2), 167–179.
- Cox, J. C., Ross, S. A., & Rubinstein, M. (1979). Option pricing: A simplified approach. Journal of Financial Economics, 7(3), 229–263.
- Rendleman, R. J. (1979). Two-state option pricing. The Journal of Finance, 34(5), 1093–1110.
- Merton, R. C. (1973). Theory of rational option pricing. The Bell Journal of Economics and Management Science, 141–183.
- Hull, J. (1993). Options, futures, and other derivative securities (Vol. 7). Prentice Hall Englewood Cliffs, NJ.
- Cox, J. C., & Rubinstein, M. (1985). Options markets. Prentice Hall.
- Carr, P., & Madan, D. (2001). Towards a theory of volatility trading. Option Pricing, Interest Rates and Risk Management, Handbooks in Mathematical Finance, 22(7), 458–476.
- Demeterfi, K., Derman, E., Kamal, M., & Zou, J. (1999). A guide to volatility and variance swaps. The Journal of Derivatives, 6(4), 9–32.
- Whaley, R. E. (1993). Derivatives on market volatility. The Journal of Derivatives, 1(1), 71–84.
- Whaley, R. E. (2009). Understanding the VIX. The Journal of Portfolio Management, 35(3), 98–105.
- Haug, E. G., & Taleb, N. N. (2011). Option traders use (very) sophisticated heuristics, never the Black–Scholes–Merton formula. Journal of Economic Behavior & Organization, 77(2), 97–106.
- Latane, H. A., & Rendleman, R. J. (1976). Standard deviations of stock price ratios implied in option prices. The Journal of Finance, 31(2), 369–381.