Arbitrage pricing theory
Автор: Tarasova O.
Журнал: Экономика и социум @ekonomika-socium
Статья в выпуске: 4-1 (13), 2014 года.
Бесплатный доступ
Arbitrage, asset pricing model, factor model6 econometrics, apt, assets, arbitrage pricing theory, mean variance efficiency, specification of factors
Короткий адрес: https://sciup.org/140124529
IDR: 140124529
Текст статьи Arbitrage pricing theory
The Arbitrage Pricing Theory (APT) was developed primarily by Ross (1976a, 1976b). It is a one-period model in which every investor believes that the stochastic properties of returns of capital assets are consistent with a factor structure. Ross argues that if equilibrium prices offer no arbitrage opportunities over static portfolios of the assets, then the expected returns on the assets are approximately linearly related to the factor loadings. (The factor loadings, or betas, are proportional to the returns’ covariances with the factors.)
Ross’ (1976a) heuristic argument for the theory is based on the preclusion of arbitrage. Ross’ formal proof shows that the linear pricing relation is a necessary condition for equilibrium in a market where agents maximize certain types of utility. The subsequent work, which is surveyed below, derives either from the assumption of the preclusion of arbitrage or the equilibrium of utilitymaximization. A linear relation between the expected returns and the betas is tantamount to an identification of the stochastic discount factor (SDF).
The APT is a substitute for the Capital Asset Pricing Model (CAPM) in that both assert a linear relation between assets’ expected returns and their covariance with other random variables. (In the CAPM, the covariance is with the market portfolio’s return.) The covariance is interpreted as a measure of risk that investors cannot avoid by diversification. The slope coefficient in the linear relation between the expected returns and the covariance is interpreted as a risk premium. Such a relation is closely tied to mean-variance efficiency.
An empirical test of the APT entails a procedure to identify at least some features of the underlying factor structure. Merely stating that some collection of portfolios (or even a single portfolio) is mean-variance efficient relative to the mean-variance frontier spanned by the existing assets does not constitute a test of the APT, because one can always find a mean-variance efficient portfolio. Consequently, as a test of the APT it is not sufficient to merely show that a set of factor portfolios satisfies the linear relation between the expected return and its covariance with the factors portfolios. Unfortunately, the APT does not necessarily preclude arbitrage opportunities over dynamic portfolios of the existing assets. Therefore, the applications of the APT in the evaluation of managed portfolios are contradictory at least in spirit to the APT, which obtains price restrictions by assuming the absence of arbitrage.
A Formal Statement
The APT assumes that investors believe that the n × 1 vector, r, of the single- period random returns on capital assets satisfies the factor model r = μ + βf + e where e is an n × 1 vector of random variables, f is a k × 1 vector of random variables (factors), μ is an n × 1 vector and β is an n × k matrix. With no loss of generality, normalize (1) to make E[f] = 0 and E[e] = 0, where E[·] denotes expectation and 0 denotes the matrix of zeros with the required dimension. The
The mathematical proof of the APT requires restrictions on □ and the covariance matrix □ = E[ee^]. An additional customary assumption is that E[e|f] = 0, but this assumption is not necessary in some of the APT's developments.
The number of assets, n, is assumed to be much larger than the number of factors, k. In some models, n is infinity or approaches infinity. In this case, representation (1) applies to a sequence of capital markets; the first n assets in the (n + 1)st market are the same as the assets in the nth market and the first n rows of the matrix β in the (n + 1)st market constitute the matrix β in the nth market.
The APT asserts the existence of a constant a such that, for each n, the inequality
(μ - Xλ)Z-1(μ - Xλ) ≤ a holds for a (k+1)×1 vector λ, and an n×n positive definite matrix Z. Here, X = (ı, β), in which ı is an n × 1 vector of ones. Let λ0 be the first component of λ and λ1 consists of the rest of the components. If some portfolio of the assets is risk-free, then □о is the return on the risk-free portfolio. The positive definite matrix Z is often the covariance matrix E[ee’]. Exact arbitrage pricing obtains if (2) is replaced by
μ=Xλ=ıλ0+βλ1 .
The vector λ1 is referred to as the risk premium, and the matrix β is referred to as the beta or loading on factor risk. The interpretation of (2) is that each component of μ depends approximately linearly on the corresponding row of β. This linear relation is the same across assets.
The approximation is better, the smaller the constant a; if a = 0, the linear relation is exact and (3) obtains.
Intuition
The intuition behind the model draws from the intuition behind Arrow-Debreu security pricing. A set of k fundamental securities spans all possible future states of nature in an Arrow-Debreu model. Each asset’s payoff can be described as the payoff on a portfolio of the fundamental k assets. In other words, an asset’s payoff is a weighted average of the fundamental assets’ payoffs. If market clearing prices allow no arbitrage opportunities, then the current price of each asset must equal the weighted average of the current prices of the fundamental assets.
The Arrow-Debreu intuition can be couched in terms of returns and expected re- turns rather than payoffs and prices. If the unexpected part of each asset’s return is a linear combination of the unexpected parts of the returns on the k fundamental securities, then the expected return of each asset is the same linear combination of the expected returns on the k fundamental assets.
To see how the Arrow-Debreu intuition leads from the factor structure (1) to exact arbitrage pricing (3), set the idiosyncratic term e on the right-hand side of (1) equal to zero. Translate the k factors on the right-hand side of (1) into the k fundamental securities in the Arrow-Debreu model. Then (3) follows immediately.
The presence of the idiosyncratic term e in the factor structure (1) makes the model more general and realistic. It also makes the relation between (1) and (3) more tenuous. Indeed, ‘no arbitrage’ arguments typically prove the weaker (2). Moreover, they require a weaker definition of arbitrage (and therefore a stronger definition of no arbitrage) in order to get from (1) to (2).
The proofs of (2) augment the Arrow-Debreu intuition with a version of the law of large numbers. That law is used to argue that the average effect of the idiosyncratic terms is negligible. In this argument, the independence among the components of e is used. Indeed, the more one assumes about the (absence of) contemporaneous correlations among the component of e, the tighter the bound on the deviation from exact APT.
Mean-Variance Efficiency
The APT was developed as a generalization of the CAPM, which asserts that the expectations of assets’ returns are linearly related to their covariances (or betas, which in turn are proportional to the covariances) with the market portfolio’s return. Equivalently, the CAPM says that the market portfolio is meanvariance efficient in the investment universe containing all possible assets. If the factors can be identified with traded assets then exact arbitrage pricing says that a portfolio of these factors is mean-variance efficient in the investment universe consisting of the assets r.
Huberman and Kandel (1985b), Jobson and Korkie (1982, 1985) and Jobson (1982) note the relation between the APT and mean-variance efficiency. They propose likelihood- ratio tests of the joint hypothesis that a given set of random variables are factors in model (1) and that exact arbitrage pricing (3) obtains. Kan and Zhou (2001) point out a crucial typographical error in Huberman and Kandel (1985b). Peñaranda and Sentana (2004) study the close relation between the Huberman and Kandel’s spanning approach and the celebrated volatility bounds in Hansen and Jagannathan (1991).
Even when the factors are not traded assets, (3) is a statement about meanvariance efficiency: Grinblatt and Titman (1987) assume that the factor structure (1) holds and that a risk-free asset is available. They identify k traded assets such that a portfolio of them is mean-variance efficient if and only if (3) holds. Huberman, Kandel and Stambaugh (1986) extend the work of Grinblatt and Titman by characterizing the sets of k traded assets with that property and show that these assets can be described as portfolios if and only if the global minimum variance portfolio has nonzero systematic risk. To find these sets of assets, one must know the matrices UUU and E[ee’]. If the latter matrix is diagonal, factor analysis produces an estimate of it, as well as an estimate of UUU.
The interpretation of (3) as a statement about mean-variance efficiency contributes to the debate about the testability of the APT. (Shanken (1982, 1985) and Dybvig and Ross (1985), however, discuss the APT’s testability without mentioning that (3) is a statement about mean-variance efficiency.) The theory’s silence about the factors’ identities renders any test of the APT a joint test of the pricing relation and the correctness of the factors. As a mean-variance efficient portfolio always exists, one can always find ‘factors’ with respect to which (3) holds. In fact, any single portfolio on the frontier can serve as a ‘factor.’
Thus, finding portfolios which are mean-variance efficient — or failure to find them — neither supports nor contradicts the APT. It is the factor structure (1) which, combined with (3), provides refutable hypotheses about assets’ returns. The factor structure (1) imposes restrictions which, combined with (3), provide refutable hypotheses about assets’ returns. The factor structure suggests looking for factors with two properties: (i) their time series movements explain a substantial fraction of the time series movements of the returns on the priced assets, and (ii) the unexplained parts of the time series movements of the returns on the priced assets are approximately uncorrelated across the priced assets.
Specification of Factors
The tests outlined above are joint tests that the matrix β is correctly estimated and that exact arbitrage pricing holds. Estimation of the factor loading matrix β entails at least an implicit identification of the factors. The three approaches listed below have been used to identify factors.
The first consists of an algorithmic analysis of the estimated covariance matrix of asset returns. For instance, Roll and Ross (1980), Chen (1983) and Lehman and Modest (1985a, 1985b) use factor analysis and Chamberlain and Rothschild (1983) and Connor and Korajczyk (1985, 1986) recommend using principal component analysis.
The second approach is one in which a researcher starts at the estimated covariance matrix of asset returns and uses his judgment to choose factors and subsequently estimate the matrix β. Huberman and Kandel (1985a) note that the correlations of stock returns of firms of different sizes increases with a similarity in size. Therefore, they choose an index of small firms, one of medium size firms and one of large firms to serve as factors. In a similar vein, Fama and French (1993) use the spread between the stock returns of small and large firms as one of their factors. Echoing the findings of Rosenberg, Reid, and Lanstein (1984), Chan, Hamao and Lakonishok (1991) and Fama and French (1992) observe that expected stock returns and their correlations are also related to the ratio of book-to-market equity. Based on these observations, Fama and French (1993) add the spread between stock returns of value and growth firms as another factor.
The third approach is purely judgmental in that it is one in which the researcher primarily uses his intuition to pick factors and then estimates the factor loadings and checks if they explain the cross-sectional variations in estimated expected returns (i.e., he checks (3)). Chan, Chen and Hsieh (1985) and Chen, Roll and Ross (1986) select financial and macroeconomic variables to serve as factors. They include the following variables: the return on an equity index, the spread of short and long-term interest rates, a measure of the private sector’s default premium, the inflation rate, the growth rates of industrial production and the aggregate consumption. Based on economic intuition, researchers continue to add new factors, which are too many to enumerate here.
The first two approaches are implemented to conform to the factor structure un- derlying the APT: the first approach by the algorithmic design and the second because researchers check that the factors they use indeed leave the unexplained parts of asset returns almost uncorrelated. The third approach is implemented without regard for the factor structure. Its attempt to relate the assets’ expected returns to the covariance of the assets’ returns with other variables is more in the spirit of Merton’s (1973) intertemporal CAPM than in the spirit of the APT.
The empirical work cited above examines the extent to which the exact APT (with whatever factors are chosen) explains the cross-sectional variation in assets’ mean re- turns better than the CAPM. It also examines the extent to which other variables — usually those that include various firm characteristics — have marginal explanatory power beyond the factor loadings to explain the crosssection of assets’ mean returns. The results usually suggest that the APT is a useful model in comparison with the CAPM. (Otherwise, they would probably have gone unpublished.) However, the results are mixed when the alternative is firm characteristics. Researchers who introduce factors tend to report results supporting the APT with their factors and test portfolios. Nevertheless, different tests and construction of portfolios often reject the proposed APT. For example, Fama and French (1993) demonstrate that exact APT using their factors holds for portfolios constructed by sorting stocks on firm size and book-to-market ratio, whereas Daniel and Titman (1997) demonstrate that the same APT does not hold for portfolios that are constructed by sorting stocks further on the estimated loadings with respect to the Fama and French’s factors.
The APT often seems to describe the data better than competing models. It is wise to recall, however, that the purported empirical success of the APT may well be due to the weakness of the tests employed. Some of the questions that come to mind are: Which factors capture the data best? What is the economic interpretation of the factors? What are the relations among the factors that different researchers have reported? As any test of the APT is a joint test that the factors are correctly identified and that the linear pricing relation holds, a host of competing theories exist side by side under the APT’s umbrella. Each fails to reject the APT but has its own factor identification procedure. The number of factors, as well as the methods of factor construction, is exploding. The multiplicity of competing factor models indicates ignorance of the true factor structure of asset returns and suggests a rich and challenging research agenda.
Applications
The APT lends itself to various practical applications due to its simplicity and flexibility. The three areas of applications critically reviewed here are: assetallocation, the computation of the cost of capital, and the performance evaluation of managed funds.
The application of the APT in asset allocation is motivated by the link between the factor structure (1) and mean-variance efficiency. Since the structure with k fac- tors implies the existence of k assets that span the efficient frontier, an investor can construct a mean-variance efficient portfolio with only k assets. The task is especially straightforward when the k factors are the payoffs of traded securities. When k is a small number, the model reduces the dimension of the optimization problem. The use of the APT in the construction of an optimal portfolio is equivalent to imposing the restriction of the APT in the estimation of the mean and covariance matrix involved in the mean-variance analysis. Such a restriction increases the reliability of the estimates because it reduces the number of unknown parameters.
If the factor structure specified in the APT is incorrect, however, the optimal port- folio constructed from the APT will not be mean-variance efficient. This uncertainty calls for adjusting, rather than restricting, the estimates of mean and covariance matrix by the APT. The degree of this adjustment should depend on investors’ prior belief in the model. Pastor and Stambaugh (2000) introduce the Bayesian approach to achieve this adjustment. Wang (2005) further shows that the Bayesian estimation of the return distribution results in a weighted average of the distribution restricted by the APT and the unrestricted distribution matched to the historical data.
The proliferation of APT-based models challenges an investor engaging in asset allocation. In fact, Wang (2005) argues that investors averse to model uncertainty may choose an asset allocation that is not mean-variance efficient for any probability distributions estimated from the prior beliefs in the model.
Being an asset pricing model, the APT should lend itself to the calculation of the cost of capital. Elton, Gruber and Mei (1994) and Bower and Schink (1994) used the APT to derive the cost of capital for electric utilities for the New York State Utility Commission. Elton, Gruber and Mei specify the factors as unanticipated changes in the term structure of interest rates, the level of interest rates, the inflation rate, the GDP growth rate, changes in foreign exchange rates, and a composite measure they devise to measure changes in other macro factors. In the meantime, Bower and Schink use the factors suggested by Fama and French (1993) to calculate the cost of capital for the Utility Commission. However, the Commission did not adopt any of the above- mentioned multi-factor models but used the CAPM instead (see DiValentino (1994).)
Other attempts to apply the APT to compute the cost of capital include Bower, Bower and Logue (1984), Goldenberg and Robin (1991) who use the APT to study the cost of capital for utility stocks, and Antoniou, Garrett and Priestley (1998)
who use the APT to calculate the cost of equity capital when examining the impact of the European exchange rate mechanism. Different studies use different factors and consequently obtain different results, a reflection of the main drawback of the APT — the theory does not specify what factors to use. According to Green, Lopez and Wang (2003), this drawback is one of the main reasons that the U.S. Federal Reserve Board has decided not to use the APT to formulate the imputed cost of equity capital for priced services at Federal Reserve Banks.
The application of asset pricing models to the evaluation of money managers was pioneered by Jensen (1967). When using the APT to evaluate money managers, the managed funds’ returns are regressed on the factors, and the intercepts are compared with the returns on benchmark securities such as Treasury bills. Examples of this application of the APT include Busse (1999), Carhart (1997), Chan, Chen, Lakonishok (2002), Cai, Chan and Yamada (1997), Elton, Gruber and Blake (1996), Mitchell and Pulvino (2001), and Pastor and Stambaugh (2002).
The APT is a one-period model that delivers arbitrage-free pricing of existing assets (and portfolios of these assets), given the factor structure of their returns. Applying it to price derivatives on existing assets or to price trading strategies is problematic, because its stochastic discount factor is a random variable which may be negative. Negativity of the SDF in an environment which permits derivatives leads to a pricing contradiction, or arbitrage. Consider, for instance, the price of an option that pays its holder whenever the SDF is negative. Being a limited liability security, such an option should have a positive price, but applying the SDF to its payoff pattern delivers a negative price. (The observation that the stochastic discount factor of the CAPM may be negative is in to Dybvig and Ingersoll (1982) who also studied some of the implications of this observation.)
Trading and derivatives on existing assets are closely related. Famously, Black and Scholes (1972) show that dynamic trading of existing securities can replicate the payoffs of options on these existing securities. Therefore, one should be careful in interpreting APT-based excess returns of actively managed funds because such funds trade rather than hold on to the same portfolios. Examples of interpretations of asset management techniques as derivative securities include Merton (1981) who argues that market- timing strategy is an option, Fung and Hsieh (2001) who show that hedge funds using trend-following strategies behave like a look-back straddle, and Mitchell and Pulvino (2001) who demonstrate that merger arbitrage funds behave like an uncovered put.
Motivated by the challenge of evaluating dynamic trading strategies, Glosten and Jagannathan (1994) suggest replacing the linear factor models with the Black- Scholes model. Wang and Zhang (2005) study the problem extensively and develop an econometric methodology to identify the problem in factor-based asset pricing models. They show that the APT with many factors is likely to have large pricing errors over actively managed funds, because empirically these models deliver SDFs which allow for arbitrage over derivative-like payoffs.
It is ironic that some of the applications of the APT require extensions of the basic model which violate its basic tenet — that assets are priced as if markets offer no arbitrage opportunities.