Factor investing, also labeled smart beta, has recently become a well-known alternative to market capitalized index investing. Apart from market risk, factor investing tilts portfolio weights towards other specific risk factors, such as value or momentum; and aims to outperform the corresponding capitalization-weighted benchmark.

In practice, factor exposures are realized with simple scoring approaches based on predefined firm characteristics. The cost-effectiveness of these rule-based methodologies is one of the main advantages of factor investing strategies. Nevertheless, an overly simplistic adaptation of factor investing strategies may lead to many misunderstandings. Amenc et al. (2016a) discuss ten misconceptions and separate these into groups, such as performance drivers, investability hurdles and strategy choices.Footnote 1 One of the strategy choice misconceptions is represented by the concentration fallacy, stating that a good factor index should provide a strong tilt to the desired factor. Although diversification is an essential cornerstone of modern portfolio theory, this misconception illustrates that implementing diversified factor investing strategies is still a challenging task.

The benefits of diversified factor investing strategies have been proposed by Amenc et al. (2016b) in a single factor environment and by Amenc et al. (2017) for multifactor strategies. The research results imply higher performance outcomes for diversified factor portfolios than for concentrated factor portfolios. Within the security selection approaches of the multifactor strategies, the authors used a simplified factor construction method with equally weighted firm characteristics. Since equal treatment of firm characteristics neglects potential differences in their informational content, the proposed procedure can lead to biased results. In particular, Heinrich and Zurek (2019) have shown, that naive stock selection models can be significantly improved by considering differences in the informational content between firm characteristics. Since a poorly calibrated stock selection model can reduce the performance of pure firm characteristic-based strategies, our study analyzes the benefits of diversified versus concentrated multifactor portfolios taking into account the full range of interaction effects between firm characteristics.

We present factor investing strategies from the perspective of the optimal orthogonal portfolio (OOP) formally introduced by MacKinlay and Pástor (2000). From the OOP perspective, it is possible to resemble the practical view of an investor who aims to beat a given benchmark portfolio in terms of mean-variance efficiency. Moreover, the OOP perspective provides a motivation for both strategies: A pure firm characteristic-based approach and a diversified strategy that additionally considers the covariance structure of the returns of the selected securities. More specifically, MacKinlay and Pástor (2000) use a strong assumptionFootnote 2 regarding the covariance structure of the residual returns to show that the OOP can be weighted proportional to the securities’ alphas without explicit consideration of the covariance matrix. As firm characteristics can be transformed into alphas, the OOP perspective motivates purely characteristic-based factor strategies.

In the diversified strategy, we weaken the strong assumption regarding the covariance structure and account for diversification effects by considering a nonzero covariance structure of the residual returns. Moreover, we keep our analysis contemporary and use two efficient state-of-the-art covariance estimation methods. Both estimators are more stable than a sample estimator, especially in a high-dimensional scenario. Due to its popularity among practitioners, the well-known linear shrinkage estimator of Ledoit and Wolf (2004) with a constant correlation structure of returns is adopted as the first estimator. The second estimator follows Amenc et al. (2012) as well as Amenc et al. (2011) and incorporates a factor-based approach to covariance estimation using principal component analysis (PCA) to determine the underlying risk factors. This method is chosen to keep the results comparable to the literature on factor strategies considering diversification effects.

To study the benefits of diversified versus concentrated strategies, we apply a horse race between a pure alpha-based weighting method and a diversification-based strategy. Both strategies are compared considering a comprehensive multifactor model. A multifactor perspective is especially important for practice since in multifactor portfolios and even in single-factor portfolios, many investors try to diversify their factor investing risk by examining several firm characteristics. With regard to the practical applicability of our study, the approaches are analyzed with factors defined according to an industry standard of the MSCI factor portfolios. Overall, the entire multifactor portfolio contains 16 firm characteristics. Moreover, to validate the robustness of the results, two different datasets consisting of stocks from the S&P 500 and from STOXX Europe 600 are included.

Our empirical findings show that applying the residual covariance matrix improves the out-of-sample performance in long-only multifactor strategies. Diversification strategies manage to both increase excess returns and reduce portfolio risk. By overweighting low-volatility stocks, the low-volatility factor exposure is strongly increased compared to other factors with less relevance. Since the low-volatility factor comprises of firm characteristics with a significant positive informational content, the increase in low-volatility factor exposure provides an important explanatory perspective that has not been recognized in previous research.

In the long-short portfolios, the low-volatility factor exposure in the alpha concentration approach is similar to that in the diversification approaches. As a result, no significant difference in performance is observed. Hence, the informational content structure of the applied firm characteristics and, in particular, the informational content of the low-volatility firm characteristics, can be identified as an important performance driver. Since our alpha forecasting model considers the differences in informational content between the firm characteristics, it allows for an appropriate comparison between the two strategies.

The remainder of the paper is organized as follows. Section 1 introduces the OOP perspective and the factor investing strategies. Section 2 presents the data set and the applied firm characteristics, whereas Sect. 3 reports the findings of the empirical backtest. This includes the results of the long-only and long-short portfolios. Finally, Sect. 4 contains some concluding remarks.

Factor investing strategies

In practice, factor investing addresses the case of an investor who aims to beat a given benchmark. Therefore, we follow Treynor and Black (1973) by separating the investment universe into an active portfolio \({\mathbb {A}}\) with \(i=1,\ldots ,N\) securities and a passive benchmark portfolio \({\mathbb {B}}\).

The main goal of our analysis is to compare the performance of a pure characteristic-based weighting approach with a scenario that considers diversification effects. Thus, we focus on the investment decisionFootnote 3 for \({\mathbb {A}}\) and assume the following linear relation between the securities’ returns in \({\mathbb {A}}\) and \({\mathbb {B}}\):

$$\begin{aligned} R^*_{\mathbb {A}},t&= \alpha + \beta \, {R^*_{{\mathbb {B}},t}} + \epsilon _{ t} \nonumber \\ \text{ E }[\epsilon _{ t}]&= {{0}} \nonumber \\ \text{ E }[\epsilon _{ t}\, \epsilon _{ t}']&= {\Sigma } \nonumber \\ \text{ Cov }[R^{*}_{{\mathbb {B}},t}, \epsilon _{t}]&= {{0}} , \end{aligned}$$
(1)

where the (\(N \times 1\))-vector \({R}^*_{{\mathbb {A}},t}\) denotes the N security returns in \({\mathbb {A}}\) at time t, and \(R^*_{{\mathbb {B}},t}\) denotes the return on the benchmark portfolio. Throughout the article, the term “return” on a given security denotes the return in excess of the risk-less rate.Footnote 4 The (\(N \times 1\))-vector \({\beta }\) represents the risk sensitivities of the securities returns towards the benchmark returns and the (\(N\times 1\))-vector \( {\epsilon }_{ t}\) represents the resulting residuals assumed to have zero-mean and a nonzero covariance, denoted as the (\(N\times N\))-matrix \({{\Sigma }}\).

Since MacKinlay and Pástor (2000) define the (\(N \times 1\))-vector \(\alpha \) as the vector of mispricings, we refer to this model as an incomplete factor model. Investors implementing factor strategies believe that factor model (1) can be complemented by an additional factor portfolio \({\mathbb {A}}\), so that mispricing disappears with \( \alpha = {\mathbf {0}}\). Accordingly, the complemented model is then defined as:

$$\begin{aligned} {{{R}}^*_{{\mathbb {A}},t}}&= \beta \, R^*_{{\mathbb {B}},t} + \beta _{{\mathbb {A}}} {R^*_{{\mathbb {A}},t}} + {{\upsilon }}_{t} \nonumber \\ \text{ E }[{\upsilon }_{ t}]&= {\mathbf {0}} \nonumber \\ \text{ E }[{\upsilon }_{ t}\, {{\upsilon }}_{ t}']&= {{\Omega }} \nonumber \\ \text{ Cov }[R^{*}_{{\mathbb {B}},t}, {{\upsilon }}_{t}]&= {\mathbf {0}} \nonumber \\ \text{ Cov }[R^{*}_{{\mathbb {A}},t}, {{\upsilon }}_{t}]&= {\mathbf {0}} . \end{aligned}$$
(2)

Assuming that the returns of \({\mathbb {A}}\) and \({\mathbb {B}}\) are uncorrelated, \({\mathbb {A}}\) represents the OOP, which corresponds to the missing risk-factor portfolio. The weights for the optimal orthogonal portfolio \({{a}}_{{\mathbb {A}}}\) can be determined by:

$$\begin{aligned} {{a}}_{{\mathbb {A}}}&= \frac{{\Sigma }^{-1} \alpha }{ {\mathbf {1}}_{N}' {\Sigma }^{-1} \alpha } . \end{aligned}$$
(3)

In accordance with Fama–French factors, the securities of \({\mathbb {A}}\) are separated into long and short portfolios. Stocks with alphas above (below) a specified quantile \(\alpha ^{+}_{qt}\) (\(\alpha ^{-}_{qt}\)) are assigned to the long (short) portfolio. Since \(\alpha ^{+}_{qt}\) and \(\alpha ^{-}_{qt}\) determine the number of selected securities, these are referred to as selection thresholds. Formally this division is expressed by

$$\begin{aligned} {{a}}_{{\mathbb {A}},\text{ long }} = \frac{{{\Sigma }}^{-1}_{+} \alpha ^{+}}{{1}' \, {{\Sigma }}^{-1}_{+} \, \alpha ^{+}} \quad \text {and} \quad {{a}}_{{\mathbb {A}},\text{ short }} = \frac{{{\Sigma }}^{-1}_{-} \alpha ^{-}}{{1}' \, {{\Sigma }}^{-1}_{-} \, \alpha ^{-}} . \end{aligned}$$
(4)

In \({{a}}_{{\mathbb {A}},\text{ long }}\), the alpha vector \( \alpha ^{+}\) and the covariance matrix \({{\Sigma }}_{+}\) hold the values for the respective securities in the long portfolio. Equivalently, \( \alpha ^{-}\) and \({{\Sigma }}_{-}\) hold the values for the securities in the short portfolio \({{a}}_{{\mathbb {A}},\text{ short }}\). In addition to long-short (LS) portfolios, we construct long-only (LO) portfolios that consider only the long portfolio. Within the diversification approach, we further need to include no-short-sale constraints to ensure the separation of securities into long and short portfolios. The weighting strategy for \({\mathbb {A}}\) can be applied to build bottom-up multifactor portfolios. The OOP perspective has been applied by Zurek and Heinrich (2021) to show the benefits of bottom-up versus top-down factor strategies. According to this approach, we can apply multiple factor-specific firm characteristics and build multifactor portfolios within a one-step approach.

To obtain the alpha forecasts \(\hat{\alpha }\), we use the same linear model as described by Heinrich and Zurek (2019). In this model, the assumption is made that the security returns in \({\mathbb {A}}\) follow a multivariate conditional normal distribution. The M different types of firm characteristics, hereinafter referred to as signals, can be observed for each of the N companies and represent the necessary conditioning information for each security. For simplification purposes, it is assumed that these signals only have informational value for their specific company, which means that signals from different companies do not correlate with each other or with returns from other companies. Further, we assume that signals are uncorrelated with benchmark returns, so that benchmark timing can be disregarded. Under these conditions, it is possible to apply alpha forecasts with the following linear model

$$\begin{aligned} {\hat{\alpha }}_{i,t}&= \hat{{{k}}} \,\, \widehat{{{C}}}^{-1} \, {{z}}_{i,t-1}, \end{aligned}$$
(5)

with

$$\begin{aligned} \hat{{{k}}}&= \left( \widehat{\text{ Corr }}\left[ R^*_{t}, g_{1, t-1}\right] \cdots \widehat{\text{ Corr }}\left[ R^*_{t}, g_{M, t-1}\right] \right) , \nonumber \\ {{z}}_{i,t-1}&= \left( \begin{array}{c} \frac{\left( g_{i,1, t-1} - \widehat{\text{ E }}\left[ g_{1, t-1}\right] \right) }{{\widehat{\sigma }}_{g_{1,t-1}}} \\ \vdots \\ \frac{\left( g_{i,M, t-1} -\widehat{\text{ E }}\left[ g_{M, t-1}\right] \right) }{{\widehat{\sigma }}_{g_{M, t-1}}} \end{array} \right) , \end{aligned}$$
(6)

where \(R^*_{i,t}\) denotes the return of security i at time t. \({{g}}_{i, t-1}\) is the (\(M \times 1\))-vector of signals observations. The correlations between the lagged signals and returns in the (\(1 \times M\))-vector \(\hat{{{k}}} \) are called information coefficients (ICs) and measure the signals’ informational content. The inverse of the signals’ correlation matrix \(\widehat{{{C}}}^{-1}\) ensures that highly correlated signals have a lower impact on the alpha forecast and vice versa. Representing the companies’ standardized signal observations, the values in the (\(M \times 1\))-vector \({{z}}_{i,t-1}\) are referred to as z scores. Because factor investing strategies are based on cross-sectional anomalies, parameter estimation is conducted with regard to cross-sectional observations. Therefore, \(\hat{{{k}}}\) and \(\widehat{{{C}}}\) are the same for all securities. This is equivalent to the assumption that a specific stock characteristic incorporates the same informational content and linear co-movement with other signal types for all securities. To obtain ex-ante estimates, we estimate the ICs and C from the time-series averages of their cross-sectional estimates. The applied linear form is strongly related to a Fama–MacBeth regression. Nevertheless, we deviate from the well-known Fama–MacBeth beta-notation by segregating betas into information coefficients, the characteristics’ interaction effects and z scores. By ignoring differences in the firm characteristics’ information coefficients and interaction effects, alpha forecasts can be linked directly to the firms’ z scores. This naive method of alpha forecasting is frequently applied in practice. However, Heinrich and Zurek (2019) emphasized the important role of ICs in factor investing strategies. Moreover, ICs are one of the main determinants in the fundamental law of active management, which is a simplified framework to explain a portfolio’s information ratio.Footnote 5 Therefore, it is straightforward to refrain from using naive alpha forecasts of equally weighted z scores and instead consider the full range of linear interaction effects between the firm characteristics.

Besides alpha forecasts, implementing diversification-based factor strategies requires the estimation of the residual returns’ covariance matrix \({\Sigma }\). Since the most popular estimator—the sample covariance—leads to high estimation errors, especially in the presence of high-dimensional data, most risk-based investment strategies choose more biased, but less variable estimators for the covariance matrix. We utilize two different covariance estimation methods, which are more stable than the sample estimator \(\widehat{{\Sigma }}_S\).

The first method considers the statistical shrinkage procedure applied to a structured target matrix \(\widehat{{{\Sigma }}}_ T \),

$$\begin{aligned} \widehat{{{\Sigma }}}_ {LS} = s\widehat{{{\Sigma }}}_ T + \left( 1- s\right) \widehat{{{\Sigma }}}_ S , \end{aligned}$$
(7)

where the constant \(s\in [0,1]\) controls the shrinkage intensity. As a compromise between the highly unstable sample estimator and the highly biased target matrix, the convex combination of both uses the bias-variance trade-off to enhance the out-of-sample performance. Implementing this linear shrinkage method in the context of financial time series requires the target matrix to be chosen with reference to an assumed correlation structure of the underlying returns. The analysis focuses on the linear shrinkage method of Ledoit and Wolf (2004), which assumes identical pairwise correlations among all N assets and is referred to hereinafter as the Ledoit Wolf constant correlation (LW-CC) estimator. While the variances are kept as their original sample values, the off-diagonal entries of the target matrix are estimated with a constant average sample correlation \({\bar{\rho }}\). This results in \(\widehat{{{\Sigma }}}_{T,ij} =\sqrt{{\widehat{\sigma }}_ ii {\widehat{\sigma }}_ jj {\bar{\rho }}}.\) For the concrete implementation of this estimation method, we use the optimal shrinkage intensity \(s^*\), as derived by Ledoit and Wolf (2004).

Finally, to establish a benchmark for the results of combining characteristics- and diversification-based methods, we follow Amenc et al. (2012) and additionally implement a robust factor-based covariance matrix estimator. Using factor models, the covariance matrix of investment returns can be estimated as

$$\begin{aligned} \widehat{{{\Sigma }}}_ {FM} = {{{B}}}\widehat{{{\Sigma }}}_ F {{{B}}}' + \widehat{{{\Sigma }}}_ u \end{aligned}$$
(8)

where B is the matrix of factor loadings, \({{\Sigma }}_ F \) is the sample covariance matrix of the common factors, and \(\widehat{{{\Sigma }}}_ u \) is the residual covariance matrix.Footnote 6

Instead of assuming a concrete underlying factor model, as described in Fan et al. (2008), Amenc et al. (2011) choose to use PCA to determine the factors directly from the data.Footnote 7 In data analysis, PCA is a dimension reduction method based on the spectral decomposition of the sample covariance matrix. First introduced by Hotelling (1933), the central idea of PCA is to reduce the dimensionality of a data set and at the same time to retain as much as possible of the present variation between the individual entries (Ledoit and Wolf 2015). In particular, if the first K factors (or sample eigenvectors) govern most of the variability of asset returns, i.e., if \(\frac{\sum _{k=0}^{K}\sigma ^2_{F,l}}{\sum _{k=0}^{N}\sigma ^2_{F,l}}\approx 1\,\), then the last \(N-K\) factors can be dropped without losing too much information on the underlying covariances. The respective covariance estimator becomes

$$\begin{aligned} \widehat{{{\Sigma }}}_ {PCA} = {{{V}}_K}\widehat{{\Lambda }}_ K {{{V}}_K}' + \widehat{{{\Sigma }}}_ u , \end{aligned}$$
(9)

where \({{V}}_K\) is the N-asset by K-factor matrix of factor loadings (first K eigenvectors), \(\Lambda _K\) is the \(K\times K\) diagonal matrix of the first K eigenvalues, and \(\widehat{{{\Sigma }}}_ u \) is the \(N\times N\) diagonal matrix with the sample variances of idiosyncratic components not explained by the first K factors. The application of PCA is strongly motivated by the observation that most of the information within a covariance matrix is loaded within the first K largest eigenvalues while the rest of the information is mostly noise. However, determining the number of factors K remains an important question in the literature.Footnote 8

Following Amenc et al. (2011), we refer to the contributions from random matrix theory as in Laloux et al. (2000), Bouchaud and Potters (2009), Bun et al. (2017). In particular, when assuming i.i.d. returns, the eigenvalues of the true random covariance matrix are described by the Marčenko-Pastur (MP) distribution with the upper bound \(\lambda _{max} = \sigma ^2(1+\sqrt{N/T})^2\) and \(\sigma ^2\) corresponding to the average estimated variance across the N assets (Laloux et al. 2000). It is therefore possible to detect which eigenvalues actually carry valuable information and which eigenvalues are random and consist of pure noise. As in Amenc et al. (2011), the number of principal components K for the method in Eq. (9) is set to be the number of eigenvalues with essential information, namely those larger than \(\lambda _{max}\). The described fully data-driven covariance estimation method produces a stable estimator based only on the actually relevant information from the returns and is expected to perform well in a high-dimensional data setting, such as within factor investing strategies.

Instead of explicitly applying covariance estimates with \(\widehat{{{\Sigma }}}_{LS} \) or \(\widehat{{{\Sigma }}}_{PCA} \), it is possible to adopt a simplifying structure of the correlation of the residual returns that allows us to rely only on the alpha predictions of the securities. As an alternative to the strong model assumption of \(\widehat{{\Sigma }}_I = I\) with I representing the identity matrix, we apply a weaker model assumption on the covariance structure of the residuals \({\Omega }\) from the complete factor model (2). Within the OOP perspective MacKinlay and Pástor (2000) provide the following relation for the residual covariance matrix \({\Sigma }\):

$$\begin{aligned} {\Sigma } = \alpha \alpha ' \frac{1}{\left( \frac{\alpha _{{\mathbb {A}}}}{\sigma _{{\mathbb {A}}}}\right) ^2} +{\Omega }. \end{aligned}$$
(10)

Here, the expression \(\left( \frac{\alpha _{{\mathbb {A}}}}{\sigma _{{\mathbb {A}}}}\right) ^2\) is the squared information ratio of the optimal orthogonal portfolio and determines its performance contribution to the overall mean variance portfolio. Interestingly, Eq. (10) reveals that with an assumption for \({\Omega }\) the vector of mispricings is linked to \({\Sigma }\). By utilizing the Sherman–Morrison formula for the inverse of \({\Sigma }\), this relation can be inserted in the optimal portfolio weight Eq. (3):

$$\begin{aligned} {{a}}_{{\mathbb {A}}}&= \frac{{\Omega }^{-1} \alpha }{ {\mathbf {1}}_{N}' {\Omega }^{-1} \alpha }. \end{aligned}$$
(11)

It becomes apparent that the residual covariance matrix \({\Omega }\) from model (2) replaces the covariance matrix \({\Sigma }\) from model (1). Moreover, under a restricted form for the residual covariance of \({\Omega } = \sigma ^2 I\) the OOP weights are proportional to the securities’ alpha parameters:

$$\begin{aligned} {{a}}_{{\mathbb {A}}} = \frac{\alpha }{{1}'\alpha } \end{aligned}$$
(12)

This result motivates building factor portfolios that exhibit weights, proportional to \(\alpha \) and are therefore solely determined by their firm characteristics.Footnote 9 Since securities with the highest alpha forecasts will reach the highest weights, the demonstrated weighting approach is referred to as alpha concentration (Alpha-Con).

The assumption—\({\Omega } = \sigma ^2 I\)—referred to as a “strong form” by MacKinlay and Pástor (2000) might appear very restrictive. However, it is shown empirically and in a simulation that the assumed structure improves the out-of-sample portfolio performance. Since the simple approach of assuming zero-correlated residuals leads to a stable but also biased solution, the question regarding whether the concentrated approach can be advantageous compared to diversified approaches remains open. To study the results within a multifactor setting, we conduct a horse race between the Alpha-Con approach and the two diversification approaches utilizing the LW-CC and PCA estimators, respectively. As an additional alternative to the Alpha-Con approach, we also investigate an equally weighted multifactor portfolio. In the equally weighted multifactor portfolio, the forecasted alphas are applied only in the security selection step, while the number of P selected securities is equally weighted by 1/P.

Data

The S&P 500 (SPX) and STOXX Europe 600 (STX) indices are used as representative data sets for the US and European stock markets, respectively. Compared to the SPX, the STX comprises different currency areas. To ensure that currency effects do not influence the result, returns are measured in local currency. In particular, the assumption is made that the currencies are hedged, whereby hedging costs are not taken into account. As proxy for the risk free rate, we applied the one-month T-bill rate for the US market and the three-month Euro Government Bond rate for the European market.

The datasetFootnote 10 includes firm characteristics and stock returns from the beginning of 2002 until the end of 2019. 2002 was chosen as a starting point, because we observed a higher number of significant predictors of monthly U.S. returns prior to 2002. Due to limited space, we do not consider time-varying predictability. However, we partially address the problem of misspecification of the factor model by also applying factors with a small number of significant predictors. In addition, we have noted a significantly lower data availability for the STX index prior to 2002.

With value, growth, momentum, quality and low-volatility, our study includes five well-known factors which corresponds to the factor setting of Zurek and Heinrich (2021). Each factor is based on several firm characteristics, with the multifactor portfolio comprising overall 16 firm characteristics.Footnote 11 The factor composition is based on MSCIFootnote 12 factor portfolios, as these are widely accepted in the industry. In the case of the low-volatility factor, we have chosen to deviate from the MSCI specification. The MSCI Low-Volatility factor uses a minimum variance approach to reduce the overall risk of the portfolio and thus differs from the other factors, which are based purely on firm characteristics. Therefore, the low-volatility factor is applied by using the characteristic-based method of Chow et al. (2014). This approach fits well with the construction methods of the other factors.

For our empirical out-of-sample analysis, we apply daily trailing data observations. This assures that the firm characteristics always resort to the most actual data points and consider the ongoing fluctuations in price-dependent data. Firm characteristics for which a higher z score does not reflect a higher expected return are multiplied by minus one. In our case, this applies to earnings variability and debt to equity. Statistical outliers are adjusted by the method of DeMiguel et al. (2020) by setting equal all observations greater (less) than the defined threshold to the third (first) quartile plus (minus) three times the interquartile range. Zurek and Heinrich (2021) have shown that the Alpha-Con approach tilts the factor exposure towards those factors that exhibit firm characteristics with the highest informational content. Since the diversification approaches can have a disrupting effect on the factor exposures, it is important to compare the ICs of each firm characteristic. Besides ICs, Zurek and Heinrich (2021) also discuss the effects of the cross-sectional correlations between the firm characteristics.Footnote 13 However, as the majority of correlation coefficients are small, the overall impact on the alpha forecast is limited. Table 1 shows the time series means of the 216 cross-sectional IC realizations within the selected test period. Bootstrap standard errors (SEs) and 90% (95%) confidence intervals are calculated for 100,000 bootstrap resamples to quantify the uncertainties in the inferences. The sample standard deviation of the average ICs across bootstrap samples is used as an estimate of the standard error. Furthermore, the 5% (2.5%) and 95% (97.5%) percentiles of the bootstrap samples are calculated for the mean ICs to obtain the lower and upper bounds of the confidence intervals, respectively. The mean ICs where the confidence interval does not cross zero from the 5 (2.5) and 95 (97.5) percentiles of the 90% (95%) confidence interval are marked as significant.

The majority of signals show positive mean ICs, which are close to zero. It is striking that the informational content of the individual characteristics varies considerably between the samples. In terms of the number of significant signals within the individual samples, two scenarios with different predictability can be identified. While the SPX exhibits only five signals with significant ICs, in the STX most ICs are significant with a total of 11 signals. Therefore, the SPX multifactor model represents a case in which an investor has identified a moderate number of ex ante predictors, and thus an accurate forecast is unlikely. In contrast, the multifactor model of the STX sample represents a well-defined factor model.

Furthermore, a strong difference in ICs within and between factors can be observed. The low-volatility factor appears to be the only factor with characteristics showing significant positive ICs in both samples. The large differences in IC structure across samples and factors underscore the importance of applying alpha predictions that account for these differences. In addition, it also demonstrates the need to compare strategies in different scenarios to achieve robust results.

Table 1 Mean ICs and bootstrap results of the SPX, and STX sample

Out-of-sample backtest

The backtest framework is chosen to closely resemble the realistic investment behavior of an institutional investor, with the main objective of outperforming an underlying cap-weighted benchmark portfolio. This objective requires considering weighting constraints, rebalancing costs, commonly used rebalancing frequencies and representative data sets. The results are analyzed for the SPX and STX samples using the multifactor settings described in Sect. 2. In order to create an appropriate comparison with the parent index, the investable securities consist of the securities that are part of the parent index at the time of rebalancing. Regarding the dataset, the backtest uses only point-in-time data,Footnote 14 i.e., only firm characteristics that were available in the database at the current time. For the estimation period, a five-year rolling window, corresponding to 60 monthly observations, is used. The out-of-sample evaluation interval begins in 2007 and ends in 2019. The portfolio is rebalanced at monthly intervals. Trading costs caused by rebalancing are considered by quantifying the cost-relevant volume with the portfolio turnover rate (\(\text{ PTR}_t = \sum _{i = 1}^N |a_{i,t-1} - a_{i,t}|\)). The PTR determines the percentage of the portfolio that causes trading costs. In this context, a PTR of 30% signifies that 15% of the old portfolio is sold. Subsequently, the incoming liquidity has to be reinvested, and trading costs are incurred for 30% of the portfolio. Following Frazzini et al. (2018), the costs are assumed to be ten basis points per traded volume. The calculated costs are demarcated on the day they arise. Consequently, these costs lead to a direct reduction in the return on the rebalancing day.

Backtest results

To ensure robust results and to investigate a wide range of portfolio sizes, we apply three different quantiles of \(qt^+ = \{90\%,80\%,70\%\}\) in the selection threshold for the long portfolio \(\alpha ^{+}_{qt}\) and \( qt^- = \{10\%,20\%,30\%\}\) in the selection threshold for the short portfolio \(\alpha ^{-}_{qt}\). Table 2 depicts the corresponding maximum (max.) portfolio weights and the normalized (norm.) Herfindahl index (HI) results for the LO strategies.Footnote 15

Since the unconstrained diversification approaches lead to high weight concentrations, weight constraints.Footnote 16 have been applied within the diversification approaches, to allow for a meaningful comparison with the Alpha-Con approach. The weight bounds are determined by the average max. weights of the Alpha-Con portfolios. Therefore, the max. weights of the diversification approaches are the same for the LW-CC and the PCA diversification approaches. Since the LW-CC and PCA approaches reach the max. weight in almost every rebalancing, the average max. weights of the diversified approaches are nearly identical to those of the Alpha-Con approach.

Almost all weights of the portfolios are below 5%. Consequently, the strategy weights are in line with the usual weighting rules applied to institutional investment funds. The higher HI values in the diversification approaches indicate that the Alpha-Con portfolios are in fact less concentrated. Moreover, the HI results of the Alpha-Con portfolios are very close to zero, which equals the HI value of the 1/N strategy. This indicates that the predicted alphas of the selected securities in the Alpha-Con portfolios are very similar to each other.

Table 2 Weight concentrations of the LO portfolios in the SPX and STX samples

To examine the portfolio performance compared to the associated cap-weighted benchmark portfolios, in Tables 3 and 4, we report three important performance measures—the annualized alpha parameters, tracking errors, and information ratios for each strategy for the LO portfolios and LS portfolios, respectively. The time series of monthly portfolio returns is used to compute the out-of-sample portfolio alphas, which are estimated from a regression of the portfolio returns against the associated benchmark returns. To determine the strategies’ information ratios, the portfolios’ alpha parameters are divided by the tracking errors, calculated from the standard deviations of the regression residuals.

Table 3 Backtest LO results of the SPX and STX samples
Table 4 Backtest LS results of the SPX and STX samples

Regarding the SPX sample, the only significant alpha performance result within the Alpha-Con portfolios can be observed for the 90% quantile portfolio. This outcome is not surprising since the SPX alpha forecasting model contains a moderate number of firm characteristics with significant mean ICs. However, it is remarkable that the two diversification approaches achieve significant positive alpha results in all three SPX samples. The alpha concentration approaches of the STX sample obtain significant alphas in the 80% and 70% portfolios, which can be explained by the large number of significant predictors. In comparison, the diversification approaches manage to significantly outperform the STX index in all three samples. The qualitative comparison demonstrates that the diversification approaches definitely achieve a higher alpha in all LO portfolios compared to Alpha-Con portfolios.

Due to the low concentrated portfolio weights of the Alpha-Con approach, its performance is similar to the performance of the 1/N portfolios. In the SPX sample, the 1/N approach performs slightly worse, whereas in the STX sample, it performs slightly better. Compared to each other, both diversification approaches lead to similar alpha results.

As a measure of active risk, the tracking errors reveal the extent to which the diversified strategies are able to reduce the portfolio risk. In all samples, the LW-CC and PCA approaches obtain a lower tracking error compared to the Alpha-Con approach. The 1/N portfolios are also able to reduce portfolio risk, but to a lesser extent than the diversification approaches. One can therefore conclude that the diversification approaches indeed reduce portfolio risk. However, in both indices, the tracking error differences between the applied strategies are not significant.

The combination of higher alpha results with lower tracking errors within the diversification approaches leads to much better information ratios compared to the Alpha-Con approach. To determine the inference in the differences between the information ratios of the Alpha-Con and the diversification portfolios, a Ledoit and Wolf (2008) test is applied. The examination includes tests for statistically significant differences at the 1%, 5% and 10% levels. In all samples, at least one diversification approach yields significantly better information ratio results compared to the Alpha-Con portfolios.

Table 4 shows the performance results for the LS portfolios. Due to the short portfolios, the corresponding market risk exposures are reduced, resulting in alphas that are multiple times greater than those of the LO portfolios. With minor exceptions for the 1/N portfolios, in almost all samples the alpha performance results are significantly positive. Especially in comparison to the LO results, it is noticeable that nearly all alphas of the Alpha-Con approach achieve higher values than those in the diversification approaches. Consistent with the LO portfolio results, the tracking errors of the diversification approaches are all smaller compared to the Alpha-Con portfolios.

In contrast to the LO portfolios, the information ratio results of the LS portfolios show very limited differences in the performance between the approaches. Therefore, no significant improvements can be observed for the diversification approaches. The results for the LS portfolios provide reasonable doubt regarding the proposition that diversified portfolio weights lead to overall performance advantages over alpha concentrated portfolio weights.

To analyze these findings in more detail, Tables 5 and 6 present the corresponding average factor exposures of the individual factors for the LO and LS portfolios. The factor exposures are measured by the weighted z scores, consistent with Ghayur et al. (2018). Our results show that the exposure distribution among the individual factors is very heterogeneous. The large factor exposure variation can be mainly explained by the ICs of the characteristics within the factors. For instance, the value factor of the STX sample contains two significantly negative ICs, which causes a negative value factor exposure. In contrast, the value factor of the SPX sample also contains characteristics with negative ICs, but these are closer to zero and not significant. In consequence, the value factor exposure of the SPX sample remains positive. This example shows that an exposure statement can be derived in particular from the significant ICs. Since low-volatility is one of the factors with the highest mean ICs among the significant firm characteristics, the exposure of this factor is particularly high.

Considering that our alpha forecasting model takes into account the full range of interaction effects between the firm characteristics, in addition to the ICs, the cross-correlations between the characteristics have an impact on the exposures. Momentum and low-volatility consist of two highly correlated characteristics, whereas the correlations of the characteristics within the quality factor are very low and in some cases even negative. While in the case of momentum and low-volatility the characteristics increase their factor exposure, the quality characteristics can cancel each other out and thus reduce the quality exposure. In particular, the low or even negative quality exposure of the SPX sample indicates that a factor with a low number of significant characteristics in combination with uncorrelated firm characteristics is affected by a loss of factor exposure. Although the quality factor exposure of the STX sample is also affected by the low correlation structure, the factor includes two highly significant characteristics. This leads to a higher quality exposure in the STX sample compared to the SPX.

Table 5 Average factor exposures of LO strategies
Table 6 Average factor exposures of LS strategies

The Alpha-Con approach tilts the factor exposure towards the factors with significant firm characteristics. In contrast, in the diversification approaches, considering the residual covariances leads to disrupting effects on the factor exposures. The disrupting effects can especially be observed in the low-volatility factor. With regard to the results, it is evident that the low-volatility factor exposure of the diversification approaches in the LO portfolios is on average almost twice as high as in the Alpha-Con portfolios. This outcome can be explained by the fact that the low-volatility stocks in the LO portfolios are weighted higher by taking covariances into account. As a result, an increasing effect on the low-volatility factor exposure occurs. In the case of the STX and SPX indices, the low-volatility factor is represented by firm characteristics with high informational content. Since the informational content describes the relationship between firm characteristics and excess returns, it is a measure of the predictive power of a firm characteristic. Higher exposures in firm characteristics with high informational content lead to a higher ability to generate alpha. As a result, LO diversified portfolios benefit from both performance drivers, additional alpha and lower risk.

In the short portfolios of the LS strategies, stocks with low-volatility are also weighted higher because of diversification effects. However, since in the LS portfolio the short portfolio is subtracted from the long portfolio, the low-volatility stocks obtain higher negative weights. Therefore, in the LS strategies, the increasing low-volatility exposure effect of the LO portfolios is canceled out between the long and the short portfolios. This canceling effect leads to a similar low-volatility exposure of the diversification approaches and the Alpha-Con portfolios. As a result, compared to the Alpha-Con approach, the diversified factor portfolios achieve smaller alpha performance values than in the LO results. Without the additional alpha performance, in the LS portfolios, the performance results of the diversification approaches are not significantly different compared to the Alpha-Con approach.

Our empirical results reveal that the Alpha-Con approach is an efficient multifactor strategy. Therefore, we are able to confirm the findings of MacKinlay and Pástor (2000), which present the strong form assumption for the residual covariance matrix \({\Omega } = \sigma ^2 I\) to be an appropriate assumption. Besides the consideration of ICs within the alpha forecast model, an additional value driver of the Alpha-Con approach can also be identified by its deconcentrated portfolio weights, which are similar to the 1/N portfolio. Although deconcentration is not as efficient as diversification, we can verify the “naive” weighting approach to be a reasonable alternative to the more complex diversification methods.

According to the diversification approaches, at least for the LO portfolios, performance improvements in comparison to the Alpha-Con approach can be observed. However, the alpha performance benefits of the diversification approaches depend to a large extent on the informational content of the firm characteristics in the low-volatility factor. In the case of the STX and SPX indices, the low-volatility factor is represented by firm characteristics with high informational content. The LO portfolios benefit from the diversification approach because the additional low-volatility exposure contributes significantly to higher alpha performance results. In contrast, the LS diversified portfolios are not able to increase the low-volatility exposure relative to the Alpha-Con approach, and consequently, the performance results are not significantly different.

Figure 1 depicts the relative strength result charts for the STX \(80\%\) quantile LO portfolios and the \(20\%/80\%\) quantile LS portfolios.Footnote 17 Relative strength is calculated as the ratio of price changes based on the log returns of the strategy portfolios compared to the benchmark portfolios,Footnote 18 scaled to an initial value of 100. The LW-CC and PCA diversification approaches are represented by blue and green lines, respectively, whereas the Alpha-Con approach and the 1/N approach are depicted by dashed and dotted lines, respectively. The charts for the LO portfolios show that the LW-CC and PCA diversification approaches achieve higher relative strength compared to the Alpha-Con and 1/N approaches. For the 80% quantiles, we observe that the PCA approach performs better than the LW-CC approach. However, this observation is not confirmed in the results for the other quantiles. Compared to the LO portfolios, there is no particular difference between the strategies in the LS portfolios. The visualized results further confirm our performance results from Tables 3 and 4.

Fig. 1
figure 1

Relative strength results of long-only (\(qt^+ = 80\%\)) and long-short (\(qt^- = 20\%\), \(qt^+ = 80\%\)) STX multifactor portfolios. The relative strength results of all approaches in relation to their underlying indices are displayed

Conclusion

This article contributes to the current literature on the advantages of diversified versus concentrated multifactor portfolios. We present factor investing from the optimal orthogonal portfolio perspective and apply a linear alpha forecasting model with the extension of two efficient and popular covariance estimation methodologies. Our empirical findings indicate that a pure characteristics-based approach is an efficient multifactor strategy. Moreover, the additional application of the residual covariance matrix improves the out-of-sample performance in long-only strategies. The high informational content of the firm characteristics in the low-volatility factor represents a relevant advantage for diversified approaches. Since informational content is a measure of the predictive power of a firm characteristic, higher exposures to firm characteristics with high informational content lead to a higher ability to generate alpha. In long-only portfolios, diversification strategies are able to provide additional exposure to the low-volatility factor, leading to an increase in excess returns and a reduction in portfolio risk. Regarding the long-short portfolios, the additional low-volatility exposure from the long-only portfolios is canceled out due to a higher weight of low-volatility stocks within the short portfolio. Without the additional low-volatility factor exposure, the performance difference between the pure firm-characteristic-based approach and the diversification approaches is not significant.

Our findings exhibit important practical implications. Since in diversification approaches, the informational content of the low-volatility factor represents an essential performance determinant, it is important to assess the informational content of low-volatility firm characteristics in the respective stock markets. In markets where low-volatility is a factor with significant informational content, increasing exposure to the low-volatility factor provides the opportunity of performance benefits. Furthermore, the performance comparison between long-only and long-short strategies underlines the relevance for the investigation of diversification approaches in terms of factor exposure. In this context, the allocation of exposure to the relevant factors is a fundamental source of portfolio performance. Therefore, it is also essential to use an alpha forecasting approach that is able to consider differences in informational content. A well-calibrated alpha forecasting model represents the foundation for efficient stock selection and an appropriate portfolio weighting strategy.