Identification in a fully nonparametric transformation model with heteroscedasticity

https://doi.org/10.1016/j.spl.2020.109018Get rights and content

Abstract

An identification result for nonparametrically transformed location scale models is proven. The result is constructive in the sense that it provides an explicit expression of the transformation function.

Introduction

Let dXN. The underlying question of this article can be formulated quite easily: Given some real valued random variable Y and some RdX-valued random variable X fulfilling the heteroscedastic transformation model h(Y)=g(X)+σ(X)εwith some error term ε independent of X and fulfilling E[ε]=0 and Var(ε)=1, are the model components h:RR,g:RdXR,σ:RdX(0,) and the error distribution uniquely determined if the joint distribution of (Y,X) is known? This uniqueness is called identification of a model.

Over the last years, transformation models have attracted more and more attention since they are often used to obtain desirable properties by first transforming the dependent random variable of a regression model. Applications for such transformations can reach from reducing skewness of the data to inducing additivity, homoscedasticity or even normality of the error terms. Already Box and Cox (1964), Bickel and Doksum (1981) and Zellner and Revankar (1969) introduced some parametric classes of transformation functions. Horowitz (1996) developed an identification result for a linear regression function g and homoscedastic errors. Later, these ideas were extended by Ekeland et al. (2004) to general smooth regression functions g. The arguably most general identification results so far were provided by Chiappori et al. (2015) and Vanhems and Van Keilegom (2019), who considered general regression functions and even allowed heteroscedastic errors to some extend, but required conditional homoscedasticity of the error and some exogenous regressors, conditional on endogenous regressors or some control variable, respectively. Unfortunately, all of these approaches cannot be applied to the heteroscedastic model in (1.1) without further assumptions so that different methods are needed. Despite their practical relevance (e.g. in duration models, see Khan et al., 2011), results allowing heteroscedasticity are rare. While Zhou et al. (2009) showed identifiability in a single-index model with a linear regression function g and a known variance function σ2, Neumeyer et al. (2016) simply required identifiability implicitly in their assumptions. In contrast to the approaches mentioned above, it is tried here to avoid the assumption of the presence of some exogenous regressors and any parametric assumption on h,g or σ. Note that the validity of the model is unaffected by linear transformations, that is, Eq. (1.1) still holds when replacing h, g and σ by h̃(y)=ah(y)+b,g̃(x)=ag(x)+b,σ̃(x)=aσ(x)for some a,bR. Consequently, at least two conditions for fixing a and b are needed. These will be called location and scale constraints in the following.

The remainder is organized as follows. First, some assumptions are listed before the main identification result for heteroscedastic transformation models is motivated and stated. Afterwards, a short conclusion is given in Section 3. The proof can be found in the supplementary material.

Section snippets

The idea and the result

The assumptions which mostly concern validity of model (1.1) and continuity of its model components are given in the first section of the supplement. In the following, the functions h,g and σ as well as the density fε of ε are used to show their uniqueness and consequently identification of the model.

Conclusion and outlook

The so far most general identification result in the theory of transformation models has been provided. While doing so, the techniques of Ekeland et al. (2004) and Chiappori et al. (2015) have been modified to reduce the problem of identifiability to that of solving an ordinary differential equation. Most of the previous results are contained as special cases. The main contribution consists in allowing heteroscedastic errors without assuming parametric restrictions or the presence of exogenous

Acknowledgements

This work was supported by the DFG, Germany (Research Unit FOR 1735 Structural Inference in Statistics: Adaptation and Efficiency).

Moreover, I would like to thank Natalie Neumeyer and Ingrid Van Keilegom as well as the editor and two anonymous referees for their very helpful suggestions and comments on the project.

References (12)

  • BickelP.J. et al.

    An analysis of transformations revisited

    J. Amer. Statist. Assoc.

    (1981)
  • BoxG.E.P. et al.

    An analysis of transformations

    J. R. Stat. Soc. Ser. B Stat. Methodol.

    (1964)
  • ChiapporiP.-A. et al.

    Nonparametric identification and estimation of transformation

    J. Econometrics

    (2015)
  • EkelandI. et al.

    Identification and estimation of hedonic models

    J. Political Econ.

    (2004)
  • HorowitzJ.L.

    Semiparametric estimation of a regression model with an unknown transformation of the dependent variable

    Econometrica

    (1996)
  • HorowitzJ.L.

    Semiparametric and Nonparametric Methods in Econometrics

    (2009)
There are more references available in the full text version of this article.

Cited by (0)

View full text