Some Practical Problems of Recent Nonparametric Procedures

Anuncio
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
•
.
SI
1.
Some Practical Problems of Recent
Nonparametric Procédures: Testing, Estimation,
and Application.
Jorge Barrientos-Marin
Advisor: Stefan Sperlich
Quantitative Economies Doctorale
Departamento de Fundamentos del Análisis Económico
Universidad de Alicante
January 2007
mff' /',*.
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
i~
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
To my wife and my family.
1
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Agradecimientos
Los artículos que componen esta tesis es el resultado de cinco años de trabajo
continuo. Pero sin duda, esto no habría sido posible sin la colaboración de muchas
personas. Quiero expresar mi gratitud para con todas ellas. Sí alguno se queda sin
mencionar, lo más posible es que mi memoria, como es usual, me juegue una mala
pasada. Quiero entonces expresar mi reconocimiento a los miembros del departamento de Fundamentos del Análisis Económico, a mis profesores y especialmente a
mis condiscípulos, ellos hicieron estos cinco años soportables lejos de casa. Especial
reconocimiento entre profesores merecen Antonio Villar, quien confió en mi siempre
y fue consejero en momentos difíciles, a Juan Mora por proveerme ánimo y ratos
agradables discutiendo resultados y teoremas, al igual que Javier Alvarez y a Lola,
quienes con sus excelentes cursos me animaron a seguir el camino de la econometría.
Entre mis condiscípulos agradezco a Alicia quien siempre ha sido una amiga.
Agradezco a Ricardo su ayuda e innumerables favores (muchos de ellos pecuniarios)
y a Paco, Silvio y Szaby su compañía placida y su amistad sincera. A Fafael López,
su buen humor e intelegiencia fueron un reto para mi. Agradezco a José Maria su
gran aprecio para conmigo, algo que es mutuo, y su generosidad, estos años habrían
sido menos divertidos y algunas navidade tristes sin su amistad.
No puedo dejar de mencionar al personal administrativo (Mercedes, Mariló, Julio,
Carlos y Lourdes) siempre estuvieron atentos a ayudarme y tuvieron paciencia para
mis innumerables solicitudes.
Agradezco también a Frédéric Ferraty y a Philippe Vieu su dedicación, ellos me
proveyeron la mejor atmósfera para hacer uno de los capítulos que componen esta
tesis. Aquí merece mención Juan y Mónica, quienes me acogieron en su casa y
siempre fueron compañía, además de introducirme, en modo nada superficial, en los
aspectos de la vida francesa.
Agradezco a mi familia, en especial a Patricia, mi esposa, quien me ha poyado
todos estos años de semi-soledad a la espera de que esto acabara, siempre con paciencia y optimismo. A mi madre, quien sé que mi ausencia siempre la entristeció.
A mis tías, para quienes soy un orgullo. A José y Leticia Restrepo por ayudar a
Patricia a llevar la carga de la soledad.
Agradezco a mis amigos en Colombia, a Mauricio Alviar y a Pedro, quienes
desde un comienzo creyeron que esto era posible de alcanzar. Menciono también a
Alejandro Gaviria, quien continua enseñándome a pensar como un economista, me
dio además consejos acertados en el momento justo.
Finalmente, un reconocimiento especial merece Stefan Sperlich, quien me ha enseñado mucho de econometría semi y noparametrica. Estaré siempre agradecido con
él, porque se preocupó de que esto terminara bien y ha sido un director excepcional
aún desde la distancia.
2
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Contents
Agradecimientos
Introduction and Summary
Introducción y Resumen en Español
1
2
5
8
The Size Problem of Kernel Based Bootstrap Tests when the Nuil
is Nonparametric
12
1.1 Introduction
12
1.2 Statistical Methods: Estimators and Test Statistics
14
1.2.1 Estimators
14
1.2.2 Test Statistics
15
1.3 Resampling and Choice of Parameters
17
1.3.1 Bootstrap Tests
18
1.3.2 The Choice of Bandwidths h
19
1.3.3 The Choice of Bandwidths k
19
1.3.4 The Choice of Bootstrap Residuals
20
1.3.5 An Alternative: Subsampling
21
1.3.6 The Choice of Bootstrap Bandwidth/i;,
23
1.4 Simulation Results
23
1.5 Conclusions
27
Références
29
2 Estimating and Testing An Additive Partially Linear Model in a
System of Engel Curves
37
2.1 Introduction
37
2.2 Additive Partially Linear Model and Testing Hypothesis
40
2.3 The Shape of Engel Curves and Spécification Testing
45
2.3.1 Data Used in this Application
48
2.3.2 Some Pictures of the Expenditure expenditure-Log Total Expenditure Relationship
49
2.3.3 Spécification Testing
56
2.4 Conclusions and Future Research
59
Références
61
3
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
3 Locally Modelled Régression and Functional Data
3.1 Introduction
3.2 Position of the Problem
3.3 Functional locally modeled régression
3.3.1 The p-dimensional case
3.3.2 The infinite-dimensional case: the functional setting
3.4 FFLM kernel-type estimator: asymptotic behavior
3.5 FFLM régression in action
3.6 Conclusions
Références
4
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
64
64
67
68
68
69
73
77
80
88
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Introduction and Summary
This thesis is composed of three chapters, in which we focus on three related, but
différent, issues regarding testing, estimation and theoretical developments1. More
precisely, in Chapter 1, "The Size Problem of Kernel Based Bootstrap Tests when
the Nuil is Nonparametric", we are interested in clioosing an appropriate smoothing parameter, a problem that is fundamental for the reasonable use of non- and
semiparametric methods. In particular for testing, we make note the this problem
is not équivalent to the one in régression. At least from a theoretical point of view,
the optimal smoothing parameter for testing has différent rates from those which
are optimal for estimation.
While there exists an increasing literature on how to find a proper smoothing
parameter for the nonparametric alternative, almost nothing is known on how to
choose a smoothing parameter in practice for the nuil hypothesis if it is also semi- or
nonparametric. We do know that at least asymptotically oversmoothing is necessary
in the pre-estimation of the nuil model for generating the bootstrap samples, see
Hardie and Marron (1990,1991). However, in practice this knowledge is of little
help. The same can be said about various parameters and procédures to be chosen
in practice when performing such tests. In this Chapter, we discuss ail thèse choice
questions. In particular we study the problem of bandwidth choice for the preestimation to genérate bootstrap samples. As an alternative, we also discuss briefly
the possibility of subsampling. 2 .
In Chapter 2, "Estimating and Testing An Additive Partially Linear Model in
a System of Engel Curves ", we focuses on an application of additive partial linear
model and some ideas extracted from applications on Chapter 1. Our main goal is
to make an application to consumer theory. More exactly, to Engel curves Systems.
The form of the Engel curve has long been a subject of discussion in applied econoChapter 1 is a joint work with Stefan Sperlich and Chapter 3 is a joint work with Frédéric
Ferraty and Philippe Vieu.
2
The authors gratefully acknowledge financial support from the Spanish DG de Investigación
del Ministerio de Ciencia y Tecnología. SEJ2004-04583/ECON.
5
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
metrics and until now there has no been definitive conclusion about its form. In this
Chapter an additive partially linear model is used to estimate semiparametrically
the effect of total expenditure in this context. Additionally, we consider the nonparametric inclusion of some regressors which traditionally have a non linear effect
such as age and schooling. To that end we compare an additive partially linear
model with the fully nonparametric one using recent popular test statistics. Because of inference in nonparametric regression can take place in a number of ways,
the most natural is to use nonparametric regression as an alternative against a fully
parametric or semiparametric null hypothesis. Then, for investigating purpose we
check whether an additive PLM provides a reasonable adjustment to our data using
different resampling schemes to obtain critical (p-values) computed by bootstrap
and subsampling schemes for the proposed test statistics.
Additionally, in this Chapter, we dealing with a well-known problem very common in the context of Engel curves, it is that total expenditure may well be jointly determined with expenditure on different goods. Therefore, endogeneity problem may
arise. In order to solve this problem we are interested in applying nonparammetric
constructed regressors as instrumental variables. In particular, we use the nonparametric two step with generated regressors and constructed variables (NP2SCV) due
to Sperlich (2005). Our feeling is that a generated variables approach in combination
with additive PLM can help us to overcome to some extent any possible endogeneity
problem and that is exactly the procedure implemented in this Chapter.
In Chapter 3, "Locally Modelled Regression and Functional Data"3, we are interested in extend nonparametric methods when the regressors are functions (i.e.
one observation could be curve, surface or any other object lying into an infinite
dimensional space). From a statistical pint of view, this corresponds to a functional
regression setting because on wishes to predict a response Y from an explanatory
functional variable X. In addition, only regularity conditions on regression operator
J
A c k n o w l e d g e m e n t . The authors thank gratefully the members of the working group STAPH
(http : //www.lsp.ups — tlse.fr/staph)
for their helpful comments and discussions. In addition, the
first author acknowledges financial support from the Spanish Ministry of Education and Science,
under project BEC2001-0535
6
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
are assumed. Then, this leads us to the nonparametric context. So, the problematic
of this work deal with the nonparametric functional regression. Recently, there are
several works dealing with the nonparametric functional regression (see for instance
Ferraty and Vieu (2002, 2005)). This nonparametric functional regression method is
essentially based on an extension of the well-known Nadaraya(1964)-Watson(1964)
kernel regression estimator of the regression, to the case of functional explanatory
variable. On the other hand, local linear ideas have been developed in the regression
context for univariate and multivariate explanatory variable, see Wand and Jones
(1995) for an overview of this topic. Therefore, our work can be considered as an
extension, which is a combination, of the nonparametric local constant method with
the ideas of functional variable. So, the aim of this setting does not make easy both
the asymptotic study and the implementation of a natural generalization of the multivariate local linear method. Therefore, one focuses on a simpler and faster local
approach. Asymptotic properties are stated, and a functional dataset illustrates the
good behavior of this fast functional local modelled regression method.
7
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Introducción y Resumen en Español
Esta tesis esta compuesta por tres capítulos, los cuales se centran en tres diferentes
problemas, aunque relacionados, estos van desde estimación y contrastes de hipótesis
hasta desarrollos teóricos. Más exactamente, en el Capítulo 1, " The Size Problem of
Kernel Based Bootstrap Tests when the Null is Nonparametric", nosotros estamos
interesados en la selección apropiada de un parámetro de suavización, un problema
que es fundamental para un razonable use de los métodos semi y noparamétricos.
En particular, para contrastes de hipótesis, nosotros notamos que este problema no
es equivalente aquel que se presenta en análisis de regresión, esto es en la simple
estimación. Al menos desde un punto de vista teórico, la selección del parámetro
para contrastes de hipótesis tiene tasas (de convergencia) diferents a las que se
supone debe tener los parámetros que son óptimos para la estimación.
Mientras que existe una creciente literatura sobre el modo de hallar un parámetro
apropiado para la hipótesis alternativa, casi nada es sabido sobre como elegir un
parámetro de suavización en la práctica para la hipótesis nula, si esta es también
semiparamétrica o incluso noparamétrica. Solo sabemos que asintóticamente una
parámetro sobresuavizado es necesario en la preestimación del modelo bajo la nula
para generar las muestras bootstrap, ver al respecto Hárdle and Marrón (1990,1991).
Sin embargo, en la práctica este conocimiento es de poca ayuda. Lo mismo puede
decirse acerca de varios parámetros y procedimientos a ser elegidos en la práctica
cuando hacemos un uso de un procedimiento de contraste. En este Capítulo entonces, nosotros discutimos estas cuestiones acerca de la selección. En particular,
nosotros estudiamos el problema de la selección del parámetro de suavizado en la
pre-estimación para generar las muestras bootstrap. Como alternativa, también
discutimos brevemente la posibilidad de submuestras.
En el Capítulo 2, "Estimation and Testing An Additive Partially Linear Model
in a System of Engel Curves", nosotros nos centramos en la aplicción de modelos
aditivos parcialemente lineales basados en algunas ideas del Capítulo 1. Nuestra
meta es hacer una aplicación en teoría del consumidor. Específicamente a sistemas
8
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
de curvas de Engel. La forma de la curva de Engel ha sido por mucho tiempo objeto
de investigación en econometría aplicada y hasta el momento no hay conclusiones
definitivas sobre su forma. En este capítulo un modelo parcialmente aditivo es usado para estimar semiparametricamente el efecto del gasto total. Adicionalmente,
consideramos la inclusión noparamétrica de algunos regresores que tradicionalmente
tienen un efecto no-lineal como la edad y la escolaridad. Para llevar a cabo este
trabajo, comparamos un modelo aditivo parcialmente lineal con un modelo plenamente noparamétrico usando algunos estadísticos de contraste recientemente desarrollados. Puesto que infererencia en regresión noparamétrica puede ser hecha de
varias maneras, lo más natural es usar la regresión noparamétrica como hipótesis
alternativa contra una hipótesis nula semiparametrica. Entonces, para propósito
de investigación nosotros chequeamos si un modelo PLM proporciona un razonable
ajuste a los datos usando diferentes métodos de reemuestreo para obtener valores
críticos calculados con bootstrap y submuestras de los mencionados estadísticos de
contraste.
En este capítulo, nosotros también tratamos un problema común el contexto
de las curvas de Engel, y es que el gasto total esta conjuntamente determinado
con el gasto en los diferentes bienes. Por ello existe una endogenidad potencial.
Para resolver este problema usamos regresores construidos como variables instrumentales, en adición a variables en otras bases de datos. En particular, nosotros
usamos el método desarrollado por Sperlich (2005) llamado regresores noparametricamente generados o construidos en dos pasos (NP2SCV). Nuestra sensación es que
ciertamente (NP2SCV) en combinación con modelos aditivos parcialmente lineales
ayudan a eliminar la endogeneidad en la estimación de las curvas de Engel.
En el Capítulo 3, "Locally Modelled Regression and Functional Data", nosotros
estamos interesados en extender los métodos noparametricos cuando los regresores
son funciones (i.e. una observación podría ser una curva, una superficie o cualquier
otro objeto perteneciente a un espacio de dimensión infinita).
Desde un punto
de vista estadístico, esto corresponde a una regresión funcional, porque deseamos
predecir un^ variable respuesta Y de una variable explicativa funcional X. Además,
9
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
solo condiciones de regularidad con impuestas al operador de regresión son asumidas.
Esto conduce entonces a un contexto noparamétrico. Así que la problemática de
este trabajo trata de la regresión funcional noparamétríca. Recientemente, varios
artículos tratan con la regresión funcional noparamétríca (ver por ejemplo Ferraty
and Vieu (2002, 2005)). Estos consiste esencialmente en la extensión de estimador
kernel Nadaraya(l964)-Watson(1964) a el caso de variable explicativa funcional. De
otro lado, ideas de regresión local han sido desarrolladas en el contexto de regresión
univariante y multivariante, ver Wand and Jones (1995). Por tanto, nuestro método
es una extensión, que es una combinación de los métodos de regresión locales con
las ideas actuales de variables funcionales. Así pues, la meta nos es fácil en cuanto
al estudio asintótico y la implementación de una más que natural generalización
del método lineal local multivariante. Por tanto, nos centramos en una más simple
y rápida aproximación local. Las propiedades asintóticas son establecidas y datos
funcionales ilustran el buen comportamiento de este método rápido de regresión
local.
10
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Ir al índice/Tornar a l'índex
REFERENCES
Ferraty, F and P. Vieu (2004). Nonparametric Models For Functional Data, with
Applications in Regression, Time Series Prediction and Curve Discrimination. NonParametric Statistics, 16, 1-2, 111-125.
Ferraty, F and P. Vieu (2006). Nonparametric Modelling for Functional Data Analysis. Theory and Practice. Springer, New York (In print).
Hardle, W and J.S Marrón (1990) Semiparametric Comparison of Regression Curves.
Annals of Statistics, 18, 63-89.
Hardle, W and J.S Marrón (1991) Bootstrap Simultaneous Bars For Nonparametric
Regression. Annals of Statistics, 19, 778-796.
Sperlich, S. (2005). A Note on Nonparametric Estimation with Constructed Variables and Generated Regressors. Working Paper. Universidad Carlos III.
Wand, M. P and M. C. Jones (1995) Kernel Smoothing. Monographs on Statistics
and Applied Probability, 60. Chapman & Hall.
Watson, G. S (1964) Smooth Regression Analysis. Sankhya Ser. A 26.
11
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based
Bootstrap Tests when t h e Null is
Nonpar ametric
1.1
Introduction
IN BOTH A P P L I E D AND MATHEMATICAL STATISTICS, N O N - AND S E M I P A R A M E T R I C
SPECIFICATION TESTING is still quite a popular research field. Any internet search
engine can find several hundred papers dealing with this topic even when looking
at the last five years only. Therefore, it is surprising that so few of them study
the problem of choosing an appropriate smoothing parameter, a problem that is
fundamental for the reasonable use of these methods. Unfortunately, for testing this
problem is not equivalent to the one in regression. It is well known that, at least
from a theoretical point of view, the optimal smoothing parameter for testing has
different rates from those which are optimal for estimation.
In the last couple of years there has been a growing amount of literature on
adaptive testing. In most cases, the adaptiveness refers to the smoothness of the
alternative and deals with the choice of smoothness parameter for the alternative, or
the test statistic, see e.g. Ledwina (1994), Spokoiny (1996,1998), Kallenberg & Ledwina (1995), Hardle et al (2001), Horowitz & Spokoiny (2001), Guerre & Lavergne
(2005). Even though these methods have so far had little direct impact in the sense
that we could not find published papers using these methods (in practice or in the12
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
ory), they have been useful in determining a better understanding of the problem.
However, to our knowledge, all these papers concentrate on testing problems where
the null hypothesis is fully parametric. It is not clear to what extend these methods help if the null hypothesis is semi- or nonparametric. This is not such a rare
situation, since additivity tests already belong to this family. When bootstrap is
used to determine the critical value, these tests entail at least one more parameter
choice problem: pre-estimating the model under the null hypothesis to later generate the bootstrap samples. This is necessary as in most cases the bandwidths
for the estimation and the bootstrap should have different rates, see e.g. Hardle &
Marrón (1990,1991). Although these authors have already mentioned the problem
of choosing an appropriate bandwidth, in practical applications this problem has
hardly been addressed. As a consequence, in most published procedures for testing or constructing confidence bands with a semi- or nonparametric null hypothesis,
there is no guarantee that the test holds the level, or the bands the nominal coverage
probability. This has recently been confirmed in the work of Dette et al. (2005)
and Rodríguez-Póo et al (2004). However, in the former it is not referred to as a
bandwidth problem but rather as a problem of correlated designs and dimensionality because the size distortion is much smaller for uncorrelated design. In the
latter paper the problem is avoided by using subsampling instead of bootstraps. It
should also be mentioned that in that simulation study, the authors face basically a
parametric bootstrap drawing the bootstrap errors from a distribution known up to
a certain parameter. Although that unknown parameter depends on nonparametric
nuisance parameters, knowledge of distribution greatly mitigates the impact of the
bandwidth on the critical value.
To study the problem outlined in more detail we concentrate on the problem
of testing additivity. We limit ourselves to test statistics proposed in Dette et al.
(2005) and Rodriguez-Poo et al. (2004) but we try different modifications, methods
of bandwidth choice, and subsampling. The aim is not to find the most efficient
additivity test or to propose new ones. Our focus is only directed at finding a
method that guarantees that the level will be held by non trivial power when the
13
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
null hypothesis and the resampling method are non- or semiparametric. So, after
a review of the additivity tests considered here, we study different procedures for
bandwidth choice. Unfortunately, we have not found a generally valid method. Our
conclusion is basically that further research is necessary.
The rest of the paper is organized as follows. In the next section we review the
estimation and testing procedures considered in this work. In Section 1.3 we discuss
the different, scenarios from which the practitioner has to make his choice, including
modifications of test statistics, and resampling methods. Section 1.4 summarizes
the main findings from our simulation results, and Section 1.5 concludes.
1.2
1.2.1
Statistical Methods: Estimators and Test Statistics
Estimators
We consider the following model:
Yi = m(Xi)+ui
'(1.1)
e Md xK i.i.d., m : Ud -» K the unknown function of interest,
with {{XtYl)}"=1
m[x) = E(Y\X
i = l,2:....n,
= x), and IÍ¿ i.i.d. random errors with E[u{] — 0 and finite variance.
The internalized Nadaraya-Watson estimator is defined as
n
-i
mk(x) = ] T vk{x, Xi)Yi, with vk{x, Xz) = (/ f c (X ; ))
where fk{Xj)
= ^ J^"=l Kk(Xj
Nadaraya-Watson, here Í fk{Xt)
Kk(x - X,)
(1.2)
— Xr) is a kernel density estimator (unlike standard
j
appears internally to the summation, see Jones
et.al (1994)), and Kfc(u) = \\da=l Kk (u) a product kernel with Kk{u) =
k^Kiuk"1).
Commonly, the kernel is assumed to be Lipschitz continuous with compact support
and / \K(x)\dx
< oo, / K{x)dx = 1. Furthermore, k is the bandwidth, assumed to
go to zero for sample size n going to infinity, but nk^ going to infinity. Let Vk be
the n xn matrix whose (j,i) element is vk{Xj,Xi)1
then rhk{x) = VkY, where Y
and mi; (•) are n x 1 vectors with rhk(Xj) and Yj is its jth entry respectively.
14
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
.Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
We are interested in the additive model, which we write in terms of
d
= x{; + J2m«
E (Y\X = x) = ms(x)
( x «) ,
(1-3)
a=l
where we set Exa {irLa(Xa)}
= J ma(x)fa(x)dx
= 0 Va for identification. Here,
ma, a = 1,. . ., d are the marginal impact functions for each regressor. Therefore,
^ is a constant equal to the unconditional expectation of Y.
ma(Xa)+m_a(X_a)
Xa, i.e. X^a
Writing m{X)
=
where X_a is the vector X of all explanatory variables without
= (Xii,...
,X¿(Q._i), Xi(a+i),...
,Xid) we can use the identification
condition directly to estimate ma. The so called marginal integration idea is based
on the fact that for xa fix we have
Ex-a [m {xa,X-a)\
= I m (xa, x_Q) /_„ (x_ Q ) dx^a = i< + ma (xa) .
Substituting for m(-) a nonparametric pre-estimator such as the one given in (1.2), a
sample average for the expectation, and for ip simply ip = - Y17-1 V* &ves (neglecting
the constant for a moment for the sake of simplicity):
n
fTla\%a)
/
t
l^ah
y^at
-A-ia) *i
j
¿=1
where
wh (xa, Xai) = Kh (xa - Xia) j - ^
Finally, we set rhs(Xj) = ip+^2a=1 rha(Xja)
l
sz£iL .
(1.4)
for each j = 1, 2,..., n. Note that defin-
ing Wh •= J2a=i Wah (xa) with Wah (xa) being the nxn
matrices with wah (Xj, X{)
as elements, one has rhs (x) — ip + Wh (x) Y. For more details see Dette el al (2005).
Some of the test statistics we will consider here are also introduced and discussed
there.
1.2.2
Test Statistics
As mentioned above, we do not introduce new testing procedures but rather study
two statistics which have already been studied in Dette et al (2005) together, with
15
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
other additivity tests, and which have turned out to perform best. We add a new
test statistic motivated by one that was introduced recently by Rodríguez-Poó et al
(2005), and which performed excellently in the study by Roca-Pardiñas & Sperlich
(2006). For more details on the test statistics readers are referred to these papers.
The null hypothesis of interest is Ho : rrc(-) = ms(-) versus Hi : m(-) ^
ms{-).
We consider the following two test statistics from Dette et al (2005) :
n
=
T2
=
(J m ( * i ) - m s ( * i ) ) M * í ) ,
nÍ /¿—
¿=i
1 "
nn
where é¿ = Y¿ — rhs(Xi),
Yi — m(Xi),
-S^ei{rh{Xi)-rns(Xl))w{Xl),
¿—•*
¿=1
i.e. the residuals under the null hypothesis, and ñ¿ =
the residuals without restrictions. Obviously, T\ calculates directly the
integrated squared difference between the null and alternative models. Alternatively,
T2 seeks to mitigate the bias problem inherited from the estimate m, which suffers
from the curse of dimensionality. In Dette et al (2005) it is proved that for all r¿,
the nkz (jj — /¿-) converge under the null to a normal variable with mean zero
and variances v\ for j — 1,2 with
= —^ / a2(x)w(x)dx
/ K2(x)dx
¡ix =
EH0{TI}
¡i2 =
EHo {r 2 } =-r-j a
<r2{x)w{x)dx K(0) + o
nk J
+
d
nk
and
v\
=
VarHo { n } = 2 I a4(x)w2(x)dx
/ (K * K ) 2 (x)dx ,
u|
-
Vor//,, {r 2 } = 2 / o-4(x)u;2(x)dx / K 2 (x)dx .
The statistic of Rodríguez-Poó et al (2005) is defined to mitigate the above
mentioned bias problem still further:
T3
2
K
lY, nk¿£ *(*-*;)to-^ra)
d
n
16
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
w(Xi) ,
(1.5)
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based .Bootstrap Tests when the Null is Nonparametric
where for ease of presentation and implementation K (•) is the same kernel function
as in the last subsection, and k again its bandwidth. It is straightforward to derive
from the above mentioned paper that nkz (T 3 — fi3) converges under the null to a
normal variable with mean zero and variance v\ for
¡d3 = EH0{TÍ}—
I (K * K) (x) dx I
a2(x)f2(x)w(x)dx
All tests have been proven to be consistent in the sense that under the alternative
they converge with n to infinity.
Finally let us mention that we have also studied other test statistics, e.g. those
given in Dette et al (2005) but not presented here. These, however, showed even
less satisfactory performance, so we have skipped them in our presentation.
1.3
Resampling and Choice of Parameters
As is well known, asymptotic expressions are of little help in practice, for calculating
the
exact critical value, for several reasons: bias and variance contain unknown expressions which have to be estimated nonparametrically, and the convergence rate
is quite slow for large d. For this reason it is common to use resampling methods
to approximate the critical value for the particular sample statistic. These can be
bootstrap methods or subsampling procedures. Unfortunately, unlike subsampling,
for the bootstrap it is not known how to choose the smoothing parameter in practice
for the pre-estimation of the model that is used to generate the bootstrap samples.
From theory it is known that one should somewhat oversmooth (see for instance
Hardle and Marrón (1991) and discussion below). For the choice of k (when estimating the alternative), some procedures are provided in the literature (see our brief
discussion of adaptive tests in the introduction). We will come back to this point
later in this section.
17
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
1.3.1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
Bootstrap
Tests
We give the general procedure first and then discuss some details:
1. With bandwidth ft-, calculate the estimate rhs under the null hypothesis of
additivity and its resulting residuals é¿, i = 1,.. ., n.
2. With bandwidth k, calculate the estimator m for the conditional expectation
without the additivity restriction, and the corresponding residuals ü¿, i =
1,.. ., n.
3. With the results from step 1 and 2 we can calculate our test statistics TI, T2,
and T3.
4. Repeat step 1 but now with a bandwidth hb which depends on h from step 1. We
call the outcome rhbs, respectively e¿ = Yi—rhbs(Xi), i = 1,. . . , n. Draw random
variables e* with E[(e*Y] = u\ (respectively e\ or e¿, see discussion below) for
j = 1,2,3 (respectively j = 1,2, see below again). Set Y* = rhbs(Xi) + e*,
i = 1 , . . . , n. Repeat this B times. This defines B different bootstrap samples
{{Xi,Y;fi)}Z=1,b=l,...,B.
5. For each bootstrap sample from step 4 calculate the test statistics r*' , j — 1, 2, 3,
b = 1,... ,B.
Then, for each test statistic r¿, j — 1,2,3, the critical value
is approximated by the corresponding quantiles of the distribution of the B
bootstrap analogues: F*(ü) = j¡ Ylb=i ^iT*j'
— ^ } - R-ecaH that they are
generated under the null hypothesis.
This procedure is well known, has proved to be consistent for many test statistics and has therefore been applied, certainly with slight modifications, to many
non- or semiparametric testing problems. However, several questions of practical
importance remain open: bandwidth choice h in step 1., bandwidth choice k in step
2., how to generate the bootstrap residuals e* in step 4. (see above), and how to
choose hb. Finally, how many bootstrap samples are necessary to get a reasonable
18
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
approximation of the distribution in step 5. In this paper we will discuss all these
questions except the last one.
1.3.2
The Choice of Bandwidths h
The problem of finding an optimal h is somewhat different from that of finding the
optimal smoothing parameter k which is directly linked to the optimal rate of the
test statistic. In that case it is clear that a theoretical optimal choice depends on the
optimal rate at which the test can detect a deviation from the null hypothesis. For
further details see the next subsection. In most cases, the estimator of the null model
can have faster convergence rates than that of the alternative, so the asymptotics
of the test statistics provide no theoretical guideline for an optimal choice of h. In
other words, we have to rely on practical issues.
As there are exist data adaptive methods for finding the optimal bandwidth k for
the alternative (compare next subsection) one could argue that h should be chosen
according to k. This way one could guarantee that the same smoothness is imposed
on the regression function regardless of whether it is estimated under the null hypothesis or not. However, it is not clear whether this is always wanted. Moreover,
we will see later that on the one hand the adaptive choice of k is computationally
intensive, and on the other hand /i¡, depends on h. For k one needs a grid search
which then has to be extended to the choice of h (as it then depends on k) and thus
to the choice of h¡,. Altogether we would get a procedure that is computationally
quite unattractive.
Intuitively, it seems to be desirable to look for a reasonable estimation of the null
model. This is only guaranteed with a reasonable bandwidth choice of h beforehand.
We therefore recommend cross validation or plug-in methods.
1.3.3
The Choice of Bandwidths k
It is known that a bandwidth k which is optimal for estimation is usually suboptimal
for testing. More specifically, for testing the optimal smoothing parameter has faster
convergence rates, i.e. we should undersmooth. As for regression, cross validation
19
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
bandwidths have a tendency to undersmooth in practice, and they are also quite
popular for nonparametric testing.
As an alternative, let us consider the adaptive testing approach introduced e.g.
in Spokoiny (1996,1998). It has been extended by Rodríguez-Poó et al (2004) to
nonparametric testing problems such as those we consider here. The method is the
same for each of our three test statistics, so we can skip the index j of Tj, j — 1,2, 3
in this subsection. Adapted to our problem it works as follows:
We consider simultaneously a family of tests {rfc, k 6 &}, where 8. — {fcj, ¿2,...., kp)
is a finite set of reasonable bandwidths. The theoretical maximal number P depends
on n but is of no practical relevance, for details see Horowitz & Spokoiny (2001).
Define
rk - Eo[rk}
Tmax
=
t
m a x
>
.
w h e r e
keK Varl/2[Tk]
EQ[] indicates the expectation under HQ. This studentizing under the null is only
to correct for the deviations in distribution caused by the different bandwidths k.
Therefore, instead of Varl^2[rk] we could take something proportional to it without
loosing consistency, as long as it corrects for the standard deviation caused by the
different k — k\,...,
kp.
A particularity of the bootstrap analogues of rmax is that one first needs to calculate the bootstrap statistics (rfc)*'6 for all k E 8. to afterwards get ( T max )*. 6 . Note
that for each k, the empirical moment of the bootstrap statistics (jk)*'b (average,
respectively standard deviation) can be used as a substitute for EQ [rh\, respectively
Var1//2[Tk], in practice. This is what we do in our simulation study.
1.3.4
T h e C h o i c e of B o o t s t r a p R e s i d u a l s
From a theoretical point of view, wild bootstrap errors should be drawn from the
residuals of the alternative model, i.e. t¿¿ should be used in Subsection 1.3.4 instead
of e¿ or é¿. It is clear that this should maximize the power as the variance of e¿ (and
é¿) can increase greatly with increasing distance between HQ and the true model.
Arguments in favor of using e¿ exist only under practical aspects: often the size
20
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
distortion in bootstrap tests is worse when using ui or é¿; when using adaptive
procedures as described in Subsection 1.3.3, then it is not that clear which of the
Ui to use or whether the t¿¿ should even be estimated independently of the fc-choice
for the test; at least in the study of Dette et al (2005) to which our study comes
closest, the power loss is negligible so the size argument is decisive.
We conclude so far that if no adaptive choice of k is made, it would be desirable
to use m as long as one can control for the size distortion.
The second question is what kind of distribution for generating the random errors
should be used. In step 4 of the bootstrap procedure described in Subsection 1.3.4 a
distribution is often taken that gives e* with E[(e*y] = ej for j = 1 up to 3 (or even
more). The so called golden-cut wild bootstrap is also quite popular, see e.g. Hardle
& Mammen (1993). More recently, in the context of size distortion of bootstrap tests,
Davidson & Flachaire (2001) argue that for problems with moderate sample size
the disadvantages of the higher-order-moment adapting bootstraps outweigh their
(asymptotic) advantages. We therefore compare different methods in our simulations
(see Section 1.4).
1.3.5
An Alternative: Subsampling
A more and more popular alternative to bootstrapping is the subsampling procedure, see Politis et al (1999). To date, as subsampling is commonly believed to
converge slower in practice than bootstrapping, it has been used almost exclusively
when the bootstrap fails, i.e. has been proven not to converge. See Neumeyer &
Sperlich (2006) as an example in a purely nonparametric testing context. However,
Rodríguez-Poó et al (2004) introduce subsampling in the context that we discuss
here, although the bootstrap is consistent, because of the size distortion their bootstrap test suffered from (until the sample size was huge). In both papers subsampling
works well. The former also studies the automatic choice of subsample size m (with
m < n) which turns out to work in their simulations. As this method might be
remodeled to serve as a procedure for finding hb, we briefly introduce subsampling
and the automatic choice of the subsample size m:
21
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kerne] Based Bootstrap Tests when the Null is Nonparametric
Let y = {{X.¿, Y¡) \i — 1,..,, n) be the original sample, and denoted by r (y) the
original statistic calculated from this sample, leaving aside index j = 1,2,3 for a
moment. To determine the critical values we need to approximate
Q{z)
= P (nVi?T Q>) < z\ .
(1.6)
Recall that under HQ this distribution converges to an iV(/¿,t>2), for ¡i and v see
Subsection 1.2.2. For finite sample size n, drawing B subsamples y¡, - each of size
m - we can approximate Q under HQ by
¿W
:==
B
1
QT,I{myñ^Tkm^m)<z)
•
(1.7)
¡>=1
Note that the awkward notation comes from the fact that we have to adjust all
bandwidths for the new sample size m. For example, imagine k = ko • n's for fco
being constant. Then, Tfcm is calculated like T but with bandwidth km — konsm'6.
Certainly, under the alternative Hi, not only nVk^T (y) but also m^/k^jkm
(ym)
converges to infinity. When demanding m/n —> 0 guarantees that ny/k^r (y) converges (much) faster to infinity than the subsample analogues. Then, Q underestimates the quantiles of Q which yields the rejection of HQThe problem here is to find a proper subsample size m. Actually, the optimal
m is a function of the level a. Again we apply resampling methods: Draw some
pseudo sequences y*>1, i = 1 , . . . , L of y of size n with the same distribution as JA
For the desired level a, test HQ : m(x) ~ ms{x) = rh{x) — rhs{x) the same way as
you want to test HQ : m(x) = m-s(x), i.e. applying your particular test statistic to
HQ and using subsampling. From the L repetitions you can determine the empirical
rejection level (estimated size) for your given a.
Now find an m such that this
empirical rejection level is ^ a. In practice, you choose from a grid of possible m
the one whose estimated rejection level for HQ is closest to a from below. Note that
HQ is always true up to an estimation error that should be almost the same as in your
original test. The only drawback of this procedure is the enormous computational
effort. For further details and examples see Politís et al (1999), Delgado et al (2001),
and Neumeyer & Sperlich (20P6).
22
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
1.3.6
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
T h e C h o i c e of B o o t s t r a p B a n d w i d t h hi
In general, for many test statistics one could repeat the arguments outlined in Hardle
& Marrón (1990,1991): For the mean of fhh(x) — m(x) under the conditional distribution of Yi,..., Yn\Xi, ...,Xn, respectively of rh*h(x) — rhhb(x) under the conditional
distribution of Yf, ...,Y*\X\, ...,Xn, we know from Rosenblatt (1969) that
EY\x{mh(x)-m{x))
Er(m-h{x)-mg(x))
where fj,(K) = J u2K(u)du.
«
h2^-m"{x)
«
h^^-m^x)
,
(1.8)
,
(1.9)
Obviously, we need that vnl'h (x) — m"(x) •—> 0. The
optimal bandwidth /i6 for estimating the second derivative is known to be much
larger (in rates) than the optimal h for estimating the function itself. We can even
give the optimal rate. For example, the optimal rate to estimate ras" is of the order
n - 1 ' 9 (instead of n~ 1//5 ), an observation we make use of in our simulation studies in
Section 1.4.
As will be seen once more in Section 1.4, the typical comment that /ib has to be
oversmoothing, is unhelpful in practice. We therefore try the following automatic
bandwidth choice: apply the same procedure used for the automatic choice of a
proper subsample size m (last subsection) to find an adequate hi, for a given level a.
This is what we explain in more detail and afterwards try in our simulation study.
1.4
Simulation Results
To study all the points listed in the last section, we perform a huge number of
simulations. We give here only a summary of them; for example, limiting the presentation to Tj, j = 1, 2, 3, one particular model, one specific (random) design, and
sample size n = 100.
The model considered is as follows: We consider the same data generating process
23
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
as Dette et al (2005). That is, we draw i.i.d. three dimensional explanatory variables
Xi ~ N(0, Ex)
/ 1 0.2 0.4 \
with Ex = 0 . 2 1 0.6
\ 0.4 0.6 1 /
and i.i.d. error terms e¿ ~ iV(0, al) to generate
Yi = Xhi + Xli + 2 sin(7rX3ii) + aX2,iX3s + eu
i=
l,...,n
with a = 0 to generate an additive separable model, or a = 2 for the alternative.
Recall that the target is a test for additivity. Unless otherwise indicated, we set
ae — 1. Dette et al (2005) show that for the rather unrealistic situation that if
Ex is the identity matrix (i.e. with an uncorrelated design), then the problem is
greatly simplified, whereas a (much) stronger correlated design than ours leads to
identification problems for moderate sample sizes.
All results in the tables are calculated from 250 replications using 200 bootstrap
samples (or subsamples respectively). For real data applications 200- bootstrap samples are certainly very few; but in our simulations the results differed little when we
increased the number to 500. We used the (multiplicative) quartic kernel throughout.
In all three test statistics we use the weighting function w(-) for different trimming: we cut the outer 10%, 5% or 0% of the sample, where "outer" refers to the
tails of the explanatory variables. This is done to get rid of the boundary effects in
the statistics. The tables only give results for 5% and 0% as the boundary effects
turned out not to be a major problem.
To further speed up our simulation studies, we first looked for an average cross
validation bandwidth k, which turned out to be kopt = 0.78. Then we did all our
simulations for the non-adaptive tests (compare Subsection 1.3.3) with kopt. This
was done not only for computational reasons but also because otherwise the size of
the tests would also depend on the randomness induced by the estimation of k. For
the adaptive test procedure, k ran over a grid of 10 bandwidths placed around kopt.
We verified that in most cases Tmax did not refer to the boundary, i.e. to kmin or
h
^rnax •
24
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Nul) is Nonparametric
As discussed above, the bandwidth choice problem is different for h. Here, the
parameter responsible for the size, /ifr, depends on both a (the level) and h. Altogether, it is no problem here that h is chosen by cross validation in each simulation
run as recommended in Subsection 1.3.2. For the internalized marginal integration
estimator, cross validation bandwidths were introduced by Kim et ai (1999). For
the nuisance directions X-a (see equation (1.4) in Section 1.2.1) we used h_ a = 6 • h
as recommended in Dette et al (2005) and Hengartner & Sperlich (2005).
We tried different bootstrap residuals (compare Subsection 1.3.4). Our simulations mainly seem to confirm the findings discussed above. Therefore, below we
report only results referring to e* = £¿e¿, where the e¿ are i.i.d., drawn either from
the golden-cut distribution
€i
f - ( \ / 5 + l)/2
~ \ (\/5 + l ) / 2
with probability
with probability
p = (>/5 + l ) / ( 2 v 5 )
1- p
or from the Gaussian normal N(0,1). However, we admit that it may be interesting
to try more, different automatic choice procedures for h¡,, in order to study again
what effect the choice of residuals taken has (ult ¿, or ¿ ; ).
Probably the most interesting and challenging point is the choice of h¡,. We first
give the results for several choices of /i¡, with different bootstrap generating methods,
/c-adaptive and non adaptive procedures. To have h¡, as a function of h, to take also
into account h/hb —* 0, and perhaps validate the rate n~ l//9 (motivated in Subsection
1.3.6) we set hb = /in 1 / 5 - 1 /" and try different tc < 9.
Table 1.1 shows the results for the non-adaptive golden-cut bootstrap test. These
results basically i) confirm the statements of Dette et al (2005) for our context;
and ii) show that the problem is not solved simply by different smoothing in the
pre-estimation. Undersmoothing, as generally stated from a theoretical point of
view, seems to go in the wrong direction. In particular, the hope that the results
of Rosenblatt (1969) (see equations (1.8) and (1.9)) might give us a hint or even
provide a rule of thumb for the choice of h^, is not confirmed here. T 3 , introduced
by Rodríguez-Poó et al (2004) clearly outperforms the others in this study (as it
does in the following).
25
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
The results for the fc-adaptive analogues, see Table 1.2, show hardly any improvement. In particular, the problem of choosing h^ or, in other words, the size
problem is only mitigated for r%.
Following to some extent the findings of Davidson & Flachaire (2001) we then
repeated these two studies but with the Gaussian bootstrap, see above. Though
there is some improvement in both, size and power, the results in Table 1.3 and 1.4
give us hope only for test statistic T%. Note that the observation that a slight undersmoothing is produces much better results than oversmoothing has not changed
over the four different trials.
Next, for comparison we also provide a small simulation study where the critical
values are approximated by subsampling, trying several subsample sizes m. The
results are given in Table 1.5 for non-adaptive tests, and in Table 1.6 for fc-adaptive
tests. We tried more sizes m for the non-adaptive test but got reasonable results
only for T 3 . In contrast, looking at the A;-adaptive versions, ryax,
T™ax seem to
work, too - though with a rather weak power. Table 1.6 unfortunately is misleading
concerning r™ax\ one needs a much smaller m to get reasonable results here. A small
simulation study evaluating the automatic choice of m seems to indicate that this
procedure might work and therefore should be tried for what is our main focus: the
automatic choice of hbTherefore, we adjusted the automatic choice of the subsample size to find an
adequate hb (see Subsection 1.3.6). This was done as follows, described here in
detail for r 3 . Let {Y*, £*}™=1 := 3^* be a member of the pseudo sequence introduced
in Subsection 1.3.5. Then, for testing HQ : m{x) — ms(x) = rh(x) — rhs(x) with
sample 3^*, an analogue to T3 would be
1
^ = IT,
2
-K^X'-XAiYj-msiXj)}
26
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
w{X¡) .
(1.10)
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
Other statistics are thinkable certainly, e.g.
2
- ¿ K , ( X t - X*){Y* ~ rhs(X;)} - ¥Lh{Xt - X3){Y0- - ms(X3)} w(X-
n /—1 nkd
but they should all be asymptotically equivalent to (1.10). The procedure was
performed with only L = 100 pseudo samples 3^*- As the results varied widely
we were forced either to enlarge L considerably or to reduce ae considerably. For
computational reasons we decided on the second option and repeated the study with
a e = 0.1.
Some results are summarized in Table 1.7. As can be seen, this time we emphasize
the possibility of undersmoothing much more. You first have to look at T\ to find
the K giving the rejection level closest to a = 5% from below. Here, this is always
K = 3. Note that this might also change depending on the trimming, a, sample size,
etc. It is important to understand that the lines of T^ can always be calculated, i.e.
without knowing the true data generating process. Therefore we call this method
fully automatic. Now look at the lines for T%, the test of interest. Obviously, K — 3
is indeed the best possible choice; it holds the level and has strongest power of
any K respecting the level. This could be taken as indicating that our suggestion
for selecting /ib works. Unfortunately, this method does not work that well for all
possible a; specifically, it becomes quite incorrect for a > 10%. We repeated this
study also for j \ and T-I- The results were always somewhat worse than for T3 so
they do not change our conclusion that this procedure seems to be an interesting
and promising approach but further research is necessary.
1.5
Conclusions
We discuss the choices of all "parameters" a practitioner has to use when facing a
kernel based specification test where the null hypothesis is non- or semiparametric.
We have set parameters in quotation marks because we refer here also to questions
such as how to generate bootstrap errors, etc. However, our main focus is the bootstrap and its size distortion in practice when the sample size is small or moderate.
27
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
These points are illustrated along the popular problem of additivity testing. Naturally, one looks for an optimal trade-off between controlling for size under the null
hypothesis HQ and maximizing power. Even though these problems have already
been discussed and studied in theory, as yet, it is unclear how to set these parameters in practice. We show that theory is not just unhelpful here; at present, a
reasonable application of tests of these kinds is questionable.
We try and compare many modifications that can be found in the literature
without finding any clue to an optimal - or even a reasonable - parameter choice.
While there are different suggestions for singular problems such us which residuals
to take for the bootstrap or an adaptive choice of k, combining them gives puzzling
results. Sometimes, in practice, combining these suggestions, the power goes down
where it should increase or size becomes less precise where it should come closer to
the level.
Altogether, we have recommend certain procedures for particular test statistics.
However, the main open question seems to be how to find an automatic choice of
lib- We suggest a new procedure, taken from subsampling theory, that seems to
be a good way to go. However, further research is necessary to provide reliable
procedures for the nonparametric testing problems considered here.
28
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
REFERENCES
Davidson, R. and Flachaire, E. (2001) The Wild Bootstrap, Tamed at Last, Working
Papers 1000, Queen's University, Department of Economics.
Delgado, M. A., Rodriguez-Poó, J. M. & Wolf, M. (2001). Subsampling Cube Root
Asymptotics with an Application to Manski's MSE. economics letters, 73, 241-250.
Dette, H., von Lieres und Wilkau, C , and Sperlich, S. (2005) A Comparison of Different Nonparametric Method for Inference on Additive Models. J. Nonparametric
Statistics, 17, 57-81.
Guerre, E. and Lavergne, P. (2005). Data-driven rate-optimal specification testing
in regression models. Annals of Statistics, 33(2), 840-870.
Hardle, W and J.S Marrón (1990) Semiparametric Comparison of Regression Curves.
Annals of Statistics, 18, 63-89.
Hardle, W and J.S Marrón (1991) Bootstrap Simultaneous Bars For Nonparametric
Regression. Annals of Statistics, 19, 778-796.
Hardle, W. and E. Mammen (1993) Comparing Nonparametric Versus Parametric
Regression Fits. Annals of Statistics, 21, 1926-1947.
Hardle, W., Sperlich, S., and Spokoiny, V. (2001) Structural tests in additive regression. J. Am. Statist. Assoc, 96, 1333-1347.
Hengartner, N.W. and Sperlich, S. (2005) Rate Optimal Estimation with the Integration Method in the Presence of Many Covariates. Journal of Multivariate Analysis,
95, 246-272.
Horowitz, J.L. and Spokoiny, V. (2001) An Adaptive, Rate-optimal Test of Parametric Mean-Regression Model Against A Nonparametric Alternative.
Econometrica,
69, No. 3, 599-631.
Jones, M., C , Davies, S.,J and B. U. Park. (1994) Versions of Kernel-Type Regression Estimators. Journal of the American Statistical Association, Vol 89, 825-832.
29
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
Kallenberg, W.C.M. and Ledwina, T. (1995), Consistency and Monte-Carlo simulatioins of a data driven version of smooth goodness-of-fit tests, Annals of Statistics,
23, 1594-1608.
Kim, W., Linton, O.B., and Hengartner, N. (1999) A computationally efficient oracle
estimator of additive nonparametric regression with bootstrap confidence intervals.
The J. of Computational and Graphical Statistics, 8, 278-297
Ledwina, T. (1994), "Data-driven version of Neyman's smooth test of fit," J. Amer.
Stat. Ass., 89, 1000-1005
Neumeyer, N. and S. Sperlich, S. (2006) Comparison of Separable Components in
Different Samples. Forthcoming in the Scandinavian Journal of Statistics
Politis, D.N., Romano, J.P., and Wolf, M. (1999) Sub sampling. Springer Series in
Statistics. Springer.
Roca-Pardiñas, J. and Sperlich, S. (2006) Testing the link when the index is semiparamtric - A comparison study. Working Paper Universidad de Vigo, Spain.
Rodriguez-Póo, J.M., Sperlich, S., and Vieu, P. (2004) And Adaptive Specification
Test For Semiparametric Models. Working Paper Carlos III de Madrid, Spain.
Rosenblatt, M. (1969) Conditional Probability Density and Regression estimators.
Multivariate Analysis II, 25-31.
Spokoiny, V. (1996) Adaptive hypothesis testing using wavelets. Annals of Statistics,
24, 2477-2498.
Spokoiny, V. (1998) Adaptive and spatially adaptive testing of a nonparametric
hypothesis. Math. Methods of Statist, 7, 245-273
30
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
Trimming
a{%)
K
0%
5
4
5
6
7
8
9
4
5
6
7
8
9
4
5
6
7
8
9
4
5
6
7
8
9
10
5%
5
10
under HQ a=0.0
T\
T2
r3
.000
.040
.068
.128
.176
.256
.024
.068
.120
.184
.272
.344
.012
.060
.108
.172
.284
.340
.040
.084
.168
.288
.364
.440
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.008
-008
.008
.012
.012
.012
.032
.024
.024
.024
.020
.020
.008
.008
.008
.012
.012
.012
.024
-020
.024
.024
.020
.020
under H}
a=2.0
T\
T2
T3
.000
.004
.012
.016
.028
.028
.004
.008
.020
.032
.036
.056
.016
.020
.028
.040
.064
.080
.024
.036
.044
.064
.076
.116
.032
.012
.012
.012
.012
.024
.060
.028
.020
.024
.028
.032
.052
.032
.028
.028
.032
.032
.112
.076
.052
.048
.052
.056
.248
.184
.184
.196
.228
.252
.448
.312
.292
.300
.304
.340
.248
.192
.184
.184
.228
.244
.448
.308
.284
.292
.308
.332
Table 1.1: Rejection levels of the three original test statistics with and without
trimming. Critical values are determined with golden-cut wild bootstrap, using
hb = hn1^"1^ for the pre-estimation.
31
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kerne] Based Bootstrap Tests when the Null is Nonparametric
Trimming
a(%)
K
0%
5
4
5
6
7
8
9
4
5
6
7
8
9
4
5
6
7
8
9
4
5
6
7
8
9
10
5%
5
10
under Ho a—0.0
T\
T2
r3
.004
.004
.000
.000
.000
.000
.016
.012
.008
.000
.000
.000
.008
.000
.000
.000
.000
.000
.020
.008
.004
.000
.000
.000
.004
.004
.000
.000
.000
.000
.012
.008
.004
.000
.000
.000
.004
.000
.000
.000
.000
.000
.012
.004
.000
.000
.000
.000
.028
.020
.012
.000
.000
.000
.076
.072
.056
.028
.008
.008
.016
.016
.008
.004
.004
.004
.080
.064
.040
.024
.008
.008
under Hi
T\
r2
.044
.064
.048
.036
.036
.032
.096
.140
.132
.104
.072
.064
.080
.068
.036
.016
.008
.008
.136
.120
.116
.100
.084
.056
.032
.056
.036
.032
.012
.008
.072
.120
.092
.052
.044
.036
.052
.024
.016
.012
.008
.004
.120
.096
.060
.036
.024
.016
a=2.0
r3
.176
.204
.204
.196
.196
.188
.316
.308
.296
.316
.296
.284
.196
.184
.188
.184
.200
.192
.328
.296
.296
.292
.284
.288
Table 1.2: Rejection levels of the three ¿-adaptive test statistics with and without
trimming. Critical values are determined with golden-cut wild bootstrap, using
h¡, — hn1^5"1^ for the pre-estimation.
32
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
Trimming
a(%)
K
0%
5
4
5
6
7
8
9
4
5
6
7
8
9
4
5
6
7
8
9
4
5
6
7
8
9
10
5%
5
10
under HQ a=0.0
T\
r2
r3
.004
.036
.080
.132
.188
.260
.020
.072
.116
.196
.276
.352
.012
.052
.116
.176
.268
.352
.028
.088
.164
.252
.380
.436
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.008
.012
.012
.012
.012
.012
.044
.044
.032
.028
.016
.020
.008
.012
.012
.012
.012
.012
.048
.032
.024
.020
.016
.020
under H]
a=2.0
T\
T2
r3
.000
.004
.008
.016
.028
.032
.012
.012
.020
.036
.044
.068
.008
.008
.028
.040
.064
.096
.036
.036
.048
.060
.092
.120
.036
.024
.012
.016
.012
.016
.064
.036
.024
.032
.032
.032
.080
.036
.036
.028
.028
.032
.172
.092
.072
.056
.048
.064
.340
.236
.216
.224
.240
.248
.560
.380
.336
.332
.344
.372
.324
.236
.212
.220
.244
.260
.556
.372
.332
.328
.340
.376
Table 1.3: Rejection levels of the three original test statistics with and without
trimming. Critical values are determined with Gaussian bootstrap, using hi, =
^ n i/5-i/K £ or t k e p r e-estimation.
33
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
Trimming
a (%)
K,
0%
5
4
5
6
7
8
9
4
5
6
7
8
9
4
5
6
10
5%
5
7
10
8
9
4
5
6
7
8
9
under i í 0 c1=0.0
Tl
r3
T2
.000
.000
.000
.000
.000
.000
.028
.020
.004
.004
.004
.000
.004
.000
.000
.000
.000
.000
.012
.004
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.008
.004
.000
.000
.000
.000
.004
.000
.000
.000
.000
.000
.012
.000
.000
.000
.000
.000
.036
.028
.020
.008
.008
.008
.096
.088
.056
.032
.024
.016
.024
.024
.012
.008
.008
.008
.096
.072
.048
.040
.020
.012
under Hi
T\
.048
.048
.052
.032
.024
.016
.124
.184
.172
.116
.092
.076
.064
.048
.032
.016
.016
.016
• 136
.124
.100
.072
.052
.040
T7
a=2.0
T3
.028 .220
.048 .204
.032 .216
.016 .200
.008 .204
.008 .200
.096 .364
.156 .340
.124 .328
.072 .324
.048 .296
.032 .304
.036 .220
.020 .200
.012 .196
.004 .200
.004 .200
.004 .204
.100 .368
.092 .332
.048 .300
.032 .312
.012 .292
.012 .296
Table 1.4: Rejection levels of the three fc-adaptive test statistics with and without
trimming. Critical values are determined with Gaussian bootstrap, using h>, =
hnl/5~1/K for the pre-estimation.
34
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 1
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
under Ho a =0.0
Trimming
0%
i
a{%)
m
5
50
40
50
40
50
40
50
40
10
5%
5
10
.000
.000
.004
.028
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.000
under Hi a =2.0
7-3
T\
T2
.000
.040
.020
.224
.000
.052
.028
.240
.004
-004
.004
,004
.000
.000
.000
.000
.000
.004
.004
.004
.000
.000
.000
.000
T
3
.028
.212
.248
.744
.032
.202
.232
.732
Table 1.5: Rejection levels of the three original test statistics with and without
trimming. Critical values are determined with subsampling, using subsamples of
sizes m.
under HQ a=0.0
under Hx a==2.0
Trimming
a(%)
m
T\
T2
r3
T¡
T2
0%
5
90
80
70
60
90
80
70
60
90
80
70
60
90
80
70
60
.000
.000
.056
.244
.000
.028
.208
.584
.000
.000
.008
.060
.000
.008
.048
.196
.000
.000
.088
.336
.000
.072
,328
.680
.000
.000
.016
.084
.000
.012
.096
.304
.000
.000
.000
.000
.000
.000
,000
.000
.000
.000
.000
.000
.000
.000
.000
.000
.140
.148
-156
.196
.192
.192
.276
.416
.080
.080
.076
,064
.128
.140
.132
.136
148
160
168
236
196
208
308
484
104
104
088
076
152
148
160
168
10
5%
5
10
000
000
000
000
000
000
000
000
000
000
000
000
000
000
000
000
Table 1.6: Rejection levels of the three ¿--adaptive test statistics with and without
trimming. Critical values are determined with subsampling, using subsamples of
sizes m.
35
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Ir al índice/Tornar a l'índex
Chapter ]
The Size Problem of Kernel Based Bootstrap Tests when the Null is Nonparametric
K
Trimming
#0
0%
(a = 0)
T
3
T3
5%
T3
Hi
( a = 2)
0%
5%
T
3
T3
1
2
3
4
5
6
7
.012
.680
.012
.676
.001
.972
.001
.968
.063
.392
.062
.380
.019
.932
.019
.936
.028
.032
.028
.024
.042
.632
.042
.620
030
012
030
012
022
380
023
368
.032
.012
.032
.012
.015
.272
.015
.260
.031
.012
.031
.012
.011
.260
.011
.252
.029
.016
.029
.020
.009
.264
.010
.264
Table 1.7: Rejection levels of T?, and T\ for a = 5%, with and without trimming,
using Gaussian bootstrap with hb = /in 1 / 5 - 1 /* for the pre-estimation.
36
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 2
Estimating and Testing An
Additive Partially Linear Model in
a System of Engel Curves
2.1
Introduction
T H E SPECIFICATION OF E N G E L CURVES IN EMPIRICAL MICROECONOMICS
has
been an important problem since the early studies of Working (1943) and Leser
(1963) and the well-known work of Deaton and Muellbauer (1980a), in which they
developed parametric structures such as the Almost Ideal and Translog demand
model. Many Microeconomic examples are provided in Deaton and Muellbauer
(1980b) in which a separable structure is convenient for analysis and important
for interpretability. However, there is increasing empirical evidence pointing to the
conclusion that a sort of nonlinearity is present in the specification of Engel curves.
An alternative way of investigating nonlinear effects is to model consumer behavior by means of semi- and nonparametric additive structures. Moreover, non and
semiparametrie regression provides an alternative to standard parametric regression,
allowing the data to determine the local shape of the conditional mean.
From an economic point of view there are many reasons why it is interesting to
recover a correct specification of Engel curves. Firstly, a correct specification allows
us to examine the nature of the effect of changes in indirect tax reforms. Secondly,
it is important to specify the response of consumers in the face of changes in total
37
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Mode! in a System of Engel Curves
income. Changes of this kind allow us to assess the impact on consumers' welfare.
Consumer demand has become a very important field for applying non and semiparametric methods. An interesting analysis of the cross-sectional behavior of consumers in the context of a fully nonparametric model can be found in Bierens and
Pott-Buter (1990). Papers which consider the implementation of semiparametric
methods in empirical analysis of consumer demand include Banks, Blundell and
Lewbel (1997) and Blundell, Duncan and Pendakur (1998). This latter paper is of
special interest because its analysis regression is based on semi- and nonparametric
specifications of Engel curves. It also tests Working-Leser and Piglog's null hypothesis against the well-known partial linear model in which budget expenditures are
linear in the log of total expenditure. In this paper we estimate the Engel curves
directly as in Lyssiotou, Pashardes and Stengos (2003) among others.
We estimate an additive partially linear model (PLM) in order to investigate
consumer behavior using individual household data drawn from the Spanish Expenditure Survey (SES) and use the result obtained from semiparametric analysis to
examine the modelling-of age, schooling and expenditure in a system of Engel curves.
The importance of using an additive PLM models lies in the fact that in the context
of this model the effects of expenditure, the age and schooling on consumer demand
can be investigated simultaneously in the semiparametric context 1 . There are several
ways to get estimations of nonparametric additive structure, and we mention only
the most important: smooth backfitting, series estimators and marginal integration.
In this paper we use internalized marginal integration to estimate nonparametric
components in the additive PLM mainly because at the present time there is no
applied or theoretical study on the testing procedure using smooth backfitting.
Most of the papers that investigate consumer behavior in a nonparametric context are focused on the appropriate way of modeling the form of the Engel curves.
Those focused on the unidimensional nonparametric effect of log total expenditure on
budget expenditures, taking in to account some parametric indexes to reflect demo1
Analysis of consumer behavior can he carried out with fully nonparametric models. However,
for sake of interpretability and implementation, additive models overcome the well-known problems
coming from multidimensional Nadaraya-Watson and Local Polynomial regression estimators.
38
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Mode! in a System of Engel Curves
graphic composition include Blundeli, Browning and Crawford (2003) and references
therein. In this paper we investigate consumer behavior in semi and -nonparametric
terms focused on the nonparametric effect of total expenditure the age and the
schooling. In this study, unless stated otherwise, the effect of age and schooling
refer to the age and schooling of the household head. There is evidence suggesting
that these have deeper effect than generally assumed in parametric demand analysis
(see Lyssiotou, Pashardes and Stengos (2001)). In fact, it is common practice to include the square of age and/or schooling as well as their higher terms in parametric
models to capture possible nonlinear effects.
Inference in nonparametric regression can take place in a number of ways. The
most natural is to use nonparametric regression as an alternative against a fully
parametric or semiparametric null hypothesis. With this in mind, we investigate
whether an additive PLM provides a reasonable adjustment to our data using different resampling schemes to obtain critical values of the test statistics. In this paper
we are interested in applying some recently developed test statistics which are very
popular in the literature about testing semiparametric hypotheses against nonparametric alternatives. These test statistics are in the spirit of Hardle and Mammen
(1993) and Gózalo and Linton (2001), among others. On the other hand there is a
growing interest in the so called adaptive testing methods, in which the test statistics are adaptive to the unknown smoothness of the alternative, see among others
Horowitz and Sponkoiny (2001) and Rodrigue2-Poo, Sperlich and Vieu (2005). in
this paper we adapt their ideas with some differences, where are considered kernel
smoother for our problem.
It should be remarked that a problem that we may well have to consider is the
endogeneity of regressors. Note that in the context of Engel curves total expenditure
may well be jointly determined with expenditure on different goods. The approach
used to solve this problem is instrumental variable estimation.
We remark two
recently developed procedures in the context of nonparametric regression to tackle
the problem of endogenous regressors. The so called nonparametric two step least
square (NP2SLS) due to Newey and Powell (2003), and the nonparametric two
39
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
step with generated regressors and constructed variables (NP2SCV) due to Sperlich
(2005). Newey and Powell (2003) 's approach is a cumbersome procedure involving
the choice of basis expansion in the first step. However, Sperlich's approach only
requires a non, semi or even parametric construction of regressors of interest in the
first step. Our feeling is that a generated variables approach in combination with
additive PLM can help us to overcome to some extent any possible endogeneity
problem and that is exactly the procedure implemented in this paper.
The contribution of this work can be summarized as follows. Firstly, we are the
first (to our knowledge) to carry out an exploratory analysis of consumer behavior
with data drawn from the Family Expenditure Survey for Spain using semiparametric models. Second, we apply recently developed methods to estimate, test (various model specifications) and correct for possible endogeneity of total expenditure.
Third, our estimations of the additive model are accompanied by a reasonable measurement of discrepancy between the fully nonparametric model and the additive
estimation. An adequate model check is necessary whenever estimations of additive
models are carried out (Dette, von Lieres and Sperlich (2004)). Additionally, our
measure of discrepancy adapts to the unknown smoothness of the non-parametric
model and this constitutes a novelty in empirical economics.
The rest of the paper is organized as follows. In Section 2 we provide some background to understand both the estimating and the testing procedures. In Section
3, we discuss the shape of Engel curves and report empirical results obtained from
the application of additive PLM. We also provide the results of testing the additive
specification as well as the linearity of each nonparametric component in additive
PLM regression. In Section 4 concludes.
2.2
Additive Partially Linear Model and Testing
Hypothesis
There are many fields of empirical economics in which explanatory variables and
their second power are included in regression analysis to capture nonlinear effects;
40
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
In order to estimate the functions ma (xQ) we first estimate the function m (x) with
a multidimensional local smoother and then integrate out the variables different
from Xa. This method can be applied to estimate all the components, and finally
the regression function m(-) is estimated by summing an estimator ifi of tp, so we
get that:
ms(X3) = 4, + ¿
for j=l,...,n.
¿
Kh [X3a - Xia)
filM-Yi
¡4}
The expression to get the estimation of each component rna (•) defined
in [4], is called the internalized marginal integration estimator (IMIE) because of
the joint density that appears under the summation sign. For a detailed explanation
see Dette, von Lieres and Sperlich (2004) and references therein. Note that IMIE
does not provide exactly the orthogonal projection onto the space of additive functions. In other words, the sum of the estimated nonparametric components does
not necessarily recover the complete conditional mean because the interaction terms
are excluded from the regression. So, it is very interesting to establish whether the
sum of additive components is the conditional mean. Therefore, it is necessary to
carry out a specification test. With this in mind, we are concerned with testing the
validity of the additive specification of the regression function m (x) in eq.[l]. Thus,
the null hypothesis to be tested can be formulated as:
HQ :m(x)=
ms (x)
[5]
against a general alternative that Ho is false. An adaptive test statistics is implemented by Horowitz and Spokoiny (2001) (among others) in the context of parametric models against nonparametric alternatives; and by Rodriguez-Poo, Sperlich
and Vieu (2005) in the context of semi and nonparametric against a nonparametric
alternative. However, it should be remarked that the first implementation in the
context of nonparametric additive separable models against a fully nonparametric
alternative adaptive test was by Barrientos and Sperlich (2005).
The first test statistic is defined as the square of the differences between the
semiparametric fit and the fully nonparametric estimator, extending the concept
42
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
introduced by Hardle and Mammen (1993). In order to test the validity of our
hypothesis, we also consider the test statistics introduced by Gózalo and Linton
(2001) and Rodriguez-Poo, Sperlich and Vieu (2005) defined as:
J
n z—
t=i
1 "
n ¿—'
t=i
n
n
12
w (Xi)
[8]
2=1
where é,: = Yx — rhs(Xi) are the residuals under the additive model and ú¿ = Yt —
rn[(Xi) denote the corresponding residuals of the unrestricted model. In this study
we use the well-known Nadaraya (l964)-Watson (1964) estimator for the unrestricted
model. These test statistics can be used not only in specification testing defined by
(5) but also to test the linearity of individual nonparametric components, see Hardle,
Huet Mammen and Sperlich (2004). More exactly, we can test the null hypothesis
HQ : ma (x a ) = 8xa for all a and for some 6
Now we discuss the procedure for computing the critical values. Note that our
idea is based on a combination of adaptive test statistics with both bootstrap 2 and
subsampling 3 schemes. For the former case see Horowitz and Spokoiny (2001) and
2
To obtain bootstrap critical values we consider the following steps. 1) To obtain the bandwidth
from cross-validation, hc». 2) To estimate rhs (as) = 4> + !C a eA ^ai^a)
-%) To use the bootstrap
scheme to get e* for each i = 1, ...,7i. 4) For each i = 1, ...n generate Y* = ms(Xl) + e ' : where e*
is sampled randomly and we use the data {Y*, Xt}^=1 to estimate J7i s (i) under Ho- 5) Repeat the
process 2-4 B times to obtain {T"hj and use these B values to construct the empirical bootstrap
distribution.
The bootstrap errors e* are generated by multiplying the original estimated residuals from the
semipaiametric model, e, = Y¡; - rhs {Xi): by a random variable with standard distribution. This
procedure provides exactly the same first and second moments for c\ and for s*.
3
In the subsampling case one takes all subsamples of size b from the original sample {Xt. Y¿}.
The problem in selectitig the subsample size b is similar to the problem in selecting the bandwidth
in nonparametric regression analysis: the assumptions on the param<3ier b to require that b/n —> 0
and b —-» oo as n —> oo. Unfortunately, such asymptotic conditions are no help m solving the
43
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
for the latter one see Rodriguez-Poo, Sperlich and Vieu (2005). It is remarkable
that using subsampling to get an estimator of the variance of the restricted errors
guarantees consistency under Hi. Having estimated semiparametric and nonparametric models, rhs (•) and m(-j respectively, we construct the origina! test statistics
denoted by Tjk- As the distribution of Tjk varies with k (the bandwidth for the
alternative) we define the standard test statistic denoted by
I9|
f* - M i
where /¿- and -0? are the estimated mean and variance of the test Tjk for j = 1,2,3
(where j denotes test statistics [6]-[8]). Then we compute the test statistics based
on the resampling data (boootstrap and subsampling data), denoted by:
J
T-k - A;
v*
[10]
This creates a family of test statistics {T^, k G Kn} where the choice of k makes the
difference between the null and global alternative hypotheses. In order to maximize
power we take the maximum of f*k over a finite set of bandwidth values Kn with
cardinality L. Then we define the final test statistics by means of:
** = %*£**
Where Kn = {k = a^n'^5
I 11 )
i = l , . . . , £ } , a {I) = [l + (cx {I - l)" 1 )] n" 1 / 5 and
cx — 7 (max (Xi) - min (Xi)) with 7 e (0,1).
The testing procedure rejects Ho if at least one of the k e Kn the original
test statistic is significantly larger than the bootstrap analogues. In Horowitz and
Spokoiny (2001) the estimators for variance and bias are asked to be consistent under
alternative hypothesis. Note that this is only necessary for efficiency; for consistency
of the test, it is sufficient for the difference between real variance and estimate to
block size choice problem in finite samples. Instead, it is possible to use an algorithm to estimate
a "good" subsample size. This method has been applied in practice in another contexts with
good results, see for instance Rodriguez-Poo. Sperlich and Vieu (2004) and Neumeyer and Sperlich
(2005).
44
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
be bounded. Nevertheless, Rodriguez -Poo, Sperlich and Vieu (2005) suggest using
a subsampling scheme in order to get a consistent estimator of variance under Hi
and thus to have optimal power. They also discuss size problems of bootstrap tests
when the null model is non or semiparametric and show that the subsampling based
analogue suffers less from this problem.
2.3
The Shape of Engel Curves and Specification
Testing
The most usual structure in consumer behavior analysis is the so-called WorkingLeser specification. In this model each expenditure expenditure is defined over the
logarithm of total expenditure. Thus the model has a simple structure given by:
Wl
= f{\nXt)+£j
[12]
where W{ is the budget expenditure, In Xi is the log total expenditure and el is an
error term satisfying E (ex\ \nXi) = 0. Empirical analysis using parametric specification in eq.[12] can be found in the literature on consumer behavior, see Deaton and
Muellbauer (1980a, 1980b). For empirical unidimensional nonparametric analysis
see Blundell, Browning and Crawford (2003) and references therein. Instead of a
Working-Leser specification we can assume that consumer demand could be modelled by means of an additive structure as in eq[2], such that:
Wi = ijj + mx (InXii) + m2(X2i)
+m3(X3i)
+ £i
i = l,...,n
[13]
where In Xit is the log total expenditure, X2i and X3i are the age and schooling and
€i is assumed to satisfy E (e^Xi) ~ 0. Consider the augmented model:
Wi = il} + Z$k + mi (In Xu) + m2 [X2r) + m 3 (X3l) + ez
i — 1, ...,n
[14]
where Z¿ is a set of discrete or continuous variables of dimension K,
P is a K x 1 vector of parameters and e, is assumed to satisfy E(et\Zl,
Xi) = 0.
The models given by [13] and [14] are motivated because they allow us to include
45
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
other regressors with nonlinear effects, and at the same time to reduce the curse of
dimensionality; which may be the main weakness of nonparametric techniques. To
estimate the model [14] we follow the treatment of Hengartner and Sperlich (2005).
There are many ways to get a Vn-consistent estimator of ¡5: we use Robinson's
(1988) method. Let $ be an estimator of/?. Eq.[14j can be written as:
w, = ^ + mi (In Xu) + m2 (X2l) + m 3 (X3t) + &
[15]
where w¿ = wi — Zifik and ^t = e¿ + Zi l/3k — /3k) is the new composite error term.
The intercept term ip can be v^-consistently estimated by ip = Y — ZTJ3 where Y
and X are the sample mean. As in eq.[14] we can apply to eq.[15] the procedure
described in Section 2 to obtain estimates of mi (lnXi), m,2 {X2) and 777.3 (X3).
Now we turn to the problem mentioned in the Introduction about constructing
regressors to overcome the endogeneity problem. For a detailed explanation, see
Sperlich (2005). Let x¿ be an unobservable or endogenous variable and let Xi be a
generated regressor4, it is then possible to write x,: = x + b (x) + a (x), where b(-) is
the bias term such that &(-) —> 0 as n —» 00 and a (•) is the variance term. In order
to obtain consistent estimates of density and conditional mean and thus construct
¿i, with the help of instruments or even with help from different data sets it is
possible to estimate the reduced regression form, semi-, non- or even parametrically
(first step) and then use it in the structural regression (second step), instead of the
original regressor.
The procedure can be described as follows. Let {W} be the set of exogenous
variables, including the log of total income. Note that we are worried about endogeneity due to jointly determination of total expenditure, lnXi, and expenditure on
different categories of goods (endogeneity due to simultaneity). Suppose that l n ^ i
is endogenous such that:
ln(X!) =g{W)
+U
[16]
'with bias and variance of order O (g 2 ) and O ( - ^ j )
Theorem 2 in Sperlich (2005)
46
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
in order to fulfil the assumptions of
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
where E (£\W) = E{U\W)
= 0 for W = (Z,X2iX3),
but £ ( £ | m X : ) ¿ 0. Putting
[15] and [16] together we get
UJ
= rf> +
mi
(g (W) + U) + m2 (X 2 ) + m3 (X3) + (
applying the modeling mi (g (W) + U) = mg (g (W)) + A {(/), what somehow means
in the end that you assume additive in the exogenous impact of the explanatory
variables what is possible as assuming [16]. The we get:
w =
u
ip + mg(g(W))
= i> + mg{g(W))
+ m2(X2)
+ m3{X3)
+ m2{X2)+m3{X3)
+ X{U)+£
+l
[17]
[18]
where E [A {U) \W] = 0, E \^\W\ = 0 and E [£\g {W), X2, X3) = 0. The expression
in eq[l8] is the model that we have estimated only with the additional burden of
a pre-estimation g (W) consistently. Note that this methodology certainly involve
less difficulties (and is faster) than Newey and Powell's (2003) approach.
Household expenditures typically display variation respect to demographic composition. Then, we can use additive specification to pool across household types.
However, Blundell et.al (2003) suggested modifications to take into account integrability conditions (integrability is related to the problem of recovering a consumer's
utility function from his demand functions). Note that in eq[14], the Z matrix represent a household composition variables for each household observation i. This
means that we imposed a restriction on the way in which demographics affect expenditures (if j index is referred to specific category of good then we are interested
in imposing the restriction Z¿ = Zij, that is demographic composition affects in the
same way the consumption of different goods). Thus, under stated restriction on Z
matrix, our empirical researching did not provide evidence of linearity of mj (•) in
our system (see Section 3 and Table 4).
Blundell et al.
(2003) agrees that an alternative specification that does not
impose restriction on the form of m¡ (In _X\) is a straightforward extension of additive
PLM: Wi = ^ + Ztak + m-j (In Xu - £ {Z[9)) + m2 {X2l) + m3 (X3t) in which £ {Z¡9)
47
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
is some known function5 of a finite set of parameters 9 (otherwise mj (•) might be
linear in In X\ whenever Slutsky symmetry conditions are satisfied).
2.3.1
D a t a U s e d in this Application
In our application we consider mainly four broad categories of goods, Food (including
alcohol and tobacco), Clothing (including shoes), Transport (personal and public)
and Leisure (recreational activities, publications and general teaching). We draw
data from the 1990-1991 Spanish Expenditure Survey (SES) and for the purposes
of our study we select only houses with three children or less. Total income, total
expenditure and expenditure categories are measured in pesetas (yearly) at constant
1983 prices. In order to preserve a degree of homogeneity in most of aspects we
use a subset of married (or cohabiting) couples of household in the Madrid regional
community. This leaves us with 757 observations, 12.4% comprising couples without
children, 20.02% couples with one child, 47% couples with two children and 20.03%
couples with three children. Table 1 gives brief descriptive statistics for the main
variables used in the empirical analysis. '
5
As they suggested (. (Z[9) can be interpreted as the log of a general equivalence scale for
household i.
48
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
Table 1. Descriptive statistics for budget expenditure data
Variables
Mean
Std.dev
Min
Max
Food expenditure
709216
348565
344776
3307304
2254260
Clothing expenditure
304160
335535
7200
Transport expenditure
413226
486898
3640
2426126
Leisure expenditure
231988
265513
999
2128000
Total Expenditure
3162401
1397284
1039319
9304396
14.87
0.429
13.85
16.04
2052240
2289599
282504
42000000
Log total income
14.37
0.50
12.5
17.5
HHAge
40.6
10.6
21
80
HH Schooling
5.1
2.47
1
10
HNAD
2.1
0.5
1
4
HHSEX
0.90
0
1
Child_0
0.124
0
1
Child_l
0.202
0
1
Child_2
0.470
0
1
Child 3
0.203
0
1
Log total Expenditure
Total income
2.3.2
S o m e P i c t u r e s of t h e E x p e n d i t u r e
Total E x p e n d i t u r e Relationship
expenditure-Log
In this section we present the estimated additive partially linear regression of the Engel curves for the four budget expenditures in our SES sample. Each figure presents
the estimated marginal effect together with 90% bootstrap pointwise confidence
bands (dashed lines). In all cases we present kernel regression for the quartic kernel
| | (1 — u2) I (\u\ < 1) where I (•) is the indicator function, using the leave-one-out
cross-validation method to automatic bandwidth choice, hcv = 0.72 in the direction
of interest and b = 6hcv in the nuisance direction as in Dette von Lieres and Sperlich
(2004). In order to estimate the parametric part of the model [13] we have used a
set of discrete variables such as number of adults, sex and dummies for number of
children; this kind of regressor traditionally enters into the regression function in
the parametric part.
As usually, it is assumed that income is partially correlated with expenditure and
49
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
we can suppose that it is not correlated with the errors in model [13], therefore log
total income is a natural instrument to the log total expenditure. Then, based on
generated regressor and constructed variable methods we adjust the estimations for
any possible endogeneity of log total expenditure with the existing data as described
in Section 3.1. The set of exogenous variables includes the log income and its power
(up to the fourth one), age and schooling6.
Figures 1-4 show the estimated marginal effect of log total expenditure, age and
schooling on budget expenditures, controlled parametrically by the sex, number of
adults in households and dummies for number of children and corrected for any
possible endogeneity. It is clear from the plots that the effect of total expenditure
on the different budget expenditures is nonlinear. We can see in the case of transport
and leisure expenditures that this effect is increasing and monotone, whereas in the
case of food and clothing it is also increasing, but less stable for different levels of
total expenditure.
Note that the effect of schooling on expenditure on different goods is nonlinear.
In the cases of leisure, clothing and even transport it is interesting to observe the
pronounced effect for values of schooling close to the average (at which point the
greatest expenditure expenditure is reached). Note that leisure and clothing are
necessary goods (but not basics like food), so this behavior could be related to low
returns on education (whenever there is a strong correlation between income and
schooling, such a relation is generally observed in practice), so that consumers prefer
to dedicate their budget to basic goods. We remark that food expenditure does not
include food outside the household. It might be assumed that head of the household
might take some meals (e.g lunch) outside the house. On the other hand, in the
case of transport expenditure, we note an increasing effect up to values close to
the average for schooling, but from that point onwards the expenditure becomes
stabilized.
Another possible explanation for the behavior of leisure expenditure with respect
''Estimations for the model given by [13] with no endogeneity correction are available from the
author on request.
54
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
to schooling, is that high levels of schooling in couples that have many children are
accompanied by high income levels, and more hours of work per week, so that they
have no time for leisure. This idea is not so absurd if we consider that more than
half of households (67.03%) have two or three children to support.
According to our results, in the households with the oldest heads there is a
tendency to spend less money than in the households with younger heads, this
effect is notable at least for a range of ages between 30 and 40. It is explained,
at least in part, because the households with the oldest heads have less children to
support. Unlike leisure and transport expenditures, in the cases of food and clothing
expenditures this decreasing effect is considerable but not dramatic. Note that we
include the number of children parametrically, so this explanation makes sense if we
keep other effects unchanged. However, except for food and clothing expenditures,
the estimated parameters have no major impact.
Another question to take in to account is that 90% of household heads in our
SES sample are men: from the sociological point of view they pay less attention
to fashion, so this may explain, partly, the decrease in spending on clothes for
household heads of 40 and over. In the case of leisure and transport, the effect by
ages is dramatically decreasing: of course older heads have less recreational activities
and spend less time outside the household, so the use of transportation (private and
public) diminishes with age.
In regard to variables included parametrically, we remark that the number of
adults has no effect on consumer demand; the estimated parameter in each regression
is not statistically significant. On the other hand, the effect of sex is important
and different depending on the expenditure considered (except for transport). The
results tell us that men spend less money on food than on clothing and leisure. Table
2 displays estimated parameter in the additive PLM.
If the model is chosen correctly, the results quantify the extent to which each
variable affects consumer behavior. Clearly, the findings of the estimated additive
PLM have to be checked: this can be done by considering the test statistics described
in Section 2.
55
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
Table 2. Estimation of linear part in APLM
Variable
Estimated parameter
t-Statistics
Food Expenditure
|
Constant
13.490
73.78
Sex HH
-0.384
-2.535
Number of Adults
0.063
0.614
One child
0.407
2.441
Two children
0.594
3.584
Three children
1.151
3.612
Clothing Expenditure
Constant
11.03
26.938
Sex HH
1.196
3.535
Numbers of adults
-0.305
-1.328
One child
-0.986
Two Children
-0.967
-2.564
Three Children
-2.525
-3.532
(
-2.606
Leisure Expenditure
Constant
|
10.369
j
30.295
Sex HH
0.710
2.507
Numbers of adults
0.202
1.050
One child
-0.234
-0.739
Two children
-0.001
-0.003
Three children
-0.752
-1.256
Transportation Expenditure
2.3.3
Constant
-0.279
33.964
Sex HH
-0.279
-0.924
Number of adults
-0.008
One child
0.592
1.787
Two children
0.692
2.099
Three children
0.854
1.347
|
-0.043
¡
Specification Testing
Table 3 reports the p-values for testing additivity adjusted for any possible endogeneity problem. Since the choice of bandwidth is a crucial point, especially for the
56
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
bootstrap needed for the test procedure, we present results for different smoothing
parameters gr € {0.75,0.85,0.95} r—1,2,3. In order to apply the procedure described in Section 2 we implement 500 bootstrap replications; and we use 100 subsamples each of 70% and 60% of the size of the original sample n for our subsampling
scheme. To estimate the model under alternative hypotheses (fully nonparametric
model) we define a set Kn (with cardinality ~L=10) of bandwidths A; in a range from
0.3 to 2.
Note that the percentage of rejection is not so large for leisure and transport
expenditures with the bootstrap scheme. However, this situation is partly corrected
with the subsampling scheme where the percentage of rejection is increases, especially in the case of leisure. On the other hand, we find that test statistics r, and
T 3 give us a strong evidence of additive separable specification. Similar results are
obtained with the subsampling scheme in the sense that we are able to reject the
null hypothesis for all test statistics for each expenditure categories. In summary,
the null hypothesis is not rejected for the household types considered for all test
statistics with both resampling schemes and for all bandwidths.
Table 3. Testing Additive Specification
Bootstrap
Band
Clothing
Food
T~2
^
.66
9i
.81
.65] .84
92
93 "~ .63
1
.85
7"3
.95
.99 ^ 9 4
.99
JEL.
~6TH
.66
.55
b2
r2
.93
Transport
T2J^ T3
.22^.99 | -13_[ -99^ .97 | .12 .95
24j_,99 -91 , ,14_[ ,99J .95 | .14 j .95
T2J_T3
.99
Food
Block
Tl
Leisure
Ti
T2
T 3 J T\
.15
.99
92
.25 .99 .89
Subsampling
Clothing
Leisure
.94
.15
.95
Transport
T ^ Tl j T 2 J T 3
?3
Ti j T 2
T 3 | Ty
T2 j
.88
.99
.99
.99
.99
.99
.94
1.0
.82
.13
.90
1.0
.99
.99
1.0
.92
.83
1.0
.54
.18 | 1.0
.95
Certainly, the results from Table 3 need to be interpreted carefully, since the
test is telling us that model is clearly separable. We do not know whether one of
the regressors in the nonparametric part has a linear effect. Note that the results
57
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Model in a System of Engel Curves
of testing additivity in Table 3 tell us that the model is additive separable in its
nonparametric part, but they tell us nothing about the linearity of each component.
In other words, it is possible to accept the nonparametric additive (separability) hypothesis even if one of those regressors has a linear effect on expenditure on different
goods. The computed p-values concerned with testing linearity of each nonparametric component from model [13] are shown in Table 4. For this testing hypothesis
procedure we use a bootstrap scheme for two bandwidth g\=l and g<z=l-2. Again,
we define a set Kn (with cardinality h—10) of bandwidths k in a range from 0.35
to 2.
Note that for the clothing expenditure we are able to reject linearity of schooling
at 10% for both bandwidths, and for the food expenditure we reject linearity of
schooling at 7.9% (7.6% for £2); in both cases with test statistic T-¡. For clothing
expenditure, similar results on linearity of schooling are obtained with r 2 . With test
r 3 the percentage of rejection of linearity of schooling decreases to 6%. For the food
expenditure, we are only able to reject linearity of age at 10% for both bandwidths.
In the rest of the cases, we reject the linear effect hypothesis of age, schooling and
expenditure at a < 5% for all test statistics and for all bandwidths.
Table 4. Testing Individual linearity
Age
Schooling
1
Expenditure |
T3
Tl
T2
M
.095
.10
.062
.060
.018
.016
.05
.05
0
.024 "1)791 .008
.020 .076 .008
.020
.020
0
0
0
0 1
0
01
.030
.030
0
0
0
0
0
0
0
0
.002
.004
.002
.002
0
0
0
0
0
•
ri
T2
Tz
T\
gi
g2
.018
.016
.014
,014
.034
.028
.10
.10
gi
g2
.10
.10
.002
.004
Leisure
gi
g2
.020
.010
0
0
.004
.004
.008
.008
Transp
gi
g2
0
0
.018
.026
0
0
.004
.004
Clothing
Food
0 I
1
0
Note that in general, linearity of age and schooling is rejected for every expenditure type. Moreover, for all test statistics and for all bandwidths the linear effect of
total expenditure on expenditures categories is strongly rejected. From Tables 3-4
58
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Est ¡mating and Testing An Additive Partially Linear Model in a System of Engel Curves
we conclude that the results are coherent with the shape of the curves estimated in
Figures 1-4. This gives us an idea of the robustness and reliability of our methods.
2.4
Conclusions and Future Research
This paper applies semiparametric additive PLM regression techniques for studying
the relationship between consumption and household characteristics based on the
Spanish Expenditure Survey. On the one hand, in the case of clothing and leisure,
the additive specification for nonparametric components is (weakly) supported for
test statistics based on errors of the additive PLM model and non,-semiparametric
estimators, with the bootstrap scheme. However, with Tj and r 3 test statistics we
are unable to reject the null hypothesis of additivity for different resampling schemes.
On the other hand, additive separable nonlinear effects are completely supported by
the results on specification testing. In general terms, there is no evidence to assert
that any linear effect of regressors of interest on the different expenditure categories
is observed in the subsample SES data used in this analysis. In conclusion, the
results from Tables 3-4 allow us to assert that the joint effect of total expenditure,
age and schooling on expenditures categories is nonlinear additive separable.
The general results obtained from the estimation and testing of Engel curves
show that modelling the effects of total expenditure on the different expenditure
types simultaneously with other regressors such as those included here certainly
deserves better treatment than usually found in one-dimensional semiparametric
analysis.
In particular we observe that households with younger heads tend to
behave differently from other households, and clearly this fact is not captured in an
Engel curve system in which only linear and quadratic age effects are included in the
empirical specification. Of course, it could be reasonable testing that nonparametric
effect of the age is quadratic rather than linear. However, we think that a perfect
quadratic specification maybe fitted only for household head in the 40 and 50 age
range in some cases and in the 35 and 55 age range in other ones. Note that
nonparametric estimation gives us the possibility for observing the effect of whole
59
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating ant! Testing An Additive Partially Linear Model in a System of Engei Curves
age range on expenditure commodities.
Note that in this paper we only take into account a partial household composition
(we only control for number of children, sex and number of adults). Therefore, a
reasonable extension of empirical analysis with additive PLM (simple additivity
does not allow such analysis) could be carried out by introducing more demographic
variation to obtain variety in behavior (more regions, labor market, temporal dummy
to capture price effects, etc.). Moreover, we could be interested in allowing Z%j vary
in any way vvith j and Stlusky symmetry, then would be necessary to impose a
function to get general equivalence scale in order to fulfill conditions of proposition
5 in Blundell et al (2003).
Another interesting point to investigate is whether changes in consumer preferences take place over time and then to make an extension to dynamic models.
One can take data from 1980 and 1990, for instance, and to make a comparison of
consumer behavior. This would be an interesting question for the future together
the inclusion of more categories of goods (health, furniture house, rent, etc).
60
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partiaiiy Linear Mode! in a System of Engei Curves
REFERENCES
Arevalo, R., Cardelus, M. T., and Ruiz-Casti])o, J. (1998) La Encuesta de Presupuestos Familiares de 1990-91. http://www.eco.uc3m.es/epf90-91.html.
Banks, J. Blundell, R., and Lewbel, A. (1997). Quadratic Engel Curves and Consumer Demand. The Review of Economics and Statistics, 79, No 4, 527-539.
Barrientos-Marin, J and S. Sperlich (2006). The Size Problem of Kernel Based
Bootstrap Test When the Null Is Nonparametric. Working in progress. University
of Alicante
Bierens, H and H. Pott-Buter (1990) Specification of Household Engel Curves by
Nonparametric Regression. Econometric Reviews, 9, 123-184.
Blundell, R., Duncan, A., and Pendakur, K. (1998). Seminparametric Estimation
and Consumer Demand. Journal of Applied Econometrics, 13, No 5, 435-461.
Blundell, Richard; Browning, Martin and Ian A. Crawford (2003). Nonparametric
Engel Curve and Revelead Preference. Econometrica, 71, No 1, 205-240.
Deaton, A and J. Muellbauer (1980a). An Almost Ideal Demand System. American
Economic Review, 70, 321-326.
Deaton, A and J. Muellbauer (1980b). Economic and Consumer Behavior. Cambridge University Press, Cambridge.
Dette Holger, C. Von Lieres and S. Sperlich (2004) A Comparison of Different Nonparametric Method for Inference on Additive Models. Nonparametric Statistics, 00,
1-25.
Gózalo, P. L. and O. B. Linton (2001) Testing Additivity in Generalized Nonparametric regression models with Estimated Parameters.
Journal of Econometrics,
104: 1-48.
Hardk, W. and E. Mammen (1993) Comparing Nonparametric Versus Parametric
Regression Fits. Annals of Statistics, 21, No. 4, 1926-1947.
61
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter Estimating and Testing An Additive Partially Linear Mode! in a System of Engel Curves
Hardle, W., Huet, S., Mammen, E., and Sperlich, S (2004) Semiparametric Additive
Indices for Binary Response and Generalized Additive Models. Econometric Theory,
20, 265-300.
Hardle, W., Miiller, M., Sperlich, S., and Axel Werwatz. Nonparametric and Semiparametric Models. Springer Series in Statistics. Springer-Verlag, 2004.
Hengartner, N and S. Sperlich (2005) Rate Optimal Estimation with the Integration
Method in the Presence of Many Covariates. Journal of Multivariate Analysis, 95,
Issue 2, 246-272.
Horowitz. J, L and V. Spokoiny (2002) An Adaptive, Rate-optimal Test of Parametric Mean-Regression Model Against A Nonparametric Alternative.
Econometrica,
69, No. 3, 599-631.
Leser, C. E. V (1963). Form of Engel Functions, 31, No 4, 694-703.
Linton, O. B., and J. P. Nielsen. (1995). A Kernel Method of Estimating Structured
Nonparametric regression Based on Marginal Integration. Biometrika, 82, 93-101.
Lyssiotou, P; Pashardes, P and Stengos, Thanasis (2001). Age Effects on Consumer
Demand: An Additive Partially Linear Regression Model, The Canadian Journal
of Economics, 35, No 1, 153-165.
Nadaraya, E. A. (1964). On Estimating Regression. Theory Probability Applied, 10.
Neumeyer, N and S. Sperlich (2005). Comparision of Separable Components in
Different Samples. Workin Paper, Universidad Carlos III.
Newey, W. K and J. Powell (2003) Instrumental Variables Estimation of Nonparametric Models. Econometrica, 71, 1565-1578,
Robinson, P (1988) Root N-Consistent Semiparametric Regression. Econometrica,
56, 931-54.
Rodriguez-Póo, J. M, S. Sperlich and P. Vieu (2005) And Adaptive Specification
Test For Semiparametric Models. Working Paper.
62
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Ir al índice/Tornar a l'índex
Chapter Bstimating and Testing An Additive Partially Linear Model in a System of Engel Curves
Sperlich, S. (2005). A Note on Nonparametric Estimation with Constructed Variables and Generated Regressors. Working Paper. Universidad Carlos III.
Stone, C J (1985). Additive Regression and Other Nonparametric Regression Models. Annals of Statistics 13: 689-705.
Stone, C J (1986). The Dimensionality Reduction Principle for Generalized Additive Models. Annals of Statistics 14, 592-506.
Watson, G. S (1964) Smooth Regression Analysis. Sankhya Ser. A 26.
Working, H. (1943) Statistical Laws of Family Expenditure. Journal of the American
Statistical Association, 38, 4-56.
63
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and
Functional Data
3.1
Introduction
NONPARAMETRIC STATISTICAL MODELS HAVE TAKEN AN IMPORTANT PLACE IN
STATISTICAL S C I E N C E .
At the same time, there is an increasing number of situation
coming from different fields of applied sciences in which the data are of functional,
nature (i.e. one observation can be a curve or a surface,...). A lot of data set
has been studied in the recent literature, dealing with chemometrics (see Frank
and Friedman (1993) ) and (Ferraty and Vieu (2003)), radar waveforms (see Hall
et.al (1991) and Dabo-Niang and Ferraty (2004)), biometrics (see Ramsay et.al
(1994) and Gasser et.al (1998)),...Many others examples can be found in (Ramsay
and Silverman (2002, 2005). Such kind of data are called functional data in the
sense that they come from observation of a functional variable. The combination
of the nonparametric models with the functional data leads to the problematic of
Nonparametric Functional Statistics which is a recent field of investigations (see
Ferraty and Vieu (2005) for both practical and theoretical aspects of this topics).
Several examples of Functional dataset could be found in Ferraty and Vieu
(2003), for instance, Electricity consumption data in U.S.A for time series prediction. We include another example about Inflation rate in Colombia from 1955 to
2004, provided by Colombia Central Bank (http://www.banrep.gov.co).
64
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
The idea is
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
Figure 3.1: 50 Continuous Time Series Curves
simple. Suppose that you have monthly Inflation rate during 50 years, then the set
of explanatory variables to be used in functional statistical methods is composed of
50 functional data which are 50 continuous time series curves, see Picture 3.1.
Now for introducing the main problematic of this paper, let us describe briefly a
functional data set which will illustration our purpose. Initially studied by (Borggaard
and Thodberg (1992)) this data set comes from a quality control problem in the food
industry an can be found in http://ltb.stat.cmu.edu/dataset/tecator.
It is concerns a
sample of finely chopped meat. For each piece of meat, one observes a spectrometric curve by means of the Tecator Infratec Food and Feed Analyzer Working in the
wavelength range 850-1050 nm by the near-infrared (NIR) transmission principle.
Figure 1 displays the 215 spectrometric curves X\,..., #235•
In addition, for each piece of meat, one knows the percentage of moisture (by
means of some chemical process), and one have at hand the 215 scalar responses
^li •••>^2i5- The spectrometric technology is generally faster and less expensive that
analytical chemical processes. Therefore, the spectrometric curves are more and
more used for identifying the component of any product instead of some chemical
65
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
Figure 3.2: The Spectrometric Curves
analytical processes. The aim is clear: can we predict the percentage of moisture in
a piece of meat from its spectrometric curve? In other words, the statistical question
is the following one: does a relationship between the A"s and Y's exists?.
From a statistical pint of view, this corresponds to a functional regression setting
because on wishes to predict a response Y from an explanatory functional variable
X. In addition, only regularity constraints will be assumed with respect to regression
operator (which model the relation between moisture and the spectrometric curves).
This leads us to the nonparametric context. So, the problematic of this work deal
with the nonparametric functional regression. On the one hand, some works dealing
with the nonparametric functional regression exist already in the recent literature
(see for instance Ferraty and Vieu (2002, 2005) for recent and deeper developments).
This nonparametric functional regression method is essentially based on an extension
of the well-known Nadaraya-Watson kernel regression estimator of the regression, see
Nadaraya (1964) and Watson (1964) to the case of functional explanatory variable.
On the other hand, local linear ideas have been developed in the regression context
66
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
for univariate and multivariate explanatory variable, see Wand and Jones (1995) for
an overview of this topic. This is a direct extension of the nonparametric regression
model which can be viewed as a local constant method.
An interesting investigation consist in proposing a locally modeling approach
when on regresses a scalar response on an explanatory functional variable. This is
the main aim of the paper. However the aim of this setting does not make easy
both the asymptotic study and the implementation of a natural generalization of
the multivariate local linear method. Therefore, one focuses on a simpler and faster
local approach. Section 2 described the position of our problem. Section 3 recalls
what is the multivariate local linear regression approach and introduces a functional
locally modeled regression method and a simpler a faster version. Some asymptotic
properties of the fast functional locally modeled regression are given in Section 4
whereas its behavior in practice is illustrated in Section 5 by means of spectrometric
dataset. Finally, the reader interested by theoretical developments will find detailed
proofs in Appendix.
3.2
Position of the Problem
This paper focuses on the nonparametric estimation of the regression operator defined by
Y = m(X) + e,
with
E[e\X} = 0.
where the explanatory variable X is valued in some infinite dimensional space Tí
and Y is scalar response. To this end we can use a functional kernel estimator (see
Ferraty and Vieu (2006) for a deep study), which is an extension to this functional
framework of the Nadaraya-Watson estimator. Based on n pairs (Xiy Yi)i=1
n
iden-
tically and independently distributed as (X,Y) the functional kernel estimator is
defined as follows:
.
M
_ Y^^K(h-^(X!Xz))Yt
67
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
where is a standard univariate kernel function, <5(-,-) locates one element of
TL with respect to another one, and h is the so-called bandwidth which plays the
role of a smoothing parameter. An interesting example for S (•, •) is to consider a
semimetric, which is an index of proximity well adapted to the spectrometric curves
as explained in Ferraty and Vieu (2002). Indeed as in the standard case (i.e. the
explanatory variable is a real r.v.), the kernel estimator rh (x)
ca
n be introduced as
the solution of the minimization problem
n
mm Y (Yi - mf K (/rM (x, *i))
¿=1
This way, rh can be called functional local constant regression estimator and
denoted by mpLC because it approximates locally the Y¿'s with a constant. One way
to define a new functional nonparametric regression estimator is to introduce it as
the solution of a more sophisticated minimization problem, which is the aim of the
next section.
3.3
Functional locally modeled regression
The main idea developed here is inspired by the local linear approach. Therefore
we recall briefly what is the guideline of the local linear regression in the finite
dimensional space case (i.e. a p-dimensional explanatory variable is considered).
Starting from the expression of the local linear estimator in the multivariate case, one
introduces extensions of the local regression able to take into account an explanatory
functional variable.
3.3.1
The p-dimensional case
The idea of local linear regression has been developed by Fan and Gibéis (1992)
and the minimax properties of the corresponding smoother has been stated by Fan
(1993). A good overview on this topic can be found in Wand and Jones (1995)
and Fan and Gibéis (1996), and we refer to García-Soidán et.al (2003), Hertgartner
68
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Locally Modelled Regression and Functional Data
Chapter 3
et.al (2002) and Cheng et.al (2002) for more recent advances. Here we focus on
the multivariate approach, that is given p > 1 (i.e. we consider a p-dimensional
explanatory variable X). Given n independent real random vectors {X¿,Yj}¿=1
n
distributed as {X, Y} and according to Rupper and Wand (1994), a simpler way to
introduce the multivariate local linear estimator vn-^n, of m — E ( Y | X ) at x 6 R p
consist in solving the following minimization problem (Vo) '•
min
Y
fa
- a - (b,Xi - x)p)
Kp (H~l (Xf - x))
where (•. •) is some inner product defined onto W, H is generally a diagonal matrix
containing p bandwidths (one per dimension) and Kp is a standard p-varíate kernel.
The multivariate local linear estimator of m (x) is given by a (x) which is the solution
for a to the minimization problem (T-'o)-
3.3.2
T h e i n f i n i t e - d i m e n s i o n a l case: t h e f u n c t i o n a l s e t t i n g
There are various options for extending the local linear idea when the explanatory
variable is valued into some infinite-dimensional space H. The firts one is a generalization of the minimization problem (Vo) to the functional setting. The second
one proposes an interesting simplified version in a fast computational way. Before
going on, let us note that, in our functional regression setting, the p-dimensional
explanatory variable X (valued in some infinite-dimensional space H), whereas the
scalar response Y unchanged.
The functional locally modeled regression
A natural way for extending (Vo) to the functional case leads us to the following
functional minimization problem (V\):
n
min
T(Yt-
a- ^ (^)^x)f
i=l
69
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
K
(h-ló(x,^))
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
where T c Tí, S(•, -) locates one element of Tí with respect to another one, K is
a standard univariate kernel and $ a known operator from T x H2 into R such
that V( <= H # (-,£,£) = 0. In particular, for the given operator ^{rp,x,x')
=
w
(i/>, x — X')-H here (•, -)n is some inner product defined onto Ti, it is easy to see (V-¡)
as a direct extension to the functional case of the local linear regression problem.
Note that 5 (-, •) is a signed index of proximity between two elements of Tí, which
means that <5(-, •) is not constrained to be metric or a semimetric and can take
negative values. From a statistical point of view, we consider that a + Í» (ip, -, v) is
a good candidate for modelling the regression operator around v. In other words,
one suppose that m ( ) can be well approximated by a + ^ (ip, , x ) and as direct
consequences of the properties of ^ , a is a good candidate for fitting m(x) since
^(tptX-.x)
~ 0. Therefore, the solution a(x) for a to the minimization problem
(Pj) defines what we call the functional locally modelled (FLM) estimator of . The
main difference and also the main difficulty when we compare (PQ) with (T7;) resides
in the fact that the minimization acts is acting over the functional space T'. One
way of overcoming this problem is to suppose T of finite dimensional (D) and thus
one can replace the unknown function xjj with its finite basis expansion
D
^(•) = ][> d ed(-)
d=l
In addition, if we assume the operator '$' linear with respect to its first argument,
that leads us to the following minimization problem (Vi):
Clearly, from a practical pint of view, this problem is easier to solve and one
get the following expression for the locally modelled estimator mFLM (x) of m (x)
1&(Y\X — x) (which is the solution for a):
m(x)FLM^u'}(Q'KQy1Q'KY
70
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
=
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
where O is the n x D matrix such that
1 *K*!,x)
1
*{eD,XuX)
*(eD,X2,X)
*(ei,A' 2 ! x)
1 #(ei,*n,x)
•••
*(eD>^n,x)
K = d i a g ( # ( / ^ (x, Xi)),..., if (h-l5 ( Xl *n))), Y ' = \YU .., Yn] and u', = [1,0,.., 0] e
R D + 1 . From a theoretical point of view, the statement of asymptotic properties remain an heavy challenge because the matrix expression of rh {X)FLM
ma
kes unusable
the current theoretical advances in the functional setting. Therefore, we prefer to
focus on a simpler version
TU^FM
which will allow us to exploit the knowledge of
the kernel type estimator in the functional regression setting (see Ferraty and Vieu
(2006) for more details and relevant and extensive bibliography on this topic). This
is the aim of the next section.
A fast functional locally modelled regression
Instead of studying {{V\) or (V2)) o n e focuses on the following minimization problem
n
^ n
¿
Ty(Yl-a~bP(Xl,x)fK{h'15(X,Xl))
(a.b)&E
where /?(-,•) is a known operator from H2 into R such that V£ e H ¡3 (•, £, £) = 0.
As 5 (-, •), P (-, -) locates also one element of H with respect another one which means
that /?(-,•) can take also negative values. The fast functional locally modelled
(FFLM) kernel estimator mpFLM
of m is the solution for a of problem (V3) and we
have
™(x)FFLM
= u 'i (Q'pKQey1
G'/JKY
where
1
0(XltX)
P(Xn,X)
71
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
One of the main advantage is that one get the explicit expression:
™
uti Er=3 Wirt
n
(X)FFLM
^
Wi U
E n¿=1 V2-ij=l
with
Wij
= P% {Pi - Pj) KiKj
where Kl = K [h~15 ( x , ^ ) ) and Pi = P (Xi,\)-
It is clear that this is a faster
way for computing a locally modelled kernel-type estimator in a functional approach.
It is suffices to replace YLd=i ^d^ (ed-, •, •) with b@ (Xit x) in order to see (P3) as a
particular case of of (V2)- From statistical point of view, our FFLM approach
assumes that a — 6/3 (-, x) is a good way of approximation m (x) around x- In other
words, a is a good approximation of m (\) and hence TTIFFLU should be a. good
estimator of m. In addition, the explicit expression of TTIFFLM allows us to state
asymptotic properties based on the current knowledge on nonparametric functional
data analysis (see Ferraty and Vieu (2006)). Finally, this kernel type estimator
appears as a weighted average since it can be rewritten as follow:
n
mpFML (x) = J ]
W Y
ii
with j — 1,..., n and
W5
E
n
TT-^n
1 = 1 LJÍ=\
Wt i j
To complete this section, let us remark that we could take P {•,•) = 6(-, •)• But
P and 6 do not plays similar role. Indeed, P refers to the local behavior of the
regression whereas 5 concerns the local waiting. Therefore, as we will see in Section
5, authorizing two ways for locating one element with respect to another one (i.e.
P ^ S) can allow fit better the data from a practical point of view, in that sense,
considering P =¡¿ 5 can lead us to a more adaptive (or flexible) method.
72
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
3.4
Locally Modelled Regression and Functional Data
FFLM kernel-type estimator: asymptotic behavior
Our goal is to study the pointwise asymptotic behavior of TUFFLM (X) °f the regression operator m (x) = E (Y\X = x)i X being a fixed element of 7i. The main results
state the almost complete convergence which implies the almost sure convergence.
Before giving the theorems, let us introduce some assumption and terminologies.
First of all, let us star with a crucial hypothesis concerning the distribution of the
functional r . v ^ ,
(HI) (px (ui,u2) := P^
< 5(X,x)
< «2) and \fu > 0, ^px(-u,u)
> 0
In the following we will use the simpler notation
<px{u)
=<px{-u,u)
A soon as 5(-,-) is a metric or, more generally, a semimetric, <p^(u) can be
interpreted as a ball of H centered at x and radius u. When u becomes smail,
the terminology "small ball probabilities" is commonly used, which is a field of the
probability theory intensively investigated, see for instance Li and Shao (2001) for
an overview on this topic in relation with Gaussian process. Actually, the function
ipx (u) (which is a direct extension of the small ball probability notion) plays a
similar role in the functional case as the density function in the finite dimensional
setting. Indeed, it is usual in the multivariate nonparametric setting to estimate a
quantity at a point for which one has all around number a number of observations
large enough.
A common way for assuming that is to assume that the density
function valued at this point is strictly positive. In the infinite dimensional setting,
there is not a reference measured as the Lebesgue one in the finite dimensional
context. However, one has to make a similar assumption without density notion.
This is the goal of Hypothesis H I which translates in the functional context the fact
that we have at hand sufficiently observation around x and hence it makes sense to
estimate the regression operator at this point.
73
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
As it is standard in the nonparametric modelling, one has to consider regularity
assumptions. The first one concerns the unknown regression operator m which will
be supposed to verify one of the following constsraints:
(H2 C ): m e j / . - W - ^ R ,
lim
/ ( X ') =
f(x)\
and
(H2L): me{f:H-+U,
C\d{XlX')\v}
3(7 € R + , X' € 7i, !/(*') - f(x)\ <
Clearly, the first hypothesis is a continuity-type constraint which will aiiow us
to get pointwise convergence. Moreover, as soon as one wishes to state rate of
convergence, one have to introduce more restrictive constraint, which is the role
played by the second Lipschitz-type hypothesis.
An another regularity-type constraints is also necessary in order to control the
shape of the local functional (3 which can be expressed as follows:
(H3) 30 < M < M2, VX' £ H, Mx \d(XlX')\
< l/?(x,x')¡ <
^2¡d(X,X%
Now we focus on assumptions concerning the kernel estimator mppLM,
Let us
first introduce assumption on the kernel function K:
(H4) K is a differentiable function K : R —> R+, such that its support is ( - 1 , lj
These kind of kernels contain the standard symmetrical kernels used in the literature (uniform, triangle, quadratic, Epanechnikov,...) Once this class of kernels
defined, one can propose additional hypothesis acting on the estimator (and also on
the distribution of the functional r.v. X)
(H5) h is a positive sequence such that:
lim h = 0 and lim
n—too
(H6) 3n0, V?i > n0> (H?)
h
^
/^ ^
og
yA = 0.
n—•ocn'Px\
(zh, h) £ (z2K (z)) dz > C > 0
¡BixJl) P («• X) dP («) = o (jB{xh)
f? (ti, x) dP (u))
74
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
'
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally .Modelled Regression and Functional Data
One ends the listing of hypothesis by focusing on the scalar response Y through
its successive conditional moments:
(H8) VA; = 2,... 0fc : x -•* E (Yk\X — x) is
a
continuous operator with respect
to*Hypothesis (H2)-(H5) and (H8) are standard in the nonparametric functional
regression setting and extends what is usually assumed in the classical p-dimensional
nonparametric literature (see Ferraty and Vieu (2006) for a large discussion). Hypothesis (H6) precise the behavior of the bandwidth h in relation with the small
ball probabilities and the kernel function K. The new hypothesis (H7) about local
behavior of the operator (3 which models the local shape of the regression. Actually,
if ¡3 = d this assumption means that the local expectation of ¡3 is small enough
with respect to its moment of second order. In order to fix the ideas and to see the
unrestrictive feature of (H7), note that if the r.r.v. (3 (X, x) admits a differentiable
density with respect to the Lebesgue measure then (H7) is satisfied.
Now, we are ready for giving the two main results. The first one states the
pointwise almost complete convergence whereas the second one precise the rate of
convergence.
Theorem 1. Under (Hi),
fH2c),
rnFFLM
T h e o r e m 2. Under (til),
{H3)-fH8),
(x) - m (x) = Oa.CQ (1)
(H.2^), (H3)-(H8),
rhpFLM (x) -
m
we have:
we have:
(x) = O {hv) + 0 Q ,
iogn
mpx (h)
As it is usual in nonparametric statistics, when we wish to get a more accurate
asymptotic result, we have to introduce more restrictive assumptions. Clearly, this
75
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
is the case here because Theorem 2 needs that the regression operator m satisfies
the Lipschitz-type consdition (instead of the continuity-type one for Theorem 1). In
the remainder of this section, we give a sketch of the proof of both theorems. Let
us first introduce the folloiving quantities
n
1
VA;
n
J J
n (n - 1) E (wn) j-f ¿-¡
and in such way that we have
with E(rñ 0 (x)) = 1- The proof of the aforementioned theorems is based on the
following descomposition
rhFFLM(x)-™(x)
=
r^{(^i(x)-E(m1(x)))-(m(x)-E(m1(x)))}
m0 [x)
——n '"oW - 1 )
m0{x)
and also on the following lemmas:
Lemma 1 Assume that ( H I ) , (H3)-(H5) are satisfied; and
(i) if (H2c) holds, we have:
m(x)-E(m,(x)) = 0(l)
(ii) if (H2 L ) holds, we have:
m(x)-E(m1(x))-0(n
Lemma 2 Suppose that assumptioms ( H I ) , (H2c), (H3)-(H7) are satisfied,
(i) we have
™o {x)"l
= O,
76
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
•mpx (h)
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
(li) in addition, if (H8) holds, we get:
m 1 ( x ) - E ( m 1 ( X ) ) = 0„.c '
'
^
rupx[h) J
Lemmas l-(i) and 2 lead us to statement of Theorem 1. Lemma l-(ii) in addition
with Lemma 2 allow us to get Theorem 2. Detailed proofs of these lemmas can be
found in the Appendix.
3.5
FFLM regression in action
The aim of this section is to give an idea on the behavior of the FFLM method
from a practical point of view. To do that, we consider the functional data set
(X*,Yi)i_1
2J5
where the vector X*' = (A¿ (Ai),..., A¿ (A10o)) is the ith discretized
spectrometric curves presented in the introduction ant Y¿ is the corresponding percentage of moisture. Before implementing our local regression and as most of statistical methods, we have to fix some quantities.
As a fist important practical aspect consist in fixing the local shape of the regression, that is the operator /?(-, •). To this end, let us consider the usual inner
product
{f,9)n = J f(t)g(t)dt
According to our experiment of such a spectrometric dataset, we propose to
define a family of local shapes as follow:
Where 0 is a fixed real-valued function and where x^
denotes the gth derivative
of the function x- ® is selected among the eigenfunctions of the covariates operator
^ ((xi
_
X (9) ) (#. Xj •. X2 ) ) • From practical point of view, we get a dicretized
version of this eigenfunctions by computing the eigenvectors of the empirical covariances operator
1
' ie£
77
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Locally Modelled Regression and Functional Data
Chapter 3
where £ is a subset of {1,.... 215} (£ will called learning sample). The idea of
choosing 9 through the spectral analysis of the covariance operator is driven by the
functional principal components analysis methodology (see
). Indeed, this is
a useful tool for exhibing functional directions which can reveal pertain information.
Here, for our current dataset we tried several parameters q and 6 and the best results
in terms of prediction are obtained for q = 1 and for the third eigenfunction (i.e.
the eigenfunction associated to the third larger eigenvalue).
A second crucial point concerns the choice for the operator 5(-, •) which allows
to locate a curve with respect to another one. According to our experiment of
spectrometric curves, as well adapted local tool is a semimetric based on the second
derivative
Some motivations for using this kind of the smoothing parameter can be found
in Ferraty and Vieu (2002, 2006). It is worth noting that considering also the second
derivative for ¡5 leads to worse MSE. So, allowing and index of proximity S to be
different of the local shape /? makes this FFLM method more flexible in the sense
that the fit is better.
The last point concerns to the choice of the smoothing parameter (i.e. the bandwidth h), which is very standard problem in nonparametric statistics. As described
in Chapter 7 of Ferraty and Vieu (2006), we did it by using Cross-Validation procedure over fcNN-type bandwidth.
In order to illustrate the pertinence of the FFLM method, one splits randomly
the initial sample into two subsamples.
The first one (Pdi, Yi)ieC
"Learning sample", allows to build the estimator (i.e. rnFFLM
usually called
(•)). the second
subsample (A¿, Yi)i&r usually called the "Testing sample" allows to predict percentage of moisture (i.e. rnFFLM
(Xi)ieT).
Of course £ and T are constructed such that
£ U T = {1, ...,215} and £ n T = <fi. Now, one way to evaluate the performance of
78
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
MEAN SQUARE
FLC
ERRORS
FFLM
AVERAGE
Figure 3.3: Distribution of the MSE' for each method
this method is to compute the mean square errors (MSE):
MSE
(A T) = rj-
£
{Yi ~ KFLM
(*)) 2
which is a usual index for evaluating the quality of fitting to the data. In order to
get more robust results, we constructed randomly 100 learnings and testing samples
\^
'
J
/i=l,...,100
which allows us to get 100 quantities MSE (£{s\T^)
for s = 1,...,100. In
addition, to show the good behavior of the FFLM method, we also implemented
the functional local constant regression estimate mFLC
(see Section 2) with the
similar bandwidth choice procedure. Figure 2, displays several box-and-whiskers
which summarizes the distribution of the mean square errors computed over the 100
experiments: the left one correspond to the FLC regression, the middle one gives
and idea of the performance of the FFLM method, and the right one correspond to
the "AVERAGE" method which predict moisture by using 0.5 {mpic + rnpFLM)79
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Locally Modelled Regression and Functional Data
Chapter 3
In this situation, the local modelled approach seems globally to fit better to
the data. Actually, looking at the AVERAGE method we see that, if we run both
functional nonparametric approaches, we improve the quality of fitting in terms of
MSE. In other words, the FFLM method has to be seen as an complementary
nonparametric tools of prediction, and not only as a competitive one with respect
to existing nonparametric functional method.
3.6
Conclusions
One can see that Fats-FLM method as a good alternative to the FLC approach. It
is worth noting that a low computational cost drive us to significant improvement
in terms of the quality of the prediction. This good behavior makes this functional
local modelled regression very attractive, at least for the kind of spectrometric data
set. as emphasized in the implementation, this is also complementary functional
prediction method in the sense that combined with the functional local constant
regression, the obtained mean square errors overpasses those coming from FLC and
Fats-FML.
In addition, this work offers very interesting perspectives of investigations. A
first direction concerns the statement of theoretical properties with respect to the
cross-validation methodology (see Benhenni, et.al 2006) for asymptotic dealing with
optimal local bandwidth choice for the functional local constant regression method).
A second track would consist in looking for a more pertinent functional direction (i.e.
9) by proposing an "optimal" linear combination of eigenfunctions. Of course, this
deserves a deeper investigations from both practical and theoretical point of view.
At a third time, one can focuses on the function locally modelled (FML) regression
explained in Section 3.2.2. Surely, the
computational cost will enlarge but we
can expect a significant improvement in therms of quality of prediction because it
would allow to construct a large family of operators (3 (-, •). At the same time, this
statistical functional method opens new theoretical challenges as soon as we wishes
to state asymptotic properties.
80
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
Finally, this works is a first step toward local statistical models taking into
account functional variables. This is also an encouragement to pursue further investigations in this topic.
81
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
Appendix
We start this section with a preliminary technical lemma. In the following C is
a generic constant (0 < C < oo).
L e m m a 3. Under ( H I ) , (H3)-(H6), we have:
(0 V(Jfc,0 GN* x N ' . E Í ^ I ^ l ' )
<Cti<fx(h)
(u) E (KiPl) > Ch2ipx (h)
Proof.
(i) Note that (H3) implies:
and because of kernel K is bounded on [—1,1] we get:
K^\^\lh~l<CK,t\5{Xl,x)\lh-ll[-l,í]{h-l\5{?¿ux)\)
and thus, we have
K*\f3,\lh-l)<apx(h)
which is the claimed result
(ii) By using (H3) we have:
E(K10l)>CE(6(X1,X)2K1
Moreover, we can write for
E K,
dU,X)
t2K{t)P~^{t)
2
h
•1
1 r rt
i L/-1
=
/ J
~-{u2K{u))+K{-l))du
du
K(-l)ipx(h)
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
'
dP*^
(t)
l[Uil]{t)dPd^{t)^u{u2K{u))
+
j - i
82
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
the last equation coming from the Fubini's theorem. In addition, it's easy to
check that:
i{x.X)
1 K 1 ) (i) dPS-
(t) <P(uh<5
(X, X) < h)
J -i
in order to write
Ki
K (-1) <px (h) + ^
h?
Lx
(uh, h) ~ {u2K (u))\ du
Finally, it comes:
h~2E {Ktfl)
> C<px (h) (K (-1) - ^ - ~ £
(px (uh, h) j - (u2K (u)f) du*j
It suffices to use (H6) for obtaining the desired lower bound, which ends the
proof of lemma 3-(M)
P r o o f of L e m m a 1.
On the one hand, we have
E(w 12 )
on the other hand, it hold
E (E (rm (x) \X2)) = —
1
rE (W12E (Y2\X2))
E(tüi 2 )
which allows us to write:
\m(x)E(m0(x))~E(m1(X))\
E(wl2)
<
sup
<{wn{m(x)
-m(X2)))
\m(x)-m(x')\
X£B(x,h)
Combining lemma 3-(n) with (H2c) in order to get lemma l-(i).
we use (H2¿) instead of (H2c), it is clear that
83
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
However, if
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
sup
\m(x) — m(x')\
— O (hv)
xeB(x,h)
which leads us to lemma 1-(M)
P r o o f of L e m m a 2.
(ii) Let us start the proof remarking that
1^
n2h2cpx
mi{x)
n(n
KJYA/I
(hy
s2
(1)
— l)E(u>i2)
n j ^ hyx {h)J yn ^
53
h<£x (/i)
¿4
which allows to write:
mi (x) - E (mi (x)) = Q [S& - E (S1S2) - (S3S4 - E (S3S4))}
At first, we have
SiS2-E(SiS2)
-
(5i-E(51))(52-E(52)) + (52-E(52))E(5i)
+(5i - E (5i))E (5 2 ) + E (5i) E (5 2 ) - E (S1S2)
(2)
It remains to study each term of expression (2):
(a) Let us write
s,
" E ( s , ) -;£i^£(S)—-h
In order to apply Corollary A.8-Ü in Ferraty and Vieu (2006), we focus on absolute moments of the r.r.v. ZU'E|Z£|
E\vAhTm{K3Y3-E{K3Y3))
=
J2 ck,m (KjYj)k
<
<Px (hym
(3)
(E (/i i y i )) m " f c ( - I ) " " "
\m—k
J2 ck,mE {K*ok (XJ) |E ( ^ m ( ^ ) ) |
fc=0
84
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
(4)
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Locally Modelled Regression and Functional Data
Chapter 3
the last inequality is obtained by conditioning on X\. In addittion, (H2C) implies
that m (XI) = m (x) + o (1) whereas we get ak (X\) = <7fc (x) + 1 as soon as (H8) is
checked. This combined with (4) allow to write
E|Z£|
=
O L^(/i)-
m
^ E « ) ( E ( ^ ) ) m—k
fc=0
=
O
max <px(h)
fc+1
\ K=0,...,m
= Ofaih)-"*1)
knowing that E (K^) = 0(<px(h))
(5)
(see Lemma 3-(i) with 1 = 0). Finally, it
suffices to apply Corollary A.8-Ü in Ferraty and Vieu (2006) with a2n = tpx (/i) _1 to
get
(b) In the same way, writing
is not difficult to see that:
m
E\Z%\ < ^ - ^ ( ^ — ^ c ^ E ^ f ) |E(/r3/3?)r~fc
(4)
fc=0
which, combined with Lemma 3-(i) when 1=2 implies that:
<\z*\ = o{<px(h)- m + l i
Once again, we apply Corollary A.8-Ü in Ferraty and Vieu (2006) with a.2
<Px (hy1
t0
get
B(a) =0
*-
7
~ tó> < '
85
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
_
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
[c] Note t h a t
E(Si)
=
ipxihr'EiKiYj
^x(hylE(Kim(Xl))
=
and because of mi (X\). — rn (x) + o (1) we get that E (Si) ~ O (1). Similarly,
we get that E(S,2) = 0(1).
E(SiS2).
Now it remains to study the quantityE (5"i) E (52) -
To this end, let us remark that
E(S1)E(S2)-E(S1S2)
n{n
=(l-
1)
n^
h'2<px
)
(h)-2E{K10l)B(K1Y1)
Using similar arguments outlined above, it is easy to see that
E (Si) E (S2) - E (SjSa) = O (n" 1 )
which is negligible with respect to
\/^~n^-
Finally, items (a), (b) and (c) states that
SiS2~E(S1S2)
= Oa.co
logn
(8)
n
^ V Vx (h) t
Similar arguments allows to conclude that
S3S4~E(S3S4)
= Oa.,
logn
xV
(9)
n<
Px (h) J
Now to achieve the proof, we have to study the upper bound of the quantity
Firstly, note that
h\E(pK1)\
(3(u,X)dP(u)
<Ch
B(xA)
and (H7) implies that
h\E(PK1)\
= o[ /
lB(x,h)
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
pz(u,X)dP(u)
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Locally Modelled Regression and Functional Data
Chapter 3
by applying Lemma 3-(i) with K — l[-i,i]> k = 1 and I = 2 we get that
/
p2(u,x)dP(u)<C'h2^(h)
JB(x,h)
which implies that
E(/31K1) = o(<px(h))
Now, Lemma 3-(M) and the last result allows us to write:
E (w12) = E (KiPfj E (Kij - E (KxPJ > Ch2^>x {h)
As a direct consequence we get that
Q =
o(i)
which ends the proof of Lemma 2-(ii).
(i) This result can be deduced directly from Lemma 2-(M) by taking Yi = 1. In
this case, Hypothesis (H8) is unnecessary.
87
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Chapter 3
Locally Modelled Regression and Functional Data
REFERENCES
Abraham, C , Cornillon, P., Matzner-Lober, E., N Molinari. (2003) Unsupervised
Curve Clustering Using B-spline. Scandinavian Journal of Statistics., 30, 581-595.
Benhenni, K., Ferraty, F., Rachdi, M., and P. Vieu. (2006). Nonparametric Regression for Functional Data: Optimal local Bandwidth Choice. Submitted.
Boente, C , and Fraiman, R. Kernel-based Functional Principal Components. Statistics and Probability Letters, 48, 335-345 (2000).
Borggaard, C and Thodberg, H. (1992). Optimal Minimal Neural Interpretation of
Spectra. Analytical Chemistry, 64, 545-551.
Cheng, M-Y and Hall, P. (2002). Error Dependent Smoothing Rules in Local Linear
Regression. Statistica Sinica, 12, 429-447.
Castro, P. E., Lawton, W. H., and Sylvestre, E. A. Principal Models of Variation
for Processes with Continuous Sample Curves. Technometrics, 28, 329-337.
Dabo-Niang, S., Ferraty, F., and Vieu, P. (2004) Nonparametric Unsupervised Classification of Satellite Wave Altimeter Form. In Proceedings in Computational Statistics, ed. J. Antoch, Physica-Verlag, Heidelberg, 878-886.
Dauxois, J., Pousee, A., Romain, Y. (1982) Asymptotic Theory for the Principipal
Component Analysis of a Random Vector function: Some application to Statistical
Inference. Journal of Multivariate Anlysis, 12, 136-154.
Fan, J (1993). Local Linear Regression Smoother and Their Minimax Efficiencies.
Annals of Statistics, 21, 1, 196-216.
Fan, J and Gijbels, I (1992). Variable Bandwidth and Local Linear Regression
Smoothers. Annals of Statistics, 20, 2008-2036.
Fan, J and Gijbels, I (1996). Local Polynomial Modelling and its Applications,
Chapman and Hall, London.
Ferraty, F and P. Vieu (2002) The Functional Nonparametric Model and Application
to Spectrometric Data. Comput. and Statistics, 17, 545-564
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Locally Modelled Regression and Functional Data
Chapter 3
Ferraty, F and P. Vieu (2003) Functional Nonparametric Statistics in Action. Quantification and Simulation
Processes, Humboldt-Universitat Zu Berlin. Discussion
Paper No 39.
Ferraty, F and P. Vieu (2004). Nonparametric Models For Functional Data, with
Applications in Regression, Time Series Prediction and Curve Discrimination. NonParametric Statistics, 16, 1-2, 111-125.
Ferraty, F and P. Vieu (2006). Nonparametric Modelling for Functional Data Analysis. Theory and Practice. Springer, New York (In print).
Frank, I, E., and Friedman, J, H. (1993). A Statistical View of Some Chemometric
Regression Tools (with discussion). Technometrics, 35, 109-148.
Gasser, T.,Hall, P and Presnell, B (1998). Nonparametric Estimation of the Mode
of a Distribution of Random Curves. J. R Statistics Society. Serie B, 60, 681-691.
García-Soidán, P., Gonzalez-Manteiga, W and Febrero-Bande, M (2003). Local
Linear Regression Estimation of the Variogram. Statist.Probab.Letters.,
64, 169-
179.
Hall, P., Poskitt, P and Presnell, D (2001). A Functional Data-Analytic Approach
to Signal Discrimination. Technometric, 35, 140-143.
Hertgarner, N., Wegkamp, M., and Matzner-Lober, E (2002) Bandwidth Selection
for Local Linear Regression Smoothers. Journal of the Royal Statistical Society.
Serie B Stat. Methodology., 64, 791-804.
Li, W. V., and Shao, Q. M. (2001). Gaussian Processes: Inequalyties, Small Ball
probabilities and Applications. In: C.R. Rao and D. Shanbhag (eds.)
Stochatics
processes, Theory and Methods. Handbook of Statistics, 19, North-Holland, Amsterdam.
Locantore, N., Marrón, J., Simpson,D., Tripoli, N., Zhang, J and Cohen, K (1999).
Robust Principal Componente Analysis for Functional Data (with discussion). Test,
8, 1-174.
89
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Some Practical Problems of Recent Nonparametric Procedures: Testing, Estimation and Application. Jorge Barrientos-Marín
Ir al índice/Tornar a l'índex
Chapter 3
Locally Modelled Regression and Functional Data
Muller, H-G (1987) Weighted Local Regression and Kernel Methods for Nonparametric Curve. Journal of the American Statistical Associations, 82, 397, 231-328.
Nadaraya, E. A. (1964). On Estimating Regression. Theory Probability Applied, 10.
Ramsay, J. O and C. J Dalzell (1991). Some Tools for Functional Data Analysis.
Journal of the Royal Statistics Society. Series B, 53, 3, 539-572.
Ramsay, J,. Altman, N., Bock, R (1994). Variation in Height Acceleration in the
Fels Growht Data. Canadian Journal of Statistics, 22, 89-102.
Ramsay, J. O and B. W. Silverman (2002). Applied Functional Data Anlysis: Methods and Case Studies. Springer-Verlang, New York.
Ramsay, J. O and B. W. Silverman (2005). Functional Data Analysis. 2nd Edition.
Springer-Verlang, New York.
Ruppert, D and M. P. Wand (1994) Multivariate Locally Weighted Least Squares
Regression. Annals of Statistics, 22, 3, 1346-1370.
Wand, M. P and M. C. Jones (1995) Kernel Smoothing. Monographs on Statistics
and Applied Probability, 60. Chapman & Hall.
Watson, G. S (1964) Smooth Regression Analysis. Sankhya Ser. A 26.
90
Tesis doctoral de la Universidad de Alicante. Tesi doctoral de la Universitat d'Alacant. 2007
Descargar