当前位置: X-MOL 学术AStA. Adv. Stat. Anal. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Confidence regions and other tools for an extension of correspondence analysis based on cumulative frequencies
AStA Advances in Statistical Analysis ( IF 1.4 ) Pub Date : 2020-10-26 , DOI: 10.1007/s10182-020-00382-5
Antonello D’Ambra , Pietro Amenta , Eric J. Beh

Over the past 50 years, correspondence analysis (CA) has increasingly been used by data analysts to examine the association structure of categorical variables that are cross-classified to form a contingency table. However, the literature has paid little attention to the case where the variables are ordinal. Indeed, Pearson’s chi-squared statistic \(X^{2}\) can perform badly in studying the association between ordinal categorical variables (Agresti in An introduction to categorical data analysis, Wiley, Hoboken, 1996; Barlow et al. in Statistical inference under order restrictions, Wiley, New York, 1972). Taguchi’s (Nair in Technometrics 28(4):283–291, 1986; Nair in J Am Stat Assoc 82:283–291, 1987) and Hirotsu’s (Biometrika 73: 165–173, 1986) statistics have been introduced in the literature as simple alternatives to Pearson’s index for contingency tables with ordered categorical variables. Taguchi’s statistic takes into account the presence of an ordinal categorical variable by considering the cumulative sum of the cell frequencies across the variable. An extension of correspondence analysis using a decomposition of Taguchi’s statistic has been introduced to accommodate this feature of the variables. This considers the impact of differences between adjacent ordered categories on the association between row and column categories. Therefore, the main aim of this paper is to introduce a confidence region for each of the ordered categories so that one may determine the statistical significance of a category with respect to the null hypothesis of independence. We highlight that the construction of these circles has not been considered in the literature for this approach to CA. We also introduce a suitable decomposition of Taguchi’s statistic to test the statistical significance of each column category.



中文翻译:

置信区域和其他工具,用于根据累积频率扩展对应分析

在过去的50年中,数据分析人员越来越多地使用对应分析(CA)来检查交叉分​​类以形成列联表的分类变量的关联结构。但是,文献很少关注变量为序数的情况。确实,皮尔逊的卡方统计量\(X ^ {2} \)可以在研究有序分类变量之间的关联方面表现不佳(Agresti,《分类数据分析简介》,Wiley,Hoboken,1996; Barlow等人在有序限制的统计推断中,Wiley,纽约,1972)。Taguchi(Nair in Technometrics 28(4):283–291,1986; Nair in J Am Stat Assoc 82:283–291,1987)和Hirotsu(Biometrika 73:165–173,1986)的统计资料已在文献中引入如下:带有序分类变量的列联表的Pearson索引的简单替代方案。Taguchi的统计数据通过考虑跨变量的单元格频率的累加总和来考虑序数分类变量的存在。引入了使用田口统计的分解进行对应分析的扩展,以适应变量的这一特征。这考虑了相邻排序类别之间的差异对行类别和列类别之间的关联的影响。因此,本文的主要目的是为每个有序类别引入一个置信区域,以便可以确定一个类别相对于独立性零假设的统计显着性。我们着重指出,对于CA的这种方法,文献中尚未考虑这些圈子的构建。我们还介绍了田口统计的适当分解,以检验每个列类别的统计显着性。这考虑了相邻排序类别之间的差异对行类别和列类别之间的关联的影响。因此,本文的主要目的是为每个有序类别引入一个置信区域,以便可以确定一个类别相对于独立性零假设的统计显着性。我们着重指出,对于CA的这种方法,文献中尚未考虑这些圈子的构建。我们还介绍了田口统计的适当分解,以检验每个列类别的统计显着性。这考虑了相邻排序类别之间的差异对行类别和列类别之间的关联的影响。因此,本文的主要目的是为每个有序类别引入一个置信区域,以便可以确定一个类别相对于独立性零假设的统计显着性。我们着重指出,对于CA的这种方法,文献中尚未考虑这些圈子的构建。我们还介绍了田口统计的适当分解,以检验每个列类别的统计显着性。本文的主要目的是为每个有序类别引入一个置信区域,以便可以确定一个类别相对于独立性零假设的统计显着性。我们着重指出,对于CA的这种方法,文献中尚未考虑这些圈子的构建。我们还介绍了田口统计的适当分解,以检验每个列类别的统计显着性。本文的主要目的是为每个有序类别引入一个置信区域,以便可以确定一个类别相对于独立性零假设的统计显着性。我们着重指出,对于CA的这种方法,文献中尚未考虑这些圈子的构建。我们还介绍了田口统计的适当分解,以检验每个列类别的统计显着性。

更新日期:2020-10-30
down
wechat
bug