-
Estimation of parameters of logistic regression for two-stage randomized response technique Comput. Stat. (IF 0.744) Pub Date : 2021-01-18 Pei-Chieh Chang, Kim-Hung Pho, Shen-Ming Lee, Chin-Shang Li
When a survey study is related to sensitive issues such as political orientation, sexual orientation, and income, respondents may not be willing to reply truthfully, which leads to bias results. To protect the respondents’ privacy and improve their willingness to provide true answers, Warner (J Am Stat Assoc 60:63–69, 1965) proposed the randomized response (RR) technique in which respondents select
-
Variable selection in partially linear additive hazards model with grouped covariates and a diverging number of parameters Comput. Stat. (IF 0.744) Pub Date : 2021-01-15 Arfan Raheen Afzal, Jing Yang, Xuewen Lu
In regression models with a grouping structure among the explanatory variables, variable selection at the group and within group individual variable level is important to improve model accuracy and interpretability. In this article, we propose a hierarchical bi-level variable selection approach for censored survival data in the linear part of a partially linear additive hazards model where the covariates
-
Factor dimension determination for panel interactive effects models: an orthogonal projection approach Comput. Stat. (IF 0.744) Pub Date : 2021-01-15 Cheng Hsiao, Yimeng Xie, Qiankun Zhou
We consider a computationally simple orthogonal projection method to implement the (Bai and Ng in Econometrica 70:191–221, 2002) information criterion to select the factor dimension for panel interactive effects models that bypasses issues arising from the joint estimation of the slope coefficients and factor structure. Our simulations show that it performs well in cases the method can be implemented
-
Variational approximation for importance sampling Comput. Stat. (IF 0.744) Pub Date : 2021-01-13 Xiao Su, Yuguo Chen
We propose an importance sampling algorithm with proposal distribution obtained from variational approximation. This method combines the strength of both importance sampling and variational method. On one hand, this method avoids the bias from variational method. On the other hand, variational approximation provides a way to design the proposal distribution for the importance sampling algorithm. Theoretical
-
Bayesian Multiple Change-Points Detection in a Normal Model with Heterogeneous Variances Comput. Stat. (IF 0.744) Pub Date : 2021-01-12 Sang Gil Kang, Woo Dong Lee, Yongku Kim
This study considers the problem of multiple change-points detection. For this problem, we develop an objective Bayesian multiple change-points detection procedure in a normal model with heterogeneous variances. Our Bayesian procedure is based on a combination of binary segmentation and the idea of the screening and ranking algorithm (Niu and Zhang in Ann Appl Stat 6:1306–1326, 2012). Using the screening
-
Goodness-of-fit testing of survival models in the presence of Type–II right censoring Comput. Stat. (IF 0.744) Pub Date : 2021-01-08 M. Cockeran, S. G. Meintanis, L. Santana, J. S. Allison
We consider a variety of tests for testing goodness–of–fit in a parametric Cox proportional hazards (PH) and accelerated failure time (AFT) model in the presence of Type–II right censoring. The testing procedures considered can be divided in two categories: an approach involving transforming the data to a complete sample and an approach using test statistics that can directly accommodate Type-II right
-
A partial least squares approach for function-on-function interaction regression Comput. Stat. (IF 0.744) Pub Date : 2021-01-06 Ufuk Beyaztas, Han Lin Shang
A partial least squares regression is proposed for estimating the function-on-function regression model where a functional response and multiple functional predictors consist of random curves with quadratic and interaction effects. The direct estimation of a function-on-function regression model is usually an ill-posed problem. To overcome this difficulty, in practice, the functional data that belong
-
Designing robust modified R control charts for asymmetric distributions under ranked set and median ranked set sampling Comput. Stat. (IF 0.744) Pub Date : 2021-01-05 Nursel Koyuncu, Derya Karagöz
Presence of outliers or contamination in the process control affect the construction of quality control limits badly. Therefore, more attention is to paid robust methods describing the data majority. The main focus of this study is to construct robust R charts by using ranked set sampling (RSS) and median ranked set sampling (MRSS) designs under contaminated skewed distributions such as Marshall–Olkin
-
On the transition laws of p -tempered $$\alpha $$ α -stable OU-processes Comput. Stat. (IF 0.744) Pub Date : 2021-01-04 Michael Grabchak
We derive an explicit representation for the transition law of a p-tempered \(\alpha \)-stable process of Ornstein–Uhlenbeck-type and use it to develop a methodology for simulation. Our results apply in both the univariate and multivariate cases. Special attention is given to the case where \(p\le \alpha \), which is more complicated and requires developing the new class of so-called incomplete gamma
-
On the exact distribution of the likelihood ratio test statistic for testing the homogeneity of the scale parameters of several inverse Gaussian distributions Comput. Stat. (IF 0.744) Pub Date : 2021-01-03 Mahmood Kharrati-Kopaei
Several researchers have addressed the problem of testing the homogeneity of the scale parameters of several independent inverse Gaussian distributions based on the likelihood ratio test. However, only approximations of the distribution function of the test statistic are available in the literature. In this note, we present the exact distribution of the likelihood ratio test statistic for testing the
-
Variables skip-lot sampling plans on the basis of process capability index for products with a low fraction of defectives Comput. Stat. (IF 0.744) Pub Date : 2021-01-03 Chien-Wei Wu, Ming-Hung Shu, Pei-An Wang, Bi-Min Hsu
The skip-lot sampling plan (SkSP) is employed in supply chains to decrease the amount of inspection required for submitted lots when they have demonstrated a succession of lots with excellent quality. As only some fractions of lots are examined, the cost of inspection is reduced. With the current abundance of high-yield products, however, the majority of SkSP schemes have been utilized for attributes
-
Outlier detection under a covariate-adjusted exponential regression model with censored data Comput. Stat. (IF 0.744) Pub Date : 2021-01-02 Yingli Pan, Zhan Liu, Guangyu Song
Exponential regression models with censored data are most widely used in practice. In the modeling process, there exist situations where the covariates are not directly observed but are observed after being contaminated by unknown functions of an observable confounder in a multiplicative manner. The problem of outlier detection is a fundamental and important problem in applied statistics. In this paper
-
Correction to: The 8-parameter Fisher–Bingham distribution on the sphere Comput. Stat. (IF 0.744) Pub Date : 2020-09-30 Tianlu Yuan
In the original publication of the article, the corrections in Eq. (13) were missed, in which 2v − 1 was changed to 2v in the exponent.
-
Spatio-temporal change of support modeling with R Comput. Stat. (IF 0.744) Pub Date : 2020-09-23 Andrew M. Raim, Scott H. Holan, Jonathan R. Bradley, Christopher K. Wikle
Spatio-temporal change of support methods are designed for statistical analysis on spatial and temporal domains which can differ from those of the observed data. Previous work introduced a parsimonious class of Bayesian hierarchical spatio-temporal models, which we refer to as STCOS, for the case of Gaussian outcomes. Application of STCOS methodology from this literature requires a level of proficiency
-
Analysis of type I and II error rates of Bayesian and frequentist parametric and nonparametric two-sample hypothesis tests under preliminary assessment of normality Comput. Stat. (IF 0.744) Pub Date : 2020-09-20 Riko Kelter
Testing for differences between two groups is among the most frequently carried out statistical methods in empirical research. The traditional frequentist approach is to make use of null hypothesis significance tests which use p values to reject a null hypothesis. Recently, a lot of research has emerged which proposes Bayesian versions of the most common parametric and nonparametric frequentist two-sample
-
Comprehensive world university ranking based on ranking aggregation Comput. Stat. (IF 0.744) Pub Date : 2020-09-18 Yang Zhang, Yu Xiao, Jun Wu, Xin Lu
Many university rankings have been proposed in recent decades. The remarkable divergence among various rankings leads to confusion for decision-makers. In this paper, we propose to generate a comprehensive world university ranking by aggregating existing individual university rankings. We present a new graph-based rank aggregation method by defining a competition graph of universities, in which each
-
Economic design of memory-type control charts: The fallacy of the formula proposed by Lorenzen and Vance (1986) Comput. Stat. (IF 0.744) Pub Date : 2020-09-17 Amir Ahmadi-Javid, Mohsen Ebadi
Memory-type statistical control charts, such as exponentially weighted moving average (EWMA) and cumulative sum (CUSUM), are broadly-used statistical feedback policies for detecting small quality changes in univariate and multivariate processes. Many papers on economic-statistical design of these control charts used the general formula proposed by Lorenzen and Vance (Technometrics 28(1):3–10, 1986)
-
An efficient algorithm for joint feature screening in ultrahigh-dimensional Cox’s model Comput. Stat. (IF 0.744) Pub Date : 2020-09-12 Xiaolin Chen, Catherine Chunling Liu, Sheng Xu
The Cox model is an exceedingly popular semiparametric hazard regression model for the analysis of time-to-event accompanied by explanatory variables. Within the ultrahigh-dimensional data setting, not like the marginal screening strategy, there is a joint feature screening method based on the partial likelihood of the Cox model but it leaves computational feasibility unsolved. In this paper, we develop
-
A method for computing tolerance intervals for a location-scale family of distributions Comput. Stat. (IF 0.744) Pub Date : 2020-09-05 Ngan Hoang-Nguyen-Thuy, K. Krishnamoorthy
The problems of computing two-sided tolerance intervals (TIs) and equal-tailed TIs for a location-scale family of distributions are considered. The TIs are constructed using one-sided tolerance limits with the Bonferroni adjustments and then adjusting the confidence levels so that the coverage probabilities of the TIs are equal to the specified nominal confidence level. The methods are simple, exact
-
A stationary bootstrap test about two mean vectors comparison with somewhat dense differences and fewer sample size than dimension Comput. Stat. (IF 0.744) Pub Date : 2020-09-02 Zhengbang Li, Fuxiang Liu, Luanjie Zeng, Guoxin Zuo
Two sample mean vectors comparison hypothesis testing problems often emerge in modern biostatistics. Many tests are proposed for detecting relatively dense signals with somewhat dense nonzero components in mean vectors differences. One kind of these tests is based on some quadratic forms about two sample mean vectors differences. Another kind of these tests is based on some quadratic forms about studentized
-
Finite mixtures of skew Laplace normal distributions with random skewness Comput. Stat. (IF 0.744) Pub Date : 2020-09-01 Fatma Zehra Doğru, Olcay Arslan
In this paper, the shape mixtures of the skew Laplace normal (SMSLN) distribution is introduced as a flexible extension of the skew Laplace normal distribution which is also a heavy-tailed distribution. The SMSLN distribution includes an extra shape parameter, which controls skewness and kurtosis. Some distributional properties of this distribution are derived. Besides, we propose finite mixtures of
-
Estimation of parameters in multivariate wrapped models for data on a p -torus Comput. Stat. (IF 0.744) Pub Date : 2020-07-24 Anahita Nodehi, Mousa Golalizadeh, Mehdi Maadooliat, Claudio Agostinelli
Multivariate circular observations, i.e. points on a torus arise frequently in fields where instruments such as compass, protractor, weather vane, sextant or theodolite are used. Multivariate wrapped models are often appropriate to describe data points scattered on p-dimensional torus. However, the statistical inference based on such models is quite complicated since each contribution in the log-likelihood
-
R package for statistical inference in dynamical systems using kernel based gradient matching: KGode Comput. Stat. (IF 0.744) Pub Date : 2020-07-23 Mu Niu, Joe Wandy, Rónán Daly, Simon Rogers, Dirk Husmeier
Many processes in science and engineering can be described by dynamical systems based on nonlinear ordinary differential equations (ODEs). Often ODE parameters are unknown and not directly measurable. Since nonlinear ODEs typically have no closed form solution, standard iterative inference procedures require a computationally expensive numerical integration of the ODEs every time the parameters are
-
Bayesian inference of nonlinear hysteretic integer-valued GARCH models for disease counts Comput. Stat. (IF 0.744) Pub Date : 2020-07-18 Cathy W. S. Chen, Sangyeol Lee, K. Khamthong
This study proposes a class of nonlinear hysteretic integer-valued GARCH models in order to describe the occurrence of weekly dengue hemorrhagic fever cases via three meteorological covariates: precipitation, average temperature, and relative humidity. The proposed model adopts the hysteretic three-regime switching mechanism with a buffer zone that are able to explain various characteristics. This
-
Optimal imputation of the missing data using multi auxiliary information Comput. Stat. (IF 0.744) Pub Date : 2020-07-18 Shashi Bhushan, Abhay Pratap Pandey
This article deals with some new imputation methods by extending the work of Bhushan and Pandey using multi-auxiliary information. The popularly used imputation like mean imputation, ratio method of imputation, regression method of imputation and power transformation method are special cases of the proposed methods apart from being less efficient than the proposed methods. The proposed imputation methods
-
A modified Canny edge detector based on weighted least squares Comput. Stat. (IF 0.744) Pub Date : 2020-07-15 Xu Qin
Edge detection is the front-end processing stage in most computer vision and image understanding systems. Among various edge detection techniques, Canny edge detector is the one of most commonly used. In this paper a modified Canny edge detection technique focusing on change of the Sobel operator is proposed. Instead of convolution kernels, the weighted least squares method is utilized to calculate
-
Computation of the expected value of a function of a chi-distributed random variable Comput. Stat. (IF 0.744) Pub Date : 2020-07-13 Paul Kabaila, Nishika Ranathunga
We consider the problem of numerically evaluating the expected value of a smooth bounded function of a chi-distributed random variable, divided by the square root of the number of degrees of freedom. This problem arises in the contexts of simultaneous inference, the selection and ranking of populations and in the evaluation of multivariate t probabilities. It also arises in the assessment of the coverage
-
Advanced algorithms for penalized quantile and composite quantile regression Comput. Stat. (IF 0.744) Pub Date : 2020-07-12 Matthew Pietrosanu, Jueyu Gao, Linglong Kong, Bei Jiang, Di Niu
In this paper, we discuss a family of robust, high-dimensional regression models for quantile and composite quantile regression, both with and without an adaptive lasso penalty for variable selection. We reformulate these quantile regression problems and obtain estimators by applying the alternating direction method of multipliers (ADMM), majorize-minimization (MM), and coordinate descent (CD) algorithms
-
Dirichlet process mixtures under affine transformations of the data Comput. Stat. (IF 0.744) Pub Date : 2020-07-12 Julyan Arbel, Riccardo Corradin, Bernardo Nipoti
Location-scale Dirichlet process mixtures of Gaussians (DPM-G) have proved extremely useful in dealing with density estimation and clustering problems in a wide range of domains. Motivated by an astronomical application, in this work we address the robustness of DPM-G models to affine transformations of the data, a natural requirement for any sensible statistical method for density estimation and clustering
-
Two generalized nonparametric methods for estimating like densities Comput. Stat. (IF 0.744) Pub Date : 2020-07-12 Zongyuan Shang, Alan Ker
This article presents two generalized nonparametric methods for estimating multiple, possibly like, densities. The first generalization contains the Nadaraya–Watson estimator, the Jones et al. (Biometrika 82(2):327–338, 1995) bias reduction estimator, and Ker (Stat Probab Lett 117:23–30, 2016) possibly similar estimator as special cases. The second generalization contains the Nadaraya–Watson estimator
-
Robust weighted Gaussian processes Comput. Stat. (IF 0.744) Pub Date : 2020-07-09 Ruben Ramirez-Padron, Boris Mederos, Avelino J. Gonzalez
This paper presents robust weighted variants of batch and online standard Gaussian processes (GPs) to effectively reduce the negative impact of outliers in the corresponding GP models. This is done by introducing robust data weighers that rely on robust and quasi-robust weight functions that come from robust M-estimators. Our robust GPs are compared to various GP models on four datasets. It is shown
-
Penalized weighted composite quantile regression for partially linear varying coefficient models with missing covariates Comput. Stat. (IF 0.744) Pub Date : 2020-07-09 Jun Jin, Tiefeng Ma, Jiajia Dai, Shuangzhe Liu
In this paper we study partially linear varying coefficient models with missing covariates. Based on inverse probability-weighting and B-spline approximations, we propose a weighted B-spline composite quantile regression method to estimate the non-parametric function and the regression coefficients. Under some mild conditions, we establish the asymptotic normality and Horvitz–Thompson property of the
-
Greedy clustering of count data through a mixture of multinomial PCA Comput. Stat. (IF 0.744) Pub Date : 2020-07-08 Nicolas Jouvin, Pierre Latouche, Charles Bouveyron, Guillaume Bataillon, Alain Livartowski
Count data is becoming more and more ubiquitous in a wide range of applications, with datasets growing both in size and in dimension. In this context, an increasing amount of work is dedicated to the construction of statistical models directly accounting for the discrete nature of the data. Moreover, it has been shown that integrating dimension reduction to clustering can drastically improve performance
-
Transformation mixture modeling for skewed data groups with heavy tails and scatter Comput. Stat. (IF 0.744) Pub Date : 2020-07-06 Yana Melnykov, Xuwen Zhu, Volodymyr Melnykov
For decades, Gaussian mixture models have been the most popular mixtures in literature. However, the adequacy of the fit provided by Gaussian components is often in question. Various distributions capable of modeling skewness or heavy tails have been considered in this context recently. In this paper, we propose a novel contaminated transformation mixture model that is constructed based on the idea
-
KLERC: kernel Lagrangian expectile regression calculator Comput. Stat. (IF 0.744) Pub Date : 2020-06-25 Songfeng Zheng
As a generalization to the ordinary least square regression, expectile regression, which can predict conditional expectiles, is fitted by minimizing an asymmetric square loss function on the training data. In literature, the idea of support vector machine was introduced to expectile regression to increase the flexibility of the model, resulting in support vector expectile regression (SVER). This paper
-
An accelerated EM algorithm for mixture models with uncertainty for rating data Comput. Stat. (IF 0.744) Pub Date : 2020-06-22 Rosaria Simone
The paper is framed within the literature around Louis’ identity for the observed information matrix in incomplete data problems, with a focus on the implied acceleration of maximum likelihood estimation for mixture models. The goal is twofold: to obtain direct expressions for standard errors of parameters from the EM algorithm and to reduce the computational burden of the estimation procedure for
-
Clustering method for censored and collinear survival data Comput. Stat. (IF 0.744) Pub Date : 2020-06-21 Silvia Liverani, Lucy Leigh, Irene L. Hudson, Julie E. Byles
In this paper we propose a Dirichlet process mixture model for censored survival data with covariates. This model is suitable in two scenarios. First, this method can be used to identify clusters determined by both the censored survival data and the predictors. Second, this method is suitable for highly correlated predictors, in cases when the usual survival models cannot be implemented because they
-
A Bayesian quantile regression approach to multivariate semi-continuous longitudinal data Comput. Stat. (IF 0.744) Pub Date : 2020-06-20 Jayabrata Biswas, Kiranmoy Das
Quantile regression is a powerful tool for modeling non-Gaussian data, and also for modeling different quantiles of the probability distributions of the responses. We propose a Bayesian approach of estimating the quantiles of multivariate longitudinal data where the responses contain excess zeros. We consider a Tobit regression approach, where the latent responses are estimated using a linear mixed
-
Usage of the GO estimator in high dimensional linear models Comput. Stat. (IF 0.744) Pub Date : 2020-06-18 Murat Genç, M. Revan Özkale
This paper discusses simultaneous parameter estimation and variable selection and presents a new penalized regression method. The method is based on the idea that the coefficient estimates are shrunken towards a predetermined coefficient vector which represents the prior information. This method can result in smaller length estimates of the coefficients depending on the prior information compared to
-
Bayesian joint-quantile regression Comput. Stat. (IF 0.744) Pub Date : 2020-06-15 Yingying Hu, Huixia Judy Wang, Xuming He, Jianhua Guo
Estimation of low or high conditional quantiles is called for in many applications, but commonly encountered data sparsity at the tails of distributions makes this a challenging task. We develop a Bayesian joint-quantile regression method to borrow information across tail quantiles through a linear approximation of quantile coefficients. Motivated by a working likelihood linked to the asymmetric Laplace
-
What is an optimal value of k in k-fold cross-validation in discrete Bayesian network analysis? Comput. Stat. (IF 0.744) Pub Date : 2020-06-13 Bruce G. Marcot, Anca M. Hanea
Cross-validation using randomized subsets of data—known as k-fold cross-validation—is a powerful means of testing the success rate of models used for classification. However, few if any studies have explored how values of k (number of subsets) affect validation results in models tested with data of known statistical properties. Here, we explore conditions of sample size, model structure, and variable
-
A dominance approach for comparing the performance of VaR forecasting models Comput. Stat. (IF 0.744) Pub Date : 2020-05-24 Laura Garcia-Jorcano; Alfonso Novales
We introduce three dominance criteria to compare the performance of alternative value at risk (VaR) forecasting models. The three criteria use the information provided by a battery of VaR validation tests based on the frequency and size of exceedances, offering the possibility of efficiently summarizing a large amount of statistical information. They do not require the use of any loss function defined
-
Estimating the number of clusters via a corrected clustering instability Comput. Stat. (IF 0.744) Pub Date : 2020-05-18 Jonas M. B. Haslbeck, Dirk U. Wulff
We improve instability-based methods for the selection of the number of clusters k in cluster analysis by developing a corrected clustering distance that corrects for the unwanted influence of the distribution of cluster sizes on cluster instability. We show that our corrected instability measure outperforms current instability-based measures across the whole sequence of possible k, overcoming limitations
-
Time-dependent stress–strength reliability models based on phase type distribution Comput. Stat. (IF 0.744) Pub Date : 2020-05-10 Joby K. Jose; M. Drisya
In many of the real-life situations, the strength of a system and stress applied to it changes as time changes. In this paper, we consider time-dependent stress–strength reliability models subjected to random stresses at random cycles of time. Each run of the system causes a change in the strength of the system over time. We obtain the stress–strength reliability of the system at time t when the initial
-
Modified empirical likelihood-based confidence intervals for data containing many zero observations Comput. Stat. (IF 0.744) Pub Date : 2020-04-30 Patrick Stewart, Wei Ning
Data containing many zeroes is popular in statistical applications, such as survey data. A confidence interval based on the traditional normal approximation may lead to poor coverage probabilities, especially when the nonzero values are highly skewed and the sample size is small or moderately large. The empirical likelihood (EL), a powerful nonparametric method, was proposed to construct confidence
-
BEST: a decision tree algorithm that handles missing values Comput. Stat. (IF 0.744) Pub Date : 2020-04-18 Cédric Beaulac; Jeffrey S. Rosenthal
The main contribution of this paper is the development of a new decision tree algorithm. The proposed approach allows users to guide the algorithm through the data partitioning process. We believe this feature has many applications but in this paper we demonstrate how to utilize this algorithm to analyse data sets containing missing values. We tested our algorithm against simulated data sets with various
-
Comparing scale parameters in several gamma distributions with known shapes Comput. Stat. (IF 0.744) Pub Date : 2020-04-04 Ali Akbar Jafari,Javad Shaabani
In this paper, we present eleven approaches for testing the equality of scale parameters in gamma distributions when shape parameters are known. These approaches are applicable to other problems such as testing homogeneity of variances in normal distributions, verifying equality of scale-like parameters in inverse Gaussian distributions, and comparing scale parameters in two-parameter exponential distributions
-
A time series model based on dependent zero inflated counting series Comput. Stat. (IF 0.744) Pub Date : 2020-04-04 Nisreen Shamma,Mehrnaz Mohammadpour,Masoumeh Shirozhan
In this paper, we introduce a new generalized negative binomial thinning operator with dependent counting series. Some properties of the thinning operator are derived. A new stationary integer-valued autoregressive model based on the thinning operator is constructed. In addition various properties of the process are determined, unknown parameters are estimated by several methods and the behavior of
-
Maximum likelihood estimation based on ranked set sampling designs for two extensions of the Lindley distribution with uncensored and right-censored data Comput. Stat. (IF 0.744) Pub Date : 2020-04-03 Cesar Augusto Taconeli,Suely Ruiz Giolo
Ranked set sampling (RSS) has been proved to be a cost-efficient alternative to simple random sampling (SRS). However, there are situations where some measurements are censored, which may not ensure the superiority of RSS over SRS. In this paper, the performance of the maximum likelihood estimators is examined when the data are assumed to follow a Power Lindley or a Weighted Lindley distribution, and
-
A Bayesian perspective of statistical machine learning for big data Comput. Stat. (IF 0.744) Pub Date : 2020-04-01 Rajiv Sambasivan; Sourish Das; Sujit K. Sahu
Statistical Machine Learning (SML) refers to a body of algorithms and methods by which computers are allowed to discover important features of input data sets which are often very large in size. The very task of feature discovery from data is essentially the meaning of the keyword ‘learning’ in SML. Theoretical justifications for the effectiveness of the SML algorithms are underpinned by sound principles
-
Models for autoregressive processes of bounded counts: How different are they? Comput. Stat. (IF 0.744) Pub Date : 2020-03-27 Hee-Young Kim,Christian H. Weiß,Tobias A. Möller
We focus on purely autoregressive (AR)-type models defined on the bounded range \(\{0,1,\ldots , n\}\) with a fixed upper limit \(n \in \mathbb {N}\). These include the binomial AR model, binomial AR conditional heteroscedasticity (ARCH) model, binomial-variation AR model with their linear conditional mean, nonlinear max-binomial AR model, and binomial logit-ARCH model. We consider the key problem
-
Statistical inference for Markov chains with applications to credit risk Comput. Stat. (IF 0.744) Pub Date : 2020-03-25 Linda Möstel,Marius Pfeuffer,Matthias Fischer
The focus of this paper is on the derivation of confidence and credibility intervals for Markov chains when discrete-time, continuous-time or discretely observed continuous-time data are available. Thereby, our contribution is threefold: First, we discuss and compare multinomial confidence regions for the rows of discrete-time Markov transition matrices in the light of empirical characteristics of
-
Meta-analysis of individual patient data with semi-competing risks under the Weibull joint frailty–copula model Comput. Stat. (IF 0.744) Pub Date : 2020-03-24 Bo-Hong Wu,Hirofumi Michimae,Takeshi Emura
In meta-analysis of individual patient data with semi-competing risks, the joint frailty–copula model has been proposed, where frailty terms account for the between-study heterogeneity and copulas account for dependence between terminal and nonterminal event times. In the previous works, the baseline hazard functions in the joint frailty–copula model are estimated by the nonparametric model or the
-
word.alignment: an R package for computing statistical word alignment and its evaluation Comput. Stat. (IF 0.744) Pub Date : 2020-03-23 Neda Daneshgar,Majid Sarmad
Word alignment has lots of applications in various natural language processing (NLP) tasks. As far as we are aware, there is no word alignment package in the R environment. In this paper, word.alignment, a new R software package is introduced which implements a statistical word alignment model as an unsupervised learning. It uses IBM Model 1 as a machine translation model based on the use of the EM
-
Bayesian robust estimation of partially functional linear regression models using heavy-tailed distributions Comput. Stat. (IF 0.744) Pub Date : 2020-03-19 Guodong Shan,Yiheng Hou,Baisen Liu
Functional linear regression (FLR) is a popular method that studies the relationship between a scalar response and a functional predictor. A common estimation procedure for the FLR model is using maximum likelihood by assuming normal distributions for measurement errors; however this method may make inferences vulnerable to the presence of outliers. In this article, we introduce a robust estimation
-
Multiple imputation and functional methods in the presence of measurement error and missingness in explanatory variables Comput. Stat. (IF 0.744) Pub Date : 2020-03-18 Firouzeh Noghrehchi; Jakub Stoklosa; Spiridon Penev
In many applications involving regression analysis, explanatory variables (or covariates) may be imprecisely measured or may contain missing values. Although there exists a vast literature on measurement error modeling to account for errors-in-variables, and on missing data methodology to handle missingness, very few methods have been developed to simultaneously address both. In this paper, we consider
-
Geometric ergodicity of a Metropolis-Hastings algorithm for Bayesian inference of phylogenetic branch lengths Comput. Stat. (IF 0.744) Pub Date : 2020-03-12 David A. Spade
This manuscript extends the work of Spade et al. (Math Biosci 268:9–21, 2015) to an examination of a fully-updating version of a Metropolis-Hastings algorithm for inference of phylogenetic branch lengths. This approach serves as an intermediary between theoretical assessment of Markov chain convergence, which in phylogenetic settings is typically difficult to do analytically, and output-based convergence
-
A partitioned quasi-likelihood for distributed statistical inference Comput. Stat. (IF 0.744) Pub Date : 2020-03-09 Guangbao Guo,Yue Sun,Xuejun Jiang
In the big data setting, working data sets are often distributed on multiple machines. However, classical statistical methods are often developed to solve the problems of single estimation or inference. We employ a novel parallel quasi-likelihood method in generalized linear models, to make the variances between different sub-estimators relatively similar. Estimates are obtained from projection subsets
-
Measuring and testing interdependence among random vectors based on Spearman’s \(\rho \) and Kendall’s \(\tau \) Comput. Stat. (IF 0.744) Pub Date : 2020-03-09 Lingyue Zhang,Dawei Lu,Xiaoguang Wang
Inspired by the correlation matrix and based on the generalized Spearman’s \(\rho \) and Kendall’s \(\tau \) between random variables proposed in Lu et al. ( J Nonparametr Stat 30(4):860–883, 2018), \(\rho \)-matrix and \(\tau \)-matrix are suggested for multivariate data sets. The matrices are used to construct the \(\rho \)-measure and the \(\tau \)-measure among random vectors with statistical estimation
-
A robust joint modeling approach for longitudinal data with informative dropouts Comput. Stat. (IF 0.744) Pub Date : 2020-03-09 Weiping Zhang,Feiyue Xie,Jiaxin Tan
This article proposes a robust method for analysing longitudinal continuous responses with informative dropouts and potential outliers by using the multivariate t -distribution. We specify a dropout mechanism and a missing covariate distribution and incorporate them into the complete data log-likelihood. Unlike the existing approaches which mainly focus on the inference of regression mean and dropouts
Contents have been reproduced by permission of the publishers.