-
Unsupervised cluster analysis of SARS‐CoV‐2 genomes reflects its geographic progression and identifies distinct genetic subgroups of SARS‐CoV‐2 virus Genet. Epidemiol. (IF 1.954) Pub Date : 2021-01-08 Georg Hahn; Sanghun Lee; Scott T. Weiss; Christoph Lange
Over 10,000 viral genome sequences of the SARS‐CoV‐2virus have been made readily available during the ongoing coronavirus pandemic since the initial genome sequence of the virus was released on the open access Virological website (http://virological.org/) early on January 11. We utilize the published data on the single stranded RNAs of 11,132 SARS‐CoV‐2 patients in the GISAID database, which contains
-
The utility of the Laplace effect size prior distribution in Bayesian fine‐mapping studies Genet. Epidemiol. (IF 1.954) Pub Date : 2021-01-06 Kevin Walters; Angela Cox; Hannuun Yaacob
The Gaussian distribution is usually the default causal single‐nucleotide polymorphism (SNP) effect size prior in Bayesian population‐based fine‐mapping association studies, but a recent study showed that the heavier‐tailed Laplace prior distribution provided a better fit to breast cancer top hits identified in genome‐wide association studies. We investigate the utility of the Laplace prior as an effect
-
Multi‐tissue transcriptome‐wide association studies Genet. Epidemiol. (IF 1.954) Pub Date : 2020-12-28 Nastasiya F. Grinberg; Chris Wallace
A transcriptome‐wide association study (TWAS) attempts to identify disease associated genes by imputing gene expression into a genome‐wide association study (GWAS) using an expression quantitative trait loci (eQTL) data set and then testing for associations with a trait of interest. Regulatory processes may be shared across related tissues and one natural extension of TWAS is harnessing cross‐tissue
-
Statistical methods with exhaustive search in the identification of gene–gene interactions for colorectal cancer Genet. Epidemiol. (IF 1.954) Pub Date : 2020-11-24 Somayeh Kafaie; Ling Xu; Ting Hu
Though additive forms of heritability are primarily studied in genetics, nonlinear, non‐additive gene–gene interactions, that is, epistasis, could explain a portion of the missing heritability in complex human diseases including cancer. In recent years, powerful computational methods have been introduced to understand multivariable genetic factors of these complex human diseases in extremely high‐dimensional
-
A unified linear mixed model for familial relatedness and population structure in genetic association studies Genet. Epidemiol. (IF 1.954) Pub Date : 2020-11-11 Nicholas DeVogel; Paul L. Auer; Regina Manansala; Andrea Rau; Tao Wang
Familial relatedness (FR) and population structure (PS) are two major sources for genetic correlation. In the human population, both FR and PS can further break down into additive and dominant components to account for potential additive and dominant genetic effects. In this study, besides the classical additive genomic relationship matrix, a dominant genomic relationship matrix is introduced. A link
-
Novel score test to increase power in association test by integrating external controls Genet. Epidemiol. (IF 1.954) Pub Date : 2020-11-08 Yatong Li; Seunggeun Lee
Recent advances in genotyping and sequencing technologies have enabled genetic association studies to leverage high‐quality genotyped data to identify variants accounting for a substantial portion of disease risk. The usage of external controls, whose genomes have already been genotyped and are publicly available, could be a cost‐effective approach to increase the power of association testing. There
-
Exploring gene-gene interaction in family-based data with an unsupervised machine learning method: EPISFA. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-01 Xiao Xiang,Siyue Wang,Tianyi Liu,Mengying Wang,Jiawen Li,Jin Jiang,Tao Wu,Yonghua Hu
Gene–gene interaction (G × G) is thought to fill the gap between the estimated heritability of complex diseases and the limited genetic proportion explained by identified single‐nucleotide polymorphisms. The current tools for exploring G × G were often developed for case‐control designs with less considerations for their applications in families. Family‐based studies are robust against bias led from
-
Fine-mapping and QTL tissue-sharing information improves the reliability of causal gene identification. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-10 Alvaro N Barbeira,Owen J Melia,Yanyu Liang,Rodrigo Bonazzola,Gao Wang,Heather E Wheeler,François Aguet,Kristin G Ardlie,Xiaoquan Wen,Hae K Im
The integration of transcriptomic studies and genome‐wide association studies (GWAS) via imputed expression has seen extensive application in recent years, enabling the functional characterization and causal gene prioritization of GWAS loci. However, the techniques for imputing transcriptomic traits from DNA variation remain underdeveloped. Furthermore, associations found when linking eQTL studies
-
Rare variants association testing for a binary outcome when pooling individual level data from heterogeneous studies Genet. Epidemiol. (IF 1.954) Pub Date : 2020-10-22 Tamar Sofer; Na Guo
Whole genome sequencing (WGS) and whole exome sequencing studies are used to test the association of rare genetic variants with health traits. Many existing WGS efforts now aggregate data from heterogeneous groups, for example, combining sets of individuals of European and African ancestries. We here investigate the statistical implications on rare variant association testing with a binary trait when
-
An exploration of genetic association tests for disease risk and age at onset Genet. Epidemiol. (IF 1.954) Pub Date : 2020-10-19 Eden R. Martin; Xiaoyi R. Gao; Yi‐Ju Li
Risk genes influence the chance of an individual developing disease over their lifetime, although the age at onset (AAO) genes influence disease timing. These two categories are not disjoint; a gene that influences AAO might also appear to influence the risk. When an allele influences both AAO and risk, a reasonable question is whether we would have more power to detect association using a statistical
-
On the application, reporting, and sharing of in silico simulations for genetic studies Genet. Epidemiol. (IF 1.954) Pub Date : 2020-10-16 Kaleigh Riggs; Huann‐Sheng Chen; Melissa Rotunno; Bing Li; Naoko I. Simonds; Leah E. Mechanic; Bo Peng
In silico simulations play an indispensable role in the development and application of statistical models and methods for genetic studies. Simulation tools allow for the evaluation of methods and investigation of models in a controlled manner. With the growing popularity of evolutionary models and simulation‐based statistical methods, genetic simulations have been applied to a wide variety of research
-
MF‐TOWmuT: Testing an optimally weighted combination of common and rare variants with multiple traits using family data Genet. Epidemiol. (IF 1.954) Pub Date : 2020-10-13 Cheng Gao; Qiuying Sha; Shuanglin Zhang; Kui Zhang
With rapid advancements of sequencing technologies and accumulations of electronic health records, a large number of genetic variants and multiple correlated human complex traits have become available in many genetic association studies. Thus, it becomes necessary and important to develop new methods that can jointly analyze the association between multiple genetic variants and multiple traits. Compared
-
JEM: A joint test to estimate the effect of multiple genetic variants on DNA methylation Genet. Epidemiol. (IF 1.954) Pub Date : 2020-10-10 Chloé Sarnowski; Tianxiao Huan; Deepti Jain; Chunyu Liu; Chen Yao; Roby Joehanes; Daniel Levy; Josée Dupuis
Multiple methods have been proposed to aggregate genetic variants in a gene or a region and jointly test their association with a trait of interest. However, these joint tests do not provide estimates of the individual effect of each variant. Moreover, few methods have evaluated the joint association of multiple variants with DNA methylation. We propose a method based on linear mixed models to estimate
-
Variation in cancer risk among families with genetic susceptibility Genet. Epidemiol. (IF 1.954) Pub Date : 2020-10-08 Theodore Huang; Danielle Braun; Henry T. Lynch; Giovanni Parmigiani
Germline mutations in many genes have been shown to increase the risk of developing cancer. This risk can vary across families who carry mutations in the same gene due to differences in the specific variants, gene–gene interactions, other susceptibility mutations, environmental factors, and behavioral factors. We develop an analytic tool to explore this heterogeneity using family history data. We propose
-
Detecting rare copy number variants from Illumina genotyping arrays with the CamCNV pipeline: Segmentation of z‐scores improves detection and reliability Genet. Epidemiol. (IF 1.954) Pub Date : 2020-10-05 Joe Dennis; Logan Walker; Jonathan Tyrer; Kyriaki Michailidou; Douglas F. Easton
The intensities from genotyping array data can be used to detect copy number variants (CNVs) but a high level of noise in the data and overlap between different copy‐number intensity distributions produces unreliable calls, particularly when only a few probes are covered by the CNV. We present a novel pipeline (CamCNV) with a series of steps to reduce noise and detect more reliably CNVs covering as
-
Statistical approaches for meta‐analysis of genetic mutation prevalence Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-30 Margaux L. A. Hujoel; Giovanni Parmigiani; Danielle Braun
Estimating the prevalence of rare germline genetic mutations in the general population is of interest as it can inform genetic counseling and risk management. Most studies that estimate the prevalence of mutations are performed in high‐risk populations, and each study is designed with differing inclusion criteria, resulting in ascertained populations. Quantifying the effects of ascertainment is necessary
-
Innovative approach to identify multigenomic and environmental interactions associated with birth defects in family‐based hybrid designs Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-30 Xiang‐Yang Lou; Ting‐Ting Hou; Shou‐Ye Liu; Hai‐Ming Xu; Feng Lin; Xinyu Tang; Stewart L. MacLeod; Mario A. Cleves; Charlotte A. Hobbs
Genes, including those with transgenerational effects, work in concert with behavioral, environmental, and social factors via complex biological networks to determine human health. Understanding complex relationships between causal factors underlying human health is an essential step towards deciphering biological mechanisms. We propose a new analytical framework to investigate the interactions between
-
TWO‐SIGMA: A novel two‐component single cell model‐based association method for single‐cell RNA‐seq data Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-29 Eric Van Buren; Ming Hu; Chen Weng; Fulai Jin; Yan Li; Di Wu; Yun Li
In this paper, we develop TWO‐SIGMA, a TWO‐component SInGle cell Model‐based Association method for differential expression (DE) analyses in single‐cell RNA‐seq (scRNA‐seq) data. The first component models the probability of “drop‐out” with a mixed‐effects logistic regression model and the second component models the (conditional) mean expression with a mixed‐effects negative binomial regression model
-
Integrative genomic analysis in African American children with asthma finds three novel loci associated with lung function Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-29 Pagé C. Goddard; Kevin L. Keys; Angel C. Y. Mak; Eunice Y. Lee; Amy K. Liu; Lesly‐Anne Samedy‐Bates; Oona Risse‐Adams; María G. Contreras; Jennifer R. Elhawary; Donglei Hu; Scott Huntsman; Sam S. Oh; Sandra Salazar; Celeste Eng; Blanca E. Himes; Marquitta J. White; Esteban G. Burchard
Bronchodilator (BD) drugs are commonly prescribed for treatment and management of obstructive lung function present with diseases such as asthma. Administration of BD medication can partially or fully restore lung function as measured by pulmonary function tests. The genetics of baseline lung function measures taken before BD medication have been extensively studied, and the genetics of the BD response
-
Loci identified by a genome-wide association study of carotid artery stenosis in the eMERGE network. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-22 Melody R Palmer,Daniel S Kim,David R Crosslin,Ian B Stanaway,Elisabeth A Rosenthal,David S Carrell,David J Cronkite,Adam Gordon,Xiaomeng Du,Yatong K Li,Marc S Williams,Chunhua Weng,Qiping Feng,Rongling Li,Sarah A Pendergrass,Hakon Hakonarson,David Fasel,Sunghwan Sohn,Patrick Sleiman,Samuel K Handelman,Elizabeth Speliotes,Iftikhar J Kullo,Eric B Larson,,Gail P Jarvik
Carotid artery atherosclerotic disease (CAAD) is a risk factor for stroke. We used a genome‐wide association (GWAS) approach to discover genetic variants associated with CAAD in participants in the electronic Medical Records and Genomics (eMERGE) Network. We identified adult CAAD cases with unilateral or bilateral carotid artery stenosis and controls without evidence of stenosis from electronic health
-
locStra: Fast analysis of regional/global stratification in whole-genome sequencing studies. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-14 Georg Hahn,Sharon M Lutz,Julian Hecker,Dmitry Prokopenko,Michael H Cho,Edwin K Silverman,Scott T Weiss,Christoph Lange,
locStra is an ‐package for the analysis of regional and global population stratification in whole‐genome sequencing (WGS) studies, where regional stratification refers to the substructure defined by the loci in a particular region on the genome. Population substructure can be assessed based on the genetic covariance matrix, the genomic relationship matrix, and the unweighted/weighted genetic Jaccard
-
Integration of multiomic annotation data to prioritize and characterize inflammation and immune-related risk variants in squamous cell lung cancer. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-14 Ryan Sun,Miao Xu,Xihao Li,Sheila Gaynor,Hufeng Zhou,Zilin Li,Yohan Bossé,Stephen Lam,Ming-Sound Tsao,Adonina Tardon,Chu Chen,Jennifer Doherty,Gary Goodman,Stig E Bojesen,Maria T Landi,Mattias Johansson,John K Field,Heike Bickeböller,H-Erich Wichmann,Angela Risch,Gadi Rennert,Suzanne Arnold,Xifeng Wu,Olle Melander,Hans Brunnström,Loic Le Marchand,Geoffrey Liu,Angeline Andrew,Eric Duell,Lambertus A Kiemeney
Clinical trial results have recently demonstrated that inhibiting inflammation by targeting the interleukin‐1β pathway can offer a significant reduction in lung cancer incidence and mortality, highlighting a pressing and unmet need to understand the benefits of inflammation‐focused lung cancer therapies at the genetic level. While numerous genome‐wide association studies (GWAS) have explored the genetic
-
Mendelian randomization analysis with survival outcomes. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-12 Youngjoo Cho,Andrea Rau,Alex Reiner,Paul L Auer
Mendelian randomization (MR) is an established approach for assessing the causal effects of heritable exposures on outcomes. Outcomes of interest often include binary clinical endpoints, but may also include censored survival times. We explore the implications of both the Cox proportional hazard model and the additive hazard model in the context of MR, with a specific emphasis on two‐stage methods
-
A comprehensive genetic and epidemiological association analysis of vitamin D with common diseases/traits in the UK Biobank. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-12 Yixuan Ye,Hongxi Yang,Yaogang Wang,Hongyu Zhao
Vitamin D has been intensively studied for its association with human health, but the scope of such association and the causal role of vitamin D remain controversial. We aim to comprehensively investigate the links between vitamin D and human health through both epidemiological and Mendelian randomization (MR) analyses. We examined the epidemiological associations between serum 25‐hydroxyvitamin D
-
Cover Image Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-11 Nicolas Vince; Venceslas Douillard; Estelle Geffard; Diogo Meyer; Erick C. Castelli; Steven J. Mack; Sophie Limou; Pierre‐Antoine Gourraud
-
A general framework for integrative analysis of incomplete multiomics data. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-07-21 Dan-Yu Lin,Donglin Zeng,David Couper
There is a tremendous current interest in measuring multiple types of omics features (e.g., DNA sequences, RNA expressions, methylation profiles, metabolic profiles, protein expressions) on a large number of subjects. Although genotypes are typically available for all study subjects, other data types may be measured only on a subset of subjects due to cost or other constraints. In addition, quantitative
-
Penalized variance components for association of multiple genes with traits Genet. Epidemiol. (IF 1.954) Pub Date : 2020-07-22 Daniel J. Schaid; Jason P. Sinnwell; Nicholas B. Larson; Jun Chen
Variance component models have gained popularity for genetic analyses, driven by their flexibility to simultaneously analyze multiple genetic variants in a gene by kernel statistics, and their ability to account for population stratification via genomic relationship matrices. For exploratory analyses with modest sample sizes and a potentially large number of variance components, it can be challenging
-
A principal component approach to improve association testing with polygenic risk scores. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-07-21 Brandon J Coombes,Alexander Ploner,Sarah E Bergen,Joanna M Biernacka
Polygenic risk scores (PRSs) have become an increasingly popular approach for demonstrating polygenic influences on complex traits and for establishing common polygenic signals between different traits. PRSs are typically constructed using pruning and thresholding (P+T), but the best choice of parameters is uncertain; thus multiple settings are used and the best is chosen. Optimization can lead to
-
Leveraging existing GWAS summary data of genetically correlated and uncorrelated traits to improve power for a new GWAS. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-07-16 Haoran Xue,Chong Wu,Wei Pan
In spite of the tremendous success of genome‐wide association studies (GWAS) in identifying genetic variants associated with complex traits and common diseases, many more are yet to be discovered. Hence, it is always desirable to improve the statistical power of GWAS. Paralleling with the intensive efforts of integrating GWAS with functional annotations or other omic data, we propose leveraging other
-
SNP-HLA Reference Consortium (SHLARC): HLA and SNP data sharing for promoting MHC-centric analyses in genomics. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-07-18 Nicolas Vince,Venceslas Douillard,Estelle Geffard,Diogo Meyer,Erick C Castelli,Steven J Mack,Sophie Limou,Pierre-Antoine Gourraud
Genome‐wide associations studies have repeatedly identified the major histocompatibility complex genomic region (6p21.3) as key in immune pathologies. Researchers have also aimed to extend the biological interpretation of associations by focusing directly on human leukocyte antigen (HLA) polymorphisms and their combination as haplotypes. To circumvent the effort and high costs of HLA typing, statistical
-
Shared genomic segment analysis with equivalence testing. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-07-16 Sukanya Horpaopan,Cathy S J Fann,Mark Lathrop,Jurg Ott
An important aspect of disease gene mapping is replication, that is, a putative finding in one group of individuals is confirmed in another set of individuals. As it can happen by chance that individuals share an estimated disease position, we developed a statistical approach to determine the p‐value for multiple individuals or families to share a possibly small number of candidate susceptibility variants
-
Dissecting the genetic overlap of smoking behaviors, lung cancer, and chronic obstructive pulmonary disease: A focus on nicotinic receptors and nicotine metabolizing enzyme. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-08-16 Michael J Bray,Li-Shiun Chen,Louis Fox,Dana B Hancock,Robert C Culverhouse,Sarah M Hartz,Eric O Johnson,Mengzhen Liu,James D McKay,Nancy L Saccone,John E Hokanson,Scott I Vrieze,Rachel F Tyndale,Timothy B Baker,Laura J Bierut
Smoking is a major contributor to lung cancer and chronic obstructive pulmonary disease (COPD). Two of the strongest genetic associations of smoking‐related phenotypes are the chromosomal regions 15q25.1, encompassing the nicotinic acetylcholine receptor subunit genes CHRNA5‐CHRNA3‐CHRNB4, and 19q13.2, encompassing the nicotine metabolizing gene CYP2A6. In this study, we examined genetic relations
-
Chances and challenges of machine learning-based disease classification in genetic association studies illustrated on age-related macular degeneration. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-08-02 Felix Guenther,Caroline Brandl,Thomas W Winkler,Veronika Wanner,Klaus Stark,Helmut Kuechenhoff,Iris M Heid
Imaging technology and machine learning algorithms for disease classification set the stage for high‐throughput phenotyping and promising new avenues for genome‐wide association studies (GWAS). Despite emerging algorithms, there has been no successful application in GWAS so far. We establish machine learning‐based phenotyping in genetic association analysis as misclassification problem. To evaluate
-
Colorectal cancer risk based on extended family history and body mass index. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-07-16 Heather M Ochs-Balcom,Priyanka Kanth,James M Farnham,Samir Abdelrahman,Lisa A Cannon-Albright
Family history and body mass index (BMI) are well‐known risk factors for colorectal cancer (CRC), however, their joint effects are not well described. Using linked data for genealogy, self‐reported height and weight from driver's licenses, and the Utah Surveillance, Epidemiology, and End‐Results cancer registry, we found that an increasing number of first‐degree relatives (FDR) with CRC is associated
-
Statistical considerations for the analysis of massively parallel reporter assays data. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-07-18 Dandi Qiao,Corwin M Zigler,Michael H Cho,Edwin K Silverman,Xiaobo Zhou,Peter J Castaldi,Nan H Laird
Noncoding DNA contains gene regulatory elements that alter gene expression, and the function of these elements can be modified by genetic variation. Massively parallel reporter assays (MPRA) enable high‐throughput identification and characterization of functional genetic variants, but the statistical methods to identify allelic effects in MPRA data have not been fully developed. In this study, we demonstrate
-
Set-based genetic association and interaction tests for survival outcomes based on weighted V statistics. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-09-07 Chenxi Li,Di Wu,Qing Lu
With advancements in high‐throughout technologies, studies have been conducted to investigate the role of massive genetic variants in human diseases. While set‐based tests have been developed for binary and continuous disease outcomes, there are few computationally efficient set‐based tests available for time‐to‐event outcomes. To facilitate the genetic association and interaction analyses of time‐to‐event
-
Detecting X-linked common and rare variant effects in family-based sequencing studies. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-08-30 Asuman S Turkmen,Shili Lin
The breakthroughs in next generation sequencing have allowed us to access data consisting of both common and rare variants, and in particular to investigate the impact of rare genetic variation on complex diseases. Although rare genetic variants are thought to be important components in explaining genetic mechanisms of many diseases, discovering these variants remains challenging, and most studies
-
Efficient gene-environment interaction tests for large biobank-scale sequencing studies. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-08-30 Xinyu Wang,Elise Lim,Ching-Ti Liu,Yun Ju Sung,Dabeeru C Rao,Alanna C Morrison,Eric Boerwinkle,Alisa K Manning,Han Chen
Complex human diseases are affected by genetic and environmental risk factors and their interactions. Gene–environment interaction (GEI) tests for aggregate genetic variant sets have been developed in recent years. However, existing statistical methods become rate limiting for large biobank‐scale sequencing studies with correlated samples. We propose efficient Mixed‐model Association tests for GEne–Environment
-
Supervariants identification for breast cancer. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-08-17 Jianchang Hu,Ting Li,Shiying Wang,Heping Zhang
In genome‐wide association studies, signals associated with rare variants and interactions between genes are hard to detect even when the sample size is in tens of thousands. To overcome these problems, we examine the concept of supervariant. Like the classic concept of the gene, a supervariant is a combination of alleles in multiple loci, but the contributing loci can be anywhere in the genome. We
-
Sequencing and imputation in GWAS: Cost-effective strategies to increase power and genomic coverage across diverse populations. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-06-09 Corbin Quick,Pramod Anugu,Solomon Musani,Scott T Weiss,Esteban G Burchard,Marquitta J White,Kevin L Keys,Francesco Cucca,Carlo Sidore,Michael Boehnke,Christian Fuchsberger
A key aim for current genome‐wide association studies (GWAS) is to interrogate the full spectrum of genetic variation underlying human traits, including rare variants, across populations. Deep whole‐genome sequencing is the gold standard to fully capture genetic variation, but remains prohibitively expensive for large sample sizes. Array genotyping interrogates a sparser set of variants, which can
-
Practical implementation of frailty models in Mendelian risk prediction. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-06-07 Theodore Huang,Malka Gorfine,Li Hsu,Giovanni Parmigiani,Danielle Braun
There are numerous statistical models used to identify individuals at high risk of cancer due to inherited mutations. Mendelian models predict future risk of cancer by using family history with estimated cancer penetrances (age‐ and sex‐specific risk of cancer given the genotype of the mutations) and mutation prevalences. However, there is often residual risk heterogeneity across families even after
-
Power loss due to testing association between covariate-adjusted traits and genetic variants. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-06-08 Pranav Yajnik,Michael Boehnke
Multiple linear regression is commonly used to test for association between genetic variants and continuous traits and estimate genetic effect sizes. Confounding variables are controlled for by including them as additional covariates. An alternative technique that is increasingly used is to regress out covariates from the raw trait and then perform regression analysis with only the genetic variants
-
Validating the doubly weighted genetic risk score for the prediction of type 2 diabetes in the Lifelines and Estonian Biobank cohorts. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-06-14 Katri Pärna,Harold Snieder,Kristi Läll,Krista Fischer,Ilja Nolte
As many cases of type 2 diabetes (T2D) are likely to remain undiagnosed, better tools for early detection of high‐risk individuals are needed to prevent or postpone the disease. We investigated the value of the doubly weighted genetic risk score (dwGRS) for the prediction of incident T2D in the Lifelines and Estonian Biobank (EstBB) cohorts. The dwGRS uses an additional weight for each single nucleotide
-
Assessing exposure effects on gene expression. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-06-08 Sarah A Reifeis,Michael G Hudgens,Mete Civelek,Karen L Mohlke,Michael I Love
In observational genomics data sets, there is often confounding of the effect of an exposure on gene expression. To adjust for confounding when estimating the exposure effect, a common approach involves including potential confounders as covariates with the exposure in a regression model of gene expression. However, when the exposure and confounders interact to influence gene expression, the fitted
-
Group analysis of distance matrices. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-06-21 Jinjuan Wang,Jialu Li,Wenjun Xiong,Qizhai Li
Distance‐based regression model has become a powerful approach to identifying phenotypic associations in many fields. It is found to be particularly useful for high‐dimensional biological and genetic data with proper distance or similarity measures being available. The pseudo F statistic used in this model accumulates information and is effective when the signals, that is the variations represented
-
Estimating the effects of copy-number variants on intelligence using hierarchical Bayesian models. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-08-11 Lai Jiang,Guillaume Huguet,Catherine Schramm,Antonio Ciampi,Antoine Main,Claudine Passo,Martineau Jean-Louis,Maude Auger,Gunter Schumann,David Porteous,Sébastien Jacquemont,Celia M T Greenwood
It is challenging to estimate the phenotypic impact of the structural genome changes known as copy‐number variations (CNVs), since there are many unique CNVs which are nonrecurrent, and most are too rare to be studied individually. In recent work, we found that CNV‐aggregated genomic annotations, that is, specifically the intolerance to mutation as measured by the pLI score (probability of being loss‐of‐function
-
Joint testing of donor and recipient genetic matching scores and recipient genotype has robust power for finding genes associated with transplant outcomes. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-08-11 Victoria L Arthur,Weihua Guan,Bao-Li Loza,Brendan Keating,Jinbo Chen
Genetic matching between transplant donor and recipient pairs has traditionally focused on the human leukocyte antigen (HLA) regions of the genome, but recent studies suggest that matching for non‐HLA regions may be important as well. We assess four genetic matching scores for use in association analyses of transplant outcomes. These scores describe genetic ancestry distance using identity‐by‐state
-
A framework for pathway knowledge driven prioritization in genome-wide association studies. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-08-10 Shrayashi Biswas,Soumen Pal,Partha P Majumder,Samsiddhi Bhattacharjee
Many variants with low frequencies or with low to modest effects likely remain unidentified in genome‐wide association studies (GWAS) because of stringent genome‐wide thresholds for detection. To improve the power of detection, variant prioritization based on their functional annotations and epigenetic landmarks has been used successfully. Here, we propose a novel method of prioritization of a GWAS
-
Incorporating multiple sets of eQTL weights into gene-by-environment interaction analysis identifies novel susceptibility loci for pancreatic cancer. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-08-10 Tianzhong Yang,Hongwei Tang,Harvey A Risch,Sarah H Olson,Gloria Peterson,Paige M Bracci,Steven Gallinger,Rayjean J Hung,Rachel E Neale,Ghislaine Scelo,Eric J Duell,Robert C Kurtz,Kay-Tee Khaw,Gianluca Severi,Malin Sund,Nick Wareham,Christopher I Amos,Donghui Li,Peng Wei
It is of great scientific interest to identify interactions between genetic variants and environmental exposures that may modify the risk of complex diseases. However, larger sample sizes are usually required to detect gene‐by‐environment interaction (G × E) than required to detect genetic main association effects. To boost the statistical power and improve the understanding of the underlying molecular
-
Polygenic modelling of treatment effect heterogeneity. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-08-10 Zhi Ming Xu,Stephen Burgess
Mendelian randomization is the use of genetic variants to assess the effect of intervening on a risk factor using observational data. We consider the scenario in which there is a pharmacomimetic (i.e., treatment‐mimicking) genetic variant that can be used as a proxy for a particular pharmacological treatment that changes the level of the risk factor. If the association of the pharmacomimetic genetic
-
Evaluating shared genetic influences on nonsyndromic cleft lip/palate and oropharyngeal neoplasms. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-07-24 Laurence J Howe,Gibran Hemani,Corina Lesseur,Valérie Gaborieau,Kerstin U Ludwig,Elisabeth Mangold,Paul Brennan,Andy R Ness,Beate St Pourcain,George Davey Smith,Sarah J Lewis
It has been hypothesised that nonsyndromic cleft lip/palate (nsCL/P) and cancer may share aetiological risk factors. Population studies have found inconsistent evidence for increased incidence of cancer in nsCL/P cases, but several genes (e.g., CDH1, AXIN2) have been implicated in the aetiologies of both phenotypes. We aimed to evaluate shared genetic aetiology between nsCL/P and oral cavity/oropharyngeal
-
An enhanced machine learning tool for cis-eQTL mapping with regularization and confounder adjustments. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-07-22 Kang K Yan,Hongyu Zhao,Joseph T Wu,Herbert Pang
Many expression quantitative trait loci (eQTL) studies have been conducted to investigate the biological effects of variants in gene regulation. However, these eQTL studies may suffer from low or moderate statistical power and overly conservative false‐discovery rate. In practice, most algorithms for eQTL identification do not model the joint effects of multiple genetic variants with weak or moderate
-
Evaluation of population stratification adjustment using genome-wide or exonic variants. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-06-30 Yuning Chen,Gina M Peloso,Ching-Ti Liu,Anita L DeStefano,Josée Dupuis
Population stratification may cause an inflated type‐I error and spurious association when assessing the association between genetic variations with an outcome. Many genetic association studies are now using exonic variants, which captures only 1% of the genome, however, population stratification adjustments have not been evaluated in the context of exonic variants. We compare the performance of two
-
Truncated tests for combining evidence of summary statistics. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-06-24 Deliang Bu,Qinglong Yang,Zhen Meng,Sanguo Zhang,Qizhai Li
To date, thousands of genetic variants to be associated with numerous human traits and diseases have been identified by genome‐wide association studies (GWASs). The GWASs focus on testing the association between single trait and genetic variants. However, the analysis of multiple traits and single nucleotide polymorphisms (SNPs) might reflect physiological process of complex diseases and the corresponding
-
The 2020 Annual Meeting of the International Genetic Epidemiology Society. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-05-23
-
A fast and powerful eQTL weighted method to detect genes associated with complex trait using GWAS summary data. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-04-29 Jianjun Zhang,Sicong Xie,Samantha Gonzales,Jianguo Liu,Xuexia Wang
Although genomewide association studies (GWASs) have identified many genetic variants underlying complex traits, a large fraction of heritability still remains unexplained. Integrative analysis that incorporates additional information, such as expression quantitativetrait locus (eQTL) data into sequencing studies (denoted as transcriptomewide association study [TWAS]), can aid the discovery of trait‐associated
-
Penalized models for analysis of multiple mediators. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-04-27 Daniel J Schaid,Jason P Sinnwell
Mediation analysis attempts to determine whether the relationship between an independent variable (e.g., exposure) and an outcome variable can be explained, at least partially, by an intermediate variable, called a mediator. Most methods for mediation analysis focus on one mediator at a time, although multiple mediators can be jointly analyzed by structural equation models (SEMs) that account for correlations
-
A comparison of robust Mendelian randomization methods using summary data. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-04-06 Eric A W Slob,Stephen Burgess
The number of Mendelian randomization (MR) analyses including large numbers of genetic variants is rapidly increasing. This is due to the proliferation of genome-wide association studies, and the desire to obtain more precise estimates of causal effects. Since it is unlikely that all genetic variants will be valid instrumental variables, several robust methods have been proposed. We compare nine robust
-
Gene-based and pathway-based testing for rare-variant association in affected sib pairs. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-04-01 Razvan G Romanescu,Jessica Green,Irene L Andrulis,Shelley B Bull
Next generation sequencing technologies have made it possible to investigate the role of rare variants (RVs) in disease etiology. Because RVs associated with disease susceptibility tend to be enriched in families with affected individuals, study designs based on affected sib pairs (ASP) can be more powerful than case-control studies. We construct tests of RV-set association in ASPs for single genomic
-
Identifying blood pressure loci whose effects are modulated by multiple lifestyle exposures. Genet. Epidemiol. (IF 1.954) Pub Date : 2020-03-29 Oyomoare L Osazuwa-Peters,R J Waken,Karen L Schwander,Yun Ju Sung,Paul S de Vries,Sarah M Hartz,Daniel I Chasman,Alanna C Morrison,Laura J Bierut,Chengjie Xiong,Lisa de Las Fuentes,D C Rao
Although multiple lifestyle exposures simultaneously impact blood pressure (BP) and cardiovascular health, most analysis so far has considered each single lifestyle exposure (e.g., smoking) at a time. Here, we exploit gene–multiple lifestyle exposure interactions to find novel BP loci. For each of 6,254 Framingham Heart Study participants, we computed lifestyle risk score (LRS) value by aggregating