Copyright © 2008 The American Society of Human Genetics. All rights reserved.
The American Journal of Human Genetics, Volume 82, Issue 4, 859-872, 03 April 2008
doi:10.1016/j.ajhg.2008.01.016
Article
Claudio Verzilli2, 10, Tina Shah1, 10, Juan P. Casas2, Juliet Chapman2, Manjinder Sandhu3, Sally L. Debenham3, Matthijs S. Boekholdt4, Kay Tee Khaw3, Nicholas J. Wareham5, Richard Judson6, Emelia J. Benjamin7, Sekar Kathiresan7, Martin G. Larson7, Jian Rong7, Reecha Sofat1, Steve E. Humphries8, Liam Smeeth2, Gianpiero Cavalleri9, John C. Whittaker2,
,
and Aroon D. Hingorani1
1 Centre for Clinical Pharmacology, University College London, London WC1E 6JF, UK
2 Department of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London WC1E 7HT, UK
3 Department of Public Health and Primary Care, University of Cambridge, Cambridge CB1 8RN, UK
4 Department of Cardiology, Academic Medical Center, Amsterdam 1100 DD, Netherlands
5 MRC Epidemiology Unit, University of Cambridge, Cambridge CB2 0QQ, UK
6 Genaissance Pharmaceuticals, New Haven, CT 06511, USA
7 Framingham Heart Study, Framingham, MA 01702-5827, USA
8 Centre for Cardiovascular Genetics, University College London, London WC1E 6JF, UK
9 Molecular and Cellular Therapeutics, RCSI Research Institute Royal College of Surgeons in Ireland, Dublin 2, Ireland
Corresponding authorAbstract
Robust assessment of genetic effects on quantitative traits or complex-disease risk requires synthesis of evidence from multiple studies. Frequently, studies have genotyped partially overlapping sets of SNPs within a gene or region of interest, hampering attempts to combine all the available data. By using the example of C-reactive protein (CRP) as a quantitative trait, we show how linkage disequilibrium in and around its gene facilitates use of Bayesian hierarchical models to integrate informative data from all available genetic association studies of this trait, irrespective of the SNP typed. A variable selection scheme, followed by contextualization of SNPs exhibiting independent associations within the haplotype structure of the gene, enhanced our ability to infer likely causal variants in this region with population-scale data. This strategy, based on data from a literature based systematic review and substantial new genotyping, facilitated the most comprehensive evaluation to date of the role of variants governing CRP levels, providing important information on the minimal subset of SNPs necessary for comprehensive evaluation of the likely causal relevance of elevated CRP levels for coronary-heart-disease risk by Mendelian randomization. The same method could be applied to evidence synthesis of other quantitative traits, whenever the typed SNPs vary among studies, and to assist fine mapping of causal variants.
| Nonparametric Tests of Association of Multiple Genes with Human Disease The American Journal of Human Genetics, Volume 76, Issue 5, 1 May 2005, Pages 780-793 Daniel J. Schaid, Shannon K. McDonnell, Scott J. Hebbring, Julie M. Cunningham and Stephen N. Thibodeau Abstract The genetic basis of many common human diseases is expected to be highly heterogeneous, with multiple causative loci and multiple alleles at some of the causative loci. Analyzing the association of disease with one genetic marker at a time can have weak power, because of relatively small genetic effects and the need to correct for multiple testing. Testing the simultaneous effects of multiple markers by multivariate statistics might improve power, but they too will not be very powerful when there are many markers, because of the many degrees of freedom. To overcome some of the limitations of current statistical methods for case-control studies of candidate genes, we develop a new class of nonparametric statistics that can simultaneously test the association of multiple markers with disease, with only a single degree of freedom. Our approach, which is based on U-statistics, first measures a score over all markers for pairs of subjects and then compares the averages of these scores between cases and controls. Genetic scoring for a pair of subjects is measured by a “kernel” function, which we allow to be fairly general. However, we provide guidelines on how to choose a kernel for different types of genetic effects. Our global statistic has the advantage of having only one degree of freedom and achieves its greatest power advantage when the contrasts of average genotype scores between cases and controls are in the same direction across multiple markers. Simulations illustrate that our proposed methods have the anticipated type I–error rate and that they can be more powerful than standard methods. Application of our methods to a study of candidate genes for prostate cancer illustrates their potential merits, and offers guidelines for interpretation. Abstract | | |
| Transmission/Disequilibrium Test Meets Measured Haplotype Analysis: Family-Based Association Analysis Guided by Evolution of Haplotypes The American Journal of Human Genetics, Volume 68, Issue 5, 1 May 2001, Pages 1250-1263 Howard Seltman, Kathryn Roeder and B. Devlin Abstract Family data teamed with the transmission/disequilibrium test (TDT), which simultaneously evaluates linkage and association, is a powerful means of detecting disease-liability alleles. To increase the information provided by the test, various researchers have proposed TDT-based methods for haplotype transmission. Haplotypes indeed produce more-definitive transmissions than do the alleles comprising them, and this tends to increase power. However, the larger number of haplotypes, relative to alleles at individual loci, tends to decrease power, because of the additional degrees of freedom required for the test. An optimal strategy would focus the test on particular haplotypes or groups of haplotypes. In this report we develop such an approach by combining the theory of TDT with that of measured haplotype analysis (MHA). MHA uses the evolutionary relationships among haplotypes to produce a limited set of hypothesis tests and to increase the interpretability of these tests. The theory of our approach, called the “evolutionary tree” (ET)–TDT, is developed for two cases: when haplotype transmission is certain and when it is not. Simulations show the ET-TDT can be more powerful than other proposed methods under reasonable conditions. More importantly, our results show that, when multiple polymorphisms are found within the gene, the ET-TDT can be useful for determining which polymorphisms affect liability. Abstract | | |
| Finding Haplotype Block Boundaries by Using the Minimum-Description-Length Principle The American Journal of Human Genetics, Volume 73, Issue 2, 1 August 2003, Pages 336-354 Eric C. Anderson and John Novembre Abstract We present a method for detecting haplotype blocks that simultaneously uses information about linkage-disequilibrium decay between the blocks and the diversity of haplotypes within the blocks. By use of phased single-nucleotide polymorphism data, our method partitions a chromosome into a series of adjacent, nonoverlapping blocks. The partition is made by choosing among a family of Markov models for block structure in a chromosomal region. Specifically, in the model, the occurrence of haplotypes within blocks follows a time-inhomogeneous Markov process along the chromosome, and we choose among possible partitions by using the two-stage minimum-description-length criterion. When applied to data simulated from the coalescent with recombination hotspots, our method reliably situates block boundaries at the hotspots and infrequently places block boundaries at sites with background levels of recombination. We apply three previously published block-finding methods to the same data, showing that they either are relatively insensitive to recombination hotspots or fail to discriminate between background sites of recombination and hotspots. When applied to the 5q31 data of Daly et al., our method identifies more block boundaries in agreement with those found by Daly et al. than do other methods. These results suggest that our method may be useful for designing association-based mapping studies that exploit haplotype blocks. Abstract | | |