Copyright © 2006 The American Society of Human Genetics. All rights reserved.
The American Journal of Human Genetics, Volume 79, Issue 4, 679-694, 1 October 2006
doi:10.1086/508264
Article
Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
Address for correspondence and reprints: Dr. Andrew Morris, Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford OX3 7BN, United KingdomAbstract
Multilocus analysis of single-nucleotide–polymorphism (SNP) haplotypes may provide evidence of association with disease, even when the individual loci themselves do not. Haplotype-based methods are expected to outperform single-SNP analyses because (i) common genetic variation can be structured into haplotypes within blocks of strong linkage disequilibrium and (ii) the functional properties of a protein are determined by the linear sequence of amino acids corresponding to DNA variation on a haplotype. Here, I propose a flexible Bayesian framework for modeling haplotype association with disease in population-based studies of candidate genes or small candidate regions. I employ a Bayesian partition model to describe the correlation between marker-SNP haplotypes and causal variants at the underlying functional polymorphism(s). Under this model, haplotypes are clustered according to their similarity, in terms of marker-SNP allele matches, which is used as a proxy for recent shared ancestry. Haplotypes within a cluster are then assigned the same probability of carrying a causal variant at the functional polymorphism(s). In this way, I can account for the dominance effect of causal variants, here corresponding to any deviation from a multiplicative contribution to disease risk. The results of a detailed simulation study demonstrate that there is minimal cost associated with modeling these dominance effects, with substantial gains in power over haplotype-based methods that do not incorporate clustering and that assume a multiplicative model of disease risks.
| Regression-Based Association Analysis with Clustered Haplotypes through Use of Genotypes The American Journal of Human Genetics, Volume 78, Issue 2, 1 February 2006, Pages 231-242 Jung-Ying Tzeng, Chih-Hao Wang, Jau-Tsuen Kao and Chuhsing Kate Hsiao Abstract Haplotype-based association analysis has been recognized as a tool with high resolution and potentially great power for identifying modest etiological effects of genes. However, in practice, its efficacy has not been as successfully reproduced as expected in theory. One primary cause is that such analysis tends to require a large number of parameters to capture the abundant haplotype varieties, and many of those are expended on rare haplotypes for which studies would have insufficient power to detect association even if it existed. To concentrate statistical power on more-relevant inferences, in this study, we developed a regression-based approach using clustered haplotypes to assess haplotype-phenotype association. Specifically, we generalized the probabilistic clustering methods of Tzeng to the generalized linear model (GLM) framework established by Schaid et al. The proposed method uses unphased genotypes and incorporates both phase uncertainty and clustering uncertainty. Its GLM framework allows adjustment of covariates and can model qualitative and quantitative traits. It can also evaluate the overall haplotype association or the individual haplotype effects. We applied the proposed approach to study the association between hypertriglyceridemia and the apolipoprotein A5 gene. Through simulation studies, we assessed the performance of the proposed approach and demonstrate its validity and power in testing for haplotype-trait association. Abstract | | |
| A Score-Statistic Approach for the Mapping of Quantitative-Trait Loci with Sibships of Arbitrary Size The American Journal of Human Genetics, Volume 70, Issue 2, 1 February 2002, Pages 412-424 K. Wang and J. Huang Abstract The Haseman-Elston method is widely used for the mapping of quantitative-trait loci. However, this method does not use all the information in the data, because it only considers the sib-pair trait-value difference. In addition, the Haseman-Elston method was developed for independent sib pairs; its generalization to nonindependent sib pairs is not straightforward. Here we introduce a score test statistic derived from a normal likelihood based on multiplex sibship data, conditional on identical-by-descent sharing statuses. This score test is asymptotically equivalent to the corresponding likelihood-ratio test, but it is much easier to implement. Because the proposed test uses all of the trait values, it makes more efficient use of the data than does the Haseman-Elston method. The proposed test is naturally applicable to sibships of arbitrary size. The finite-sample properties of the proposed score statistic are evaluated via simulations. Abstract | | |
| On the Probabilities of Identity States in Permutable Populations The American Journal of Human Genetics, Volume 62, Issue 3, 1 March 1998, Pages 698-702 C. Cannings Abstract Summary:
Génin and Clerget-Darpoux recently discussed the derivation of the probabilities of identity states for populations in which there was some degree of kinship, primarily to allow the extension of the classical affected-sib-pair method to such populations. It is argued here that their derivation makes certain assumptions that are valid only for some very restricted population models and that are not needed for an appropriate treatment. Here the probabilities of the identity states of two individuals with a given genealogical relationship are specified in terms of the kinship parameters of the underlying population, from which the founders of the individuals' genealogy have been randomly selected. It is argued that an appropriate representation for a permutable population, one in which gene identity does not depend on the pattern of genes across individuals, requires three parameters. This representation is related to that of Génin and Clerget-Darpoux and to that of Weir. Abstract | | |