Copyright © 2007 The American Society of Human Genetics. All rights reserved.
The American Journal of Human Genetics, Volume 81, Issue 6, 1221-1231, 1 December 2007
doi:10.1086/522239
Article
Stephen L. Gutherya, Benjamin A. Salisburyb, Manish S. Pungliyab, J. Claiborne Stephensc and Michael Bamshadd, e,
, 
a Department of Pediatrics, University of Utah, Salt Lake City
b Genaissance Pharmaceuticals, New Haven, CT
c Departments of Pediatrics and Genome Sciences, University of Washington, Motif BioSciences, New York
d Departments of Pediatrics and Genome Sciences, University of Washington, Seattle
e Children’s Hospital and Regional Medical Center, Seattle
Address for correspondence and reprints: Dr. Michael Bamshad, Department of Pediatrics, Division of Genetics and Developmental Medicine, University of Washington School of Medicine, 1959 NE Pacific Street, HSB RR349, Seattle, WA 98195Abstract
The common-variant/common-disease model predicts that most risk alleles underlying complex health-related traits are common and, therefore, old and found in multiple populations, rather than being rare or population specific. Accordingly, there is widespread interest in assessing the population structure of common alleles. However, such assessments have been confounded by analysis of data sets with bias toward ascertainment of common alleles (e.g., HapMap and Perlegen) or in which a relatively small number of genes and/or populations were sampled. The aim of this study was to examine the structure of common variation ascertained in major U.S. populations, by resequencing the exons and flanking regions of 3,873 genes in 154 chromosomes from European, Latino/Hispanic, Asian, and African Americans generated by the Genaissance Resequencing Project. The frequency distributions of private and common single-nucleotide polymorphisms (SNPs) were measured, and the extent to which common SNPs were shared across populations was analyzed using several different estimators of population structure. Most SNPs that were common in one population were present in multiple populations, but SNPs common in one population were frequently not common in other populations. Moreover, SNPs that were common in two or more populations often differed significantly in frequency from one population to another, particularly in comparisons of African Americans versus other U.S. populations. These findings indicate that, even if the bulk of alleles underlying complex health-related traits are common SNPs, geographic ancestry might well be an important predictor of whether a person carries a risk allele.
| Analysis of Genomic Admixture in Uyghur and Its Implication in Mapping Strategy The American Journal of Human Genetics, Volume 82, Issue 4, 11 April 2008, Pages 883-894 Shuhua Xu, Wei Huang, Ji Qian and Li Jin Abstract The Uyghur (UIG) population, settled in Xinjiang, China, is a population presenting a typical admixture of Eastern and Western anthropometric traits. We dissected its genomic structure at population level, individual level, and chromosome level by using 20,177 SNPs spanning nearly the entire chromosome 21. Our results showed that UIG was formed by two-way admixture, with 60% European ancestry and 40% East Asian ancestry. Overall linkage disequilibrium (LD) in UIG was similar to that in its parental populations represented in East Asia and Europe with regard to common alleles, and UIG manifested elevation of LD only within 500 kb and at a level of 0.1 < r2 < 0.8 when ancestry-informative markers (AIMs) were used. The size of chromosomal segments that were derived from East Asian and European ancestries averaged 2.4 cM and 4.1 cM, respectively. Both the magnitude of LD and fragmentary ancestral chromosome segments indicated a long history of Uyghur. Under the assumption of a hybrid isolation (HI) model, we estimated that the admixture event of UIG occurred about 126 [107∼146] generations ago, or 2520 [2140∼2920] years ago assuming 20 years per generation. In spite of the long history and short LD of Uyghur compared with recent admixture populations such as the African-American population, we suggest that mapping by admixture LD (MALD) is still applicable in the Uyghur population but ∼10-fold AIMs are necessary for a whole-genome scan. Abstract | | |
| A Genomewide Single-Nucleotide–Polymorphism Panel for Mexican American Admixture Mapping The American Journal of Human Genetics, Volume 80, Issue 6, 1 June 2007, Pages 1014-1023 Chao Tian, David A. Hinds, Russell Shigeta, Sharon G. Adler, Annette Lee, Madeleine V. Pahl, Gabriel Silva, John W. Belmont, Robert L. Hanson, William C. Knowler, Peter K. Gregersen, Dennis G. Ballinger and Michael F. Seldin Abstract For admixture mapping studies in Mexican Americans (MAM), we define a genomewide single-nucleotide–polymorphism (SNP) panel that can distinguish between chromosomal segments of Amerindian (AMI) or European (EUR) ancestry. These studies used genotypes for >400,000 SNPs, defined in EUR and both Pima and Mayan AMI, to define a set of ancestry-informative markers (AIMs). The use of two AMI populations was necessary to remove a subset of SNPs that distinguished genotypes of only one AMI subgroup from EUR genotypes. The AIMs set contained 8,144 SNPs separated by a minimum of 50 kb with only three intermarker intervals >1 Mb and had EUR/AMI FST values >0.30 (mean FST=0.48) and Mayan/Pima FST values <0.05 (mean FST<0.01). Analysis of a subset of these SNP AIMs suggested that this panel may also distinguish ancestry between EUR and other disparate AMI groups, including Quechuan from South America. We show, using realistic simulation parameters that are based on our analyses of MAM genotyping results, that this panel of SNP AIMs provides good power for detecting disease-associated chromosomal segments for genes with modest ethnicity risk ratios. A reduced set of 5,287 SNP AIMs captured almost the same admixture mapping information, but smaller SNP sets showed substantial drop-off in admixture mapping information and power. The results will enable studies of type 2 diabetes, rheumatoid arthritis, and other diseases among which epidemiological studies suggest differences in the distribution of ancestry-associated susceptibility. Abstract | | |
| Estimating Ethnic Admixture from Pedigree Data The American Journal of Human Genetics, Volume 82, Issue 3, 3 March 2008, Pages 748-755 Janet S. Sinsheimer, Christopher L. Plaisier, Adriana Huertas-Vazquez, Carlos Aguilar-Salinas, Teresa Tusie-Luna, Päivi Pajukanta and Kenneth Lange Abstract This paper introduces a likelihood method of estimating ethnic admixture that uses individuals, pedigrees, or a combination of individuals and pedigrees. For each founder of a pedigree, admixture proportions are calculated by conditioning on the pedigree-wide genotypes at all ancestry-informative markers. These estimates are then propagated down the pedigree to the nonfounders by a simple averaging process. The large-sample standard errors of the founders' proportions can be similarly transformed into standard errors for the admixture proportions of the descendants. These standard errors are smaller than the corresponding standard errors when each individual is treated independently. Both hard and soft information on a founder's ancestry can be accommodated in this scheme, which has been implemented in the genetic software package Mendel. The utility of the method is demonstrated on simulated data and a real data example involving Mexican families of mixed Amerindian and Spanish ancestry. Abstract | | |