Copyright © 2008 The American Society of Human Genetics. All rights reserved.
The American Journal of Human Genetics, Volume 82, Issue 4, 883-894, 20 March 2008
doi:10.1016/j.ajhg.2008.01.017
Article
Shuhua Xu1, 2, Wei Huang3, Ji Qian2 and Li Jin1, 2,
, 
1 Chinese Academy of Sciences and Max Planck Society (CAS-MPG) Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
2 Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology and Center for Evolutionary Biology, School of Life Sciences and Institutes of Biomedical Sciences, Fudan University, Shanghai 200433, China
3 Chinese National Human Genome Center at Shanghai, Shanghai 201203, China
Corresponding authorAbstract
The Uyghur (UIG) population, settled in Xinjiang, China, is a population presenting a typical admixture of Eastern and Western anthropometric traits. We dissected its genomic structure at population level, individual level, and chromosome level by using 20,177 SNPs spanning nearly the entire chromosome 21. Our results showed that UIG was formed by two-way admixture, with 60% European ancestry and 40% East Asian ancestry. Overall linkage disequilibrium (LD) in UIG was similar to that in its parental populations represented in East Asia and Europe with regard to common alleles, and UIG manifested elevation of LD only within 500 kb and at a level of 0.1 < r2 < 0.8 when ancestry-informative markers (AIMs) were used. The size of chromosomal segments that were derived from East Asian and European ancestries averaged 2.4 cM and 4.1 cM, respectively. Both the magnitude of LD and fragmentary ancestral chromosome segments indicated a long history of Uyghur. Under the assumption of a hybrid isolation (HI) model, we estimated that the admixture event of UIG occurred about 126 [107∼146] generations ago, or 2520 [2140∼2920] years ago assuming 20 years per generation. In spite of the long history and short LD of Uyghur compared with recent admixture populations such as the African-American population, we suggest that mapping by admixture LD (MALD) is still applicable in the Uyghur population but ∼10-fold AIMs are necessary for a whole-genome scan.
| A Coalescence-Guided Hierarchical Bayesian Method for Haplotype Inference The American Journal of Human Genetics, Volume 79, Issue 2, 1 August 2006, Pages 313-322 Yu Zhang, Tianhua Niu and Jun S. Liu Abstract Haplotype inference from phase-ambiguous multilocus genotype data is an important task for both disease-gene mapping and studies of human evolution. We report a novel haplotype-inference method based on a coalescence-guided hierarchical Bayes model. In this model, a hierarchical structure is imposed on the prior haplotype frequency distributions to capture the similarities among modern-day haplotypes attributable to their common ancestry. As a consequence, the model both allows distinct haplotypes to have different a priori probabilities according to the inferred hierarchical ancestral structure and results in a proper joint posterior distribution for all the parameters of interest. A Markov chain–Monte Carlo scheme is designed to draw from this posterior distribution. By using coalescence-based simulation and empirically generated data sets (Whitehead Institute's inflammatory bowel disease data sets and HapMap data sets), we demonstrate the merits of the new method in comparison with HAPLOTYPER and PHASE, with or without the presence of recombination hotspots and missing genotypes. Abstract | | |
| Evaluation of Genetic Variation Contributing to Differences in Gene Expression between Populations The American Journal of Human Genetics, Volume 82, Issue 3, 3 March 2008, Pages 631-640 Wei Zhang, Shiwei Duan, Emily O. Kistner, Wasim K. Bleibel, R. Stephanie Huang, Tyson A. Clark, Tina X. Chen, Anthony C. Schweitzer, John E. Blume, Nancy J. Cox and M. Eileen Dolan Abstract Gene expression is a complex quantitative trait partially regulated by genetic variation in DNA sequence. Population differences in gene expression could contribute to some of the observed differences in susceptibility to common diseases and response to drug treatments. We characterized gene expression in the full set of HapMap lymphoblastoid cell lines derived from individuals of European and African ancestry for 9156 transcript clusters (gene-level) evaluated with the Affymetrix GeneChip Human Exon 1.0 ST Array. Gene expression was found to differ significantly between these samples for 383 transcript clusters. Biological processes including ribosome biogenesis and antimicrobial humoral response were found to be enriched in these differential genes, suggesting their possible roles in contributing to the population differences at a higher level than that of mRNA expression and in response to environmental information. Genome-wide association studies for local or distant genetic variants that correlate with the differentially expressed genes enabled identification of significant associations with one or more single-nucleotide polymorphisms (SNPs), consistent with the hypothesis that genetic factors and not simply population identity or other characteristics (age of cell lines, length of culture, etc.) contribute to differences in gene expression in these samples. Our results provide a comprehensive view of the genes differentially expressed between populations and the enriched biological processes involved in these genes. We also provide an evaluation of the contributions of genetic variation and nongenetic factors to the population differences in gene expression. Abstract | | |
| Genetic Architecture of Transcript-Level Variation in Humans The American Journal of Human Genetics, Volume 82, Issue 5, 9 May 2008, Pages 1101-1113 Shiwei Duan, R. Stephanie Huang, Wei Zhang, Wasim K. Bleibel, Cheryl A. Roe, Tyson A. Clark, Tina X. Chen, Anthony C. Schweitzer, John E. Blume, Nancy J. Cox and M. Eileen Dolan Abstract We report here the results of testing the pairwise association of 12,747 transcriptional gene-expression values with more than two million single-nucleotide polymorphisms (SNPs) in samples of European (CEPH from Utah; CEU) and African (Yoruba from Ibadan; YRI) ancestry. We found 4,677 and 5,125 significant associations between expression quantitative nucleotides (eQTNs) and transcript clusters in the CEU and the YRI samples, respectively. The physical distance between an eQTN and its associated transcript cluster was referred to as the intrapair distance. An association with 4 Mb or less intrapair distance was defined as local; otherwise, it was defined as distant. The enrichment analysis of functional categories shows that genes harboring the local eQTNs are enriched in the categories related to nucleosome and chromatin assembly; the genes harboring the distant eQTNs are enriched in the categories related to transmembrane signal transduction, suggesting that these biological pathways are likely to play a significant role in regulation of gene expression. We highlight in the EPHX1 gene a deleterious nonsynonymous SNP that is distantly associated with gene expression of ORMDL3, a susceptibility gene for asthma. Abstract | | |