Chromosome X-wide analysis of positive selection in human populations: from common and private signals to selection impact on inactivated genes and enhancers-like signatures
biorxiv(2021)
摘要
The ability of detecting adaptive (positive) selection in the genome has opened the possibility of understanding the genetic bases of population-specific adaptations genome-wide. Here we present the analysis of recent selective sweeps specifically in the X chromosome in different human populations from the third phase of the 1000 Genomes Project using three different haplotype-based statistics. We describe numerous instances of genes under recent positive selection that fit the regimes of hard and soft sweeps, showing a higher amount of detectable sweeps in sub-Saharan Africans than in non-Africans (Europe and East Asia). A global enrichment is seen in neural-related processes while numerous genes related to fertility appear among the top candidates, reflecting the importance of reproduction in human evolution. Commonalities with previously reported genes under positive selection are found, while particularly strong new signals are reported in specific populations or shared across different continental groups. We report an enrichment of signals in genes that escape X chromosome inactivation, which may contribute to the differentiation between sexes. We also provide evidence of a widespread presence of soft-sweep-like signatures across the chromosome and a global enrichment of highly scoring regions that overlap potential regulatory elements. Among these, enhancers-like signatures seem to present putative signals of positive selection that might be in concordance with selection in their target genes. Also, particularly strong signals appear in regulatory regions that show differential activities, which might point to population-specific regulatory adaptations.
### Competing Interest Statement
The authors have declared no competing interest.
* CADD
: Combined Annotation Dependent Depletion
DAF
: Derived Allele Frequency
EHH
: Extended Haplotype Homozygosity
eQTL
: Expression Quantitative Trait Loci
GO
: Gene Ontology
HACER
: Human Active Enhancer to interpret Regulatory variants database
HMM
: Hidden Markov Model
iHH12
: Integrated Haplotype Homozygosity pooled test
iHS
: Integrated Haplotype Score test
Kb
: Kilobases
LD
: Linkage disequilibrium
MAF
: Minor Allele Frequency
Mb
: Megabases
nPAR
: Non-pseudoautosomal region
nSL
: Number of segregating sites by length test
OEA
: Overrepresentation Enrichment Analysis
PAR
: Pseudoautosomal region
SFS
: Site Frequency Spectrum
SNP
: Single Nucleotide Polymorphism
TFBS
: Transcription Factor Binding Site
XCI
: X Chromosome Inactivation
XIC
: X-inactivation center
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要