EA-YPredictor:基于Y-STR数据的家系特异性单倍群归属判别分析软件

殷才湧, 孙辉, 周怀谷, 金力, 李士林

刑事技术 ›› 2020, Vol. 45 ›› Issue (2) : 117-124. DOI: 10.16467/j.1008-3650.2020.02.002
“十三五”专栏

EA-YPredictor:基于Y-STR数据的家系特异性单倍群归属判别分析软件

  • 殷才湧1, 2, 3, 孙辉4, 周怀谷5, 金力1, 2, 3, 6, 李士林1, 2, 3, 6, *
作者信息 +

EA-YPredictor: One New Software Developed to Predict Pedigree Haplogroup Based on Y-STR Haplotypes

  • YIN Caiyong1, 2, 3, SUN Hui4, ZHOU Huaigu5, JIN Li1, 2, 3, 6, LI Shilin1, 2, 3, 6, *
Author information +
History +

摘要

目的 Y染色体为男性所特有,其遗传标记蕴含着丰富的生物地理信息,故可溯源家系,在嫌疑人排查和追踪中发挥作用。Y-STR突变率较高,而Y-SNP突变率极低,几乎不会发生回复突变,所以后代男性群体携带祖先特有的Y-SNP。本研究期望通过现在我国Y库建设中通用的17个Y-STR的单倍型数据预测Y-SNP单倍群细支。方法 基于前期观察,选取千人基因组计划III期中的513例东亚人群(中国及周边区域)作为基础数据集,在Java平台和Microsoft Excel软件框架下,以遗传距离计算和Y染色体进化树构建手段相联合研发Y-STR数据的家系特异性单倍群归属判别分析软件:EA-YPredictor。结果 本研究揭示了15个单倍群大支下的核心单倍型。通过随机选取70个公开数据库样本,EA-YPredictor软件预测准确性达到92.8%(95%置信区间:[84.1%, 97.6%])。结论 在Y-SNP复合扩增检测尚无定论的情况下,本软件可基于二代测序样本对Y-STR数据库样本进行单倍群细支的准确预测,能适用于辅助家系单倍群判断。随着测序技术的不断换代和优化,更多高通量的Y-STR和Y-SNP数据补充将会使本软件进一步优化。此外,本软件对于Y数据库中Y-SNP遗传标记的筛查建库有一定指向作用。

Abstract

Y chromosome, male specific, carres the genetic markers that could indicate the bio-geographic information of unknown male individuals, therefore playing vital and unique roles in pedigree searching and individuals tracking. Y-SNPs, the genetic markers hardly occurring of reverse mutation, harbor extreme low mutation rate compared to Y-STRs. Thus, male individuals of the same lineages always carry the specific Y-SNPs of their identical ancestors. However, Y-SNP typing is time- and cost- consuming, making it not an ideal tool for investigation. To target suspects’ pedigree for investigation purpose, Y-SNP haplogroups were here tentatively predicted from the current 17 Y-STR haplotypes that are commonly used for Y-database construction. With re-evaluation of previous research results and analysis of the open East-Asian datasets from 1000-Genome Project (Phase III), one new Y-STR-based pedigree-specific haplogroup prediction software, EA-YPredictor, was developed through calculation of genetic distance and Y chromosomal phylogenetic tree reconstruction, hence successfully having screened the 15 Y-SNP major haplogroups out from the core 17 Y-STR haplotypes on the adaptable Java platform or Microsoft Excel formulation. Randomly selected of 70 new samples, EA-YPredictor was tested of its prediction accuracy (92.8%, 95% CI: [84.1%, 97.6%]), revealing the exact Y haplogroup affiliation to the males of East Asian ancestry, thereby demonstrating its validity to predict the Y-SNP haplogroup of samples in existing Y chromosome database. Following the next-generation sequencing technology to confirm the location and annotation of more Y-STRs and Y-SNPs, EA-YPredictor will be definitely optimized further so that more indications would be provided on screening which set of Y-SNP markers from Y chromosome databases.

关键词

法医遗传学 / Y染色体单倍群预测 / Y-STR / Y-SNP / 东亚男性人群

Key words

forensic genetics / Y chromosomal haplogroup prediction / Y-STR (short tandem repeat) / Y-SNP (single nucleotide polymorphism) / East Asian male populations

引用本文

导出引用
殷才湧, 孙辉, 周怀谷, 金力, 李士林. EA-YPredictor:基于Y-STR数据的家系特异性单倍群归属判别分析软件. 刑事技术. 2020, 45(2): 117-124 https://doi.org/10.16467/j.1008-3650.2020.02.002
YIN Caiyong, SUN Hui, ZHOU Huaigu, JIN Li, LI Shilin. EA-YPredictor: One New Software Developed to Predict Pedigree Haplogroup Based on Y-STR Haplotypes. Forensic Science and Technology. 2020, 45(2): 117-124 https://doi.org/10.16467/j.1008-3650.2020.02.002

参考文献

[1] FAN Y, SILBER S J.Y Chromosome Infertility[J]. GeneReviews®, Seattle(WA), 1993.
[2] GRAVES J A.Sex chromosome specialization and degeneration in mammals[J]. Cell, 2006, 124(5): 901-914.
[3] JOBLING M A, TYLER-SMITH C.Fathers and sons: the Y chromosome and human evolution[J]. Trends Genet, 1995, 11(11): 449-456.
[4] BALLANTYNE K N, GOEDBLOED M, FANG R, et al.Mutability of Y-chromosomal microsatellites: rates, characteristics, molecular bases, and forensic implications[J]. The American Journal of Human Genethcs, 2010, 87(3): 341-353.
[5] MENDEZ F L, KRAHN T, SCHRACK B, et al.An African American paternal lineage adds an extremely ancient root to the human Y chromosome phylogenetic tree[J]. The American Journal of Human Genethcs, 2013, 92(3): 454-459.
[6] WHIT A.Haplogroup prediction from Y-STR values using an allele-frequency approach[J]. Journal of Genetic Genealogy, 2005, 1: 1-7.
[7] WHIT A.Haplogroup prediction from Y-STR values using a Bayesian-allele-frequency approach[J]. Journal of Genetic Genealogy, 2006, 2: 34-39.
[8] SCHLECHT J, KAPLAN M E, BARNARD K, et al.Machine-learning approaches for classifying haplogroup from Y chromosome STR data[J]. Plos Computational Biology, 2008, 4(6):e1000093.
[9] KE Y, SU B, SONG X, et al.African origin of modern humans in East Asia: a tale of 12,000 Y chromosomes[J]. Science, 2001, 292(5519): 1151-1153.
[10] AUTON A, BROOKS L D, DURBIN R M, et al.A global reference for human genetic variation[J]. Nature, 2015, 526(7571): 68-74.
[11] SUDMANT P H, RAUSCH T, GARDNER E J, et al.An integrated map of structural variation in 2,504 human genomes[J]. Nature, 2015,526(7571):75-81.
[12] NEI M.Phylogenetic analysis in molecular evolutionary genetics[J]. Annuul Review of Genetics, 1996, 30: 371-403.
[13] WU W, REN W, HAO H, et al.Mutation rates at 42 Y chromosomal short tandem repeats in Chinese Han population in Eastern China[J]. International Journal of Legal Medicine, 2018, 132(5): 1317-1319.
[14] YANG Y, WANG W, CHENG F, et al.Haplotypic polymorphisms and mutation rate estimates of 22 Y-chromosome STRs in the Northern Chinese Han father-son pairs[J]. Scientific Reports, 2018,8(1):7135.
[15] WANG Y, ZHANG Y J, ZHANG C C, et al.Genetic polymorphisms and mutation rates of 27 Y-chromosomal STRs in a Han population from Guangdong Province, Southern China[J]. Forensic Science International:Genetics, 2016, 21: 5-9.
[16] BALLANTYNE K N, KEERL V, WOLLSTEIN A, et al.A new future of forensic Y-chromosome analysis: rapidly mutating Y-STRs for differentiating male relatives and paternal lineages[J]. Forensic Science International:Genetics, 2012, 6(2): 208-218.
[17] KIMURA M, CROW JF.The number of alleles that can be maintained in a finite population[J]. Genetics, 1964, 49(4): 725-738.
[18] WATAHIKI H, FUJII K, FUKAGAWA T, et al.Polymorphisms and microvariant sequences in the Japanese population for 25 Y-STR markers and their relationships to Y-chromosome haplogroups[J]. Forensic Science International:Genetics, 2019, 41: e1-e7.
[19] ARROYO-PARDO E, GUSMAO L, LOPEZ-PARRA A M, et al. Genetic variability of 16 Y-chromosome STRs in a sample from Equatorial Guinea (Central Africa)[J]. Forensic Science International, 2005, 149(1): 109-113.

基金

“十三五”国家重点研发计划课题(2016YFC1201700); 上海市市级科技重大专项(2017SHZDZX01); 中央级公益性科研院所基本科研业务费专项资金项目(2017JB004); 公安部物证鉴定中心开放课题(2015FGKFKT01); 上海市刑事科学技术研究院现场物证重点实验室开放课题(KEH1322878)

文章所在专题

二代DNA测序技术

447

Accesses

0

Citation

Detail

段落导航
相关文章

/