论文标题

在机器学习时代推断细菌特征的遗传力

Inferring the heritability of bacterial traits in the era of machine learning

论文作者

Mai, The Tien, Lees, John A, Gladstone, Rebecca A, Corander, Jukka

论文摘要

遗传力的量化是遗传学中的基本逃亡者,它允许评估添加剂遗传变异对感兴趣特征变异性的贡献。评估性状遗传力的传统计算方法已在定量遗传学领域发展。但是,现代人口基因组学的兴起具有较大的样本量,导致了几种基于机器学习的新方法来推断遗传力。在本文中,我们系统地总结了机器学习的最新进展,该进步可用于推断遗传力。我们专注于这些方法在细菌基因组中的应用,在细菌基因组中,遗传力在理解诸如抗生素耐药性和毒力等表型中起着关键作用,由于抗菌耐药性的频率上升,这一点尤其重要。通过设计一种遗传力模型,该模型结合了全基因组链接的现实模式,不平衡经常重组细菌病原体,我们测试了包括GCTA在内的各种推理方法的广泛范围。除了合成数据基准外,我们还提供了多种细菌病原体抗生素耐药性特征的方法的比较。基准测试和实际数据分析的见解表明,不同方法的高度可变性能,并表明遗传力推断可能会从量身定制方法来对目标生物的特定遗传结构的特定遗传结构中受益。

Quantification of heritability is a fundamental desideratum in genetics, which allows an assessment of the contribution of additive genetic variation to the variability of a trait of interest. The traditional computational approaches for assessing the heritability of a trait have been developed in the field of quantitative genetics. However, the rise of modern population genomics with large sample sizes has led to the development of several new machine learning based approaches to inferring heritability. In this paper, we systematically summarize recent advances in machine learning which can be used to infer heritability. We focus on an application of these methods to bacterial genomes, where heritability plays a key role in understanding phenotypes such as antibiotic resistance and virulence, which are particularly important due to the rising frequency of antimicrobial resistance. By designing a heritability model incorporating realistic patterns of genome-wide linkage disequilibrium for a frequently recombining bacterial pathogen, we test the performance of a wide spectrum of different inference methods, including also GCTA. In addition to the synthetic data benchmark, we present a comparison of the methods for antibiotic resistance traits for multiple bacterial pathogens. Insights from the benchmarking and real data analyses indicate a highly variable performance of the different methods and suggest that heritability inference would likely benefit from tailoring of the methods to the specific genetic architecture of the target organism.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源