论文标题
算法分类器的分类表示学习和RG流动运算符
Categorical Representation Learning and RG flow operators for algorithmic classifiers
论文作者
论文摘要
遵循前两个作者的分类表示学习(ARXIV:2103.14770)的早期形式主义,我们讨论了“基于RG-Flow的分类器”的构建。 Borrowing ideas from theory of renormalization group flows (RG) in quantum field theory, holographic duality, and hyperbolic geometry, and mixing them with neural ODE's, we construct a new algorithmic natural language processing (NLP) architecture, called the RG-flow categorifier or for short the RG categorifier, which is capable of data classification and generation in all layers.我们将算法平台应用于生物医学数据集,并在序列功能映射的领域显示其性能。特别是,我们将RG分类器应用于流感病毒的特定基因组序列,并显示我们的技术如何从给定的基因组序列中提取信息,找到其隐藏的对称性和主导特征,对其进行分类,并使用训练有素的数据来对新的Plausible预测与新的Plausible序列相关的与新病毒相关的新序列,从而避免了人类免疫系统。当前文章的内容是前两位作者最近提交的美国专利申请的一部分(美国专利申请号:63/313.504)。
Following the earlier formalism of the categorical representation learning (arXiv:2103.14770) by the first two authors, we discuss the construction of the "RG-flow based categorifier". Borrowing ideas from theory of renormalization group flows (RG) in quantum field theory, holographic duality, and hyperbolic geometry, and mixing them with neural ODE's, we construct a new algorithmic natural language processing (NLP) architecture, called the RG-flow categorifier or for short the RG categorifier, which is capable of data classification and generation in all layers. We apply our algorithmic platform to biomedical data sets and show its performance in the field of sequence-to-function mapping. In particular we apply the RG categorifier to particular genomic sequences of flu viruses and show how our technology is capable of extracting the information from given genomic sequences, find their hidden symmetries and dominant features, classify them and use the trained data to make stochastic prediction of new plausible generated sequences associated with new set of viruses which could avoid the human immune system. The content of the current article is part of the recent US patent application submitted by first two authors (U.S. Patent Application No.: 63/313.504).