论文标题
数据驱动的网络神经科学:关于数据收集和基准测试
Data-Driven Network Neuroscience: On Data Collection and Benchmark
论文作者
论文摘要
本文介绍了功能性人脑网络数据的全面收集,以在神经科学,机器学习和图形分析的交集中进行潜在的研究。解剖学和功能性MRI图像已用于了解人脑的功能连通性,并且在识别基本神经退行性疾病(例如阿尔茨海默氏症,帕金森氏症和自闭症)方面尤其重要。最近,使用机器学习和图形分析以大脑网络形式对大脑的研究变得越来越流行,尤其是为了预测这些疾病的早期发作。表示为图的大脑网络保留了传统检查方法无法捕获的丰富结构和位置信息。但是,缺乏公开访问的大脑网络数据可阻止研究人员无法进行数据驱动的探索。主要困难之一在于复杂的域特异性预处理步骤以及将数据从MRI图像转换为脑网络所需的详尽计算。我们通过从公共数据库和私人来源收集大量MRI图像来弥合这一差距,与域专家合作,做出明智的设计选择,并预处理MRI图像以生成大脑网络数据集的集合。该数据集来自6种不同的来源,覆盖4个大脑条件,由2,702名受试者组成。我们测试了12个机器学习模型上的图形数据集,以提供基准并验证最近的图形分析模型的数据质量。 To lower the barrier to entry and promote the research in this interdisciplinary field, we release our brain network data and complete preprocessing details including codes at https://doi.org/10.17608/k6.auckland.21397377 and https://github.com/brainnetuoa/data_driven_network_neuroscience.
This paper presents a comprehensive and quality collection of functional human brain network data for potential research in the intersection of neuroscience, machine learning, and graph analytics. Anatomical and functional MRI images have been used to understand the functional connectivity of the human brain and are particularly important in identifying underlying neurodegenerative conditions such as Alzheimer's, Parkinson's, and Autism. Recently, the study of the brain in the form of brain networks using machine learning and graph analytics has become increasingly popular, especially to predict the early onset of these conditions. A brain network, represented as a graph, retains rich structural and positional information that traditional examination methods are unable to capture. However, the lack of publicly accessible brain network data prevents researchers from data-driven explorations. One of the main difficulties lies in the complicated domain-specific preprocessing steps and the exhaustive computation required to convert the data from MRI images into brain networks. We bridge this gap by collecting a large amount of MRI images from public databases and a private source, working with domain experts to make sensible design choices, and preprocessing the MRI images to produce a collection of brain network datasets. The datasets originate from 6 different sources, cover 4 brain conditions, and consist of a total of 2,702 subjects. We test our graph datasets on 12 machine learning models to provide baselines and validate the data quality on a recent graph analysis model. To lower the barrier to entry and promote the research in this interdisciplinary field, we release our brain network data and complete preprocessing details including codes at https://doi.org/10.17608/k6.auckland.21397377 and https://github.com/brainnetuoa/data_driven_network_neuroscience.