论文标题
Patapasco:用于跨语言信息检索实验的Python框架
Patapasco: A Python Framework for Cross-Language Information Retrieval Experiments
论文作者
论文摘要
尽管有高质量的软件框架用于信息检索实验,但它们并未明确支持跨语言信息检索(CLIR)。为了填补这一空白,我们创建了Patapsco,这是一个Python Clir框架。该框架专门解决了用多种语言运行实验带来的复杂性。 Patapsco设计为许多语言对可扩展,可扩展到大型文档集合,并支持由配置文件驱动的可重复实验。我们使用多个设置包括标准CLIR收集的Patapsco结果。
While there are high-quality software frameworks for information retrieval experimentation, they do not explicitly support cross-language information retrieval (CLIR). To fill this gap, we have created Patapsco, a Python CLIR framework. This framework specifically addresses the complexity that comes with running experiments in multiple languages. Patapsco is designed to be extensible to many language pairs, to be scalable to large document collections, and to support reproducible experiments driven by a configuration file. We include Patapsco results on standard CLIR collections using multiple settings.