论文标题
OCR质量会影响历史报纸剪报的感知有用性 - 用户研究
OCR quality affects perceived usefulness of historical newspaper clippings -- a user study
论文作者
论文摘要
到目前为止,在有关检索结果的有效性方面,已经研究了光学特征识别(OCR)质量对历史信息检索的影响。此类研究的重点是人为降解的OCR质量的影响(例如,参见[1-2]),或使用基于基于真实低质量OCR数据的文本的测试收集(例如,参见,例如[3])。在本文中,在以用户为导向的信息检索设置中研究了OCR质量的效果。 32个用户使用模拟的工作任务设置根据预先构造的查询进行了主观查询结果(在30个主题中)的主观查询结果。据我们所知,我们模拟的工作任务实验是第一个从经验上表明用户对检索文档的主观相关性评估受到光学阅读文本质量的变化的影响。迄今为止,历史报纸收藏的用户已经评论了OCR的数据质量的影响,主要是以印象派的方式进行评论,并且控制了OCR质量对用户对检索结果的相关性评估的影响的受控用户环境已经缺少。为了纠正这一点,芬兰国家图书馆(NLF)为一家芬兰历史报纸《 Uusi Suometar》(Uusi Suometar)1869-1918建立了一个实验性查询环境,以比较用户对数字化报纸文章的两种不同OCR质量的搜索结果的评估。查询界面能够根据两种选择为用户提供相同的基础文档:基于OCR质量较低或基于较高的OCR质量,并且选择是随机的。用户不知道他们评估的文章文本中的质量差异。这项研究的主要结果是,改善的光学特征识别质量会严重影响历史报纸文章的有用性。改善OCR结果的平均评估得分比旧OCR结果的平均平均评估得分高7.94%。
Effects of Optical Character Recognition (OCR) quality on historical information retrieval have so far been studied in data-oriented scenarios regarding the effectiveness of retrieval results. Such studies have either focused on the effects of artificially degraded OCR quality (see, e.g., [1-2]) or utilized test collections containing texts based on authentic low quality OCR data (see, e.g., [3]). In this paper the effects of OCR quality are studied in a user-oriented information retrieval setting. Thirty-two users evaluated subjectively query results of six topics each (out of 30 topics) based on pre-formulated queries using a simulated work task setting. To the best of our knowledge our simulated work task experiment is the first one showing empirically that users' subjective relevance assessments of retrieved documents are affected by a change in the quality of optically read text. Users of historical newspaper collections have so far commented effects of OCR'ed data quality mainly in impressionistic ways, and controlled user environments for studying effects of OCR quality on users' relevance assessments of the retrieval results have so far been missing. To remedy this The National Library of Finland (NLF) set up an experimental query environment for the contents of one Finnish historical newspaper, Uusi Suometar 1869-1918, to be able to compare users' evaluation of search results of two different OCR qualities for digitized newspaper articles. The query interface was able to present the same underlying document for the user based on two alternatives: either based on the lower OCR quality, or based on the higher OCR quality, and the choice was randomized. The users did not know about quality differences in the article texts they evaluated. The main result of the study is that improved optical character recognition quality affects perceived usefulness of historical newspaper articles significantly. The mean average evaluation score for the improved OCR results was 7.94% higher than the mean average evaluation score of the old OCR results.