论文标题

哪些图书馆数字化忽略了:预测英语小说的数字替代物的可用性

What Library Digitization Leaves Out: Predicting the Availability of Digital Surrogates of English Novels

论文作者

Riddell, Allen, Bassett, Troy J.

论文摘要

图书馆数字化已为公众提供了超过十万19世纪的英语书籍。数字化的书籍是否反映了出版书籍的人数?一个肯定的答案将使书籍和文学历史学家能够将主要数字图书馆的持有用作已发表作品人群的代理,从而使他们享有收集代表样本的劳动。我们通过利用1836年和1838年不列颠群岛出版的小说的详尽书目来解决这个问题,确定其中哪些小说在互联网档案馆,Hathitrust,Google Books和British Librarals中至少有一位数字代理。我们发现数字替代的可用性不是随机的。某些类型的小说,尤其是以多卷格式出版的男性和小说撰写的小说,其数字替代物的价格明显高于其他类型的小说。由于导致这种结果的过程不太可能与1830年代后期隔离,因此这些发现表明,在邻近的几十年前和其他出版类型(例如非小说)中,可能会观察到类似的模式。

Library digitization has made more than a hundred thousand 19th-century English-language books available to the public. Do the books which have been digitized reflect the population of published books? An affirmative answer would allow book and literary historians to use holdings of major digital libraries as proxies for the population of published works, sparing them the labor of collecting a representative sample. We address this question by taking advantage of exhaustive bibliographies of novels published for the first time in the British Isles in 1836 and 1838, identifying which of these novels have at least one digital surrogate in the Internet Archive, HathiTrust, Google Books, and the British Library. We find that digital surrogate availability is not random. Certain kinds of novels, notably novels written by men and novels published in multivolume format, have digital surrogates available at distinctly higher rates than other kinds of novels. As the processes leading to this outcome are unlikely to be isolated to the novel and the late 1830s, these findings suggest that similar patterns will likely be observed during adjacent decades and in other genres of publishing (e.g., non-fiction).

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源