论文标题

沿着兔子的洞走:特征wikipedia阅读会议的长尾巴

Going Down the Rabbit Hole: Characterizing the Long Tail of Wikipedia Reading Sessions

论文作者

Piccardi, Tiziano, Gerlach, Martin, West, Robert

论文摘要

“ Wiki Rabbit洞”被非正式地定义为导航路径,其次是Wikipedia读者,导致他们进行长期探索,有时涉及意外文章。尽管Wiki Rabbit洞是互联网文化中的一个流行概念,但我们目前对它们的动态的理解仅基于轶事报告。为了弥合这一差距,本文对掉入Wiki Rabbit洞的读者的导航痕迹进行了大规模的定量表征。首先,我们代表用户会话作为导航树,并根据这些树的深度操作Wiki Rabbit孔的概念。然后,我们根据结构模式,时间特性和局部探索来表征兔子孔会话。 我们发现文章的布局影响了兔子洞会话的结构,并且夜晚的兔子孔会话的比例更高。此外,从有关娱乐,体育,政治和历史的文章开始,读者更有可能陷入兔子洞。最后,我们观察到,平均而言,即使在兔子洞会议期间,读者也倾向于通过留在第一篇文章的语义社区来关注一个主题。 这些发现有助于我们理解Wikipedia读者在网络上的信息需求和用户行为。

"Wiki rabbit holes" are informally defined as navigation paths followed by Wikipedia readers that lead them to long explorations, sometimes involving unexpected articles. Although wiki rabbit holes are a popular concept in Internet culture, our current understanding of their dynamics is based on anecdotal reports only. To bridge this gap, this paper provides a large-scale quantitative characterization of the navigation traces of readers who fell into a wiki rabbit hole. First, we represent user sessions as navigation trees and operationalize the concept of wiki rabbit holes based on the depth of these trees. Then, we characterize rabbit hole sessions in terms of structural patterns, time properties, and topical exploration. We find that article layout influences the structure of rabbit hole sessions and that the fraction of rabbit hole sessions is higher during the night. Moreover, readers are more likely to fall into a rabbit hole starting from articles about entertainment, sports, politics, and history. Finally, we observe that, on average, readers tend to stay focused on one topic by remaining in the semantic neighborhood of the first articles even during rabbit hole sessions. These findings contribute to our understanding of Wikipedia readers' information needs and user behavior on the Web.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源