论文标题

利用Google的出版商特定ID来检测网站管理

Leveraging Google's Publisher-specific IDs to Detect Website Administration

论文作者

Papadogiannakis, Emmanouil, Papadopoulos, Panagiotis, Markatos, Evangelos P., Kourtellis, Nicolas

论文摘要

数字广告是互联网上内容货币化的最流行方式。出版商产生了新网站,而较旧网站则易于换手,目的是通过用户流量获利。在这个不断发展的生态系统中,有效回答诸如以下问题:哪些实体将哪些网站货币化是一项挑战?普通实体通常会以哪些类别的网站获利以及这些网站的多样化?这个网站管理生态系统如何改变? 在本文中,我们提出了一种基于图形的新方法,以利用与广告相关的发行商特定的ID来检测网络上的管理。我们将我们的方法应用于前100万个网站,并研究网站管理创建图的特征。我们的发现表明,大约90%的网站与一个发行商相关联,而小型出版商倾向于管理不太受欢迎的网站。我们对多达800万个网站进行历史分析,并找到一个新的,不断增加的(中介)发行商,这些发行商可以控制数百个网站的流量,从而寻求广告市场的份额。我们还观察到,随着时间的流逝,网站倾向于从大型管理员转移到小型管理员。

Digital advertising is the most popular way for content monetization on the Internet. Publishers spawn new websites, and older ones change hands with the sole purpose of monetizing user traffic. In this ever-evolving ecosystem, it is challenging to effectively answer questions such as: Which entities monetize what websites? What categories of websites does an average entity typically monetize on and how diverse are these websites? How has this website administration ecosystem changed across time? In this paper, we propose a novel, graph-based methodology to detect administration of websites on the Web, by exploiting the ad-related publisher-specific IDs. We apply our methodology across the top 1 million websites and study the characteristics of the created graphs of website administration. Our findings show that approximately 90% of the websites are associated each with a single publisher, and that small publishers tend to manage less popular websites. We perform a historical analysis of up to 8 million websites, and find a new, constantly rising number of (intermediary) publishers that control and monetize traffic from hundreds of websites, seeking a share of the ad-market pie. We also observe that over time, websites tend to move from big to smaller administrators.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源