论文标题
跟踪巴西联邦官方公报的环境政策变化
Tracking environmental policy changes in the Brazilian Federal Official Gazette
论文作者
论文摘要
尽管它的大部分能源产生来自可再生能源,但由于强烈的耕作和生物群落(例如亚马逊雨林)的森林耕种,巴西是世界上最大的温室气体发射器之一,亚马逊雨林(Amazon Rainforest)的保存对于遵守巴黎协议至关重要。尽管如此,无论大堂或普遍的政治取向,所有政府的法律诉讼都会每天在巴西联邦官方公报(BFOG或葡萄牙的“Diáriooficial daUnião”)中发表。但是,当局每天都会颁布数百个法令,手动分析所有这些过程并找出哪些可能造成严重的环境危害绝对是繁重的。在本文中,我们提出了一种策略,旨在构成自动化技术和域专家知识,以处理来自BFOG的所有数据。我们还提供了葡萄牙的政府行动跟踪器,这是一个高度策划的数据集,由域名专家注释,涉及有关巴西环境政策的联邦政府行为。最后,我们在此数据集中的类别任务上构建并比较了四个不同的NLP模型。我们最佳模型的F1得分为$ 0.714 \ pm 0.031 $。将来,该系统应通过最少的人类监督来扩展所有文档的高质量跟踪,并有助于提高社会对政府行动的认识。
Even though most of its energy generation comes from renewable sources, Brazil is one of the largest emitters of greenhouse gases in the world, due to intense farming and deforestation of biomes such as the Amazon Rainforest, whose preservation is essential for compliance with the Paris Agreement. Still, regardless of lobbies or prevailing political orientation, all government legal actions are published daily in the Brazilian Federal Official Gazette (BFOG, or "Diário Oficial da União" in Portuguese). However, with hundreds of decrees issued every day by the authorities, it is absolutely burdensome to manually analyze all these processes and find out which ones can pose serious environmental hazards. In this paper, we present a strategy to compose automated techniques and domain expert knowledge to process all the data from the BFOG. We also provide the Government Actions Tracker, a highly curated dataset, in Portuguese, annotated by domain experts, on federal government acts about the Brazilian environmental policies. Finally, we build and compared four different NLP models on the classfication task in this dataset. Our best model achieved a F1-score of $0.714 \pm 0.031$. In the future, this system should serve to scale up the high-quality tracking of all oficial documents with a minimum of human supervision and contribute to increasing society's awareness of government actions.