论文标题

机器学习模型,以识别墨西哥公共采购合同中的腐败

A machine learning model to identify corruption in México's public procurement contracts

论文作者

Aldana, Andrés, Falcón-Cortés, Andrea, Larralde, Hernán

论文摘要

政府腐败的成本和影响范围从损害一个国家的经济增长到影响其公民的福祉和安全。政府依赖与私营部门实例之间的公共契约(称为公共采购)是腐败实践的肥沃土地,在全球范围内造成了巨大的货币损失。因此,确定和阻止政府与私营部门之间的腐败活动至关重要。但是,由于几个因素,公共采购的腐败是识别和追踪的挑战,导致腐败的做法未被注意。本文提出了一个基于随机森林分类器集合的机器学习模型,我们称之为“超福”,以识别和预测墨西哥公共采购数据中的腐败合同。该方法的结果正确检测了数据集中评估的大​​多数损坏和非腐败合同。此外,我们发现该模型中考虑的最关键的预测因素是与买家与供应商之间的关系相关的,而不是与单个合同的特征相关的预测因素。此外,这里提出的方法足够通用,可以接受来自其他国家的数据的培训。总体而言,我们的工作提出了一种工具,可以帮助决策过程,以识别,预测和分析公共采购合同中的腐败。

The costs and impacts of government corruption range from impairing a country's economic growth to affecting its citizens' well-being and safety. Public contracting between government dependencies and private sector instances, referred to as public procurement, is a fertile land of opportunity for corrupt practices, generating substantial monetary losses worldwide. Thus, identifying and deterring corrupt activities between the government and the private sector is paramount. However, due to several factors, corruption in public procurement is challenging to identify and track, leading to corrupt practices going unnoticed. This paper proposes a machine learning model based on an ensemble of random forest classifiers, which we call hyper-forest, to identify and predict corrupt contracts in México's public procurement data. This method's results correctly detect most of the corrupt and non-corrupt contracts evaluated in the dataset. Furthermore, we found that the most critical predictors considered in the model are those related to the relationship between buyers and suppliers rather than those related to features of individual contracts. Also, the method proposed here is general enough to be trained with data from other countries. Overall, our work presents a tool that can help in the decision-making process to identify, predict and analyze corruption in public procurement contracts.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源