论文标题
利用基于机器学习的单杆组决策聚合的元认知特征
Exploiting Meta-Cognitive Features for a Machine-Learning-Based One-Shot Group-Decision Aggregation
论文作者
论文摘要
集体决策过程(例如众包)的结果通常取决于其各个成员的观点被汇总的程序。流行的聚合方法(例如多数规则)通常无法产生最佳结果,尤其是在高复杂任务中。依赖元认知信息的方法,例如基于置信度的方法和令人惊讶的流行选择,已显示出各种任务的改进。但是,仍然有大量没有最佳解决方案的情况。我们的目的是利用元认知信息并从中学习,以增强小组产生正确答案的能力。具体而言,我们提出了两种不同的特征代表方法:(1)以响应为中心的特征表示(RCR),它重点介绍了个别响应实例的特征,以及(2)以答案为中心的特征表示(ACR),重点是每个潜在答案的特征。使用这两种功能代表方法,我们训练机器学习(ML)模型,以预测响应的正确性和答案的正确性。训练有素的模型被用作基于ML的聚合方法的基础,该方法与其他基于ML的技术相反,其优势是作为一种“单一”技术,独立于人群特定的组成和个人记录,并适应各种情况。为了评估我们的方法论,我们为不同的任务收集了2490个响应,我们用于特征工程和ML模型的培训。我们通过提出的基于ML的聚合方法的性能测试了我们的功能代表方法。与使用基于标准规则的聚合方法相比,成功率的增长率增加了20%至35%。
The outcome of a collective decision-making process, such as crowdsourcing, often relies on the procedure through which the perspectives of its individual members are aggregated. Popular aggregation methods, such as the majority rule, often fail to produce the optimal result, especially in high-complexity tasks. Methods that rely on meta-cognitive information, such as confidence-based methods and the Surprisingly Popular Option, had shown an improvement in various tasks. However, there is still a significant number of cases with no optimal solution. Our aim is to exploit meta-cognitive information and to learn from it, for the purpose of enhancing the ability of the group to produce a correct answer. Specifically, we propose two different feature-representation approaches: (1) Response-Centered feature Representation (RCR), which focuses on the characteristics of the individual response instances, and (2) Answer-Centered feature Representation (ACR), which focuses on the characteristics of each of the potential answers. Using these two feature-representation approaches, we train Machine-Learning (ML) models, for the purpose of predicting the correctness of a response and of an answer. The trained models are used as the basis of an ML-based aggregation methodology that, contrary to other ML-based techniques, has the advantage of being a "one-shot" technique, independent from the crowd-specific composition and personal record, and adaptive to various types of situations. To evaluate our methodology, we collected 2490 responses for different tasks, which we used for feature engineering and for the training of ML models. We tested our feature-representation approaches through the performance of our proposed ML-based aggregation methods. The results show an increase of 20% to 35% in the success rate, compared to the use of standard rule-based aggregation methods.