论文标题
当日交货公平
Same-Day Delivery with Fairness
论文作者
论文摘要
在过去的几年中,对当天分娩(SDD)的需求迅速增加,在19009年大流行期间尤其蓬勃发展。快速增长并非没有挑战。在2016年,由于成员资格的浓度低,距离仓库距离很远,因此某些少数民族社区被排除在获得亚马逊的SDD服务之外,这引起了人们对公平性的担忧。在本文中,我们研究了向客户提供公平SDD服务的问题。服务区域分为不同的区域。在一天的过程中,客户要求提供SDD服务以及请求和交付地点的时间安排不提前。调度员在交付截止日期之前动态分配了车辆以向被接受的客户进行交付。除了总体服务率(实用程序)外,我们还最大程度地提高了所有地区(公平)的最低区域服务率。我们将问题建模为多目标马尔可夫决策过程,并开发深入的Q学习解决方案方法。我们介绍了从费率到实际服务的新型学习转变,这创造了一个稳定,有效的学习过程。计算结果证明了我们方法在不同客户地理学中在空间和时间上减轻不公平的有效性。我们还显示,这种有效性在不同的仓库位置是有效的,为企业提供了从任何位置获得更好公平性的机会。此外,我们考虑了忽略服务公平性的影响,结果表明,当客户对服务水平的期望很高时,我们的政策最终优于公用事业驱动的基线。
The demand for same-day delivery (SDD) has increased rapidly in the last few years and has particularly boomed during the COVID-19 pandemic. The fast growth is not without its challenge. In 2016, due to low concentrations of memberships and far distance from the depot, certain minority neighborhoods were excluded from receiving Amazon's SDD service, raising concerns about fairness. In this paper, we study the problem of offering fair SDD-service to customers. The service area is partitioned into different regions. Over the course of a day, customers request for SDD service, and the timing of requests and delivery locations are not known in advance. The dispatcher dynamically assigns vehicles to make deliveries to accepted customers before their delivery deadline. In addition to the overall service rate (utility), we maximize the minimal regional service rate across all regions (fairness). We model the problem as a multi-objective Markov decision process and develop a deep Q-learning solution approach. We introduce a novel transformation of learning from rates to actual services, which creates a stable and efficient learning process. Computational results demonstrate the effectiveness of our approach in alleviating unfairness both spatially and temporally in different customer geographies. We also show this effectiveness is valid with different depot locations, providing businesses with an opportunity to achieve better fairness from any location. Further, we consider the impact of ignoring fairness in service, and results show that our policies eventually outperform the utility-driven baseline when customers have a high expectation on service level.