TY - JOUR
T1 - Use of Proximal Policy Optimization for the Joint Replenishment Problem
AU - Vanvuchelen, Nathalie
AU - Gijsbrechts, Joren
AU - Boute, Robert
N1 - Publisher Copyright:
© 2020 Elsevier B.V.
PY - 2020/8
Y1 - 2020/8
N2 - Deep reinforcement learning has been coined as a promising research avenue to solve sequential decision-making problems, especially if few is known about the optimal policy structure. We apply the proximal policy optimization algorithm to the intractable joint replenishment problem. We demonstrate how the algorithm approaches the optimal policy structure and outperforms two other heuristics. Its deployment in supply chain control towers can orchestrate and facilitate collaborative shipping in the Physical Internet.
AB - Deep reinforcement learning has been coined as a promising research avenue to solve sequential decision-making problems, especially if few is known about the optimal policy structure. We apply the proximal policy optimization algorithm to the intractable joint replenishment problem. We demonstrate how the algorithm approaches the optimal policy structure and outperforms two other heuristics. Its deployment in supply chain control towers can orchestrate and facilitate collaborative shipping in the Physical Internet.
KW - Collaborative Shipping
KW - Deep Reinforcement Learning
KW - Joint Replenishment Problem
KW - Machine Learning
KW - Physical Internet
KW - Proximal Policy Optimization
UR - http://www.scopus.com/inward/record.url?scp=85083654780&partnerID=8YFLogxK
U2 - 10.1016/j.compind.2020.103239
DO - 10.1016/j.compind.2020.103239
M3 - Article
AN - SCOPUS:85083654780
SN - 0166-3615
VL - 119
JO - Computers in Industry
JF - Computers in Industry
M1 - 103239
ER -