“Reinforcement learning approach based on Proximal Policy Optimization algorithm for efficient last mile delivery using Smart Lockers” (2026) Inspire Smart Systems, 1(1). Available at: https://inspirequill.org/index.php/inspireSmartSystems/article/view/9-25 (Accessed: 28 January 2026).