Reinforcement learning approach based on Proximal Policy Optimization algorithm for efficient last mile delivery using Smart Lockers. (2026). Inspire Smart Systems, 1(1). https://inspirequill.org/index.php/inspireSmartSystems/article/view/9-25