“Reinforcement Learning Approach Based on Proximal Policy Optimization Algorithm for Efficient Last Mile Delivery Using Smart Lockers”. 2026. Inspire Smart Systems 1 (1). https://inspirequill.org/index.php/inspireSmartSystems/article/view/9-25.