“Reinforcement Learning Approach Based on Proximal Policy Optimization Algorithm for Efficient Last Mile Delivery Using Smart Lockers”. Inspire Smart Systems Journal 1, no. 1 (January 1, 2026): 9–25. Accessed March 15, 2026. https://inspirequill.org/index.php/inspireSmartSystems/article/view/15.