“Reinforcement Learning Approach Based on Proximal Policy Optimization Algorithm for Efficient Last Mile Delivery Using Smart Lockers”. 2026. Inspire Smart Systems Journal 1 (1): 9-25. https://doi.org/10.65718/inspireSS.2026.3002.