Reinforcement learning approach based on Proximal Policy Optimization algorithm for efficient last mile delivery using Smart Lockers. (2026). Inspire Smart Systems Journal, 1(1), 9-25. https://doi.org/10.65718/inspireSS.2026.3002