“Reinforcement learning approach based on Proximal Policy Optimization algorithm for efficient last mile delivery using Smart Lockers” (2026) Inspire Smart Systems Journal, 1(1), pp. 9–25. doi:10.65718/inspireSS.2026.3002.