[1]
“Reinforcement learning approach based on Proximal Policy Optimization algorithm for efficient last mile delivery using Smart Lockers”, Inspire Smart Systems, vol. 1, no. 1, pp. 9–25, Jan. 2026, doi: 10.65718/inspireSS.2026.3002.