Reinforcement learning approach based on Proximal Policy Optimization algorithm for efficient last mile delivery using Smart Lockers. Inspire Smart Systems Journal, [S. l.], v. 1, n. 1, p. 9–25, 2026. DOI: 10.65718/inspireSS.2026.3002. Disponível em: https://inspirequill.org/index.php/inspireSmartSystems/article/view/15. Acesso em: 15 mar. 2026.