Reinforcement learning approach based on Proximal Policy Optimization algorithm for efficient last mile delivery using Smart Lockers. Inspire Smart Systems, [S. l.], v. 1, n. 1, 2026. Disponível em: https://inspirequill.org/index.php/inspireSmartSystems/article/view/9-25. Acesso em: 28 jan. 2026.