News

Researchers at The University of Texas at Austin and Cognizant AI Labs have developed an AI-driven system that leverages 175 ...
Though computers have surpassed humans at many tasks, especially computationally intensive ones, there are many tasks for which human expertise remains necessary and/or useful. For such tasks, it is ...
Patrick MacAlpine and Peter Stone.
Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.
Transfer Learning for Reinforcement Learning on a Physical Robot. Samuel Barrett, Matt E. Taylor, and Peter Stone. In Ninth International Conference on Autonomous Agents and Multiagent Systems - ...
Imitation from observation (IfO) is the problem of learning directly from state-only demonstrations without having access to the demonstrator's actions.The lack of action information both ...
UT Austin Villa RoboCup 3D Simulation Base Code Release. Patrick MacAlpine and Peter Stone. In Sven Behnke, Daniel D. Lee, Sanem Sariel, and Raymond Sheh, editors, RoboCup 2016: Robot Soccer World Cup ...
Recent work has shown that deep neural networks are capable ofapproximating both value functions and policies in reinforcementlearning domains featuring continuous state and actionspaces. However, to ...
To Teach or not to Teach? Decision Making Under Uncertainty in Ad Hoc Teams. Peter Stone and Sarit Kraus. In The Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), ...
Reasoning about Hypothetical Agent Behaviours and their Parameters. Stefano Albrecht and Peter Stone. In Proceedings of the 16th International Conference on Autonomous Agents and Multiagent Systems ...
Relaxed Exploration Constrained Reinforcement Learning. Shahaf S. Shperberg, Bo Liu, and Peter Stone. @InProceedings{shahaf_shperberg_AAMAS_2024, author = {Shahaf S. Shperberg and Bo Liu and Peter ...
Current approaches to learning cooperative multi-agent behaviors assumerelatively restrictive settings. In standard fully cooperative multi-agentreinforcement learning, the learning algorithm controls ...