TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots. Todd Hester and Peter Stone. Machine Learning, 90(3):385–429, 2013.
Design and Optimization of an Omnidirectional Humanoid Walk:A Winning Approach at the RoboCup 2011 3D Simulation Competition. Patrick MacAlpine, Samuel Barrett, Daniel Urieli, Victor Vu, and Peter ...
Classically, imitation learning algorithms have been developed for idealized situations, e.g., the demonstrations are often required to be collected in the exact same environment and usually include ...
Recent work has shown that deep neural networks are capable ofapproximating both value functions and policies in reinforcementlearning domains featuring continuous state and actionspaces. However, to ...
Congestion is one of the biggest challenges faced by the transportation community, accounting for an estimated 87.2 billion dollars in losses in 2007 alone. As such, transportation professionals need ...
Transfer learning is a method where an agent reuses knowledge learned in a source task to improve learning on a target task. Recent work has shown that transfer learning can be extended to the idea of ...
As president in my sophomore year, CSB tried to do a couple of things. One was social bonding—bringing everyone together through socials and having fun. The other part was sourcing career networking ...
Though computers have surpassed humans at many tasks, especially computationally intensive ones, there are many tasks for which human expertise remains necessary and/or useful. For such tasks, it is ...
Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.
Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism. Kurt Dresner and Peter Stone. In The Third International Joint Conference on Autonomous Agents and Multiagent Systems ...
Multiagent Systems: A survey from a machine learning perspective. Peter Stone and Manuela Veloso. Autonomous Robots, 8(3):345–383, July 2000. @Article(MASsurvey ...
Gaussian processes for sample efficient reinforcement learning with RMAX-like exploration. Tobias Jung and Peter Stone. @InProceedings{ECML10-jung, author = "Tobias Jung and Peter Stone", title = ...