Mitigating Value Hallucination in Dyna-Style Planning via Multistep Predecessor Models
Abstract
Index Terms
- Mitigating Value Hallucination in Dyna-Style Planning via Multistep Predecessor Models
Recommendations
Multi-step linear Dyna-style planning
NIPS'09: Proceedings of the 22nd International Conference on Neural Information Processing SystemsIn this paper we introduce a multi-step linear Dyna-style planning algorithm. The key element of the multi-step linear Dyna is a multi-step linear model that enables multi-step projection of a sampled feature and multi-step planning based on the ...
Dyna-style planning with linear function approximation and prioritized sweeping
UAI'08: Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial IntelligenceWe consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available after each interaction with the world. This paper develops an explicitly model-...
Dyna-MLAC: Trading Computational and Sample Complexities in Actor-Critic Reinforcement Learning
BRACIS '15: Proceedings of the 2015 Brazilian Conference on Intelligent Systems (BRACIS)Sampling and computation budgets are two of the key elements that determine the performance of a reinforcement learning algorithm. In essence, any reinforcement learning agent must sample the environment and perform some computation over the samples to ...
Comments
Please enable JavaScript to view thecomments powered by Disqus.Information & Contributors
Information
Published In
Publisher
AI Access Foundation
El Segundo, CA, United States
Publication History
Qualifiers
- Article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 60Total Downloads
- Downloads (Last 12 months)60
- Downloads (Last 6 weeks)24
Other Metrics
Citations
View Options
Get Access
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in