Skip to content

hardikp/papers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 

Repository files navigation

2018-11

  • Exploration by Random Network Distillation - arXiv
  • BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding - arXiv

2018-05

2018-04

  • Measuring the Intrinsic Dimension of Objective Landscapes - arXiv
  • Prefrontal cortex as a meta-reinforcement learning system - BioArxiv

2018-03

  • An Analysis of Neural Language Modeling at Multiple Scales - arXiv
  • Averaging Weights Leads to Wider Optima and Better Generalization - arXiv
  • Machine Theory of Mind - arXiv
  • On the Optimization of Deep Networks: Implicit Acceleration by Overparameterization - arXiv
  • Diversity is All You Need: Learning Skills without a Reward Function - arXiv

2017-12

  • Breaking the Softmax Bottleneck: A High-Rank RNN Language Model - arXiv
  • Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm - arXiv

2017-11

2017-10

  • Playing Atari with Deep Reinforcement Learning - arXiv - paper
  • Deep Reinforcement Learning: An Overview - arXiv
  • A Brief Survey of Deep Reinforcement Learning - arXiv
  • A Deep Reinforcement Learning Chatbot - arXiv

2017-09

2017-08

About

Summary of selected research papers

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published