首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > 其他 > beliefbox

beliefbox

  • 资源大小:10.92 MB
  • 上传时间:2021-06-30
  • 下载次数:0次
  • 浏览次数:0次
  • 资源积分:1积分
  • 标      签:

资 源 简 介

Statistical inference and planning Markov decision processes and algorithms for reinforcement learning. Some highlights include: Bayesian estimators including: - Parametric conjugate distributions (e.g. Dirichlet/Multinomial) - Non-parametric methods (Gaussian processes, various tree models) - Approximate Bayesian Computation (ABC) - Various (problem-specific) Monte-Carlo samplers. (Approximate) dynamic programming algorithms - Backwards induction / value iteration - Policy iteration - Rollout sampling policy iteration - Least-Squares Policy Iteration - Least-Squares Temporal Differences - Fitted Value / Q - iteration. Reinforcement learning algorithms: Stochastic approximators (Q-learning, Sarsa and various generalisations) Upper-confidence bound algorihtms (UCB/UCRL) Bayesian algorithms (Thompson sampling, Upper/Lower Bayesian Bound algorithms) Gradient-based Bellman error minimisation (GBRL) Example rl-glue

文 件 列 表

beliefbox-r943
dat
bin
Doxyfile
lib
src
inc
Makefile
README
VIP VIP
  • Zzz 1天前 成为了本站会员

  • Katou Megumi 1天前 成为了本站会员

  • 1天前 成为了本站会员

  • 流浪 1天前 成为了本站会员

  • 也是一生 1天前 成为了本站会员

  • king666 2天前 成为了本站会员

  • ﹏約啶℡ 2天前 成为了本站会员

  • Long for 2天前 成为了本站会员

  • 2天前 成为了本站会员

  • 金. 2天前 成为了本站会员

0.185310s