Pavlov

Navigation Menu
  • archive
  • about
  • contact

March 2019

This archive holds all posts form March 2019.

Interactive Bandits

Code, Contextual, Lablog, R, Research, Statistics • March 2, 2019 • Robin van Emden

To help students get a better feel for three of the most popular “multi-armed bandit” exploration/exploitation balancing strategies (Epsilon Greedy, Thompson Sampling, and Upper Confidence Bound), I combined my R package “contextual” with the versatile …

Copyright 2018 | Robin van Emden | Pavlov.tech