Notes on "Deep Bayesian Bandits Showdown" (Riquelme et al., 2018)
I probably overuse the normal-inverse-gamma posterior. Every time I build a bandit system, every time I need uncertainty quantification for sequential decisions, I end up back at conjugate linear regression.