notstr's avatar
notstr 6 days ago
Key Tools for Poker/Blackjack These games require Reinforcement Learning methods like: MDPs/POMDPs (Markov Decision Processes for Blackjack; Partially Observable MDPs for Poker). Counterfactual Regret Minimization (CFR) for Poker. Deep RL (e.g., Q-learning with neural networks). ... Way out of my league

Replies (1)

notstr's avatar
notstr 6 days ago
Maybe start with a slot machine? (k-arm bandit problem)