Yeah I reset the vibe and it worked. My original prompt was way too complicated. Thank you
Login to reply
Replies (3)
Less is more
View quoted note →
Key Tools for Poker/Blackjack
These games require Reinforcement Learning methods like:
MDPs/POMDPs (Markov Decision Processes for Blackjack; Partially Observable MDPs for Poker).
Counterfactual Regret Minimization (CFR) for Poker.
Deep RL (e.g., Q-learning with neural networks).
...
Way out of my league
Maybe start with a slot machine? (k-arm bandit problem)