Pluribus (poker bot)

Pluribus is a computer poker player using artificial intelligence built by Facebook's AI Lab and Carnegie Mellon University. Pluribus plays the poker variation no-limit Texas hold 'em and is "the first bot to beat humans in a complex multiplayer competition".[1]

According to the Pluribus creators, "Developing a superhuman AI for multiplayer poker was the widely recognized main remaining milestone" in computer poker prior to Pluribus. Pluribus relies on offline self-play to build a base strategy, but then continues to learn in real-time during its online play. The base strategy was computed in eight days, and at market rates would cost about $144 to produce, much smaller than contemporary superhuman game-playing milestones such as AlphaZero. In AI, two-player zero-sum games (such as heads-up hold'em) are usually won by approximating a Nash equilibrium strategy; however, this approach does not work for games with three or more players. Pluribus instead uses an approach which lacks strong theoretical guarantees, but nevertheless appears to work well empirically at defeating human players. Across the competitions, Pluribus won an average of over 30 milli big blinds per game. Pluribus' self-learned play style avoids "limping" (calling the big blind), and engages in "donk betting" (ending a round with a call and starting the next round by betting) more often than human experts do.[2]

Among expert poker players, Jason Les stated he felt "very hopeless. You don't feel like there’s anything you can do to win." Chris Ferguson stated "Pluribus is a very hard opponent to play against. It's really hard to pin him down on any kind of hand." Jimmy Chou stated "Whenever playing the bot, I feel like I pick up something new to incorporate into my game." In The Wall Street Journal, science editor Daniela Hernandez characterized Pluribus as "advanced at a key human skill — deception".[3][4]

Playing No-Limit Hold'em against five professional poker players, Pluribus won an average of $5 per hand with winnings of $1,000 per hour, which Facebook described as a "decisive margin of victory."[5][6]

Following the victory, the developers declined to release the source code, out of fear it would be misused to surreptitiously cheat against human poker players in online matches.[7][8]

References

  1. Solly, Meilan (15 July 2019). "This Poker-Playing A.I. Knows When to Hold 'Em and When to Fold 'Em". Smithsonian. Retrieved 23 February 2023.
  2. Brown, Noam; Sandholm, Tuomas (30 August 2019). "Superhuman AI for multiplayer poker". Science. 365 (6456): 885–890. Bibcode:2019Sci...365..885B. doi:10.1126/science.aay2400. PMID 31296650.
  3. Vincent, James (11 July 2019). "Facebook and CMU's 'superhuman' poker AI beats human pros". The Verge. Retrieved 4 February 2021.
  4. Hernandez, Daniela (11 July 2019). "Computers Can Now Bluff Like a Poker Champ. Better, Actually". Wall Street Journal. Retrieved 4 February 2021.
  5. Brown, Noam; Sandholm, Tuomas (2019). "Superhuman AI for multiplayer poker". Science. 365 (6456): 885–890. Bibcode:2019Sci...365..885B. doi:10.1126/science.aay2400. PMID 31296650. S2CID 195892791.
  6. Brown, Noam (11 July 2019). "Facebook, Carnegie Mellon build first AI that beats pros in 6-player poker". ai.facebook.com. Retrieved 23 February 2023.
  7. Ouellette, Jennifer (11 July 2019). "Facebook AI Pluribus defeats top poker professionals in 6-player Texas Hold 'em". Ars Technica.
  8. Knight, Will (11 July 2019). "Facebook's new poker-playing AI could wreck the online poker industry—so it's not being released". MIT Technology Review.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.