CMU-CS-22-119
Computer Science Department
School of Computer Science, Carnegie Mellon University



CMU-CS-22-119

Using intuitive behavior models to
adapt to and work with human teammates in Hanabi

Arnav Mahajan

M.S. Thesis

May 2022

CMU-CS-22-119.pdf


Keywords: Human-Computer Interaction, Artificial Intelligence, Behavioral Models

An agent that can rapidly and accurately model its teammate is a powerful tool in the field of Collaborative AI. Furthermore, if an approximation for this goal was possible in the field of Human-AI Collaboration, teams of people and machines could be more efficient and effective immediately after starting to work together. Using the cooperative card game Hanabi as a testbed, we developed the Chief agent, which models teammates using a pool of intuitive behavioral models. To achieve the goal of rapid learning, it uses Bayesian inference to quickly evaluate the different models relative to each other. To generate an accurate model, it uses historical data augmented by up-to-date knowledge and sampling methods to handle environmental noise and unknowns. We demonstrate that the Chief's mechanisms for modeling and understanding the teammate show promise, but the overall performance still can use improvement to reliably outperform a solution which skips inferring a best strategy and assumes all strategies in the pool are equally likely for the teammate.

54 pages

Thesis Committee:
Reid Simmons (Advisor)
Pat Virtue

Srinivasan Seshan, Head, Computer Science Department
Martial Hebert, Dean, School of Computer Science


Return to: SCS Technical Report Collection
School of Computer Science

This page maintained by [email protected]