Human-AI Collaboration with Bandit Feedback

Ruijiang Gao, Maytal Saar-Tsechansky, Maria De-Arteaga, Ligong Han, Min Kyung Lee, Matthew Lease

Producció científica: Capítol de llibreContribució a congrés/conferènciaAvaluat per experts

19 Cites (Scopus)

Resum

Human-machine complementarity is important when neither the algorithm nor the human yield dominant performance across all instances in a given domain. Most research on algorithmic decision-making solely centers on the algorithm's performance, while recent work that explores human-machine collaboration has framed the decision-making problems as classification tasks. In this paper, we first propose and then develop a solution for a novel human-machine collaboration problem in a bandit feedback setting. Our solution aims to exploit the human-machine complementarity to maximize decision rewards. We then extend our approach to settings with multiple human decision makers. We demonstrate the effectiveness of our proposed methods using both synthetic and real human responses, and find that our methods outperform both the algorithm and the human when they each make decisions on their own. We also show how personalized routing in the presence of multiple human decision-makers can further improve the human-machine team performance.

Idioma originalAnglès
Títol de la publicacióProceedings of the 30th International Joint Conference on Artificial Intelligence, IJCAI 2021
EditorsZhi-Hua Zhou
Lloc de publicacióMontreal
EditorInternational Joint Conferences on Artificial Intelligence
Pàgines1722-1728
Nombre de pàgines7
ISBN (electrònic)9780999241196
DOIs
Estat de la publicacióPublicada - d’ag. 2021
Publicat externament
Esdeveniment30th International Joint Conference on Artificial Intelligence, IJCAI 2021 - Virtual, Online, Canada
Durada: 19 d’ag. 202127 d’ag. 2021

Sèrie de publicacions

NomIJCAI International Joint Conference on Artificial Intelligence
ISSN (imprès)1045-0823

Conferència

Conferència30th International Joint Conference on Artificial Intelligence, IJCAI 2021
País/TerritoriCanada
CiutatVirtual, Online
Període19/08/2127/08/21

Fingerprint

Navegar pels temes de recerca de 'Human-AI Collaboration with Bandit Feedback'. Junts formen un fingerprint únic.

Com citar-ho