CS Seminar – Aadirupa Saha

CS Seminar – Aadirupa Saha

Speaker: Aadirupa Saha, Toyota Technological Institute at Chicago

Bio: Aadirupa is visiting faculty at TTI Chicago. Before this, she was a postdoctoral researcher at Microsoft Research New York City. She obtained her Ph.D. from the Department of Computer Science, Indian Institute of Science, Bangalore, advised by Aditya Gopalan and Chiranjib Bhattacharyya. Aadirupa was an intern at Microsoft Research, Bangalore, Inria, Paris, and Google AI, Mountain View.

Her research interests include Bandits, Reinforcement Learning, Optimization, Learning theory, Algorithms. Off late, she is also very interested in working on problems in the intersection of ML and Game theory, Algorithmic fairness, and Privacy.

Logistics:

Date: Wednesday, Oct 19, 2022
Location: Northwestern University
Room: Mudd Library 3514
Time: 12:00 pm ( Chicago Time)
Panopto: Click here to watch the recording

Titles and Abstracts:

Title: Battling Bandits: Exploiting Preference Feedback towards Efficient Information Aggregation

Abstract: Studies have revealed that users often find it easier to elicit their preferences in terms of relative feedback, say “Do you prefer Item A over B?”, rather than their absolute counterparts: “How much do you score items A and B on a scale of [0-10]?”. Drawing inspirations, in the search for an effective feedback mechanism, this led to the famous formulation of Dueling Bandits (DB), which is a widely studied online learning framework for efficient information aggregation from relative / comparative feedback. However despite the novel objective, unfortunately, most of the existing DB techniques were limited only to simpler settings of finite decision spaces, and stochastic environments, which are unrealistic in practice.

In this talk, we will start with the basic problem formulations for DB and familiarize ourselves with some of the breakthrough results. Following this, will dive deeper into a more practical framework of contextual dueling bandits (C-DB) where the goal of the learner is to make customized predictions based on the user contexts: We will see a new algorithmic approach that can efficiently achieve the optimal regret performance for this problem, resolving an open problem from Dudík et al. [COLT, 2015]. We will conclude the talk with some interesting open problems.

[The discussion on C-DB setup is based on joint work with Akshay Krishnamurthy (MSR, NYC), ALT 2022]

IDEAL

The Institute for Data, Econometrics, Algorithms, and Learning

CS Seminar – Aadirupa Saha