Open AccessPosted Content
Learning Strategy-Aware Linear Classifiers
TL;DR: The authors showed that Stackelberg and external regret for the problem of strategic classification are strongly incompatible, i.e., there exist worst-case scenarios where any sequence of actions providing sublinear external regret might result in linear Stackeberg regret and vice versa.
read more
Abstract: We address the question of repeatedly learning linear classifiers against agents who are strategically trying to game the deployed classifiers, and we use the Stackelberg regret to measure the performance of our algorithms. First, we show that Stackelberg and external regret for the problem of strategic classification are strongly incompatible: i.e., there exist worst-case scenarios, where any sequence of actions providing sublinear external regret might result in linear Stackelberg regret and vice versa. Second, we present a strategy-aware algorithm for minimizing the Stackelberg regret for which we prove nearly matching upper and lower regret bounds. Finally, we provide simulations to complement our theoretical analysis. Our results advance the growing literature of learning from revealed preferences, which has so far focused on "smoother" assumptions from the perspective of the learner and the agents respectively.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Causal Feature Discovery through Strategic Modification.
TL;DR: It is shown that even simple behavior on the learner's part allows her to simultaneously accurately recover which features have an impact on an agent's true label, and incentivize agents to invest in these impactful features, rather than in features that have no effect on their true label.
34
•Posted Content
Auctions Between Regret-Minimizing Agents.
Yoav Kolumbus,Noam Nisan +1 more
TL;DR: In this article, the authors analyze regret minimization algorithms in a repeated auction on behalf of their users and show that in second price auctions the players have incentives to mis-report their true valuations to their own learning agents, while in the first price auction it is a dominant strategy for all players to truthfully report their valuations.
9
Optimal Data Acquisition with Privacy-Aware Agents
01 Feb 2023
TL;DR: In this paper , the authors study the problem faced by a data analyst or platform that wishes to collect private data from privacy-aware agents and provide a semi-closed form characterization of the optimal choice of agent weights for the platform in two variants of their model.
7
•Proceedings Article
Classification with Few Tests through Self-Selection
Hanrui Zhang,Yu Cheng,Vincent Conitzer +2 more
- 18 May 2021
TL;DR: In this article, the authors study test-based binary classification, where a principal either accepts or rejects agents based on the outcomes they get in a set of tests, and they focus on the case where agents can be either "good" or "bad" (corresponding to their distribution over test outcomes).
5
Fundamental Bounds on Online Strategic Classification
Saba Ahmadi,Avrim Blum,Kunhe Yang +2 more
- 23 Feb 2023
TL;DR: In this article , the problem of online binary classification where strategic agents can manipulate their observable features in predefined ways, modeled by a manipulation graph, in order to receive a positive classification was studied.
References
•Book
Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems
Sébastien Bubeck,Nicolò Cesa-Bianchi +1 more
- 12 Dec 2012
TL;DR: In this article, the authors focus on regret analysis in the context of multi-armed bandit problems, where regret is defined as the balance between staying with the option that gave highest payoff in the past and exploring new options that might give higher payoffs in the future.
2.5K
Gambling in a rigged casino: The adversarial multi-armed bandit problem
Peter Auer,Nicolò Cesa-Bianchi,Yoav Freund,Robert E. Schapire +3 more
- 23 Oct 1995
TL;DR: A solution to the bandit problem in which an adversary, rather than a well-behaved stochastic process, has complete control over the payoffs is given.
Online convex optimization in the bandit setting: gradient descent without a gradient
Abraham D. Flaxman,Adam Tauman Kalai,H. Brendan McMahan +2 more
- 23 Jan 2005
TL;DR: It is possible to use gradient descent without seeing anything more than the value of the functions at a single point, and the guarantees hold even in the most general case: online against an adaptive adversary.
•Book
Introduction to Multi-Armed Bandits
Aleksandrs Slivkins
- 31 Oct 2019
TL;DR: This book provides a more introductory, textbook-like treatment of multi-armed bandits, providing a self-contained, teachable technical introduction and a brief review of the further developments.
Actionable Recourse in Linear Classification
TL;DR: An integer programming toolkit is presented to measure the feasibility and difficulty of recourse in a target population, and generate a list of actionable changes for a person to obtain a desired outcome, and illustrate how recourse can be significantly affected by common modeling practices.
571