Skip to main content

Showing 1–18 of 18 results for author: Fiez, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03062  [pdf, ps, other

    cs.LG stat.ML

    Multi-Metric Adaptive Experimental Design under Fixed Budget with Validation

    Authors: Qining Zhang, Tanner Fiez, Yi Liu, Wenyang Liu

    Abstract: Standard A/B tests in online experiments face statistical power challenges when testing multiple candidates simultaneously, while adaptive experimental designs (AED) alone fall short in inferring experiment statistics such as the average treatment effect, especially with many metrics (e.g., revenue, safety) and heterogeneous variances. This paper proposes a fixed-budget multi-metric AED framework… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2409.13847  [pdf, other

    cs.LG cs.IR

    Segment Discovery: Enhancing E-commerce Targeting

    Authors: Qiqi Li, Roopali Singh, Charin Polpanumas, Tanner Fiez, Namita Kumar, Shreya Chakrabarti

    Abstract: Modern e-commerce services frequently target customers with incentives or interventions to engage them in their products such as games, shopping, video streaming, etc. This customer engagement increases acquisition of more customers and retention of existing ones, leading to more business for the company while improving customer experience. Often, customers are either randomly targeted or targeted… ▽ More

    Submitted 30 December, 2024; v1 submitted 20 September, 2024; originally announced September 2024.

    Comments: Accepted at the CONSEQUENCES'24 workshop, co-located with ACM RecSys'24

  3. arXiv:2406.10738  [pdf, other

    cs.LG stat.ME

    Adaptive Experimentation When You Can't Experiment

    Authors: Yao Zhao, Kwang-Sung Jun, Tanner Fiez, Lalit Jain

    Abstract: This paper introduces the \emph{confounded pure exploration transductive linear bandit} (\texttt{CPET-LB}) problem. As a motivating example, often online services cannot directly assign users to specific control or treatment experiences either for business or practical reasons. In these settings, naively comparing treatment and control groups that may result from self-selection can lead to biased… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  4. arXiv:2402.10870  [pdf, other

    cs.LG stat.ME

    Best of Three Worlds: Adaptive Experimentation for Digital Marketing in Practice

    Authors: Tanner Fiez, Houssam Nassif, Yu-Cheng Chen, Sergio Gamez, Lalit Jain

    Abstract: Adaptive experimental design (AED) methods are increasingly being used in industry as a tool to boost testing throughput or reduce experimentation cost relative to traditional A/B/N testing methods. However, the behavior and guarantees of such methods are not well-understood beyond idealized stationary settings. This paper shares lessons learned regarding the challenges of naively using AED system… ▽ More

    Submitted 26 February, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Journal ref: The Web Conference (WWW'24), Singapore, pp. 3586 - 3597, 2024

  5. Neural Insights for Digital Marketing Content Design

    Authors: Fanjie Kong, Yuan Li, Houssam Nassif, Tanner Fiez, Ricardo Henao, Shreya Chakrabarti

    Abstract: In digital marketing, experimenting with new website content is one of the key levers to improve customer engagement. However, creating successful marketing content is a manual and time-consuming process that lacks clear guiding principles. This paper seeks to close the loop between content creation and online experimentation by offering marketers AI-driven actionable insights based on historical… ▽ More

    Submitted 7 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Journal ref: International Conference on Knowledge Discovery and Data Mining (KDD'23), Long Beach, CA, pp. 4320-4332, 2023

  6. arXiv:2210.14369  [pdf, other

    cs.LG stat.ME

    Adaptive Experimental Design and Counterfactual Inference

    Authors: Tanner Fiez, Sergio Gamez, Arick Chen, Houssam Nassif, Lalit Jain

    Abstract: Adaptive experimental design methods are increasingly being used in industry as a tool to boost testing throughput or reduce experimentation cost relative to traditional A/B/N testing methods. This paper shares lessons learned regarding the challenges and pitfalls of naively using adaptive experimentation systems in industrial settings where non-stationarity is prevalent, while also providing pers… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: In Workshops of the Conference on Recommender Systems (RecSys), 2022

  7. arXiv:2111.03377  [pdf, other

    cs.GT cs.LG cs.MA

    Online Learning in Periodic Zero-Sum Games

    Authors: Tanner Fiez, Ryann Sim, Stratis Skoulakis, Georgios Piliouras, Lillian Ratliff

    Abstract: A seminal result in game theory is von Neumann's minmax theorem, which states that zero-sum games admit an essentially unique equilibrium solution. Classical learning results build on this theorem to show that online no-regret dynamics converge to an equilibrium in a time-average sense in zero-sum games. In the past several years, a key research direction has focused on characterizing the day-to-d… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: To appear at NeurIPS 2021

  8. arXiv:2109.12286  [pdf, other

    cs.LG

    Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms

    Authors: Liyuan Zheng, Tanner Fiez, Zane Alumbaugh, Benjamin Chasnov, Lillian J. Ratliff

    Abstract: The hierarchical interaction between the actor and critic in actor-critic based reinforcement learning algorithms naturally lends itself to a game-theoretic interpretation. We adopt this viewpoint and model the actor and critic interaction as a two-player general-sum game with a leader-follower structure known as a Stackelberg game. Given this abstraction, we propose a meta-framework for Stackelbe… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  9. arXiv:2106.01488  [pdf, other

    cs.LG cs.GT

    Minimax Optimization with Smooth Algorithmic Adversaries

    Authors: Tanner Fiez, Chi Jin, Praneeth Netrapalli, Lillian J. Ratliff

    Abstract: This paper considers minimax optimization $\min_x \max_y f(x, y)$ in the challenging setting where $f$ can be both nonconvex in $x$ and nonconcave in $y$. Though such optimization problems arise in many machine learning paradigms including training generative adversarial networks (GANs) and adversarially robust models, many fundamental issues remain in theory, such as the absence of efficiently co… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

  10. arXiv:2012.08382  [pdf, other

    cs.GT cs.LG cs.MA

    Evolutionary Game Theory Squared: Evolving Agents in Endogenously Evolving Zero-Sum Games

    Authors: Stratis Skoulakis, Tanner Fiez, Ryann Sim, Georgios Piliouras, Lillian Ratliff

    Abstract: The predominant paradigm in evolutionary game theory and more generally online learning in games is based on a clear distinction between a population of dynamic agents that interact given a fixed, static game. In this paper, we move away from the artificial divide between dynamic agents and static games, to introduce and analyze a large class of competitive settings where both the agents and the g… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: To appear in AAAI 2021

  11. arXiv:2009.14820  [pdf, other

    cs.LG cs.GT eess.SY stat.ML

    Gradient Descent-Ascent Provably Converges to Strict Local Minmax Equilibria with a Finite Timescale Separation

    Authors: Tanner Fiez, Lillian Ratliff

    Abstract: We study the role that a finite timescale separation parameter $τ$ has on gradient descent-ascent in two-player non-convex, non-concave zero-sum games where the learning rate of player 1 is denoted by $γ_1$ and the learning rate of player 2 is defined to be $γ_2=τγ_1$. Existing work analyzing the role of timescale separation in gradient descent-ascent has primarily focused on the edge cases of pla… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  12. arXiv:2007.07079  [pdf, other

    cs.AI cs.IR cs.LG

    A SUPER* Algorithm to Optimize Paper Bidding in Peer Review

    Authors: Tanner Fiez, Nihar B. Shah, Lillian Ratliff

    Abstract: A number of applications involve sequential arrival of users, and require showing each user an ordering of items. A prime example (which forms the focus of this paper) is the bidding process in conference peer review where reviewers enter the system sequentially, each reviewer needs to be shown the list of submitted papers, and the reviewer then "bids" to review some papers. The order of the paper… ▽ More

    Submitted 31 July, 2020; v1 submitted 27 June, 2020; originally announced July 2020.

  13. arXiv:1906.08399  [pdf, other

    stat.ML cs.LG

    Sequential Experimental Design for Transductive Linear Bandits

    Authors: Tanner Fiez, Lalit Jain, Kevin Jamieson, Lillian Ratliff

    Abstract: In this paper we introduce the transductive linear bandit problem: given a set of measurement vectors $\mathcal{X}\subset \mathbb{R}^d$, a set of items $\mathcal{Z}\subset \mathbb{R}^d$, a fixed confidence $δ$, and an unknown vector $θ^{\ast}\in \mathbb{R}^d$, the goal is to infer $\text{argmax}_{z\in \mathcal{Z}} z^\topθ^\ast$ with probability $1-δ$ by making as few sequentially chosen noisy meas… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  14. arXiv:1906.01217  [pdf, other

    cs.GT cs.LG eess.SY

    Convergence of Learning Dynamics in Stackelberg Games

    Authors: Tanner Fiez, Benjamin Chasnov, Lillian J. Ratliff

    Abstract: This paper investigates the convergence of learning dynamics in Stackelberg games. In the class of games we consider, there is a hierarchical game being played between a leader and a follower with continuous action spaces. We establish a number of connections between the Nash and Stackelberg equilibrium concepts and characterize conditions under which attracting critical points of simultaneous gra… ▽ More

    Submitted 6 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: This version includes numerical results training generative adversarial networks

    MSC Class: math.OC

  15. arXiv:1807.02297  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Combinatorial Bandits for Incentivizing Agents with Dynamic Preferences

    Authors: Tanner Fiez, Shreyas Sekar, Liyuan Zheng, Lillian J. Ratliff

    Abstract: The design of personalized incentives or recommendations to improve user engagement is gaining prominence as digital platform providers continually emerge. We propose a multi-armed bandit framework for matching incentives to users, whose preferences are unknown a priori and evolving dynamically in time, in a resource constrained environment. We design an algorithm that combines ideas from three di… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: Published as a conference paper in Conference on Uncertainty in Artificial Intelligence (UAI) 2018

  16. arXiv:1806.05749  [pdf, other

    cs.GT eess.SY

    Adaptive Incentive Design

    Authors: Lillian J. Ratliff, Tanner Fiez

    Abstract: We apply control theoretic and optimization techniques to adaptively design incentives. In particular, we consider the problem of a planner with an objective that depends on data from strategic decision makers. The planner does not know the process by which the strategic agents make decisions. Under the assumption that the agents are utility maximizers, we model their interactions as a non-coopera… ▽ More

    Submitted 14 June, 2018; originally announced June 2018.

  17. arXiv:1803.04008  [pdf, other

    cs.LG

    Multi-Armed Bandits for Correlated Markovian Environments with Smoothed Reward Feedback

    Authors: Tanner Fiez, Shreyas Sekar, Lillian J. Ratliff

    Abstract: We study a multi-armed bandit problem in a dynamic environment where arm rewards evolve in a correlated fashion according to a Markov chain. Different than much of the work on related problems, in our formulation a learning algorithm does not have access to either a priori information or observations of the state of the Markov chain and only observes smoothed reward feedback following time interva… ▽ More

    Submitted 1 March, 2019; v1 submitted 11 March, 2018; originally announced March 2018.

    Comments: Significant revision of prior version including deeper discussion of related work, gap-independent regret bounds, and regret bounds for discounted rewards

  18. arXiv:1702.06156  [pdf, other

    cs.CY

    How Much Urban Traffic is Searching for Parking? Simulating Curbside Parking as a Network of Finite Capacity Queues

    Authors: Chase Dowling, Tanner Fiez, Lillian Ratliff, Baosen Zhang

    Abstract: With the increasing availability of transaction data collected by digital parking meters, paid curbside parking can be advantageously modeled as a network of interdependent queues. In this article we introduce methods for analyzing a special class of networks of finite capacity queues, where tasks arrive from an exogenous source, join the queue if there is an available server or are rejected and m… ▽ More

    Submitted 11 May, 2018; v1 submitted 20 February, 2017; originally announced February 2017.

    Comments: Updated May 11, 2018 (fixed formatting errors)