Search | arXiv e-print repository

Learning in Random Utility Models Via Online Decision Problems

Abstract: This paper examines the Random Utility Model (RUM) in repeated stochastic choice settings where decision-makers lack full information about payoffs. We propose a gradient-based learning algorithm that embeds RUM into an online decision-making framework. Our analysis establishes Hannan consistency for a broad class of RUMs, meaning the average regret relative to the best fixed action in hindsight v… ▽ More This paper examines the Random Utility Model (RUM) in repeated stochastic choice settings where decision-makers lack full information about payoffs. We propose a gradient-based learning algorithm that embeds RUM into an online decision-making framework. Our analysis establishes Hannan consistency for a broad class of RUMs, meaning the average regret relative to the best fixed action in hindsight vanishes over time. We also show that our algorithm is equivalent to the Follow-The-Regularized-Leader (FTRL) method, offering an economically grounded approach to online optimization. Applications include modeling recency bias and characterizing coarse correlated equilibria in normal-form games △ Less

Submitted 19 June, 2025; originally announced June 2025.

arXiv:2504.08783 [pdf, other]

Mensuração da Transferência de Riqueza em Planos de Contribuição Definida com a Marcação de Ativos na Curva

Authors: Eduardo Fraga L. de Melo, Rodrigo S. Targino

Abstract: The methodology for measuring financial assets in defined contribution (DC) pension plans has significant implications whether wealth transfers will occur among participants. In December 2024, a regulatory act was issued for Closed Pension Entities, allowing the use of the hold-to-maturity (HTM) measurement method of treasury bonds in DC plans. This article quantifies the financial impact on parti… ▽ More The methodology for measuring financial assets in defined contribution (DC) pension plans has significant implications whether wealth transfers will occur among participants. In December 2024, a regulatory act was issued for Closed Pension Entities, allowing the use of the hold-to-maturity (HTM) measurement method of treasury bonds in DC plans. This article quantifies the financial impact on participants of adopting HTM valuation in these plans, using real data from the term structure of the real interest rates to assess the resulting wealth transfers. The analysis highlights how HTM valuation creates asymmetries in financial outcomes, benefiting some participants at the expense of others. Wealth transfers occur both during any withdrawal of funds and at the time of contributions, including portfolio reallocations that involve buying or selling bonds. Partial use of HTM or attempts to immunize outflows do not completely eliminate wealth transfers. The results reinforce that the use of mark-to-market (MTM) valuation of assets in DC plans prevents wealth transfers and, consequently, financial losses for participants. O método de mensuração de ativos financeiros em planos de previdência na modalidade de contribuição definida (CD, ou contribuição variável CV, na fase de acumulação) tem implicações significativas se haverá transferência de riqueza entre os participantes. Em Dez/2024 foi publicada norma para as Entidades Fechadas de Previdência Complementar possibilitando o uso da marcação na curva de títulos públicos federais nos planos CD e CV na fase de acumulação. Este artigo quantifica o impacto financeiro nos participantes da adoção da marcação na curva (HTM {\it Hold to Maturity}) nestes planos, utilizando dados reais da estrutura a termo da taxa de juros de cupom de IPCA para avaliar as transferências de riqueza resultantes dessa adoção. A análise evidencia como a marcação na curva gera assimetrias nos resultados financeiros, beneficiando alguns participantes em detrimento de outros. As transferências de riqueza ocorrem tanto em qualquer retirada de recursos quanto também na entrada (contribuições), inclusive realocações da carteira que impliquem venda ou compra de títulos. O uso do HTM de forma parcial ou a tentativa de imunização de saídas não eliminam por completo transferências de riqueza. Os resultados reforçam que, para fins de cotização, o uso da marcação a mercado (MTM {\it Mark to Market}) de ativos em planos CD (e CV na fase de diferimento) evita transferências de riqueza e, por consequência, prejuízos financeiros aos seus participantes. △ Less

Submitted 5 April, 2025; originally announced April 2025.

Comments: in Portuguese language

arXiv:2402.01892 [pdf, other]

Censored Beliefs and Wishful Thinking

Authors: Jarrod Burgh, Emerson Melo

Abstract: We present a model elucidating wishful thinking, which comprehensively incorporates both the costs and benefits associated with biased beliefs. Our findings reveal that wishful thinking behavior can be characterized as equivalent to superquantile-utility maximization within the domain of threshold beliefs distortion cost functions. By leveraging this equivalence, we establish WT as driving decisio… ▽ More We present a model elucidating wishful thinking, which comprehensively incorporates both the costs and benefits associated with biased beliefs. Our findings reveal that wishful thinking behavior can be characterized as equivalent to superquantile-utility maximization within the domain of threshold beliefs distortion cost functions. By leveraging this equivalence, we establish WT as driving decision-makers to exhibit a preference for choices characterized by skewness and increased risk. Furthermore, we discuss how our framework facilitates the study of optimistic stochastic choice and optimistic risk aversion. △ Less

Submitted 27 January, 2025; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: This paper, previously circulated under the title "Wishful Thinking is Risky Thinking: A Statistical-Distance Based Approach,'' has been divided into two separate and complementary works: "Wishful Thinking is Risky Thinking" [arXiv:2307.02422] and "Censored Beliefs and Wishful Thinking."

arXiv:2307.02422 [pdf, other]

Wishful Thinking is Risky Thinking

Authors: Jarrod Burgh, Emerson Melo

Abstract: We develop a model of wishful thinking that incorporates the costs and benefits of biased beliefs. We establish the connection between distorted beliefs and risk, revealing how wishful thinking can be understood in terms of risk measures. Our model accommodates extreme beliefs, allowing wishful-thinking decision-makers to assign zero probability to undesirable states and positive probability to ot… ▽ More We develop a model of wishful thinking that incorporates the costs and benefits of biased beliefs. We establish the connection between distorted beliefs and risk, revealing how wishful thinking can be understood in terms of risk measures. Our model accommodates extreme beliefs, allowing wishful-thinking decision-makers to assign zero probability to undesirable states and positive probability to otherwise impossible states. △ Less

Submitted 2 February, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

Comments: This paper, previously circulated under the title "Wishful Thinking is Risky Thinking: A Statistical-Distance Based Approach," has been divided into two separate and complementary works: "Wishful Thinking is Risky Thinking" and "Censored Beliefs and Wishful Thinking."

arXiv:2303.05888 [pdf, other]

A Distributionally Robust Random Utility Model

Authors: David Müller, Emerson Melo, Ruben Schlotter

Abstract: This paper introduces the distributionally robust random utility model (DRO-RUM), which allows the preference shock (unobserved heterogeneity) distribution to be misspecified or unknown. We make three contributions using tools from the literature on robust optimization. First, by exploiting the notion of distributionally robust social surplus function, we show that the DRO-RUM endogenously generat… ▽ More This paper introduces the distributionally robust random utility model (DRO-RUM), which allows the preference shock (unobserved heterogeneity) distribution to be misspecified or unknown. We make three contributions using tools from the literature on robust optimization. First, by exploiting the notion of distributionally robust social surplus function, we show that the DRO-RUM endogenously generates a shock distributionthat incorporates a correlation between the utilities of the different alternatives. Second, we show that the gradient of the distributionally robust social surplus yields the choice probability vector. This result generalizes the celebrated William-Daly-Zachary theorem to environments where the shock distribution is unknown. Third, we show how the DRO-RUM allows us to nonparametrically identify the mean utility vector associated with choice market data. This result extends the demand inversion approach to environments where the shock distribution is unknown or misspecified. We carry out several numerical experiments comparing the performance of the DRO-RUM with the traditional multinomial logit and probit models. △ Less

Submitted 10 March, 2023; originally announced March 2023.

arXiv:2208.03370

On the Distributional Robustness of Finite Rational Inattention Models

Authors: Emerson Melo

Abstract: In this paper we study a rational inattention model in environments where the decision maker faces uncertainty about the true prior distribution over states. The decision maker seeks to select a stochastic choice rule over a finite set of alternatives that is robust to prior ambiguity. We fully characterize the distributional robustness of the rational inattention model in terms of a tractable con… ▽ More In this paper we study a rational inattention model in environments where the decision maker faces uncertainty about the true prior distribution over states. The decision maker seeks to select a stochastic choice rule over a finite set of alternatives that is robust to prior ambiguity. We fully characterize the distributional robustness of the rational inattention model in terms of a tractable concave program. We establish necessary and sufficient conditions to construct robust consideration sets. Finally, we quantify the impact of prior uncertainty, by introducing the notion of \emph{Worst-Case Sensitivity}. △ Less

Submitted 4 May, 2023; v1 submitted 5 August, 2022; originally announced August 2022.

Comments: Outdated version

arXiv:2112.10993 [pdf, ps, other]

Learning in Random Utility Models Via Online Decision Problems

Authors: Emerson Melo

Abstract: This paper studies the Random Utility Model (RUM) in a repeated stochastic choice situation, in which the decision maker is imperfectly informed about the payoffs of each available alternative. We develop a gradient-based learning algorithm by embedding the RUM into an online decision problem. We show that a large class of RUMs are Hannan consistent (\citet{Hahn1957}); that is, the average differe… ▽ More This paper studies the Random Utility Model (RUM) in a repeated stochastic choice situation, in which the decision maker is imperfectly informed about the payoffs of each available alternative. We develop a gradient-based learning algorithm by embedding the RUM into an online decision problem. We show that a large class of RUMs are Hannan consistent (\citet{Hahn1957}); that is, the average difference between the expected payoffs generated by a RUM and that of the best-fixed policy in hindsight goes to zero as the number of periods increase. In addition, we show that our gradient-based algorithm is equivalent to the Follow the Regularized Leader (FTRL) algorithm, which is widely used in the machine learning literature to model learning in repeated stochastic choice problems. Thus, we provide an economically grounded optimization framework to the FTRL algorithm. Finally, we apply our framework to study recency bias, no-regret learning in normal form games, and prediction markets. △ Less

Submitted 12 August, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

arXiv:2010.02398 [pdf, other]

A Recursive Logit Model with Choice Aversion and Its Application to Transportation Networks

Authors: Austin Knies, Jorge Lorca, Emerson Melo

Abstract: We propose a recursive logit model which captures the notion of choice aversion by imposing a penalty term that accounts for the dimension of the choice set at each node of the transportation network. We make three contributions. First, we show that our model overcomes the correlation problem between routes, a common pitfall of traditional logit models, and that the choice aversion model can be se… ▽ More We propose a recursive logit model which captures the notion of choice aversion by imposing a penalty term that accounts for the dimension of the choice set at each node of the transportation network. We make three contributions. First, we show that our model overcomes the correlation problem between routes, a common pitfall of traditional logit models, and that the choice aversion model can be seen as an alternative to these models. Second, we show how our model can generate violations of regularity in the path choice probabilities. In particular, we show that removing edges in the network may decrease the probability for existing paths. Finally, we show that under the presence of choice aversion, adding edges to the network can make users worse off. In other words, a type of Braess's paradox can emerge outside of congestion and can be characterized in terms of a parameter that measures users' degree of choice aversion. We validate these contributions by estimating this parameter over GPS traffic data captured on a real-world transportation network. △ Less

Submitted 18 October, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

Comments: 58 pages, 12 figures, 6 tables; forthcoming at Transportation Research Part B: Methodological

arXiv:1709.09117 [pdf, ps, other]

Discrete Choice and Rational Inattention: a General Equivalence Result

Authors: Mogens Fosgerau, Emerson Melo, Andre de Palma, Matthew Shum

Abstract: This paper establishes a general equivalence between discrete choice and rational inattention models. Matejka and McKay (2015, AER) showed that when information costs are modelled using the Shannon entropy function, the resulting choice probabilities in the rational inattention model take the multinomial logit form. By exploiting convex-analytic properties of the discrete choice model, we show tha… ▽ More This paper establishes a general equivalence between discrete choice and rational inattention models. Matejka and McKay (2015, AER) showed that when information costs are modelled using the Shannon entropy function, the resulting choice probabilities in the rational inattention model take the multinomial logit form. By exploiting convex-analytic properties of the discrete choice model, we show that when information costs are modelled using a class of generalized entropy functions, the choice probabilities in any rational inattention model are observationally equivalent to some additive random utility discrete choice model and vice versa. Thus any additive random utility model can be given an interpretation in terms of boundedly rational behavior. This includes empirically relevant specifications such as the probit and nested logit models. △ Less

Submitted 26 September, 2017; originally announced September 2017.

Showing 1–9 of 9 results for author: Melo, E