Search | arXiv e-print repository

Improving choice model specification using reinforcement learning

Authors: Gabriel Nova, Sander van Cranenburgh, Stephane Hess

Abstract: Discrete choice modelling is a theory-driven modelling framework for understanding and forecasting choice behaviour. To obtain behavioural insights, modellers test several competing model specifications in their attempts to discover the 'true' data generation process. This trial-and-error process requires expertise, is time-consuming, and relies on subjective theoretical assumptions. Although meta… ▽ More Discrete choice modelling is a theory-driven modelling framework for understanding and forecasting choice behaviour. To obtain behavioural insights, modellers test several competing model specifications in their attempts to discover the 'true' data generation process. This trial-and-error process requires expertise, is time-consuming, and relies on subjective theoretical assumptions. Although metaheuristics have been proposed to assist choice modellers, they treat model specification as a classic optimisation problem, relying on static strategies, applying predefined rules, and neglecting outcomes from previous estimated models. As a result, current metaheuristics struggle to prioritise promising search regions, adapt exploration dynamically, and transfer knowledge to other modelling tasks. To address these limitations, we introduce a deep reinforcement learning-based framework where an 'agent' specifies models by estimating them and receiving rewards based on goodness-of-fit and parsimony. Results demonstrate the agent dynamically adapts its strategies to identify promising specifications across data generation processes, showing robustness and potential transferability, without prior domain knowledge. △ Less

Submitted 6 June, 2025; originally announced June 2025.

Comments: 13 pages, 7 figures

arXiv:2506.03693 [pdf, ps, other]

Combine and conquer: model averaging for out-of-distribution forecasting

Authors: Stephane Hess, Sander van Cranenburgh

Abstract: Travel behaviour modellers have an increasingly diverse set of models at their disposal, ranging from traditional econometric structures to models from mathematical psychology and data-driven approaches from machine learning. A key question arises as to how well these different models perform in prediction, especially when considering trips of different characteristics from those used in estimatio… ▽ More Travel behaviour modellers have an increasingly diverse set of models at their disposal, ranging from traditional econometric structures to models from mathematical psychology and data-driven approaches from machine learning. A key question arises as to how well these different models perform in prediction, especially when considering trips of different characteristics from those used in estimation, i.e. out-of-distribution prediction, and whether better predictions can be obtained by combining insights from the different models. Across two case studies, we show that while data-driven approaches excel in predicting mode choice for trips within the distance bands used in estimation, beyond that range, the picture is fuzzy. To leverage the relative advantages of the different model families and capitalise on the notion that multiple `weak' models can result in more robust models, we put forward the use of a model averaging approach that allocates weights to different model families as a function of the \emph{distance} between the characteristics of the trip for which predictions are made, and those used in model estimation. Overall, we see that the model averaging approach gives larger weight to models with stronger behavioural or econometric underpinnings the more we move outside the interval of trip distances covered in estimation. Across both case studies, we show that our model averaging approach obtains improved performance both on the estimation and validation data, and crucially also when predicting mode choices for trips of distances outside the range used in estimation. △ Less

Submitted 4 June, 2025; originally announced June 2025.

arXiv:2411.01704 [pdf, ps, other]

Understanding the decision-making process of choice modellers

Authors: Gabriel Nova, Sander van Cranenburgh, Stephane Hess

Abstract: Discrete Choice Modelling serves as a robust framework for modelling human choice behaviour across various disciplines. Building a choice model is a semi structured research process that involves a combination of a priori assumptions, behavioural theories, and statistical methods. This complex set of decisions, coupled with diverse workflows, can lead to substantial variability in model outcomes.… ▽ More Discrete Choice Modelling serves as a robust framework for modelling human choice behaviour across various disciplines. Building a choice model is a semi structured research process that involves a combination of a priori assumptions, behavioural theories, and statistical methods. This complex set of decisions, coupled with diverse workflows, can lead to substantial variability in model outcomes. To better understand these dynamics, we developed the Serious Choice Modelling Game, which simulates the real world modelling process and tracks modellers' decisions in real time using a stated preference dataset. Participants were asked to develop choice models to estimate Willingness to Pay values to inform policymakers about strategies for reducing noise pollution. The game recorded actions across multiple phases, including descriptive analysis, model specification, and outcome interpretation, allowing us to analyse both individual decisions and differences in modelling approaches. While our findings reveal a strong preference for using data visualisation tools in descriptive analysis, it also identifies gaps in missing values handling before model specification. We also found significant variation in the modelling approach, even when modellers were working with the same choice dataset. Despite the availability of more complex models, simpler models such as Multinomial Logit were often preferred, suggesting that modellers tend to avoid complexity when time and resources are limited. Participants who engaged in more comprehensive data exploration and iterative model comparison tended to achieve better model fit and parsimony, which demonstrate that the methodological choices made throughout the workflow have significant implications, particularly when modelling outcomes are used for policy formulation. △ Less

Submitted 6 June, 2025; v1 submitted 3 November, 2024; originally announced November 2024.

Comments: 35 pages, 7 figures

arXiv:2404.13198 [pdf]

An economically-consistent discrete choice model with flexible utility specification based on artificial neural networks

Authors: Jose Ignacio Hernandez, Niek Mouter, Sander van Cranenburgh

Abstract: Random utility maximisation (RUM) models are one of the cornerstones of discrete choice modelling. However, specifying the utility function of RUM models is not straightforward and has a considerable impact on the resulting interpretable outcomes and welfare measures. In this paper, we propose a new discrete choice model based on artificial neural networks (ANNs) named "Alternative-Specific and Sh… ▽ More Random utility maximisation (RUM) models are one of the cornerstones of discrete choice modelling. However, specifying the utility function of RUM models is not straightforward and has a considerable impact on the resulting interpretable outcomes and welfare measures. In this paper, we propose a new discrete choice model based on artificial neural networks (ANNs) named "Alternative-Specific and Shared weights Neural Network (ASS-NN)", which provides a further balance between flexible utility approximation from the data and consistency with two assumptions: RUM theory and fungibility of money (i.e., "one euro is one euro"). Therefore, the ASS-NN can derive economically-consistent outcomes, such as marginal utilities or willingness to pay, without explicitly specifying the utility functional form. Using a Monte Carlo experiment and empirical data from the Swissmetro dataset, we show that ASS-NN outperforms (in terms of goodness of fit) conventional multinomial logit (MNL) models under different utility specifications. Furthermore, we show how the ASS-NN is used to derive marginal utilities and willingness to pay measures. △ Less

Submitted 19 April, 2024; originally announced April 2024.

arXiv:2308.08276 [pdf]

Computer vision-enriched discrete choice models, with an application to residential location choice

Authors: Sander van Cranenburgh, Francisco Garrido-Valenzuela

Abstract: Visual imagery is indispensable to many multi-attribute decision situations. Examples of such decision situations in travel behaviour research include residential location choices, vehicle choices, tourist destination choices, and various safety-related choices. However, current discrete choice models cannot handle image data and thus cannot incorporate information embedded in images into their re… ▽ More Visual imagery is indispensable to many multi-attribute decision situations. Examples of such decision situations in travel behaviour research include residential location choices, vehicle choices, tourist destination choices, and various safety-related choices. However, current discrete choice models cannot handle image data and thus cannot incorporate information embedded in images into their representations of choice behaviour. This gap between discrete choice models' capabilities and the real-world behaviour it seeks to model leads to incomplete and, possibly, misleading outcomes. To solve this gap, this study proposes "Computer Vision-enriched Discrete Choice Models" (CV-DCMs). CV-DCMs can handle choice tasks involving numeric attributes and images by integrating computer vision and traditional discrete choice models. Moreover, because CV-DCMs are grounded in random utility maximisation principles, they maintain the solid behavioural foundation of traditional discrete choice models. We demonstrate the proposed CV-DCM by applying it to data obtained through a novel stated choice experiment involving residential location choices. In this experiment, respondents faced choice tasks with trade-offs between commute time, monthly housing cost and street-level conditions, presented using images. As such, this research contributes to the growing body of literature in the travel behaviour field that seeks to integrate discrete choice modelling and machine learning. △ Less

Submitted 16 August, 2023; originally announced August 2023.

arXiv:2104.10973 [pdf]

doi 10.1016/j.tra.2022.03.027

Traveller behaviour in public transport in the early stages of the COVID-19 pandemic in the Netherlands

Authors: Sanmay Shelat, Oded Cats, Sander van Cranenburgh

Abstract: Public transport ridership around the world has been hit hard by the COVID-19 pandemic. Travellers are likely to adapt their behaviour to avoid the risk of transmission and these changes may even be sustained after the pandemic. To evaluate travellers' behaviour in public transport networks during these times and assess how they will respond to future changes in the pandemic, we conduct a stated c… ▽ More Public transport ridership around the world has been hit hard by the COVID-19 pandemic. Travellers are likely to adapt their behaviour to avoid the risk of transmission and these changes may even be sustained after the pandemic. To evaluate travellers' behaviour in public transport networks during these times and assess how they will respond to future changes in the pandemic, we conduct a stated choice experiment with train travellers in the Netherlands. We specifically assess behaviour related to three criteria affecting the risk of COVID-19 transmission: (i) crowding, (ii) exposure duration, and (iii) prevalent infection rate. Observed choices are analysed using a latent class choice model which reveals two, nearly equally sized traveller segments: 'COVID Conscious' and 'Infection Indifferent'. The former has a significantly higher valuation of crowding, accepting, on average 8.75 minutes extra waiting time to reduce one person on-board. Moreover, they demonstrate a strong desire to sit without anybody in their neighbouring seat and are quite sensitive to changes in the prevalent infection rate. By contrast, Infection Indifferent travellers' value of crowding (1.04 waiting time minutes/person) is only slightly higher than pre-pandemic estimates and they are relatively unaffected by infection rates. We find that older and female travellers are more likely to be COVD Conscious while those reporting to use the trains more frequently during the pandemic tend to be Infection Indifferent. Further analysis also reveals differences between the two segments in attitudes towards the pandemic and self-reported rule-following behaviour. The behavioural insights from this study will not only contribute to better demand forecasting for service planning but will also inform public transport policy decisions aimed at curbing the shift to private modes. △ Less

Submitted 13 April, 2022; v1 submitted 22 April, 2021; originally announced April 2021.

Journal ref: Transp. Res. A: Policy Pract. 159 (2022) 357-371

arXiv:2101.11948 [pdf]

doi 10.1016/j.jocm.2021.100340

Choice modelling in the age of machine learning -- discussion paper

Authors: S. Van Cranenburgh, S. Wang, A. Vij, F. Pereira, J. Walker

Abstract: Since its inception, the choice modelling field has been dominated by theory-driven modelling approaches. Machine learning offers an alternative data-driven approach for modelling choice behaviour and is increasingly drawing interest in our field. Cross-pollination of machine learning models, techniques and practices could help overcome problems and limitations encountered in the current theory-dr… ▽ More Since its inception, the choice modelling field has been dominated by theory-driven modelling approaches. Machine learning offers an alternative data-driven approach for modelling choice behaviour and is increasingly drawing interest in our field. Cross-pollination of machine learning models, techniques and practices could help overcome problems and limitations encountered in the current theory-driven modelling paradigm, such as subjective labour-intensive search processes for model selection, and the inability to work with text and image data. However, despite the potential benefits of using the advances of machine learning to improve choice modelling practices, the choice modelling field has been hesitant to embrace machine learning. This discussion paper aims to consolidate knowledge on the use of machine learning models, techniques and practices for choice modelling, and discuss their potential. Thereby, we hope not only to make the case that further integration of machine learning in choice modelling is beneficial, but also to further facilitate it. To this end, we clarify the similarities and differences between the two modelling paradigms; we review the use of machine learning for choice modelling; and we explore areas of opportunities for embracing machine learning models and techniques to improve our practices. To conclude this discussion paper, we put forward a set of research questions which must be addressed to better understand if and how machine learning can benefit choice modelling. △ Less

Submitted 24 November, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

Comments: 40 pages, 2 tables, 0 figures

Journal ref: Journal of Choice Modelling 42 (2022): 100340

Showing 1–7 of 7 results for author: van Cranenburgh, S