Skip to main content

Showing 1–26 of 26 results for author: Lessmann, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.09232  [pdf, other

    cs.LG

    Uplift modeling with continuous treatments: A predict-then-optimize approach

    Authors: Simon De Vos, Christopher Bockel-Rickermann, Stefan Lessmann, Wouter Verbeke

    Abstract: The goal of uplift modeling is to recommend actions that optimize specific outcomes by determining which entities should receive treatment. One common approach involves two steps: first, an inference step that estimates conditional average treatment effects (CATEs), and second, an optimization step that ranks entities based on their CATE values and assigns treatment to the top k within a given bud… ▽ More

    Submitted 20 May, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

  2. arXiv:2411.14463  [pdf, other

    cs.CL cs.AI econ.GN

    Leveraging AI and NLP for Bank Marketing: A Systematic Review and Gap Analysis

    Authors: Christopher Gerling, Stefan Lessmann

    Abstract: This paper explores the growing impact of AI and NLP in bank marketing, highlighting their evolving roles in enhancing marketing strategies, improving customer engagement, and creating value within this sector. While AI and NLP have been widely studied in general marketing, there is a notable gap in understanding their specific applications and potential within the banking sector. This research ad… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

  3. arXiv:2411.03372  [pdf, other

    cs.LG

    Energy Price Modelling: A Comparative Evaluation of four Generations of Forecasting Methods

    Authors: Alexandru-Victor Andrei, Georg Velev, Filip-Mihai Toma, Daniel Traian Pele, Stefan Lessmann

    Abstract: Energy is a critical driver of modern economic systems. Accurate energy price forecasting plays an important role in supporting decision-making at various levels, from operational purchasing decisions at individual business organizations to policy-making. A significant body of literature has looked into energy price forecasting, investigating a wide range of methods to improve accuracy and inform… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  4. arXiv:2409.19377  [pdf, other

    stat.ML cs.LG

    Interpretable, multi-dimensional Evaluation Framework for Causal Discovery from observational i.i.d. Data

    Authors: Georg Velev, Stefan Lessmann

    Abstract: Nonlinear causal discovery from observational data imposes strict identifiability assumptions on the formulation of structural equations utilized in the data generating process. The evaluation of structure learning methods under assumption violations requires a rigorous and interpretable approach, which quantifies both the structural similarity of the estimation with the ground truth and the capac… ▽ More

    Submitted 16 December, 2024; v1 submitted 28 September, 2024; originally announced September 2024.

  5. arXiv:2407.13009  [pdf, other

    stat.ML cs.LG

    Fighting Sampling Bias: A Framework for Training and Evaluating Credit Scoring Models

    Authors: Nikita Kozodoi, Stefan Lessmann, Morteza Alamgir, Luis Moreira-Matias, Konstantinos Papakonstantinou

    Abstract: Scoring models support decision-making in financial institutions. Their estimation and evaluation are based on the data of previously accepted applicants with known repayment behavior. This creates sampling bias: the available labeled data offers a partial picture of the distribution of candidate borrowers, which the model is supposed to score. The paper addresses the adverse effect of sampling bi… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  6. arXiv:2403.15886  [pdf, other

    cs.CL cs.AI cs.LG

    Leveraging Zero-Shot Prompting for Efficient Language Model Distillation

    Authors: Lukas Vöge, Vincent Gurgul, Stefan Lessmann

    Abstract: This paper introduces a novel approach for efficiently distilling LLMs into smaller, application-specific models, significantly reducing operational costs and manual labor. Addressing the challenge of deploying computationally intensive LLMs in specific applications or edge devices, this technique utilizes LLMs' reasoning capabilities to generate labels and natural language rationales for unlabele… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  7. arXiv:2312.05234  [pdf, other

    stat.ML cs.LG

    The impact of heteroskedasticity on uplift modeling

    Authors: Björn Bokelmann, Stefan Lessmann

    Abstract: There are various applications, where companies need to decide to which individuals they should best allocate treatment. To support such decisions, uplift models are applied to predict treatment effects on an individual level. Based on the predicted treatment effects, individuals can be ranked and treatment allocation can be prioritized according to this ranking. An implicit assumption, which has… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 10 pages, 4 figures

  8. arXiv:2311.14759  [pdf, other

    q-fin.ST cs.LG stat.ML

    Deep Learning and NLP in Cryptocurrency Forecasting: Integrating Financial, Blockchain, and Social Media Data

    Authors: Vincent Gurgul, Stefan Lessmann, Wolfgang Karl Härdle

    Abstract: We introduce novel approaches to cryptocurrency price forecasting, leveraging Machine Learning (ML) and Natural Language Processing (NLP) techniques, with a focus on Bitcoin and Ethereum. By analysing news and social media content, primarily from Twitter and Reddit, we assess the impact of public sentiment on cryptocurrency markets. A distinctive feature of our methodology is the application of th… ▽ More

    Submitted 25 October, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

  9. arXiv:2308.02680  [pdf, other

    cs.CY cs.LG

    Fair Models in Credit: Intersectional Discrimination and the Amplification of Inequity

    Authors: Savina Kim, Stefan Lessmann, Galina Andreeva, Michael Rovatsos

    Abstract: The increasing usage of new data sources and machine learning (ML) technology in credit modeling raises concerns with regards to potentially unfair decision-making that rely on protected characteristics (e.g., race, sex, age) or other socio-economic and demographic data. The authors demonstrate the impact of such algorithmic bias in the microfinance context. Difficulties in assessing credit are di… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  10. arXiv:2307.11845  [pdf, other

    cs.CL cs.AI cs.CV q-fin.CP

    Multimodal Document Analytics for Banking Process Automation

    Authors: Christopher Gerling, Stefan Lessmann

    Abstract: Traditional banks face increasing competition from FinTechs in the rapidly evolving financial ecosystem. Raising operational efficiency is vital to address this challenge. Our study aims to improve the efficiency of document-intensive business processes in banking. To that end, we first review the landscape of business documents in the retail segment. Banking documents often contain text, layout,… ▽ More

    Submitted 26 November, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: A Preprint

  11. The Deep Promotion Time Cure Model

    Authors: Victor Medina-Olivares, Stefan Lessmann, Nadja Klein

    Abstract: We propose a novel method for predicting time-to-event in the presence of cure fractions based on flexible survivals models integrated into a deep neural network framework. Our approach allows for non-linear relationships and high-dimensional interactions between covariates and survival and is suitable for large-scale applications. Furthermore, we allow the method to incorporate an identified pred… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  12. arXiv:2211.00921  [pdf, other

    q-fin.RM cs.LG

    A Data-driven Case-based Reasoning in Bankruptcy Prediction

    Authors: Wei Li, Wolfgang Karl Härdle, Stefan Lessmann

    Abstract: There has been intensive research regarding machine learning models for predicting bankruptcy in recent years. However, the lack of interpretability limits their growth and practical implementation. This study proposes a data-driven explainable case-based reasoning (CBR) system for bankruptcy prediction. Empirical results from a comparative study show that the proposed approach performs superior t… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  13. arXiv:2204.05781  [pdf

    q-fin.ST cs.LG

    Forecasting Cryptocurrency Returns from Sentiment Signals: An Analysis of BERT Classifiers and Weak Supervision

    Authors: Duygu Ider, Stefan Lessmann

    Abstract: Anticipating price developments in financial markets is a topic of continued interest in forecasting. Funneled by advancements in deep learning and natural language processing (NLP) together with the availability of vast amounts of textual data in form of news articles, social media postings, etc., an increasing number of studies incorporate text-based predictors in forecasting models. We contribu… ▽ More

    Submitted 19 March, 2023; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: 29 pages

  14. arXiv:2112.08060  [pdf, other

    cs.LG cs.CV stat.ML

    Leveraging Image-based Generative Adversarial Networks for Time Series Generation

    Authors: Justin Hellermann, Stefan Lessmann

    Abstract: Generative models for images have gained significant attention in computer vision and natural language processing due to their ability to generate realistic samples from complex data distributions. To leverage the advances of image-based generative models for the time series domain, we propose a two-dimensional image representation for time series, the Extended Intertemporal Return Plot (XIRP). Ou… ▽ More

    Submitted 31 August, 2023; v1 submitted 15 December, 2021; originally announced December 2021.

  15. arXiv:2111.11344  [pdf, other

    cs.LG stat.ML

    Modeling Irregular Time Series with Continuous Recurrent Units

    Authors: Mona Schirmer, Mazin Eltayeb, Stefan Lessmann, Maja Rudolph

    Abstract: Recurrent neural networks (RNNs) are a popular choice for modeling sequential data. Modern RNN architectures assume constant time-intervals between observations. However, in many datasets (e.g. medical records) observation times are irregular and can carry important information. To address this challenge, we propose continuous recurrent units (CRUs) -- a neural architecture that can naturally hand… ▽ More

    Submitted 26 July, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Accepted at ICML 2022, Baltimore, Maryland

  16. arXiv:2105.14599  [pdf

    cs.IR

    Personalization in E-Grocery: Top-N versus Top-k Rankings

    Authors: Franziska Scherpinski, Stefan Lessmann

    Abstract: Business success in e-commerce depends on customer perceived value. A customer with high perceived value buys, returns, and recommends items. The perceived value is at risk whenever the information load harms users' shopping experience. In e-grocery, shoppers face an overwhelming number of items, the majority of which is irrelevant for the shopper. Recommender systems (RS) enable businesses to mas… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    MSC Class: 62P20 ACM Class: H.3.3

  17. arXiv:2103.01907  [pdf, other

    stat.ML cs.LG q-fin.RM

    Fairness in Credit Scoring: Assessment, Implementation and Profit Implications

    Authors: Nikita Kozodoi, Johannes Jacob, Stefan Lessmann

    Abstract: The rise of algorithmic decision-making has spawned much research on fair machine learning (ML). Financial institutions use ML for building risk scorecards that support a range of credit-related decisions. Yet, the literature on fair ML in credit scoring is scarce. The paper makes three contributions. First, we revisit statistical fairness criteria and examine their adequacy for credit scoring. Se… ▽ More

    Submitted 17 June, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted to European Journal of Operational Research

  18. arXiv:2101.03336  [pdf

    cs.LG

    Interpretable Multiple Treatment Revenue Uplift Modeling

    Authors: Robin M. Gubela, Stefan Lessmann

    Abstract: Big data and business analytics are critical drivers of business and societal transformations. Uplift models support a firm's decision-making by predicting the change of a customer's behavior due to a treatment. Prior work examines models for single treatments and binary customer responses. The paper extends corresponding approaches by developing uplift models for multiple treatments and continuou… ▽ More

    Submitted 9 January, 2021; originally announced January 2021.

    Journal ref: Proceedings of the 26th Americas Conference on Information Systems (AMCIS 2020)

  19. arXiv:2008.09202  [pdf, other

    cs.LG

    Conditional Wasserstein GAN-based Oversampling of Tabular Data for Imbalanced Learning

    Authors: Justin Engelmann, Stefan Lessmann

    Abstract: Class imbalance is a common problem in supervised learning and impedes the predictive performance of classification models. Popular countermeasures include oversampling the minority class. Standard methods like SMOTE rely on finding nearest neighbours and linear interpolations which are problematic in case of high-dimensional, complex data distributions. Generative Adversarial Networks (GANs) have… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

  20. Response Transformation and Profit Decomposition for Revenue Uplift Modeling

    Authors: Robin M. Gubela, Stefan Lessmann, Szymon Jaroszewicz

    Abstract: Uplift models support decision-making in marketing campaign planning. Estimating the causal effect of a marketing treatment, an uplift model facilitates targeting communication to responsive customers and efficient allocation of marketing budgets. Research into uplift models focuses on conversion models to maximize incremental sales. The paper introduces uplift modeling strategies for maximizing i… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

    Comments: 53 pages including online appendix

    Journal ref: European Journal of Operational Research 2019

  21. arXiv:1910.00393  [pdf, other

    cs.LG stat.AP stat.ML

    Affordable Uplift: Supervised Randomization in Controlled Experiments

    Authors: Johannes Haupt, Daniel Jacob, Robin M. Gubela, Stefan Lessmann

    Abstract: Customer scoring models are the core of scalable direct marketing. Uplift models provide an estimate of the incremental benefit from a treatment that is used for operational decision-making. Training and monitoring of uplift models require experimental data. However, the collection of data under randomized treatment assignment is costly, since random targeting deviates from an established targetin… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    MSC Class: 68U35

  22. arXiv:1909.11114  [pdf, other

    stat.AP cs.LG stat.ML

    Churn Prediction with Sequential Data and Deep Neural Networks. A Comparative Analysis

    Authors: C. Gary Mena, Arno De Caigny, Kristof Coussement, Koen W. De Bock, Stefan Lessmann

    Abstract: Off-the-shelf machine learning algorithms for prediction such as regularized logistic regression cannot exploit the information of time-varying features without previously using an aggregation procedure of such sequential data. However, recurrent neural networks provide an alternative approach by which time-varying features can be readily used for modeling. This paper assesses the performance of n… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

  23. arXiv:1909.06108  [pdf, ps, other

    stat.ML cs.LG q-fin.RM

    Shallow Self-Learning for Reject Inference in Credit Scoring

    Authors: Nikita Kozodoi, Panagiotis Katsas, Stefan Lessmann, Luis Moreira-Matias, Konstantinos Papakonstantinou

    Abstract: Credit scoring models support loan approval decisions in the financial services industry. Lenders train these models on data from previously granted credit applications, where the borrowers' repayment behavior has been observed. This approach creates sample bias. The scoring model (i.e., classifier) is trained on accepted cases only. Applying the resulting model to screen credit applications from… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

    Comments: Preprint of the paper accepted to ECML PKDD 2019

    Journal ref: ECML PKDD 2019. Lecture Notes in Computer Science, vol 11908. Springer, Cham

  24. arXiv:1901.01726  [pdf

    cs.SE

    Evaluating software defect prediction performance: an updated benchmarking study

    Authors: Libo Li, Stefan Lessmann, Bart Baesens

    Abstract: Accurately predicting faulty software units helps practitioners target faulty units and prioritize their efforts to maintain software quality. Prior studies use machine-learning models to detect faulty software code. We revisit past studies and point out potential improvements. Our new study proposes a revised benchmarking configuration. The configuration considers many new dimensions, such as cla… ▽ More

    Submitted 7 January, 2019; originally announced January 2019.

  25. arXiv:1812.06175  [pdf, other

    q-fin.RM cs.LG stat.AP

    Can Deep Learning Predict Risky Retail Investors? A Case Study in Financial Risk Behavior Forecasting

    Authors: Yaodong Yang, Alisa Kolesnikova, Stefan Lessmann, Tiejun Ma, Ming-Chien Sung, Johnnie E. V. Johnson

    Abstract: The paper examines the potential of deep learning to support decisions in financial risk management. We develop a deep learning model for predicting whether individual spread traders secure profits from future trades. This task embodies typical modeling challenges faced in risk and behavior forecasting. Conventional machine learning requires data that is representative of the feature-target relati… ▽ More

    Submitted 17 November, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: Within the "equal" contribution, Yaodong Yang contributed the core deep learning algorithm along with its experimental results, and the first draft of the manuscript (including Figure 1,2,3,4,7,8,9,11, and Table 3)

  26. Robust identification of email tracking: A machine learning approach

    Authors: Johannes Haupt, Benedict Bender, Benjamin Fabian, Stefan Lessmann

    Abstract: Email tracking allows email senders to collect fine-grained behavior and location data on email recipients, who are uniquely identifiable via their email address. Such tracking invades user privacy in that email tracking techniques gather data without user consent or awareness. Striving to increase privacy in email communication, this paper develops a detection engine to be the core of a selective… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

    Comments: Accepted publication, In press, European Journal of Operational Research, 2018