Skip to main content

Showing 1–29 of 29 results for author: Bravo, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.06090  [pdf, other

    cs.LG

    Concept Bottleneck Language Models For protein design

    Authors: Aya Abdelsalam Ismail, Tuomas Oikarinen, Amy Wang, Julius Adebayo, Samuel Stanton, Taylor Joren, Joseph Kleinhenz, Allen Goodman, Héctor Corrada Bravo, Kyunghyun Cho, Nathan C. Frey

    Abstract: We introduce Concept Bottleneck Protein Language Models (CB-pLM), a generative masked language model with a layer where each neuron corresponds to an interpretable concept. Our architecture offers three key benefits: i) Control: We can intervene on concept values to precisely control the properties of generated proteins, achieving a 3 times larger change in desired concept values compared to basel… ▽ More

    Submitted 11 December, 2024; v1 submitted 9 November, 2024; originally announced November 2024.

  2. arXiv:2410.05177  [pdf, ps, other

    stat.ML cs.LG

    Are causal effect estimations enough for optimal recommendations under multitreatment scenarios?

    Authors: Sherly Alfonso-Sánchez, Kristina P. Sendova, Cristián Bravo

    Abstract: When making treatment selection decisions, it is essential to include a causal effect estimation analysis to compare potential outcomes under different treatments or controls, assisting in optimal selection. However, merely estimating individual treatment effects may not suffice for truly optimal decisions. Our study addressed this issue by incorporating additional criteria, such as the estimation… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: 34 pages, 4 figures

    MSC Class: 62-07; 62P05

  3. arXiv:2407.15532  [pdf, other

    q-fin.PM cs.AI cs.SI q-fin.RM stat.ML

    Large-scale Time-Varying Portfolio Optimisation using Graph Attention Networks

    Authors: Kamesh Korangi, Christophe Mues, Cristián Bravo

    Abstract: Apart from assessing individual asset performance, investors in financial markets also need to consider how a set of firms performs collectively as a portfolio. Whereas traditional Markowitz-based mean-variance portfolios are widespread, network-based optimisation techniques offer a more flexible tool to capture complex interdependencies between asset values. However, most of the existing studies… ▽ More

    Submitted 3 February, 2025; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: 39 pages, 10 figures, v2

  4. arXiv:2402.03966  [pdf, other

    cs.LG

    On dimensionality of feature vectors in MPNNs

    Authors: César Bravo, Alexander Kozachinskiy, Cristóbal Rojas

    Abstract: We revisit the classical result of Morris et al.~(AAAI'19) that message-passing graphs neural networks (MPNNs) are equal in their distinguishing power to the Weisfeiler--Leman (WL) isomorphism test. Morris et al.~show their simulation result with ReLU activation function and $O(n)$-dimensional feature vectors, where $n$ is the number of nodes of the graph. By introducing randomness into the arch… ▽ More

    Submitted 14 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 15 pages, 2 figures. Changes to the previous version: added reference to Amir et al.~(NeurIPS'23)

  5. arXiv:2402.00299  [pdf, other

    q-fin.GN cs.LG

    Attention-based Dynamic Multilayer Graph Neural Networks for Loan Default Prediction

    Authors: Sahab Zandi, Kamesh Korangi, María Óskarsdóttir, Christophe Mues, Cristián Bravo

    Abstract: Whereas traditional credit scoring tends to employ only individual borrower- or loan-level predictors, it has been acknowledged for some time that connections between borrowers may result in default risk propagating over a network. In this paper, we present a model for credit risk assessment leveraging a dynamic multilayer network built from a Graph Neural Network and a Recurrent Neural Network, e… ▽ More

    Submitted 24 June, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

  6. arXiv:2311.07407  [pdf, other

    cs.CV

    Towards Automatic Honey Bee Flower-Patch Assays with Paint Marking Re-Identification

    Authors: Luke Meyers, Josué Rodríguez Cordero, Carlos Corrada Bravo, Fanfan Noel, José Agosto-Rivera, Tugrul Giray, Rémi Mégret

    Abstract: In this paper, we show that paint markings are a feasible approach to automatize the analysis of behavioral assays involving honey bees in the field where marking has to be as lightweight as possible. We contribute a novel dataset for bees re-identification with paint-markings with 4392 images and 27 identities. Contrastive learning with a ResNet backbone and triplet loss led to identity represent… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Paper 17, workshop "CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling", in conjunction with Computer Vision and Pattern Recognition (CVPR 2023), June 18, 2023, Vancouver, Canada

    ACM Class: I.4.8; I.4.9; J.3

  7. INFLECT-DGNN: Influencer Prediction with Dynamic Graph Neural Networks

    Authors: Elena Tiukhova, Emiliano Penaloza, María Óskarsdóttir, Bart Baesens, Monique Snoeck, Cristián Bravo

    Abstract: Leveraging network information for predictive modeling has become widespread in many domains. Within the realm of referral and targeted marketing, influencer detection stands out as an area that could greatly benefit from the incorporation of dynamic network representation due to the continuous evolution of customer-brand relationships. In this paper, we present INFLECT-DGNN, a new method for prof… ▽ More

    Submitted 10 September, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: 27 pages, 7 figures

    Journal ref: IEEE Access, 12, 115026-115041 (2024)

  8. Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning

    Authors: Sherly Alfonso-Sánchez, Jesús Solano, Alejandro Correa-Bahnsen, Kristina P. Sendova, Cristián Bravo

    Abstract: Reinforcement learning has been explored for many problems, from video games with deterministic environments to portfolio and operations management in which scenarios are stochastic; however, there have been few attempts to test these methods in banking problems. In this study, we sought to find and automatize an optimal credit card limit adjustment policy by employing reinforcement learning techn… ▽ More

    Submitted 16 February, 2024; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 29 pages, 16 figures

    Journal ref: Alfonso-Sanchez, S., Solano, J., Correa-Bahnsen, A., Sendova, K. P., & Bravo, C. (2024). Optimizing credit limit adjustments under adversarial goals using reinforcement learning. European Journal of Operational Research 315(2): 802-817

  9. Multi-Modal Deep Learning for Credit Rating Prediction Using Text and Numerical Data Streams

    Authors: Mahsa Tavakoli, Rohitash Chandra, Fengrui Tian, Cristián Bravo

    Abstract: Knowing which factors are significant in credit rating assignment leads to better decision-making. However, the focus of the literature thus far has been mostly on structured data, and fewer studies have addressed unstructured or multi-modal datasets. In this paper, we present an analysis of the most effective architectures for the fusion of deep learning models for the prediction of company credi… ▽ More

    Submitted 25 November, 2024; v1 submitted 21 April, 2023; originally announced April 2023.

    Journal ref: Applied Soft Computing, Volume 171, March 2025, 112771

  10. arXiv:2301.01212  [pdf, ps, other

    q-fin.RM cs.LG cs.SI

    Assessment of creditworthiness models privacy-preserving training with synthetic data

    Authors: Ricardo Muñoz-Cancino, Cristián Bravo, Sebastián A. Ríos, Manuel Graña

    Abstract: Credit scoring models are the primary instrument used by financial institutions to manage credit risk. The scarcity of research on behavioral scoring is due to the difficult data access. Financial institutions have to maintain the privacy and security of borrowers' information refrain them from collaborating in research initiatives. In this work, we present a methodology that allows us to evaluate… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Journal ref: Hybrid Artificial Intelligent Systems. HAIS 2022. Lecture Notes in Computer Science(), vol 13469

  11. arXiv:2211.09664  [pdf, other

    cs.SI cs.AI cs.LG

    Influencer Detection with Dynamic Graph Neural Networks

    Authors: Elena Tiukhova, Emiliano Penaloza, María Óskarsdóttir, Hernan Garcia, Alejandro Correa Bahnsen, Bart Baesens, Monique Snoeck, Cristián Bravo

    Abstract: Leveraging network information for prediction tasks has become a common practice in many domains. Being an important part of targeted marketing, influencer detection can potentially benefit from incorporating dynamic network representation. In this work, we investigate different dynamic Graph Neural Networks (GNNs) configurations for influencer detection and evaluate their prediction performance u… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Conference workshop camera-ready paper - accepted at NeurIPS TGL 2022. 8 pages, 4 figures

  12. arXiv:2206.10074  [pdf, ps, other

    stat.CO cs.DM cs.SI

    Statistical network isomorphism

    Authors: Pierre Miasnikof, Alexander Y. Shestopaloff, Cristián Bravo, Yuri Lawryshyn

    Abstract: Graph isomorphism is a problem for which there is no known polynomial-time solution. Nevertheless, assessing (dis)similarity between two or more networks is a key task in many areas, such as image recognition, biology, chemistry, computer and social networks. Moreover, questions of similarity are typically more general and their answers more widely applicable than the more restrictive isomorphism… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

    Comments: 11 pages, two figures

  13. arXiv:2204.06122  [pdf, other

    cs.SI cs.LG

    On the dynamics of credit history and social interaction features, and their impact on creditworthiness assessment performance

    Authors: Ricardo Muñoz-Cancino, Cristián Bravo, Sebastián A. Ríos, Manuel Graña

    Abstract: For more than a half-century, credit risk management has used credit scoring models in each of its well-defined stages to manage credit risk. Application scoring is used to decide whether to grant a credit or not, while behavioral scoring is used mainly for portfolio management and to take preventive actions in case of default signals. In both cases, network data has recently been shown to be valu… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  14. Deep residential representations: Using unsupervised learning to unlock elevation data for geo-demographic prediction

    Authors: Matthew Stevenson, Christophe Mues, Cristián Bravo

    Abstract: LiDAR (short for "Light Detection And Ranging" or "Laser Imaging, Detection, And Ranging") technology can be used to provide detailed three-dimensional elevation maps of urban and rural landscapes. To date, airborne LiDAR imaging has been predominantly confined to the environmental and archaeological domains. However, the geographically granular and open-source nature of this data also lends itsel… ▽ More

    Submitted 1 August, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: 29 pages, 13 figures. V2 - Published

    Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing, 187, 378-392 (2022)

  15. arXiv:2111.14338  [pdf, other

    cs.CV cs.AI cs.LG

    Improving Deep Learning Interpretability by Saliency Guided Training

    Authors: Aya Abdelsalam Ismail, Héctor Corrada Bravo, Soheil Feizi

    Abstract: Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in unfaithful feature attributions. In this paper, we tackle this issue and introduce a {\it saliency guided training}procedure for neural networks to reduce noisy gra… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Journal ref: Thirty-fifth Conference on Neural Information Processing Systems 2021

  16. On the combination of graph data for assessing thin-file borrowers' creditworthiness

    Authors: Ricardo Muñoz-Cancino, Cristián Bravo, Sebastián A. Ríos, Manuel Graña

    Abstract: The thin-file borrowers are customers for whom a creditworthiness assessment is uncertain due to their lack of credit history; many researchers have used borrowers' relationships and interactions networks in the form of graphs as an alternative data source to address this. Incorporating network data is traditionally made by hand-crafted feature engineering, and lately, the graph neural network has… ▽ More

    Submitted 16 September, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Journal ref: Expert Systems with Applications, 2022, 118809

  17. arXiv:2111.09902  [pdf, other

    q-fin.GN cs.CY cs.LG q-fin.RM

    A transformer-based model for default prediction in mid-cap corporate markets

    Authors: Kamesh Korangi, Christophe Mues, Cristián Bravo

    Abstract: In this paper, we study mid-cap companies, i.e. publicly traded companies with less than US $10 billion in market capitalisation. Using a large dataset of US mid-cap companies observed over 30 years, we look to predict the default probability term structure over the medium term and understand which data sources (i.e. fundamental, market or pricing data) contribute most to the default risk. Whereas… ▽ More

    Submitted 20 April, 2023; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: 38 pages, 6 figures, V4 published

    Journal ref: European Journal of Operational Research, 308, 306-320 (2023)

  18. Improving healthcare access management by predicting patient no-show behaviour

    Authors: David Barrera Ferro, Sally Brailsford, Cristián Bravo, Honora Smith

    Abstract: Low attendance levels in medical appointments have been associated with poor health outcomes and efficiency problems for service providers. To address this problem, healthcare managers could aim at improving attendance levels or minimizing the operational impact of no-shows by adapting resource allocation policies. However, given the uncertainty of patient behaviour, generating relevant informatio… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Comments: v4 - 26 pages

    Journal ref: Decision Support Systems 138: 113398 (2020)

  19. arXiv:2010.13924  [pdf, other

    cs.LG stat.ML

    Benchmarking Deep Learning Interpretability in Time Series Predictions

    Authors: Aya Abdelsalam Ismail, Mohamed Gunady, Héctor Corrada Bravo, Soheil Feizi

    Abstract: Saliency methods are used extensively to highlight the importance of input features in model predictions. These methods are mostly used in vision and language tasks, and their applications to time series data is relatively unexplored. In this paper, we set out to extensively compare the performance of various saliency-based interpretability methods across diverse neural architectures, including Re… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Journal ref: NeurIPS 2020

  20. Multilayer Network Analysis for Improved Credit Risk Prediction

    Authors: María Óskarsdóttir, Cristián Bravo

    Abstract: We present a multilayer network model for credit risk assessment. Our model accounts for multiple connections between borrowers (such as their geographic location and their economic activity) and allows for explicitly modelling the interaction between connected borrowers. We develop a multilayer personalized PageRank algorithm that allows quantifying the strength of the default exposure of any bor… ▽ More

    Submitted 26 July, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 24 pages, 15 figures. v4 - accepted

    Journal ref: Omega 105: 102520 (2021)

  21. arXiv:2005.14658  [pdf, other

    q-fin.GN cs.CY cs.LG stat.ML

    Super-App Behavioral Patterns in Credit Risk Models: Financial, Statistical and Regulatory Implications

    Authors: Luisa Roa, Alejandro Correa-Bahnsen, Gabriel Suarez, Fernando Cortés-Tejada, María A. Luque, Cristián Bravo

    Abstract: In this paper we present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models. These alternative data sources have shown themselves to be immensely powerful in predicting borrower behavior in segments traditionally underserved by banks and financial institutions. Our results, validated across two countries,… ▽ More

    Submitted 4 January, 2021; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: Accepted - v2. 25 pages

    Journal ref: Expert Systems with Applications: 114486 (2020)

  22. arXiv:2005.12418  [pdf, other

    cs.SI cs.LG physics.soc-ph stat.ML

    Evolution of Credit Risk Using a Personalized Pagerank Algorithm for Multilayer Networks

    Authors: Cristián Bravo, María Óskarsdóttir

    Abstract: In this paper we present a novel algorithm to study the evolution of credit risk across complex multilayer networks. Pagerank-like algorithms allow for the propagation of an influence variable across single networks, and allow quantifying the risk single entities (nodes) are subject to given the connection they have to other nodes in the network. Multilayer networks, on the other hand, are network… ▽ More

    Submitted 10 August, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

    Comments: Conference camera-ready paper - accepted at KDD MLF 2020. 15 pages, 10 figures

    Journal ref: Proceedings of the Third KDD Workshop on Machine Learning in Finance, joint with 26th ACM SIGKDD Conference on Knowledge Discovery in Databases (KDD MLF 2020). ACM, New York, NY, USA, 8 pages

  23. The value of text for small business default prediction: A deep learning approach

    Authors: Matthew Stevenson, Christophe Mues, Cristián Bravo

    Abstract: Compared to consumer lending, Micro, Small and Medium Enterprise (mSME) credit risk modelling is particularly challenging, as, often, the same sources of information are not available. Therefore, it is standard policy for a loan officer to provide a textual loan assessment to mitigate limited data availability. In turn, this statement is analysed by a credit expert alongside any available standard… ▽ More

    Submitted 7 July, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: 25 pages, 12 figures. v4 - Accepted

    Journal ref: European Journal of Operational Research 295 (2): 758-771 (2021)

  24. arXiv:2002.09931  [pdf, other

    cs.SI cs.CY cs.LG stat.ML

    The Value of Big Data for Credit Scoring: Enhancing Financial Inclusion using Mobile Phone Data and Social Network Analytics

    Authors: María Óskarsdóttir, Cristián Bravo, Carlos Sarraute, Jan Vanthienen, Bart Baesens

    Abstract: Credit scoring is without a doubt one of the oldest applications of analytics. In recent years, a multitude of sophisticated classification techniques have been developed to improve the statistical performance of credit scoring models. Instead of focusing on the techniques themselves, this paper leverages alternative data sources to enhance both statistical and economic model performance. The stud… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

    Journal ref: Applied Soft Computing, Volume 74, January 2019, Pages 26-39

  25. arXiv:2001.10994  [pdf, other

    cs.SI cs.CY

    Credit Scoring for Good: Enhancing Financial Inclusion with Smartphone-Based Microlending

    Authors: María Óskarsdóttir, Cristián Bravo, Carlos Sarraute, Bart Baesens, Jan Vanthienen

    Abstract: Globally, two billion people and more than half of the poorest adults do not use formal financial services. Consequently, there is increased emphasis on developing financial technology that can facilitate access to financial products for the unbanked. In this regard, smartphone-based microlending has emerged as a potential solution to enhance financial inclusion. We propose a methodology to impr… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

    Comments: Thirty Ninth International Conference on Information Systems (ICIS), December 14, 2018, San Francisco, USA

  26. Social Network Analytics for Churn Prediction in Telco: Model Building, Evaluation and Network Architecture

    Authors: María Óskarsdóttir, Cristián Bravo, Wouter Verbeke, Carlos Sarraute, Bart Baesens, Jan Vanthienen

    Abstract: Social network analytics methods are being used in the telecommunication industry to predict customer churn with great success. In particular it has been shown that relational learners adapted to this specific problem enhance the performance of predictive models. In the current study we benchmark different strategies for constructing a relational learner by applying them to a total of eight dist… ▽ More

    Submitted 18 January, 2020; originally announced January 2020.

    Journal ref: Expert Systems with Applications, Volume 85, 1 November 2017, Pages 204-220

  27. A Comparative Study of Social Network Classifiers for Predicting Churn in the Telecommunication Industry

    Authors: Maria Óskarsdóttir, Cristián Bravo, Wouter Verbeke, Carlos Sarraute, Bart Baesens, Jan Vanthienen

    Abstract: Relational learning in networked data has been shown to be effective in a number of studies. Relational learners, composed of relational classifiers and collective inference methods, enable the inference of nodes in a network given the existence and strength of links to other nodes. These methods have been adapted to predict customer churn in telecommunication companies showing that incorporating… ▽ More

    Submitted 18 January, 2020; originally announced January 2020.

    Comments: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

  28. arXiv:1910.12370  [pdf, other

    cs.LG stat.ML

    Input-Cell Attention Reduces Vanishing Saliency of Recurrent Neural Networks

    Authors: Aya Abdelsalam Ismail, Mohamed Gunady, Luiz Pessoa, Héctor Corrada Bravo, Soheil Feizi

    Abstract: Recent efforts to improve the interpretability of deep neural networks use saliency to characterize the importance of input features to predictions made by models. Work on interpretability using saliency-based methods on Recurrent Neural Networks (RNNs) has mostly targeted language tasks, and their applicability to time series data is less understood. In this work we analyze saliency-based methods… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

    Journal ref: Neurips 2019

  29. arXiv:1804.06776  [pdf, other

    cs.LG stat.ML

    Improving Long-Horizon Forecasts with Expectation-Biased LSTM Networks

    Authors: Aya Abdelsalam Ismail, Timothy Wood, Héctor Corrada Bravo

    Abstract: State-of-the-art forecasting methods using Recurrent Neural Net- works (RNN) based on Long-Short Term Memory (LSTM) cells have shown exceptional performance targeting short-horizon forecasts, e.g given a set of predictor features, forecast a target value for the next few time steps in the future. However, in many applica- tions, the performance of these methods decays as the forecasting horizon ex… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.