Skip to main content

Showing 1–8 of 8 results for author: von Ehrenheim, V

Searching in archive cs. Search in all archives.
.
  1. Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask

    Authors: Zineb Senane, Lele Cao, Valentin Leonhard Buchner, Yusuke Tashiro, Lei You, Pawel Herman, Mats Nordahl, Ruibo Tu, Vilhelm von Ehrenheim

    Abstract: Time Series Representation Learning (TSRL) focuses on generating informative representations for various Time Series (TS) modeling tasks. Traditional Self-Supervised Learning (SSL) methods in TSRL fall into four main categories: reconstructive, adversarial, contrastive, and predictive, each with a common challenge of sensitivity to noise and intricate data nuances. Recently, diffusion-based method… ▽ More

    Submitted 17 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Published as a full paper by KDD 2024 Research Track (12 pages as main paper and 11 pages as appendix). Source code available at https://github.com/llcresearch/TSDE

    ACM Class: G.3; I.6.5; I.2.4

  2. arXiv:2309.16888  [pdf, other

    cs.LG cs.AI cs.CE q-fin.PM

    Beyond Gut Feel: Using Time Series Transformers to Find Investment Gems

    Authors: Lele Cao, Gustaf Halvardsson, Andrew McCornack, Vilhelm von Ehrenheim, Pawel Herman

    Abstract: This paper addresses the growing application of data-driven approaches within the Private Equity (PE) industry, particularly in sourcing investment targets (i.e., companies) for Venture Capital (VC) and Growth Capital (GC). We present a comprehensive review of the relevant approaches and propose a novel approach leveraging a Transformer-based Multivariate Time Series Classifier (TMTSC) for predict… ▽ More

    Submitted 14 June, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Published by ICANN (33rd International Conference on Artificial Neural Networks) 2024 as full paper (15 pages and 7 figures)

    Report number: EQT-Motherbrain-Research-2023SIT MSC Class: 91B84 (Primary) 68T07 (Secondary) ACM Class: I.2.6; I.2.1; H.4.0

  3. arXiv:2309.12075  [pdf, other

    cs.CL cs.AI

    Prompt Tuned Embedding Classification for Multi-Label Industry Sector Allocation

    Authors: Valentin Leonhard Buchner, Lele Cao, Jan-Christoph Kalo, Vilhelm von Ehrenheim

    Abstract: Prompt Tuning is emerging as a scalable and cost-effective method to fine-tune Pretrained Language Models (PLMs), which are often referred to as Large Language Models (LLMs). This study benchmarks the performance and computational efficiency of Prompt Tuning and baselines for multi-label text classification. This is applied to the challenging task of classifying companies into an investment firm's… ▽ More

    Submitted 12 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted by NAACL 2024 industry track (6 pages, 4 figures). Source code to be found at https://github.com/EQTPartners/PTEC

    MSC Class: 68T50 ACM Class: I.2.7; I.2.0

  4. arXiv:2306.10649  [pdf, other

    cs.AI cs.CE cs.DB cs.LG

    CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification

    Authors: Lele Cao, Vilhelm von Ehrenheim, Mark Granroth-Wilding, Richard Anselmo Stahl, Andrew McCornack, Armin Catovic, Dhiana Deva Cavacanti Rocha

    Abstract: In the investment industry, it is often essential to carry out fine-grained company similarity quantification for a range of purposes, including market mapping, competitor analysis, and mergers and acquisitions. We propose and publish a knowledge graph, named CompanyKG, to represent and learn diverse company features and relations. Specifically, 1.17 million companies are represented as nodes enri… ▽ More

    Submitted 9 June, 2024; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: CompanyKG (version 1.x). Published by IEEE Transactions on Big Data (12 pages, 10 figures and 2 tables) + Appendix (9 pages, 1 figures and 4 tables). Code: https://github.com/EQTPartners/CompanyKG ; Data: https://zenodo.org/record/8010239

    Report number: CompanyKG-V01 MSC Class: 05C85; 05C12; 68T07; 68T50; 05C90 ACM Class: E.0; I.2.1; I.2.6; H.4.0; J.0; I.2.8; I.2.7

  5. arXiv:2306.03313  [pdf, other

    cs.CL cs.AI

    A Scalable and Adaptive System to Infer the Industry Sectors of Companies: Prompt + Model Tuning of Generative Language Models

    Authors: Lele Cao, Vilhelm von Ehrenheim, Astrid Berghult, Cecilia Henje, Richard Anselmo Stahl, Joar Wandborg, Sebastian Stan, Armin Catovic, Erik Ferm, Hannes Ingelhag

    Abstract: The Private Equity (PE) firms operate investment funds by acquiring and managing companies to achieve a high return upon selling. Many PE funds are thematic, meaning investment professionals aim to identify trends by covering as many industry sectors as possible, and picking promising companies within these sectors. So, inferring sectors for companies is critical to the success of thematic PE fund… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by FinNLP (Financial Technology and Natural Language Processing) @ IJCAI2023 as long paper (8 pages and 8 figures)

    MSC Class: 68T50; 68T05 ACM Class: I.2.7; I.2.1

  6. arXiv:2210.14195  [pdf, other

    q-fin.CP cs.LG

    Using Deep Learning to Find the Next Unicorn: A Practical Synthesis

    Authors: Lele Cao, Vilhelm von Ehrenheim, Sebastian Krakowski, Xiaoxue Li, Alexandra Lutz

    Abstract: Startups often represent newly established business models associated with disruptive innovation and high scalability. They are commonly regarded as powerful engines for economic and social development. Meanwhile, startups are heavily constrained by many factors such as limited financial funding and human resources. Therefore, the chance for a startup to eventually succeed is as rare as "spotting… ▽ More

    Submitted 10 June, 2024; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: A condensed version is published by IJCAI 2024 Workshop on FinNLP and Muffin (48 pages, 18 figures). ACL Link: https://aclanthology.org/2023.finnlp-1.6

    MSC Class: 68T07 ACM Class: H.1.0

  7. Simulation-Informed Revenue Extrapolation with Confidence Estimate for Scaleup Companies Using Scarce Time-Series Data

    Authors: Lele Cao, Sonja Horn, Vilhelm von Ehrenheim, Richard Anselmo Stahl, Henrik Landgren

    Abstract: Investment professionals rely on extrapolating company revenue into the future (i.e. revenue forecast) to approximate the valuation of scaleups (private companies in a high-growth stage) and inform their investment decision. This task is manual and empirical, leaving the forecast quality heavily dependent on the investment professionals' experiences and insights. Furthermore, financial data on sca… ▽ More

    Submitted 26 September, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: Published in CIKM 2022 as full paper (12 pages and 6 figures). For data and code, see https://github.com/EQTPartners/sire

  8. arXiv:2109.03155  [pdf, other

    cs.CL cs.AI cs.LG

    PAUSE: Positive and Annealed Unlabeled Sentence Embedding

    Authors: Lele Cao, Emil Larsson, Vilhelm von Ehrenheim, Dhiana Deva Cavalcanti Rocha, Anna Martin, Sonja Horn

    Abstract: Sentence embedding refers to a set of effective and versatile techniques for converting raw text into numerical vector representations that can be used in a wide range of natural language processing (NLP) applications. The majority of these techniques are either supervised or unsupervised. Compared to the unsupervised methods, the supervised ones make less assumptions about optimization objectives… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Comments: Accepted by EMNLP 2021 main conference as long paper (12 pages and 2 figures). For source code, see https://github.com/EQTPartners/pause