Skip to main content

Showing 1–17 of 17 results for author: Mara, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.07653  [pdf, ps, other

    cs.CL

    JobHop: A Large-Scale Dataset of Career Trajectories

    Authors: Iman Johary, Raphael Romero, Alexandru C. Mara, Tijl De Bie

    Abstract: Understanding labor market dynamics is essential for policymakers, employers, and job seekers. However, comprehensive datasets that capture real-world career trajectories are scarce. In this paper, we introduce JobHop, a large-scale public dataset derived from anonymized resumes provided by VDAB, the public employment service in Flanders, Belgium. Utilizing Large Language Models (LLMs), we process… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  2. arXiv:2410.18417  [pdf, other

    cs.CL cs.LG

    Large Language Models Reflect the Ideology of their Creators

    Authors: Maarten Buyl, Alexander Rogiers, Sander Noels, Guillaume Bied, Iris Dominguez-Catena, Edith Heiter, Iman Johary, Alexandru-Cristian Mara, Raphaël Romero, Jefrey Lijffijt, Tijl De Bie

    Abstract: Large language models (LLMs) are trained on vast amounts of data to generate natural language, enabling them to perform tasks like text summarization and question answering. These models have become popular in artificial intelligence (AI) assistants like ChatGPT and already play an influential role in how humans access information. However, the behavior of LLMs varies depending on their design, tr… ▽ More

    Submitted 30 January, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

  3. arXiv:2209.08064  [pdf, other

    cs.LG cs.SI

    A Systematic Evaluation of Node Embedding Robustness

    Authors: Alexandru Mara, Jefrey Lijffijt, Stephan Günnemann, Tijl De Bie

    Abstract: Node embedding methods map network nodes to low dimensional vectors that can be subsequently used in a variety of downstream prediction tasks. The popularity of these methods has grown significantly in recent years, yet, their robustness to perturbations of the input data is still poorly understood. In this paper, we assess the empirical robustness of node embedding models to random and adversaria… ▽ More

    Submitted 30 November, 2022; v1 submitted 16 September, 2022; originally announced September 2022.

  4. arXiv:2005.10701  [pdf, other

    cs.SI cs.LG stat.ML

    CSNE: Conditional Signed Network Embedding

    Authors: Alexandru Mara, Yoosof Mashayekhi, Jefrey Lijffijt, Tijl De Bie

    Abstract: Signed networks are mathematical structures that encode positive and negative relations between entities such as friend/foe or trust/distrust. Recently, several papers studied the construction of useful low-dimensional representations (embeddings) of these networks for the prediction of missing relations or signs. Existing embedding methods for sign prediction generally enforce different notions o… ▽ More

    Submitted 25 May, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

  5. Benchmarking Network Embedding Models for Link Prediction: Are We Making Progress?

    Authors: Alexandru Mara, Jefrey Lijffijt, Tijl De Bie

    Abstract: Network embedding methods map a network's nodes to vectors in an embedding space, in such a way that these representations are useful for estimating some notion of similarity or proximity between pairs of nodes in the network. The quality of these node representations is then showcased through results of downstream prediction tasks. Commonly used benchmark tasks such as link prediction, however, p… ▽ More

    Submitted 3 September, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

  6. Block-Approximated Exponential Random Graphs

    Authors: Florian Adriaens, Alexandru Mara, Jefrey Lijffijt, Tijl De Bie

    Abstract: An important challenge in the field of exponential random graphs (ERGs) is the fitting of non-trivial ERGs on large graphs. By utilizing fast matrix block-approximation techniques, we propose an approximative framework to such non-trivial ERGs that result in dyadic independence (i.e., edge independent) distributions, while being able to meaningfully model both local information of the graph (e.g.,… ▽ More

    Submitted 26 August, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

    Comments: Accepted for DSAA 2020 conference

  7. Semi-supervised Learning in Network-Structured Data via Total Variation Minimization

    Authors: Alexander Jung, Alfred O. Hero III, Alexandru Mara, Saeed Jahromi, Ayelet Heimowitz, Yonina C. Eldar

    Abstract: We propose and analyze a method for semi-supervised learning from partially-labeled network-structured data. Our approach is based on a graph signal recovery interpretation under a clustering hypothesis that labels of data points belonging to the same well-connected subset (cluster) are similar valued. This lends naturally to learning the labels by total variation (TV) minimization, which we solve… ▽ More

    Submitted 2 November, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

  8. EvalNE: A Framework for Evaluating Network Embeddings on Link Prediction

    Authors: Alexandru Mara, Jefrey Lijffijt, Tijl De Bie

    Abstract: In this paper we present EvalNE, a Python toolbox for evaluating network embedding methods on link prediction tasks. Link prediction is one of the most popular choices for evaluating the quality of network embeddings. However, the complexity of this task requires a carefully designed evaluation pipeline in order to provide consistent, reproducible and comparable results. EvalNE simplifies this pro… ▽ More

    Submitted 22 January, 2019; originally announced January 2019.

  9. arXiv:1709.01402  [pdf, other

    stat.ML cs.LG

    Recovery Conditions and Sampling Strategies for Network Lasso

    Authors: Alexandru Mara, Alexander Jung

    Abstract: The network Lasso is a recently proposed convex optimization method for machine learning from massive network structured datasets, i.e., big data over networks. It is a variant of the well-known least absolute shrinkage and selection operator (Lasso), which is underlying many methods in learning and signal processing involving sparse models. Highly scalable implementations of the network Lasso can… ▽ More

    Submitted 3 September, 2017; originally announced September 2017.

    Comments: nominated as student paper award finalist at Asilomar 2017. arXiv admin note: substantial text overlap with arXiv:1704.02107

  10. arXiv:1612.01414  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Learning via Sparse Label Propagation

    Authors: Alexander Jung, Alfred O. Hero III, Alexandru Mara, Saeed Jahromi

    Abstract: This work proposes a novel method for semi-supervised learning from partially labeled massive network-structured datasets, i.e., big data over networks. We model the underlying hypothesis, which relates data points to labels, as a graph signal, defined over some graph (network) structure intrinsic to the dataset. Following the key principle of supervised learning, i.e., similar inputs yield simila… ▽ More

    Submitted 15 May, 2017; v1 submitted 5 December, 2016; originally announced December 2016.

  11. arXiv:1611.00714  [pdf, ps, other

    cs.LG cs.DC

    Scalable Semi-Supervised Learning over Networks using Nonsmooth Convex Optimization

    Authors: Alexander Jung, Alfred O. Hero III, Alexandru Mara, Sabeur Aridhi

    Abstract: We propose a scalable method for semi-supervised (transductive) learning from massive network-structured datasets. Our approach to semi-supervised learning is based on representing the underlying hypothesis as a graph signal with small total variation. Requiring a small total variation of the graph signal representing the underlying hypothesis corresponds to the central smoothness assumption that… ▽ More

    Submitted 2 November, 2016; originally announced November 2016.

  12. arXiv:1212.5599  [pdf

    cs.CE

    Elaboration of a new tool for weather data sequences generation

    Authors: Laetitia Adelard, Thierry Alex Mara, Harry Boyer, Jean Claude Gatina

    Abstract: This paper deals about the presentation of a new software RUNEOLE used to provide weather data in buildings physics. RUNEOLE associates three modules leading to the description, the modelling and the generation of weather data. The first module is dedicated to the description of each climatic variable included in the database. Graphic representation is possible (with histograms for example). Mathe… ▽ More

    Submitted 21 December, 2012; originally announced December 2012.

    Comments: Available from http://www.ibpsa.org/m_bs1999.asp; IBPSA'99, International Building Performance Association, Tokyo : Japan (1999)

  13. arXiv:1212.5594  [pdf

    cs.NE cs.CE

    Black box modelling of HVAC system : improving the performances of neural networks

    Authors: Eric Fock, Thierry Alex Mara, Alfred Jean Philippe Lauret, Harry Boyer

    Abstract: This paper deals with neural networks modelling of HVAC systems. In order to increase the neural networks performances, a method based on sensitivity analysis is applied. The same technique is also used to compute the relevance of each input. To avoid the prediction errors in dry coil conditions, a metamodel for each capacity is derived from the neural networks. The regression coefficients of the… ▽ More

    Submitted 21 December, 2012; originally announced December 2012.

    Comments: Eighth International IBPSA Conference, Eindhoven : Netherlands (2003); Proceedings available at http://www.ibpsa.org/m_bs2003.asp

  14. arXiv:1212.5263  [pdf

    cs.CE

    Use of BESTEST procedure to improve a building thermal simulation program

    Authors: Ted Soubdhan, Thierry Alex Mara, Harry Boyer, Anis Younès

    Abstract: Validation of building energy simulation programs is of major interest to both users and modellers. To achieve such a task, it is essential to apply a methodology based on a priori test and empirical validation. A priori test consists in verifying that models embedded in a program and their implementation are correct. this should be achieved before carrying out experiments. The aim of this report… ▽ More

    Submitted 20 December, 2012; originally announced December 2012.

    Journal ref: World Renewable Energy Congress VI Renewables: The Energy for the 21st Century World Renewable Energy Congress VI 1-7 July 2000 Brighton, UK, Elsevier (Ed.) (2000) 1800-1803

  15. arXiv:1212.5255  [pdf

    cs.CE

    A Comparison between CODYRUN and TRNSYS, simulation models for thermal buildings behaviour

    Authors: Franck Lucas, Thierry Alex Mara, François Garde, Harry Boyer

    Abstract: Simulation codes of thermal behaviour could significantly improve housing construction design. Among the existing software, CODYRUN and TRNSYS are calculations codes of different conceptions. CODYRUN is exclusively dedicated to housing thermal behaviour, whereas TRNSYS is more generally used on any thermal system. The purpose of this article is to compare these two instruments in two different con… ▽ More

    Submitted 18 December, 2012; originally announced December 2012.

    Journal ref: Word Renewable Energy Congress, Florence : Italy (1998)

  16. arXiv:1212.3928  [pdf

    cs.CE

    A validation methodology aid for improving a thermal building model: Case of diffuse radiation accounting in a tropical climate

    Authors: A. J. P. Lauret, T. A. Mara, H. Boyer, L. Adelard, F. Garde

    Abstract: As part of our efforts to complete the software CODYRUN validation, we chose as test building a block of flats constructed in Reunion Island, which has a humid tropical climate. The sensitivity analysis allowed us to study the effects of both diffuse and direct solar radiation on our model of this building. With regard to the choice and location of sensors, this stage of the study also led us to m… ▽ More

    Submitted 17 December, 2012; originally announced December 2012.

    Journal ref: Energy and Buildings 33, 7 (2001) 711-718

  17. arXiv:1212.3924  [pdf

    cs.CE

    Building ventilation: A pressure airflow model computer generation and elements of validation

    Authors: H. Boyer, A. P. Lauret, L. Adelard, T. A. Mara

    Abstract: The calculation of airflows is of great importance for detailed building thermal simulation computer codes, these airflows most frequently constituting an important thermal coupling between the building and the outside on one hand, and the different thermal zones on the other. The driving effects of air movement, which are the wind and the thermal buoyancy, are briefly outlined and we look closely… ▽ More

    Submitted 17 December, 2012; originally announced December 2012.

    Journal ref: Energy and Buildings 29, 3 (1999) 283-292