-
KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes
Authors:
Eugenie Lai,
Gerardo Vitagliano,
Ziyu Zhang,
Sivaprasad Sudhir,
Om Chabra,
Anna Zeng,
Anton A. Zabreyko,
Chenning Li,
Ferdi Kossmann,
Jialin Ding,
Jun Chen,
Markos Markakis,
Matthew Russo,
Weiyang Wang,
Ziniu Wu,
Michael J. Cafarella,
Lei Cao,
Samuel Madden,
Tim Kraska
Abstract:
Constructing real-world data-to-insight pipelines often involves data extraction from data lakes, data integration across heterogeneous data sources, and diverse operations from data cleaning to analysis. The design and implementation of data science pipelines require domain knowledge, technical expertise, and even project-specific insights. AI systems have shown remarkable reasoning, coding, and…
▽ More
Constructing real-world data-to-insight pipelines often involves data extraction from data lakes, data integration across heterogeneous data sources, and diverse operations from data cleaning to analysis. The design and implementation of data science pipelines require domain knowledge, technical expertise, and even project-specific insights. AI systems have shown remarkable reasoning, coding, and understanding capabilities. However, it remains unclear to what extent these capabilities translate into successful design and execution of such complex pipelines. We introduce KRAMABENCH: a benchmark composed of 104 manually-curated real-world data science pipelines spanning 1700 data files from 24 data sources in 6 different domains. We show that these pipelines test the end-to-end capabilities of AI systems on data processing, requiring data discovery, wrangling and cleaning, efficient processing, statistical reasoning, and orchestrating data processing steps given a high-level task. Our evaluation tests 5 general models and 3 code generation models using our reference framework, DS-GURU, which instructs the AI model to decompose a question into a sequence of subtasks, reason through each step, and synthesize Python code that implements the proposed design. Our results on KRAMABENCH show that, although the models are sufficiently capable of solving well-specified data science code generation tasks, when extensive data processing and domain knowledge are required to construct real-world data science pipelines, existing out-of-box models fall short. Progress on KramaBench represents crucial steps towards developing autonomous data science agents for real-world applications. Our code, reference framework, and data are available at https://github.com/mitdbg/KramaBench.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
A NuSTAR study of quasi-periodic oscillations from the ultraluminous X-ray sources in M82
Authors:
Hamza El Byad,
Matteo Bachetti,
Silvia Columbu,
Giuseppe Rodriguez,
Maura Pilia,
Matthew J. Middleton,
Dominic J Walton,
Murray Brightman,
Hannah Earnshaw,
Karl Forster,
Brian Grefenstette,
Felix Fürst,
Marianne Heida,
Matteo Imbrogno,
Eleonora Veronica Lai,
Thomas Maccarone
Abstract:
The study of quasi-periodic oscillations in X-ray binaries provides valuable insights into the physics of accretion around compact objects. The M82 galaxy hosts two ultraluminous X-ray sources (ULXs), one of which is suspected to harbor an intermediate-mass black hole. Using 39 NuSTAR observations acquired between 2014--2024, we investigate the aperiodic X-ray variability in M82. In particular, we…
▽ More
The study of quasi-periodic oscillations in X-ray binaries provides valuable insights into the physics of accretion around compact objects. The M82 galaxy hosts two ultraluminous X-ray sources (ULXs), one of which is suspected to harbor an intermediate-mass black hole. Using 39 NuSTAR observations acquired between 2014--2024, we investigate the aperiodic X-ray variability in M82. In particular, we study in detail the evolution of the QPO from M82 X-1 in the range 20--300 mHz. We do not find additional timing features in the data, besides a frequent broad noise component at lower frequencies. The QPO behaves similarly to other classes of low-frequency oscillations in accreting compact objects, both black holes and neutron stars.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
A Dynamic Fuzzy Rule and Attribute Management Framework for Fuzzy Inference Systems in High-Dimensional Data
Authors:
Ke Liu,
Jing Ma,
Edmund M-K Lai
Abstract:
This paper presents an Adaptive Dynamic Attribute and Rule (ADAR) framework designed to address the challenges posed by high-dimensional data in neuro-fuzzy inference systems. By integrating dual weighting mechanisms-assigning adaptive importance to both attributes and rules-together with automated growth and pruning strategies, ADAR adaptively streamlines complex fuzzy models without sacrificing…
▽ More
This paper presents an Adaptive Dynamic Attribute and Rule (ADAR) framework designed to address the challenges posed by high-dimensional data in neuro-fuzzy inference systems. By integrating dual weighting mechanisms-assigning adaptive importance to both attributes and rules-together with automated growth and pruning strategies, ADAR adaptively streamlines complex fuzzy models without sacrificing performance or interpretability. Experimental evaluations on four diverse datasets - Auto MPG (7 variables), Beijing PM2.5 (10 variables), Boston Housing (13 variables), and Appliances Energy Consumption (27 variables) show that ADAR-based models achieve consistently lower Root Mean Square Error (RMSE) compared to state-of-the-art baselines. On the Beijing PM2.5 dataset, for instance, ADAR-SOFENN attained an RMSE of 56.87 with nine rules, surpassing traditional ANFIS [12] and SOFENN [16] models. Similarly, on the high-dimensional Appliances Energy dataset, ADAR-ANFIS reached an RMSE of 83.25 with nine rules, outperforming established fuzzy logic approaches and interpretability-focused methods such as APLR. Ablation studies further reveal that combining rule-level and attribute-level weight assignment significantly reduces model overlap while preserving essential features, thereby enhancing explainability. These results highlight ADAR's effectiveness in dynamically balancing rule complexity and feature importance, paving the way for scalable, high-accuracy, and transparent neuro-fuzzy systems applicable to a range of real-world scenarios.
△ Less
Submitted 27 April, 2025;
originally announced April 2025.
-
Auto-Prep: Holistic Prediction of Data Preparation Steps for Self-Service Business Intelligence
Authors:
Eugenie Y. Lai,
Yeye He,
Surajit Chaudhuri
Abstract:
Business Intelligence (BI) plays a critical role in empowering modern enterprises to make informed data-driven decisions, and has grown into a billion-dollar business. Self-service BI tools like Power BI and Tableau have democratized the ``dashboarding'' phase of BI, by offering user-friendly, drag-and-drop interfaces that are tailored to non-technical enterprise users. However, despite these adva…
▽ More
Business Intelligence (BI) plays a critical role in empowering modern enterprises to make informed data-driven decisions, and has grown into a billion-dollar business. Self-service BI tools like Power BI and Tableau have democratized the ``dashboarding'' phase of BI, by offering user-friendly, drag-and-drop interfaces that are tailored to non-technical enterprise users. However, despite these advances, we observe that the ``data preparation'' phase of BI continues to be a key pain point for BI users today.
In this work, we systematically study around 2K real BI projects harvested from public sources, focusing on the data-preparation phase of the BI workflows. We observe that users often have to program both (1) data transformation steps and (2) table joins steps, before their raw data can be ready for dashboarding and analysis. A careful study of the BI workflows reveals that transformation and join steps are often intertwined in the same BI project, such that considering both holistically is crucial to accurately predict these steps. Leveraging this observation, we develop an Auto-Prep system to holistically predict transformations and joins, using a principled graph-based algorithm inspired by Steiner-tree, with provable quality guarantees. Extensive evaluations using real BI projects suggest that Auto-Prep can correctly predict over 70\% transformation and join steps, significantly more accurate than existing algorithms as well as language-models such as GPT-4.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.
-
MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX
Authors:
Liuyue Xie,
George Z. Wei,
Avik Kuthiala,
Ce Zheng,
Ananya Bal,
Mosam Dabhi,
Liting Wen,
Taru Rustagi,
Ethan Lai,
Sushil Khyalia,
Rohan Choudhury,
Morteza Ziyadi,
Xu Zhang,
Hao Yang,
László A. Jeni
Abstract:
Frontier models have either been language-only or have primarily focused on vision and language modalities. Although recent advancements in models with vision and audio understanding capabilities have shown substantial progress, the field lacks a standardized evaluation framework for thoroughly assessing their cross-modality perception performance. We introduce MAVERIX~(Multimodal Audio-Visual Eva…
▽ More
Frontier models have either been language-only or have primarily focused on vision and language modalities. Although recent advancements in models with vision and audio understanding capabilities have shown substantial progress, the field lacks a standardized evaluation framework for thoroughly assessing their cross-modality perception performance. We introduce MAVERIX~(Multimodal Audio-Visual Evaluation Reasoning IndeX), a novel benchmark with 700 videos and 2,556 questions explicitly designed to evaluate multimodal models through tasks that necessitate close integration of video and audio information. MAVERIX uniquely provides models with audiovisual tasks, closely mimicking the multimodal perceptual experiences available to humans during inference and decision-making processes. To our knowledge, MAVERIX is the first benchmark aimed explicitly at assessing comprehensive audiovisual integration. Experiments with state-of-the-art models, including Gemini 1.5 Pro and o1, show performance approaching human levels (around 70% accuracy), while human experts reach near-ceiling performance (95.1%). With standardized evaluation protocols, a rigorously annotated pipeline, and a public toolkit, MAVERIX establishes a challenging testbed for advancing audiovisual multimodal intelligence.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Residual Learning Inspired Crossover Operator and Strategy Enhancements for Evolutionary Multitasking
Authors:
Ruilin Wang,
Xiang Feng,
Huiqun Yu,
Edmund M-K Lai
Abstract:
In evolutionary multitasking, strategies such as crossover operators and skill factor assignment are critical for effective knowledge transfer. Existing improvements to crossover operators primarily focus on low-dimensional variable combinations, such as arithmetic crossover or partially mapped crossover, which are insufficient for modeling complex high-dimensional interactions.Moreover, static or…
▽ More
In evolutionary multitasking, strategies such as crossover operators and skill factor assignment are critical for effective knowledge transfer. Existing improvements to crossover operators primarily focus on low-dimensional variable combinations, such as arithmetic crossover or partially mapped crossover, which are insufficient for modeling complex high-dimensional interactions.Moreover, static or semi-dynamic crossover strategies fail to adapt to the dynamic dependencies among tasks. In addition, current Multifactorial Evolutionary Algorithm frameworks often rely on fixed skill factor assignment strategies, lacking flexibility. To address these limitations, this paper proposes the Multifactorial Evolutionary Algorithm-Residual Learning (MFEA-RL) method based on residual learning. The method employs a Very Deep Super-Resolution (VDSR) model to generate high-dimensional residual representations of individuals, enhancing the modeling of complex relationships within dimensions. A ResNet-based mechanism dynamically assigns skill factors to improve task adaptability, while a random mapping mechanism efficiently performs crossover operations and mitigates the risk of negative transfer. Theoretical analysis and experimental results show that MFEA-RL outperforms state-of-the-art multitasking algorithms. It excels in both convergence and adaptability on standard evolutionary multitasking benchmarks, including CEC2017-MTSO and WCCI2020-MTSO. Additionally, its effectiveness is validated through a real-world application scenario.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Towards Reinforcement Learning for Exploration of Speculative Execution Vulnerabilities
Authors:
Evan Lai,
Wenjie Xiong,
Edward Suh,
Mohit Tiwari,
Mulong Luo
Abstract:
Speculative attacks such as Spectre can leak secret information without being discovered by the operating system. Speculative execution vulnerabilities are finicky and deep in the sense that to exploit them, it requires intensive manual labor and intimate knowledge of the hardware. In this paper, we introduce SpecRL, a framework that utilizes reinforcement learning to find speculative execution le…
▽ More
Speculative attacks such as Spectre can leak secret information without being discovered by the operating system. Speculative execution vulnerabilities are finicky and deep in the sense that to exploit them, it requires intensive manual labor and intimate knowledge of the hardware. In this paper, we introduce SpecRL, a framework that utilizes reinforcement learning to find speculative execution leaks in post-silicon (black box) microprocessors.
△ Less
Submitted 3 April, 2025; v1 submitted 23 February, 2025;
originally announced February 2025.
-
X-ray and optical polarization aligned with the radio jet ejecta in GX 339-4
Authors:
G. Mastroserio,
B. De Marco,
M. C. Baglio,
F. Carotenuto,
S. Fabiani,
T. D. Russell,
F. Capitanio,
Y. Cavecchi,
S. Motta,
D. M. Russell,
M. Dovciak,
M. Del Santo,
K. Alabarta,
A. Ambrifi,
S. Campana,
P. Casella,
S. Covino,
G. Illiano,
E. Kara,
E. V. Lai,
G. Lodato,
A. Manca,
I. Mariani,
A. Marino,
C. Miceli
, et al. (5 additional authors not shown)
Abstract:
We present the first X-ray polarization measurements of GX 339-4. IXPE observed this source twice during its 2023-2024 outburst, once in the soft-intermediate state and again during a soft state. The observation taken during the intermediate state shows significant ($4σ$) polarization degree P = $1.3\% \pm 0.3\%$ and polarization angle $θ$ = -74\degree $\pm$ 7\degree only in the 3 - 8 keV band. FO…
▽ More
We present the first X-ray polarization measurements of GX 339-4. IXPE observed this source twice during its 2023-2024 outburst, once in the soft-intermediate state and again during a soft state. The observation taken during the intermediate state shows significant ($4σ$) polarization degree P = $1.3\% \pm 0.3\%$ and polarization angle $θ$ = -74\degree $\pm$ 7\degree only in the 3 - 8 keV band. FORS2 at VLT observed the source simultaneously detecting optical polarization in the B, V, R, I bands (between $0.1%$ and $0.7\%$), all roughly aligned with the X-ray polarization. We also detect a discrete jet knot from radio observations taken later in time; this knot would have been ejected from the system around the same time as the hard-to-soft X-ray state transition and a bright radio flare occurred $\sim$3 months earlier. The proper motion of the jet knot provides a direct measurement of the jet orientation angle on the plane of the sky at the time of the ejection. We find that both the X-ray and optical polarization angles are aligned with the direction of the ballistic jet.
△ Less
Submitted 9 August, 2024;
originally announced August 2024.
-
Characterisation of the stellar wind in Cyg X-1 via modelling of colour-colour diagrams
Authors:
E. V. Lai,
B. De Marco,
Y. Cavecchi,
I. El Mellah,
M. Cinus,
C. M. Diez,
V. Grinberg,
A. A. Zdziarski,
P. Uttley,
M. Bachetti,
J. José,
G. Sala,
A. Różańska,
J. Wilms
Abstract:
Cygnus X-1 is a high mass X-ray binary where accretion onto the black hole is mediated by the stellar wind from the blue supergiant companion star HDE 226868. Depending on the position of the black hole along the orbit, X-ray observations can probe different layers of the stellar wind. Deeper wind layers can be investigated at superior conjunction (i.e. null orbital phases). We aim at characterisi…
▽ More
Cygnus X-1 is a high mass X-ray binary where accretion onto the black hole is mediated by the stellar wind from the blue supergiant companion star HDE 226868. Depending on the position of the black hole along the orbit, X-ray observations can probe different layers of the stellar wind. Deeper wind layers can be investigated at superior conjunction (i.e. null orbital phases). We aim at characterising the stellar wind in the Cyg X-1/HDE 226868 system analysing one passage at superior conjunction covered by XMM-Newton during the CHOCBOX campaign via modelling of colour-colour diagrams. Since X-ray absorption is energy-dependent, colour indices provide information on the parameters of the stellar wind, such as the column density $N_{H,w}$ and the covering factor $f_c$. We fitted colour-colour diagrams with models that include both a continuum and a stellar wind component. We used the KDE method to infer the unknown probability distribution of the data points in the colour-colour diagram, and selected the model corresponding to the highest likelihood. In order to study the temporal evolution of the wind around superior conjunction, we extracted and fitted time-resolved colour-colour diagrams. We found that the model that best describes the shape of the colour-colour diagram of Cyg X-1 at superior conjunction requires the wind to be partially ionised. The shape of the colour-colour diagram strongly varies during the analysed observation, as due to concurrent changes of the mean $N_{H,w}$ and the $f_c$ of the wind. Our results suggest the existence of a linear scaling between the rapid variability amplitude of $N_{H,w}$ (on time scales between 10 s and 11 ks) and its long term variations (on time scales 11>ks). Using the inferred best-fit values, we estimated the stellar mass loss rate to be $\sim 7\times10^{-6} {\rm M_{\odot}yr^{-1}}$ and the clumps to have a mass of $\sim10^{17}$ g.
△ Less
Submitted 11 August, 2024;
originally announced August 2024.
-
Causal Inference in the Multiverse of Hazard
Authors:
En-Yu Lai,
Yen-Tsung Huang
Abstract:
Hazard serves as a pivotal estimand in both practical applications and methodological frameworks. However, its causal interpretation poses notable challenges, including inherent selection biases and ill-defined populations to be compared between different treatment groups. In response, we propose a novel definition of counterfactual hazard within the framework of possible worlds. Instead of condit…
▽ More
Hazard serves as a pivotal estimand in both practical applications and methodological frameworks. However, its causal interpretation poses notable challenges, including inherent selection biases and ill-defined populations to be compared between different treatment groups. In response, we propose a novel definition of counterfactual hazard within the framework of possible worlds. Instead of conditioning on prior survival status as a conditional probability, our new definition involves intervening in the prior status, treating it as a marginal probability. Using single-world intervention graphs, we demonstrate that the proposed counterfactual hazard is a type of controlled direct effect. Conceptually, intervening in survival status at each time point generates a new possible world, where the proposed hazards across time points represent risks in these hypothetical scenarios, forming a "multiverse of hazard." The cumulative and average counterfactual hazards correspond to the sum and average of risks across this multiverse, respectively, with the actual world's risk lying between the two. This conceptual shift reframes hazards in the actual world as a collection of risks across possible worlds, marking a significant advancement in the causal interpretation of hazards.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Prompt Selection and Augmentation for Few Examples Code Generation in Large Language Model and its Application in Robotics Control
Authors:
On Tai Wu,
Frodo Kin Sun Chan,
Zunhao Zhang,
Yan Nei Law,
Benny Drescher,
Edmond Shiao Bun Lai
Abstract:
Few-shot prompting and step-by-step reasoning have enhanced the capabilities of Large Language Models (LLMs) in tackling complex tasks including code generation. In this paper, we introduce a prompt selection and augmentation algorithm aimed at improving mathematical reasoning and robot arm operations. Our approach incorporates a multi-stage example augmentation scheme combined with an example sel…
▽ More
Few-shot prompting and step-by-step reasoning have enhanced the capabilities of Large Language Models (LLMs) in tackling complex tasks including code generation. In this paper, we introduce a prompt selection and augmentation algorithm aimed at improving mathematical reasoning and robot arm operations. Our approach incorporates a multi-stage example augmentation scheme combined with an example selection scheme. This algorithm improves LLM performance by selecting a set of examples that increase diversity, minimize redundancy, and increase relevance to the question. When combined with the Program-of-Thought prompting, our algorithm demonstrates an improvement in performance on the GSM8K and SVAMP benchmarks, with increases of 0.3% and 1.1% respectively. Furthermore, in simulated tabletop environments, our algorithm surpasses the Code-as-Policies approach by achieving a 3.4% increase in successful task completions and a decrease of over 70% in the number of examples used. Its ability to discard examples that contribute little to solving the problem reduces the inferencing time of an LLM-powered robotics system. This algorithm also offers important benefits for industrial process automation by streamlining the development and deployment process, reducing manual programming effort, and enhancing code reusability.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Highly Significant Detection of X-Ray Polarization from the Brightest Accreting Neutron Star Sco X-1
Authors:
Fabio La Monaca,
Alessandro Di Marco,
Juri Poutanen,
Matteo Bachetti,
Sara E. Motta,
Alessandro Papitto,
Maura Pilia,
Fei Xie,
Stefano Bianchi,
Anna Bobrikova,
Enrico Costa,
Wei Deng,
Mingyu Ge,
Giulia Illiano,
Shu-Mei Jia,
Henric Krawczynski,
Eleonora V. Lai,
Kuan Liu,
Guglielmo Mastroserio,
Fabio Muleri,
John Rankin,
Paolo Soffitta,
Alexandra Veledina,
Filippo Ambrosino,
Melania Del Santo
, et al. (94 additional authors not shown)
Abstract:
The Imaging X-ray Polarimetry Explorer (IXPE) measured with high significance the X-ray polarization of the brightest Z-source Scorpius X-1, resulting in the nominal 2-8 keV energy band in a polarization degree of 1.0(0.2)% and a polarization angle of 8(6)° at 90% of confidence level. This observation was strictly simultaneous with observations performed by NICER, NuSTAR, and Insight-HXMT, which a…
▽ More
The Imaging X-ray Polarimetry Explorer (IXPE) measured with high significance the X-ray polarization of the brightest Z-source Scorpius X-1, resulting in the nominal 2-8 keV energy band in a polarization degree of 1.0(0.2)% and a polarization angle of 8(6)° at 90% of confidence level. This observation was strictly simultaneous with observations performed by NICER, NuSTAR, and Insight-HXMT, which allowed for a precise characterization of its broad-band spectrum from soft to hard X-rays. The source has been observed mainly in its soft state, with short periods of flaring. We also observed low-frequency quasi-periodic oscillations. From a spectro-polarimetric analysis, we associate a polarization to the accretion disk at <3.2% at 90% of confidence level, compatible with expectations for an electron-scattering dominated optically thick atmosphere at the Sco X-1 inclination of 44°; for the higher-energy Comptonized component, we obtain a polarization of 1.3(0.4)%, in agreement with expectations for a slab of Thomson optical depth of ~7 and an electron temperature of ~3 keV. A polarization rotation with respect to previous observations by OSO-8 and PolarLight, and also with respect to the radio-jet position angle, is observed. This result may indicate a variation of the polarization with the source state that can be related to relativistic precession or to a change in the corona geometry with the accretion flow.
△ Less
Submitted 24 January, 2024; v1 submitted 10 November, 2023;
originally announced November 2023.
-
ROME: Evaluating Pre-trained Vision-Language Models on Reasoning beyond Visual Common Sense
Authors:
Kankan Zhou,
Eason Lai,
Wei Bin Au Yeong,
Kyriakos Mouratidis,
Jing Jiang
Abstract:
Humans possess a strong capability for reasoning beyond common sense. For example, given an unconventional image of a goldfish laying on the table next to an empty fishbowl, a human would effortlessly determine that the fish is not inside the fishbowl. The case, however, may be different for a vision-language model, whose reasoning could gravitate towards the common scenario that the fish is insid…
▽ More
Humans possess a strong capability for reasoning beyond common sense. For example, given an unconventional image of a goldfish laying on the table next to an empty fishbowl, a human would effortlessly determine that the fish is not inside the fishbowl. The case, however, may be different for a vision-language model, whose reasoning could gravitate towards the common scenario that the fish is inside the bowl, despite the visual input. In this paper, we introduce a novel probing dataset named ROME (reasoning beyond commonsense knowledge) to evaluate whether the state-of-the-art pre-trained vision-language models have the reasoning capability to correctly interpret counter-intuitive content. ROME contains images that defy commonsense knowledge with regards to color, shape, material, size and positional relation. Experiments on the state-of-the-art pre-trained vision-language models reveal that most of these models are still largely incapable of interpreting counter-intuitive scenarios. We hope that ROME will spur further investigations on reasoning beyond commonsense knowledge in vision-language research.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Extract-Transform-Load for Video Streams
Authors:
Ferdinand Kossmann,
Ziniu Wu,
Eugenie Lai,
Nesime Tatbul,
Lei Cao,
Tim Kraska,
Samuel Madden
Abstract:
Social media, self-driving cars, and traffic cameras produce video streams at large scales and cheap cost. However, storing and querying video at such scales is prohibitively expensive. We propose to treat large-scale video analytics as a data warehousing problem: Video is a format that is easy to produce but needs to be transformed into an application-specific format that is easy to query. Analog…
▽ More
Social media, self-driving cars, and traffic cameras produce video streams at large scales and cheap cost. However, storing and querying video at such scales is prohibitively expensive. We propose to treat large-scale video analytics as a data warehousing problem: Video is a format that is easy to produce but needs to be transformed into an application-specific format that is easy to query. Analogously, we define the problem of Video Extract-Transform-Load (V-ETL). V-ETL systems need to reduce the cost of running a user-defined V-ETL job while also giving throughput guarantees to keep up with the rate at which data is produced. We find that no current system sufficiently fulfills both needs and therefore propose Skyscraper, a system tailored to V-ETL. Skyscraper can execute arbitrary video ingestion pipelines and adaptively tunes them to reduce cost at minimal or no quality degradation, e.g., by adjusting sampling rates and resolutions to the ingested content. Skyscraper can hereby be provisioned with cheap on-premises compute and uses a combination of buffering and cloud bursting to deal with peaks in workload caused by expensive processing configurations. In our experiments, we find that Skyscraper significantly reduces the cost of V-ETL ingestion compared to adaptions of current SOTA systems, while at the same time giving robustness guarantees that these systems are lacking.
△ Less
Submitted 7 October, 2023;
originally announced October 2023.
-
AaKOS: Aspect-adaptive Knowledge-based Opinion Summarization
Authors:
Guan Wang,
Weihua Li,
Edmund M-K. Lai,
Quan Bai
Abstract:
The rapid growth of information on the Internet has led to an overwhelming amount of opinions and comments on various activities, products, and services. This makes it difficult and time-consuming for users to process all the available information when making decisions. Text summarization, a Natural Language Processing (NLP) task, has been widely explored to help users quickly retrieve relevant in…
▽ More
The rapid growth of information on the Internet has led to an overwhelming amount of opinions and comments on various activities, products, and services. This makes it difficult and time-consuming for users to process all the available information when making decisions. Text summarization, a Natural Language Processing (NLP) task, has been widely explored to help users quickly retrieve relevant information by generating short and salient content from long or multiple documents. Recent advances in pre-trained language models, such as ChatGPT, have demonstrated the potential of Large Language Models (LLMs) in text generation. However, LLMs require massive amounts of data and resources and are challenging to implement as offline applications. Furthermore, existing text summarization approaches often lack the ``adaptive" nature required to capture diverse aspects in opinion summarization, which is particularly detrimental to users with specific requirements or preferences. In this paper, we propose an Aspect-adaptive Knowledge-based Opinion Summarization model for product reviews, which effectively captures the adaptive nature required for opinion summarization. The model generates aspect-oriented summaries given a set of reviews for a particular product, efficiently providing users with useful information on specific aspects they are interested in, ensuring the generated summaries are more personalized and informative. Extensive experiments have been conducted using real-world datasets to evaluate the proposed model. The results demonstrate that our model outperforms state-of-the-art approaches and is adaptive and efficient in generating summaries that focus on particular aspects, enabling users to make well-informed decisions and catering to their diverse interests and preferences.
△ Less
Submitted 25 May, 2023;
originally announced June 2023.
-
KATSum: Knowledge-aware Abstractive Text Summarization
Authors:
Guan Wang,
Weihua Li,
Edmund Lai,
Jianhua Jiang
Abstract:
Text Summarization is recognised as one of the NLP downstream tasks and it has been extensively investigated in recent years. It can assist people with perceiving the information rapidly from the Internet, including news articles, social posts, videos, etc. Most existing research works attempt to develop summarization models to produce a better output. However, advent limitations of most existing…
▽ More
Text Summarization is recognised as one of the NLP downstream tasks and it has been extensively investigated in recent years. It can assist people with perceiving the information rapidly from the Internet, including news articles, social posts, videos, etc. Most existing research works attempt to develop summarization models to produce a better output. However, advent limitations of most existing models emerge, including unfaithfulness and factual errors. In this paper, we propose a novel model, named as Knowledge-aware Abstractive Text Summarization, which leverages the advantages offered by Knowledge Graph to enhance the standard Seq2Seq model. On top of that, the Knowledge Graph triplets are extracted from the source text and utilised to provide keywords with relational information, producing coherent and factually errorless summaries. We conduct extensive experiments by using real-world data sets. The results reveal that the proposed framework can effectively utilise the information from Knowledge Graph and significantly reduce the factual errors in the summary.
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
Benchopt: Reproducible, efficient and collaborative optimization benchmarks
Authors:
Thomas Moreau,
Mathurin Massias,
Alexandre Gramfort,
Pierre Ablin,
Pierre-Antoine Bannier,
Benjamin Charlier,
Mathieu Dagréou,
Tom Dupré la Tour,
Ghislain Durif,
Cassio F. Dantas,
Quentin Klopfenstein,
Johan Larsson,
En Lai,
Tanguy Lefort,
Benoit Malézieux,
Badr Moufad,
Binh T. Nguyen,
Alain Rakotomamonjy,
Zaccharie Ramzi,
Joseph Salmon,
Samuel Vaiter
Abstract:
Numerical validation is at the core of machine learning research as it allows to assess the actual impact of new methods, and to confirm the agreement between theory and practice. Yet, the rapid development of the field poses several challenges: researchers are confronted with a profusion of methods to compare, limited transparency and consensus on best practices, as well as tedious re-implementat…
▽ More
Numerical validation is at the core of machine learning research as it allows to assess the actual impact of new methods, and to confirm the agreement between theory and practice. Yet, the rapid development of the field poses several challenges: researchers are confronted with a profusion of methods to compare, limited transparency and consensus on best practices, as well as tedious re-implementation work. As a result, validation is often very partial, which can lead to wrong conclusions that slow down the progress of research. We propose Benchopt, a collaborative framework to automate, reproduce and publish optimization benchmarks in machine learning across programming languages and hardware architectures. Benchopt simplifies benchmarking for the community by providing an off-the-shelf tool for running, sharing and extending experiments. To demonstrate its broad usability, we showcase benchmarks on three standard learning tasks: $\ell_2$-regularized logistic regression, Lasso, and ResNet18 training for image classification. These benchmarks highlight key practical findings that give a more nuanced view of the state-of-the-art for these problems, showing that for practical evaluation, the devil is in the details. We hope that Benchopt will foster collaborative work in the community hence improving the reproducibility of research findings.
△ Less
Submitted 28 October, 2022; v1 submitted 27 June, 2022;
originally announced June 2022.
-
The X-ray spectral-timing contribution of the stellar wind in the hard state of Cyg X-1
Authors:
E. V. Lai,
B. De Marco,
A. A. Zdziarski,
T. M. Belloni,
S. Mondal,
P. Uttley,
V. Grinberg,
J. Wilms,
A. Różańska
Abstract:
The clumpy stellar wind from the companion star in high mass X-ray binaries causes variable, partial absorption of the emission from the X-ray source. We studied XMM-Newton observations from the 7.22 d-long "Cyg X-1 Hard state Observations of a Complete Binary Orbit in X-rays" (CHOCBOX) monitoring campaign, in order to constrain the effects of the stellar wind on the short-timescale X-ray spectral…
▽ More
The clumpy stellar wind from the companion star in high mass X-ray binaries causes variable, partial absorption of the emission from the X-ray source. We studied XMM-Newton observations from the 7.22 d-long "Cyg X-1 Hard state Observations of a Complete Binary Orbit in X-rays" (CHOCBOX) monitoring campaign, in order to constrain the effects of the stellar wind on the short-timescale X-ray spectral-timing properties of the source. We find these properties to change significantly in the presence of the wind. In particular, the longest sampled timescales (corresponding to temporal frequencies of $ν\sim$ 0.1-1 Hz) reveal an enhancement of the fractional variability power, while on the shortest sampled timescales ($ν\sim$ 1-10 Hz) the variability is suppressed. In addition, we observe a reduction (by up to a factor of $\sim$ 1.8) of the otherwise high coherence between soft and hard band light curves, as well as of the amplitude of the hard X-ray lags intrinsic to the X-ray continuum. The observed increase of low frequency variability power can be explained in terms of variations of the wind column density as a consequence of motions of the intervening clumps. In this scenario (and assuming a terminal velocity of $v_{\infty}=2400\ {\rm km\ s^{-1}}$), we obtain an estimate of $l \sim$ 0.5-1.5 $\times 10^{-4} R_{\ast}$ for the average radial size of a clump. On the other hand, we suggest the behaviour at high frequencies to be due to scattering in an optically thicker medium, possibly formed by collision of the stellar wind with the edge of the disc.
△ Less
Submitted 14 February, 2022;
originally announced February 2022.
-
ABEM: An Adaptive Agent-based Evolutionary Approach for Mining Influencers in Online Social Networks
Authors:
Weihua Li,
Yuxuan Hu,
Shiqing Wu,
Quan Bai,
Edmund Lai
Abstract:
A key step in influence maximization in online social networks is the identification of a small number of users, known as influencers, who are able to spread influence quickly and widely to other users. The evolving nature of the topological structure of these networks makes it difficult to locate and identify these influencers. In this paper, we propose an adaptive agent-based evolutionary approa…
▽ More
A key step in influence maximization in online social networks is the identification of a small number of users, known as influencers, who are able to spread influence quickly and widely to other users. The evolving nature of the topological structure of these networks makes it difficult to locate and identify these influencers. In this paper, we propose an adaptive agent-based evolutionary approach to address this problem in the context of both static and dynamic networks. This approach is shown to be able to adapt the solution as the network evolves. It is also applicable to large-scale networks due to its distributed framework. Evaluation of our approach is performed by using both synthetic networks and real-world datasets. Experimental results demonstrate that the proposed approach outperforms state-of-the-art seeding algorithms in terms of maximizing influence.
△ Less
Submitted 13 April, 2021;
originally announced April 2021.
-
The inner flow geometry in MAXI J1820+070 during hard and hard-intermediate states
Authors:
B. De Marco,
A. A. Zdziarski,
G. Ponti,
G. Migliori,
T. M. Belloni,
A. Segovia Otero,
M. Dziełak,
E. V. Lai
Abstract:
[Abridged] Context: We present a systematic X-ray spectral-timing study of the recently discovered, exceptionally bright black hole X-ray binary system MAXI J1820+070. Our analysis focuses on the first part of the 2018 outburst, covering the rise throughout the hard state, the bright hard and hard-intermediate states, and the transition to the soft-intermediate state. Aims: We address the issue of…
▽ More
[Abridged] Context: We present a systematic X-ray spectral-timing study of the recently discovered, exceptionally bright black hole X-ray binary system MAXI J1820+070. Our analysis focuses on the first part of the 2018 outburst, covering the rise throughout the hard state, the bright hard and hard-intermediate states, and the transition to the soft-intermediate state. Aims: We address the issue of constraining the geometry of the innermost accretion flow and its evolution throughout an outburst. Methods: We employed two independent X-ray spectral-timing methods applied to the NICER data of MAXI J1820+070. We first identified and tracked the evolution of a characteristic frequency of soft X-ray reverberation lags. Then, we studied the spectral evolution of the quasi-thermal component responsible for the observed thermal reverberation lags. Results: The frequency of thermal reverberation lags steadily increases throughout most of the outburst, implying that the relative distance between the X-ray source and the disc decreases as the source softens. However, near transition this evolution breaks, showing a sudden increase (decrease) of lag amplitude (frequency). The temperature of the quasi-thermal component in covariance spectra consistently increases throughout all the analysed observations. Conclusions: The behaviour of thermal reverberation lags near transition might be related to the relativistic plasma ejections detected at radio wavelengths, suggesting a causal connection between the two phenomena. Throughout most of the hard and hard-intermediate states the disc is consistent with being truncated (with an inner radius $R_{\rm in}>\sim 10 R_{\rm g}$), reaching close to the innermost stable circular orbit only near transition.
△ Less
Submitted 6 August, 2021; v1 submitted 15 February, 2021;
originally announced February 2021.
-
An extreme Ultraluminous X-ray source X-1 in NGC 5055
Authors:
Samaresh Mondal,
Agata Rozanska,
Eleonora Veronica Lai,
Barbara De Marco
Abstract:
Aims. We analyzed multi-epoch X-ray data of the Ultraluminous X-ray source (ULX) NGC 5055 X-1, with luminosity up to $2.32\times10^{40}\ \rm erg\ s^{-1}$, in order to constrain the physical parameters of the source. Methods. We performed timing and spectral analysis of Chandra and XMM-Newton observations. We used spectral models which assume the emission is from an accreting black hole system. We…
▽ More
Aims. We analyzed multi-epoch X-ray data of the Ultraluminous X-ray source (ULX) NGC 5055 X-1, with luminosity up to $2.32\times10^{40}\ \rm erg\ s^{-1}$, in order to constrain the physical parameters of the source. Methods. We performed timing and spectral analysis of Chandra and XMM-Newton observations. We used spectral models which assume the emission is from an accreting black hole system. We fit the data with a multicolor disk (MCD) combined with a powerlaw (PL) or a thermal Comptonization (NTHCOMP) component, and compared those fits with a slim disk model. Results. The lightcurves of the source do not show significant variability. From the hardness ratios (3-10 keV/0.3-3 keV flux) we infer that the source is not spectrally variable. We found that the photon index is tightly, positively correlated with the unabsorbed 0.3-10 keV flux and the hydrogen column density. Furthermore, the temperature emissivity profile indicates a deviation from the standard sub-Eddington thin disk model. The source shows an inverse correlation between luminosity and inner disk temperature in all fitted models. Conclusions. Our analysis favors the source to be in an ultraluminous soft state. The positive correlations between the photon index and the flux, and between the photon index and the hydrogen column density may suggest the source is accreting at high Eddington ratios and might indicate the presence of a wind. The inverse luminosity relation with the inner disk temperature for all spectral models may indicate that the emission is geometrically beamed by an optically thick outflow.
△ Less
Submitted 5 August, 2020;
originally announced August 2020.
-
Heidelberg Colorectal Data Set for Surgical Data Science in the Sensor Operating Room
Authors:
Lena Maier-Hein,
Martin Wagner,
Tobias Ross,
Annika Reinke,
Sebastian Bodenstedt,
Peter M. Full,
Hellena Hempe,
Diana Mindroc-Filimon,
Patrick Scholz,
Thuy Nuong Tran,
Pierangela Bruno,
Anna Kisilenko,
Benjamin Müller,
Tornike Davitashvili,
Manuela Capek,
Minu Tizabi,
Matthias Eisenmann,
Tim J. Adler,
Janek Gröhl,
Melanie Schellenberg,
Silvia Seidlitz,
T. Y. Emmy Lai,
Bünyamin Pekdemir,
Veith Roethlingshoefer,
Fabian Both
, et al. (8 additional authors not shown)
Abstract:
Image-based tracking of medical instruments is an integral part of surgical data science applications. Previous research has addressed the tasks of detecting, segmenting and tracking medical instruments based on laparoscopic video data. However, the proposed methods still tend to fail when applied to challenging images and do not generalize well to data they have not been trained on. This paper in…
▽ More
Image-based tracking of medical instruments is an integral part of surgical data science applications. Previous research has addressed the tasks of detecting, segmenting and tracking medical instruments based on laparoscopic video data. However, the proposed methods still tend to fail when applied to challenging images and do not generalize well to data they have not been trained on. This paper introduces the Heidelberg Colorectal (HeiCo) data set - the first publicly available data set enabling comprehensive benchmarking of medical instrument detection and segmentation algorithms with a specific emphasis on method robustness and generalization capabilities. Our data set comprises 30 laparoscopic videos and corresponding sensor data from medical devices in the operating room for three different types of laparoscopic surgery. Annotations include surgical phase labels for all video frames as well as information on instrument presence and corresponding instance-wise segmentation masks for surgical instruments (if any) in more than 10,000 individual frames. The data has successfully been used to organize international competitions within the Endoscopic Vision Challenges 2017 and 2019.
△ Less
Submitted 23 February, 2021; v1 submitted 7 May, 2020;
originally announced May 2020.
-
Enhanced Particle Swarm Optimization Algorithms for Multiple-Input Multiple-Output System Modelling using Convolved Gaussian Process Models
Authors:
Gang Cao,
Edmund M-K Lai,
Fakhrul Alam
Abstract:
Convolved Gaussian Process (CGP) is able to capture the correlations not only between inputs and outputs but also among the outputs. This allows a superior performance of using CGP than standard Gaussian Process (GP) in the modelling of Multiple-Input Multiple-Output (MIMO) systems when observations are missing for some of outputs. Similar to standard GP, a key issue of CGP is the learning of hype…
▽ More
Convolved Gaussian Process (CGP) is able to capture the correlations not only between inputs and outputs but also among the outputs. This allows a superior performance of using CGP than standard Gaussian Process (GP) in the modelling of Multiple-Input Multiple-Output (MIMO) systems when observations are missing for some of outputs. Similar to standard GP, a key issue of CGP is the learning of hyperparameters from a set of input-output observations. It typically performed by maximizing the Log-Likelihood (LL) function which leads to an unconstrained nonlinear and non-convex optimization problem. Algorithms such as Conjugate Gradient (CG) or Broyden-Fletcher-Goldfarb-Shanno (BFGS) are commonly used but they often get stuck in local optima, especially for CGP where there are more hyperparameters. In addition, the LL value is not a reliable indicator for judging the quality intermediate models in the optimization process. In this paper, we propose to use enhanced Particle Swarm Optimization (PSO) algorithms to solve this problem by minimizing the model output error instead. This optimization criterion enables the quality of intermediate solutions to be directly observable during the optimization process. Two enhancements to the standard PSO algorithm which make use of gradient information and the multi- start technique are proposed. Simulation results on the modelling of both linear and nonlinear systems demonstrate the effectiveness of minimizing the model output error to learn hyperparameters and the performance of using enhanced algorithms.
△ Less
Submitted 12 July, 2017;
originally announced September 2017.
-
Gaussian Process Model Predictive Control of An Unmanned Quadrotor
Authors:
Gang Cao,
Edmund M-K Lai,
Fakhrul Alam
Abstract:
The Model Predictive Control (MPC) trajectory tracking problem of an unmanned quadrotor with input and output constraints is addressed. In this article, the dynamic models of the quadrotor are obtained purely from operational data in the form of probabilistic Gaussian Process (GP) models. This is different from conventional models obtained through Newtonian analysis. A hierarchical control scheme…
▽ More
The Model Predictive Control (MPC) trajectory tracking problem of an unmanned quadrotor with input and output constraints is addressed. In this article, the dynamic models of the quadrotor are obtained purely from operational data in the form of probabilistic Gaussian Process (GP) models. This is different from conventional models obtained through Newtonian analysis. A hierarchical control scheme is used to handle the trajectory tracking problem with the translational subsystem in the outer loop and the rotational subsystem in the inner loop. Constrained GP based MPC are formulated separately for both subsystems. The resulting MPC problems are typically nonlinear and non-convex. We derived 15 a GP based local dynamical model that allows these optimization problems to be relaxed to convex ones which can be efficiently solved with a simple active-set algorithm. The performance of the proposed approach is compared with an existing unconstrained Nonlinear Model Predictive Control (NMPC). Simulation results show that the two approaches exhibit similar trajectory tracking performance. However, our approach has the advantage of incorporating constraints on the control inputs. In addition, our approach only requires 20% of the computational time for NMPC.
△ Less
Submitted 12 July, 2017;
originally announced July 2017.
-
Gaussian Process Model Predictive Control of Unknown Nonlinear Systems
Authors:
Gang Cao,
Edmund M-K Lai,
Fakhrul Alam
Abstract:
Model Predictive Control (MPC) of an unknown system that is modelled by Gaussian Process (GP) techniques is studied in this paper. Using GP, the variances computed during the modelling and inference processes allow us to take model uncertainty into account. The main issue in using MPC to control systems modelled by GP is the propagation of such uncertainties within the control horizon. In this pap…
▽ More
Model Predictive Control (MPC) of an unknown system that is modelled by Gaussian Process (GP) techniques is studied in this paper. Using GP, the variances computed during the modelling and inference processes allow us to take model uncertainty into account. The main issue in using MPC to control systems modelled by GP is the propagation of such uncertainties within the control horizon. In this paper, two approaches to solve this problem, called GPMPC1 and GPMPC2, are proposed. With GPMPC1, the original Stochastic Model Predictive Control (SMPC) problem is relaxed to a deterministic nonlinear MPC based on a basic linearized GP local model. The resulting optimization problem, though non-convex, can be solved by the Sequential Quadratic Programming (SQP). By incorporating the model variance into the state vector, an extended local model is derived. This model allows us to relax the non-convex MPC problem to a convex one which can be solved by an active-set method efficiently. The performance of both approaches is demonstrated by applying them to two trajectory tracking problems. Results show that both GPMPC1 and GPMPC2 produce effective controls but GPMPC2 is much more efficient computationally.
△ Less
Submitted 4 December, 2016;
originally announced December 2016.
-
A Low Complexity Spectrum Sensing Scheme for Estimating Frequency Band Edges in Multi-Standard Military Communication Receivers
Authors:
S. J. Darak,
A. P. Vinod,
E. M-K. Lai
Abstract:
In a typical multi-standard military communication receiver, fast and reliable spectrum sensing unit is required to extract the information of multiple channels (frequency bands) present in a wideband input signal. In this paper, an energy detector based on our reconfigurable filter bank, in [5], for detecting the edge frequencies of the channels is proposed. Simulation results are presented to sh…
▽ More
In a typical multi-standard military communication receiver, fast and reliable spectrum sensing unit is required to extract the information of multiple channels (frequency bands) present in a wideband input signal. In this paper, an energy detector based on our reconfigurable filter bank, in [5], for detecting the edge frequencies of the channels is proposed. Simulation results are presented to show the trade-off between the time required to calculate edge frequencies of all the channels and the maximum normalized error in estimating the edge frequencies. The proposed method is compared with existing energy detector methods for complexity and performance. It is shown that, for a fixed number of input samples, error decreases with time in the proposed algorithm as compared to other methods where error is constant. Design examples and simulations show that the complexity of the proposed method is lower than the other methods for a given error in estimating the edge frequencies
△ Less
Submitted 14 August, 2016;
originally announced August 2016.
-
Design of Variable Bandpass Filters Using First Order Allpass Transformation And Coefficient Decimation
Authors:
S. J. Darak,
A. P. Vinod,
E. M-K. Lai
Abstract:
In this paper, the design of a computationally efficient variable bandpass digital filter is presented. The center frequency and bandwidth of this filter can be changed online without updating the filter coefficients. The warped filters, obtained by replacing each unit delay of a digital filter with an allpass filter, are widely used for various audio processing applications. However, warped filte…
▽ More
In this paper, the design of a computationally efficient variable bandpass digital filter is presented. The center frequency and bandwidth of this filter can be changed online without updating the filter coefficients. The warped filters, obtained by replacing each unit delay of a digital filter with an allpass filter, are widely used for various audio processing applications. However, warped filters fail to provide variable bandwidth bandpass responses for a given center frequency using first order allpass transformation. To overcome this drawback, our design is accomplished by combining warped filter with the coefficient decimation technique. The design example shows that the proposed variable digital filter is simple to design and offers a total gate count reduction of 36% and 65% over the warped filters compared to the designs presented in [3] and [1] respectively
△ Less
Submitted 14 August, 2016;
originally announced August 2016.
-
Unifying the Phase Diagrams of the Magnetic and Transport Properties of La_(2-x)Sr_xCuO_4, 0 < x < 0.05
Authors:
E. Lai,
R. J. Gooding
Abstract:
An extensive experimental and theoretical effort has led to a largely complete mapping of the magnetic phase diagram of La_(2-x)Sr_xCuO_4, and a microscopic model of the spin textures produced in the x < 0.05 regime has been shown to be in agreement with this phase diagram. Here we use this same model to derive a theory of the impurity-dominated, low temperature transport. Then, we present an an…
▽ More
An extensive experimental and theoretical effort has led to a largely complete mapping of the magnetic phase diagram of La_(2-x)Sr_xCuO_4, and a microscopic model of the spin textures produced in the x < 0.05 regime has been shown to be in agreement with this phase diagram. Here we use this same model to derive a theory of the impurity-dominated, low temperature transport. Then, we present an analysis of previously published data for two samples: x = 0.002 data from Chen et. al., and x = 0.04 data from Keimer et. al. We show that the transport mechanisms in the two systems are the same, even though they are on opposite sides of the observed insulator-to-metal transition. Our model of impurity effects on the impurity band conduction, variable-range hopping conduction, and coulomb gap conduction, is similar to that used to describe doped semiconductors. However, for La_(2-x)Sr_xCuO_4 we find that in addition to impurity-generated disorder effects, strong correlations are important and must be treated on a equal level with disorder. On the basis of this work we propose a phase diagram that is consistent with available magnetic and transport experiments, and which connects the undoped parent compound with the lowest x value for which La_(2-x)Sr_xCuO_4 is found to be superconducting, x about 0.06.
△ Less
Submitted 5 September, 1997;
originally announced September 1997.