Skip to main content

Showing 1–28 of 28 results for author: Lai, E

.
  1. arXiv:2506.06541  [pdf, ps, other

    cs.DB cs.AI cs.MA

    KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes

    Authors: Eugenie Lai, Gerardo Vitagliano, Ziyu Zhang, Sivaprasad Sudhir, Om Chabra, Anna Zeng, Anton A. Zabreyko, Chenning Li, Ferdi Kossmann, Jialin Ding, Jun Chen, Markos Markakis, Matthew Russo, Weiyang Wang, Ziniu Wu, Michael J. Cafarella, Lei Cao, Samuel Madden, Tim Kraska

    Abstract: Constructing real-world data-to-insight pipelines often involves data extraction from data lakes, data integration across heterogeneous data sources, and diverse operations from data cleaning to analysis. The design and implementation of data science pipelines require domain knowledge, technical expertise, and even project-specific insights. AI systems have shown remarkable reasoning, coding, and… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2505.16921  [pdf, ps, other

    astro-ph.HE

    A NuSTAR study of quasi-periodic oscillations from the ultraluminous X-ray sources in M82

    Authors: Hamza El Byad, Matteo Bachetti, Silvia Columbu, Giuseppe Rodriguez, Maura Pilia, Matthew J. Middleton, Dominic J Walton, Murray Brightman, Hannah Earnshaw, Karl Forster, Brian Grefenstette, Felix Fürst, Marianne Heida, Matteo Imbrogno, Eleonora Veronica Lai, Thomas Maccarone

    Abstract: The study of quasi-periodic oscillations in X-ray binaries provides valuable insights into the physics of accretion around compact objects. The M82 galaxy hosts two ultraluminous X-ray sources (ULXs), one of which is suspected to harbor an intermediate-mass black hole. Using 39 NuSTAR observations acquired between 2014--2024, we investigate the aperiodic X-ray variability in M82. In particular, we… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 16 pages, 8 figures, 2 tables

  3. arXiv:2504.19148  [pdf

    cs.AI

    A Dynamic Fuzzy Rule and Attribute Management Framework for Fuzzy Inference Systems in High-Dimensional Data

    Authors: Ke Liu, Jing Ma, Edmund M-K Lai

    Abstract: This paper presents an Adaptive Dynamic Attribute and Rule (ADAR) framework designed to address the challenges posed by high-dimensional data in neuro-fuzzy inference systems. By integrating dual weighting mechanisms-assigning adaptive importance to both attributes and rules-together with automated growth and pruning strategies, ADAR adaptively streamlines complex fuzzy models without sacrificing… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  4. arXiv:2504.11627  [pdf, other

    cs.DB

    Auto-Prep: Holistic Prediction of Data Preparation Steps for Self-Service Business Intelligence

    Authors: Eugenie Y. Lai, Yeye He, Surajit Chaudhuri

    Abstract: Business Intelligence (BI) plays a critical role in empowering modern enterprises to make informed data-driven decisions, and has grown into a billion-dollar business. Self-service BI tools like Power BI and Tableau have democratized the ``dashboarding'' phase of BI, by offering user-friendly, drag-and-drop interfaces that are tailored to non-technical enterprise users. However, despite these adva… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: full version of a paper accepted to VLDB 2025

  5. arXiv:2503.21699  [pdf, other

    cs.MM cs.AI cs.CV cs.SD eess.AS

    MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX

    Authors: Liuyue Xie, George Z. Wei, Avik Kuthiala, Ce Zheng, Ananya Bal, Mosam Dabhi, Liting Wen, Taru Rustagi, Ethan Lai, Sushil Khyalia, Rohan Choudhury, Morteza Ziyadi, Xu Zhang, Hao Yang, László A. Jeni

    Abstract: Frontier models have either been language-only or have primarily focused on vision and language modalities. Although recent advancements in models with vision and audio understanding capabilities have shown substantial progress, the field lacks a standardized evaluation framework for thoroughly assessing their cross-modality perception performance. We introduce MAVERIX~(Multimodal Audio-Visual Eva… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  6. arXiv:2503.21347  [pdf, other

    cs.NE cs.AI

    Residual Learning Inspired Crossover Operator and Strategy Enhancements for Evolutionary Multitasking

    Authors: Ruilin Wang, Xiang Feng, Huiqun Yu, Edmund M-K Lai

    Abstract: In evolutionary multitasking, strategies such as crossover operators and skill factor assignment are critical for effective knowledge transfer. Existing improvements to crossover operators primarily focus on low-dimensional variable combinations, such as arithmetic crossover or partially mapped crossover, which are insufficient for modeling complex high-dimensional interactions.Moreover, static or… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: 9 pages, 4 figures

  7. arXiv:2502.16756  [pdf, other

    cs.CR cs.AI

    Towards Reinforcement Learning for Exploration of Speculative Execution Vulnerabilities

    Authors: Evan Lai, Wenjie Xiong, Edward Suh, Mohit Tiwari, Mulong Luo

    Abstract: Speculative attacks such as Spectre can leak secret information without being discovered by the operating system. Speculative execution vulnerabilities are finicky and deep in the sense that to exploit them, it requires intensive manual labor and intimate knowledge of the hardware. In this paper, we introduce SpecRL, a framework that utilizes reinforcement learning to find speculative execution le… ▽ More

    Submitted 3 April, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

  8. arXiv:2408.06856  [pdf, other

    astro-ph.HE astro-ph.IM

    X-ray and optical polarization aligned with the radio jet ejecta in GX 339-4

    Authors: G. Mastroserio, B. De Marco, M. C. Baglio, F. Carotenuto, S. Fabiani, T. D. Russell, F. Capitanio, Y. Cavecchi, S. Motta, D. M. Russell, M. Dovciak, M. Del Santo, K. Alabarta, A. Ambrifi, S. Campana, P. Casella, S. Covino, G. Illiano, E. Kara, E. V. Lai, G. Lodato, A. Manca, I. Mariani, A. Marino, C. Miceli , et al. (5 additional authors not shown)

    Abstract: We present the first X-ray polarization measurements of GX 339-4. IXPE observed this source twice during its 2023-2024 outburst, once in the soft-intermediate state and again during a soft state. The observation taken during the intermediate state shows significant ($4σ$) polarization degree P = $1.3\% \pm 0.3\%$ and polarization angle $θ$ = -74\degree $\pm$ 7\degree only in the 3 - 8 keV band. FO… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: Submitted to ApJ

  9. arXiv:2408.05852  [pdf, other

    astro-ph.HE

    Characterisation of the stellar wind in Cyg X-1 via modelling of colour-colour diagrams

    Authors: E. V. Lai, B. De Marco, Y. Cavecchi, I. El Mellah, M. Cinus, C. M. Diez, V. Grinberg, A. A. Zdziarski, P. Uttley, M. Bachetti, J. José, G. Sala, A. Różańska, J. Wilms

    Abstract: Cygnus X-1 is a high mass X-ray binary where accretion onto the black hole is mediated by the stellar wind from the blue supergiant companion star HDE 226868. Depending on the position of the black hole along the orbit, X-ray observations can probe different layers of the stellar wind. Deeper wind layers can be investigated at superior conjunction (i.e. null orbital phases). We aim at characterisi… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

    Comments: Accepted for publication in A&A

  10. arXiv:2405.04446  [pdf, other

    stat.ME

    Causal Inference in the Multiverse of Hazard

    Authors: En-Yu Lai, Yen-Tsung Huang

    Abstract: Hazard serves as a pivotal estimand in both practical applications and methodological frameworks. However, its causal interpretation poses notable challenges, including inherent selection biases and ill-defined populations to be compared between different treatment groups. In response, we propose a novel definition of counterfactual hazard within the framework of possible worlds. Instead of condit… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  11. arXiv:2403.12999  [pdf

    cs.RO cs.AI cs.CL cs.LG

    Prompt Selection and Augmentation for Few Examples Code Generation in Large Language Model and its Application in Robotics Control

    Authors: On Tai Wu, Frodo Kin Sun Chan, Zunhao Zhang, Yan Nei Law, Benny Drescher, Edmond Shiao Bun Lai

    Abstract: Few-shot prompting and step-by-step reasoning have enhanced the capabilities of Large Language Models (LLMs) in tackling complex tasks including code generation. In this paper, we introduce a prompt selection and augmentation algorithm aimed at improving mathematical reasoning and robot arm operations. Our approach incorporates a multi-stage example augmentation scheme combined with an example sel… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 17 pages, 4 figures

  12. Highly Significant Detection of X-Ray Polarization from the Brightest Accreting Neutron Star Sco X-1

    Authors: Fabio La Monaca, Alessandro Di Marco, Juri Poutanen, Matteo Bachetti, Sara E. Motta, Alessandro Papitto, Maura Pilia, Fei Xie, Stefano Bianchi, Anna Bobrikova, Enrico Costa, Wei Deng, Mingyu Ge, Giulia Illiano, Shu-Mei Jia, Henric Krawczynski, Eleonora V. Lai, Kuan Liu, Guglielmo Mastroserio, Fabio Muleri, John Rankin, Paolo Soffitta, Alexandra Veledina, Filippo Ambrosino, Melania Del Santo , et al. (94 additional authors not shown)

    Abstract: The Imaging X-ray Polarimetry Explorer (IXPE) measured with high significance the X-ray polarization of the brightest Z-source Scorpius X-1, resulting in the nominal 2-8 keV energy band in a polarization degree of 1.0(0.2)% and a polarization angle of 8(6)° at 90% of confidence level. This observation was strictly simultaneous with observations performed by NICER, NuSTAR, and Insight-HXMT, which a… ▽ More

    Submitted 24 January, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Journal ref: ApJL 960 L11 (2024)

  13. arXiv:2310.19301  [pdf, other

    cs.CL cs.AI cs.CV

    ROME: Evaluating Pre-trained Vision-Language Models on Reasoning beyond Visual Common Sense

    Authors: Kankan Zhou, Eason Lai, Wei Bin Au Yeong, Kyriakos Mouratidis, Jing Jiang

    Abstract: Humans possess a strong capability for reasoning beyond common sense. For example, given an unconventional image of a goldfish laying on the table next to an empty fishbowl, a human would effortlessly determine that the fish is not inside the fishbowl. The case, however, may be different for a vision-language model, whose reasoning could gravitate towards the common scenario that the fish is insid… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: This is the camera-ready version of the paper that will be published in the EMNLP 2023 Findings (Singapore, 6-10 December 2023)

  14. arXiv:2310.04830  [pdf, other

    cs.DB cs.CV cs.LG

    Extract-Transform-Load for Video Streams

    Authors: Ferdinand Kossmann, Ziniu Wu, Eugenie Lai, Nesime Tatbul, Lei Cao, Tim Kraska, Samuel Madden

    Abstract: Social media, self-driving cars, and traffic cameras produce video streams at large scales and cheap cost. However, storing and querying video at such scales is prohibitively expensive. We propose to treat large-scale video analytics as a data warehousing problem: Video is a format that is easy to produce but needs to be transformed into an application-specific format that is easy to query. Analog… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: 26 pages, 23 figures

    Journal ref: Proc. VLDB Endow. 16, 9 (May 2023), 2302-2315

  15. arXiv:2306.05537  [pdf, other

    cs.CL

    AaKOS: Aspect-adaptive Knowledge-based Opinion Summarization

    Authors: Guan Wang, Weihua Li, Edmund M-K. Lai, Quan Bai

    Abstract: The rapid growth of information on the Internet has led to an overwhelming amount of opinions and comments on various activities, products, and services. This makes it difficult and time-consuming for users to process all the available information when making decisions. Text summarization, a Natural Language Processing (NLP) task, has been widely explored to help users quickly retrieve relevant in… ▽ More

    Submitted 25 May, 2023; originally announced June 2023.

    Comments: 21 pages, 4 figures, 7 tables

  16. arXiv:2212.03371  [pdf, other

    cs.CL cs.AI

    KATSum: Knowledge-aware Abstractive Text Summarization

    Authors: Guan Wang, Weihua Li, Edmund Lai, Jianhua Jiang

    Abstract: Text Summarization is recognised as one of the NLP downstream tasks and it has been extensively investigated in recent years. It can assist people with perceiving the information rapidly from the Internet, including news articles, social posts, videos, etc. Most existing research works attempt to develop summarization models to produce a better output. However, advent limitations of most existing… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Presented at PKAW 2022 (arXiv:2211.03888) Report-no: PKAW/2022/02

    Report number: Report-no: PKAW/2022/02

  17. arXiv:2206.13424  [pdf, other

    cs.LG math.OC stat.ML

    Benchopt: Reproducible, efficient and collaborative optimization benchmarks

    Authors: Thomas Moreau, Mathurin Massias, Alexandre Gramfort, Pierre Ablin, Pierre-Antoine Bannier, Benjamin Charlier, Mathieu Dagréou, Tom Dupré la Tour, Ghislain Durif, Cassio F. Dantas, Quentin Klopfenstein, Johan Larsson, En Lai, Tanguy Lefort, Benoit Malézieux, Badr Moufad, Binh T. Nguyen, Alain Rakotomamonjy, Zaccharie Ramzi, Joseph Salmon, Samuel Vaiter

    Abstract: Numerical validation is at the core of machine learning research as it allows to assess the actual impact of new methods, and to confirm the agreement between theory and practice. Yet, the rapid development of the field poses several challenges: researchers are confronted with a profusion of methods to compare, limited transparency and consensus on best practices, as well as tedious re-implementat… ▽ More

    Submitted 28 October, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted in proceedings of NeurIPS 22; Benchopt library documentation is available at https://benchopt.github.io/

  18. The X-ray spectral-timing contribution of the stellar wind in the hard state of Cyg X-1

    Authors: E. V. Lai, B. De Marco, A. A. Zdziarski, T. M. Belloni, S. Mondal, P. Uttley, V. Grinberg, J. Wilms, A. Różańska

    Abstract: The clumpy stellar wind from the companion star in high mass X-ray binaries causes variable, partial absorption of the emission from the X-ray source. We studied XMM-Newton observations from the 7.22 d-long "Cyg X-1 Hard state Observations of a Complete Binary Orbit in X-rays" (CHOCBOX) monitoring campaign, in order to constrain the effects of the stellar wind on the short-timescale X-ray spectral… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: 16 pages, 13 figures

  19. arXiv:2104.06563  [pdf, other

    cs.SI cs.AI

    ABEM: An Adaptive Agent-based Evolutionary Approach for Mining Influencers in Online Social Networks

    Authors: Weihua Li, Yuxuan Hu, Shiqing Wu, Quan Bai, Edmund Lai

    Abstract: A key step in influence maximization in online social networks is the identification of a small number of users, known as influencers, who are able to spread influence quickly and widely to other users. The evolving nature of the topological structure of these networks makes it difficult to locate and identify these influencers. In this paper, we propose an adaptive agent-based evolutionary approa… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: 22 pages, 9 figures

    MSC Class: 68Txx (Primary); 68Uxx (Secondary) ACM Class: I.2.11; I.6.0

  20. The inner flow geometry in MAXI J1820+070 during hard and hard-intermediate states

    Authors: B. De Marco, A. A. Zdziarski, G. Ponti, G. Migliori, T. M. Belloni, A. Segovia Otero, M. Dziełak, E. V. Lai

    Abstract: [Abridged] Context: We present a systematic X-ray spectral-timing study of the recently discovered, exceptionally bright black hole X-ray binary system MAXI J1820+070. Our analysis focuses on the first part of the 2018 outburst, covering the rise throughout the hard state, the bright hard and hard-intermediate states, and the transition to the soft-intermediate state. Aims: We address the issue of… ▽ More

    Submitted 6 August, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Accepted for publication in Astronomy & Astrophysics, matches published version

    Journal ref: A&A 654, A14 (2021)

  21. An extreme Ultraluminous X-ray source X-1 in NGC 5055

    Authors: Samaresh Mondal, Agata Rozanska, Eleonora Veronica Lai, Barbara De Marco

    Abstract: Aims. We analyzed multi-epoch X-ray data of the Ultraluminous X-ray source (ULX) NGC 5055 X-1, with luminosity up to $2.32\times10^{40}\ \rm erg\ s^{-1}$, in order to constrain the physical parameters of the source. Methods. We performed timing and spectral analysis of Chandra and XMM-Newton observations. We used spectral models which assume the emission is from an accreting black hole system. We… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: 8 pages, 10 figures, Accepted for publication in A&A

    Journal ref: A&A 642, A94 (2020)

  22. arXiv:2005.03501  [pdf

    cs.CV

    Heidelberg Colorectal Data Set for Surgical Data Science in the Sensor Operating Room

    Authors: Lena Maier-Hein, Martin Wagner, Tobias Ross, Annika Reinke, Sebastian Bodenstedt, Peter M. Full, Hellena Hempe, Diana Mindroc-Filimon, Patrick Scholz, Thuy Nuong Tran, Pierangela Bruno, Anna Kisilenko, Benjamin Müller, Tornike Davitashvili, Manuela Capek, Minu Tizabi, Matthias Eisenmann, Tim J. Adler, Janek Gröhl, Melanie Schellenberg, Silvia Seidlitz, T. Y. Emmy Lai, Bünyamin Pekdemir, Veith Roethlingshoefer, Fabian Both , et al. (8 additional authors not shown)

    Abstract: Image-based tracking of medical instruments is an integral part of surgical data science applications. Previous research has addressed the tasks of detecting, segmenting and tracking medical instruments based on laparoscopic video data. However, the proposed methods still tend to fail when applied to challenging images and do not generalize well to data they have not been trained on. This paper in… ▽ More

    Submitted 23 February, 2021; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: Submitted to Nature Scientific Data

  23. arXiv:1709.04319  [pdf, ps, other

    cs.NE eess.SY

    Enhanced Particle Swarm Optimization Algorithms for Multiple-Input Multiple-Output System Modelling using Convolved Gaussian Process Models

    Authors: Gang Cao, Edmund M-K Lai, Fakhrul Alam

    Abstract: Convolved Gaussian Process (CGP) is able to capture the correlations not only between inputs and outputs but also among the outputs. This allows a superior performance of using CGP than standard Gaussian Process (GP) in the modelling of Multiple-Input Multiple-Output (MIMO) systems when observations are missing for some of outputs. Similar to standard GP, a key issue of CGP is the learning of hype… ▽ More

    Submitted 12 July, 2017; originally announced September 2017.

  24. arXiv:1707.04515  [pdf, ps, other

    eess.SY

    Gaussian Process Model Predictive Control of An Unmanned Quadrotor

    Authors: Gang Cao, Edmund M-K Lai, Fakhrul Alam

    Abstract: The Model Predictive Control (MPC) trajectory tracking problem of an unmanned quadrotor with input and output constraints is addressed. In this article, the dynamic models of the quadrotor are obtained purely from operational data in the form of probabilistic Gaussian Process (GP) models. This is different from conventional models obtained through Newtonian analysis. A hierarchical control scheme… ▽ More

    Submitted 12 July, 2017; originally announced July 2017.

    Comments: arXiv admin note: text overlap with arXiv:1612.01211

  25. arXiv:1612.01211  [pdf, ps, other

    eess.SY

    Gaussian Process Model Predictive Control of Unknown Nonlinear Systems

    Authors: Gang Cao, Edmund M-K Lai, Fakhrul Alam

    Abstract: Model Predictive Control (MPC) of an unknown system that is modelled by Gaussian Process (GP) techniques is studied in this paper. Using GP, the variances computed during the modelling and inference processes allow us to take model uncertainty into account. The main issue in using MPC to control systems modelled by GP is the propagation of such uncertainties within the control horizon. In this pap… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

  26. arXiv:1608.04070  [pdf, other

    cs.IT

    A Low Complexity Spectrum Sensing Scheme for Estimating Frequency Band Edges in Multi-Standard Military Communication Receivers

    Authors: S. J. Darak, A. P. Vinod, E. M-K. Lai

    Abstract: In a typical multi-standard military communication receiver, fast and reliable spectrum sensing unit is required to extract the information of multiple channels (frequency bands) present in a wideband input signal. In this paper, an energy detector based on our reconfigurable filter bank, in [5], for detecting the edge frequencies of the channels is proposed. Simulation results are presented to sh… ▽ More

    Submitted 14 August, 2016; originally announced August 2016.

    Comments: nternational Conference on Communication, Science and Information Engineering (CCSIE) , London

  27. arXiv:1608.04069  [pdf

    cs.SD cs.IT

    Design of Variable Bandpass Filters Using First Order Allpass Transformation And Coefficient Decimation

    Authors: S. J. Darak, A. P. Vinod, E. M-K. Lai

    Abstract: In this paper, the design of a computationally efficient variable bandpass digital filter is presented. The center frequency and bandwidth of this filter can be changed online without updating the filter coefficients. The warped filters, obtained by replacing each unit delay of a digital filter with an allpass filter, are widely used for various audio processing applications. However, warped filte… ▽ More

    Submitted 14 August, 2016; originally announced August 2016.

    Comments: 18th Electronics New Zealand Conference (ENZCON)

  28. Unifying the Phase Diagrams of the Magnetic and Transport Properties of La_(2-x)Sr_xCuO_4, 0 < x < 0.05

    Authors: E. Lai, R. J. Gooding

    Abstract: An extensive experimental and theoretical effort has led to a largely complete mapping of the magnetic phase diagram of La_(2-x)Sr_xCuO_4, and a microscopic model of the spin textures produced in the x < 0.05 regime has been shown to be in agreement with this phase diagram. Here we use this same model to derive a theory of the impurity-dominated, low temperature transport. Then, we present an an… ▽ More

    Submitted 5 September, 1997; originally announced September 1997.

    Comments: 7 pages revtex with one .ps figure