Skip to main content

Showing 1–50 of 289 results for author: Maa, A

.
  1. arXiv:2506.20650  [pdf, ps, other

    cs.LG stat.ML

    Mastering Multiple-Expert Routing: Realizable $H$-Consistency and Strong Guarantees for Learning to Defer

    Authors: Anqi Mao, Mehryar Mohri, Yutao Zhong

    Abstract: The problem of learning to defer with multiple experts consists of optimally assigning input instances to experts, balancing the trade-off between their accuracy and computational cost. This is a critical challenge in natural language generation, but also in other fields such as image processing, and medical diagnostics. Recent studies have proposed surrogate loss functions to optimize deferral, b… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: ICML 2025

  2. A new window into the sub-parsec scale magnetic field in the Milky Way? Unveiling small-scale magneto-ionic structures with Faraday complexity

    Authors: Yik Ki Ma, Amit Seta, N. M. McClure-Griffiths, C. L. Van Eck, S. A. Mao, A. Ordog, J. C. Brown, T. O. Kovacs, Takuya Akahori, K. Kurahara, L. Oberhelman, C. S. Anderson

    Abstract: Radio broadband spectro-polarimetric observations are sensitive to the spatial fluctuations of the Faraday depth (FD) within the telescope beam. Such FD fluctuations are referred to as "Faraday complexity", and can unveil small-scale magneto-ionic structures in both the synchrotron-emitting and the foreground volumes. We explore the astrophysical origin of the Faraday complexity exhibited by 191 p… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 32 pages, 19 figures, MNRAS accepted

  3. arXiv:2506.08260  [pdf, ps, other

    cs.CL cs.AI

    Automatic Generation of Inference Making Questions for Reading Comprehension Assessments

    Authors: Wanjing Anya Ma, Michael Flor, Zuowei Wang

    Abstract: Inference making is an essential but complex skill in reading comprehension (RC). Some inferences require resolving references across sentences, and some rely on using prior knowledge to fill in the detail that is not explicitly written in the text. Diagnostic RC questions can help educators provide more effective and targeted reading instruction and interventions for school-age students. We intro… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: Accepted to the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025), co-located with the ACL 2025

  4. arXiv:2506.07048  [pdf, ps, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.other

    Dimensionless Hierarchical Topological Phononic States

    Authors: Joel R. Pyfrom, Kai Sun, Jihong A. Ma

    Abstract: Topological insulators exhibit unique boundary states that are protected by the topology of the bulk bands, a phenomenon that has now been extended to classical systems such as phononics and mechanics. Typically, nontrivial topology in an $n$-dimensional bulk leads to the emergence of $(n-1)$-dimensional topologically protected boundary states. However, these states can often be gapped out by brea… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: 15 pages, 9 figures

  5. arXiv:2505.20692  [pdf, ps, other

    cs.HC cs.AI cs.CL

    Can we Debias Social Stereotypes in AI-Generated Images? Examining Text-to-Image Outputs and User Perceptions

    Authors: Saharsh Barve, Andy Mao, Jiayue Melissa Shi, Prerna Juneja, Koustuv Saha

    Abstract: Recent advances in generative AI have enabled visual content creation through text-to-image (T2I) generation. However, despite their creative potential, T2I models often replicate and amplify societal stereotypes -- particularly those related to gender, race, and culture -- raising important ethical concerns. This paper proposes a theory-driven bias detection rubric and a Social Stereotype Index (… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  6. arXiv:2505.08272  [pdf, ps, other

    astro-ph.GA

    The Polarisation Sky Survey of the Universe's Magnetism (POSSUM): Science Goals and Survey Description

    Authors: B. M. Gaensler, G. H. Heald, N. M. McClure-Griffiths, C. S. Anderson, C. L. Van Eck, J. L. West, A. J. M. Thomson, J. P. Leahy, L. Rudnick, Y. K. Ma, Takuya Akahori, G. Gürkan, T. L. Landecker, S. A. Mao, S. P. O'Sullivan, W. Raja, X. Sun, T. Vernstrom, Lerato Baidoo, Ettore Carretti, A. R. Taylor, A. G. Willis, Erik Osinga, J. D. Livingston, E. L. Alexander , et al. (35 additional authors not shown)

    Abstract: The Australian SKA Pathfinder (ASKAP) offers powerful new capabilities for studying the polarised and magnetised Universe at radio wavelengths. In this paper, we introduce the Polarisation Sky Survey of the Universe's Magnetism (POSSUM), a groundbreaking survey with three primary objectives: (1) to create a comprehensive Faraday rotation measure (RM) grid of up to one million compact extragalactic… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: Accepted for publication in PASA. 32 pages, 9 figures, 1 table

  7. arXiv:2505.02361  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Learning simple heuristic rules for classifying materials based on chemical composition

    Authors: Andrew Ma, Marin Soljačić

    Abstract: In the past decade, there has been a significant interest in the use of machine learning approaches in materials science research. Conventional deep learning approaches that rely on complex, nonlinear models have become increasingly important in computational materials science due to their high predictive accuracy. In contrast to these approaches, we have shown in a recent work that a remarkably s… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 10 pages, 3 figures

  8. arXiv:2505.02222  [pdf, other

    cs.LG stat.ML

    Practical Efficiency of Muon for Pretraining

    Authors: Essential AI, :, Ishaan Shah, Anthony M. Polloreno, Karl Stratos, Philip Monk, Adarsh Chaluvaraju, Andrew Hojel, Andrew Ma, Anil Thomas, Ashish Tanwer, Darsh J Shah, Khoi Nguyen, Kurt Smith, Michael Callahan, Michael Pust, Mohit Parmar, Peter Rushton, Platon Mazarakis, Ritvik Kapila, Saurabh Srivastava, Somanshu Singla, Tim Romanski, Yash Vanjani, Ashish Vaswani

    Abstract: We demonstrate that Muon, the simplest instantiation of a second-order optimizer, explicitly expands the Pareto frontier over AdamW on the compute-time tradeoff. We find that Muon is more effective than AdamW in retaining data efficiency at large batch sizes, far beyond the so-called critical batch size, while remaining computationally efficient, thus enabling more economical training. We study th… ▽ More

    Submitted 19 May, 2025; v1 submitted 4 May, 2025; originally announced May 2025.

  9. arXiv:2505.00258  [pdf, ps, other

    math.NA

    Quantile-RK and Double Quantile-RK Error Horizon Analysis

    Authors: Emeric Battaglia, Anna Ma

    Abstract: In solving linear systems of equations of the form $Ax=b$, corruptions present in $b$ affect stochastic iterative algorithms' ability to reach the true solution $x^\ast$ to the uncorrupted linear system. The randomized Kaczmarz method converges in expectation to $x^\ast$ up to an error horizon dependent on the conditioning of $A$ and the supremum norm of the corruption in $b$. To avoid this error… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

    MSC Class: 65F10; 65F20

  10. arXiv:2504.13092  [pdf, other

    cs.CV

    EventVAD: Training-Free Event-Aware Video Anomaly Detection

    Authors: Yihua Shao, Haojin He, Sijie Li, Siyu Chen, Xinwei Long, Fanhu Zeng, Yuxuan Fan, Muyang Zhang, Ziyang Yan, Ao Ma, Xiaochen Wang, Hao Tang, Yan Wang, Shuyan Li

    Abstract: Video Anomaly Detection~(VAD) focuses on identifying anomalies within videos. Supervised methods require an amount of in-domain training data and often struggle to generalize to unseen anomalies. In contrast, training-free methods leverage the intrinsic world knowledge of large language models (LLMs) to detect anomalies but face challenges in localizing fine-grained visual transitions and diverse… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  11. arXiv:2504.04022  [pdf, other

    cs.CL cs.AI

    Rethinking Reflection in Pre-Training

    Authors: Essential AI, :, Darsh J Shah, Peter Rushton, Somanshu Singla, Mohit Parmar, Kurt Smith, Yash Vanjani, Ashish Vaswani, Adarsh Chaluvaraju, Andrew Hojel, Andrew Ma, Anil Thomas, Anthony Polloreno, Ashish Tanwer, Burhan Drak Sibai, Divya S Mansingka, Divya Shivaprasad, Ishaan Shah, Karl Stratos, Khoi Nguyen, Michael Callahan, Michael Pust, Mrinal Iyer, Philip Monk , et al. (4 additional authors not shown)

    Abstract: A language model's ability to reflect on its own reasoning provides a key advantage for solving complex problems. While most recent research has focused on how this ability develops during reinforcement learning, we show that it actually begins to emerge much earlier - during the model's pre-training. To study this, we introduce deliberate errors into chains-of-thought and test whether the model c… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  12. arXiv:2503.22122  [pdf, other

    cs.RO cs.AI cs.CL cs.CV

    REMAC: Self-Reflective and Self-Evolving Multi-Agent Collaboration for Long-Horizon Robot Manipulation

    Authors: Puzhen Yuan, Angyuan Ma, Yunchao Yao, Huaxiu Yao, Masayoshi Tomizuka, Mingyu Ding

    Abstract: Vision-language models (VLMs) have demonstrated remarkable capabilities in robotic planning, particularly for long-horizon tasks that require a holistic understanding of the environment for task decomposition. Existing methods typically rely on prior environmental knowledge or carefully designed task-specific prompts, making them struggle with dynamic scene changes or unexpected task conditions, e… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  13. arXiv:2503.21011  [pdf, other

    cs.CL cs.AI

    Can Large Language Models Predict Associations Among Human Attitudes?

    Authors: Ana Ma, Derek Powell

    Abstract: Prior work has shown that large language models (LLMs) can predict human attitudes based on other attitudes, but this work has largely focused on predictions from highly similar and interrelated attitudes. In contrast, human attitudes are often strongly associated even across disparate and dissimilar topics. Using a novel dataset of human responses toward diverse attitude statements, we found that… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  14. arXiv:2503.18888  [pdf, other

    cs.SE cs.CL cs.IR

    Toward building next-generation Geocoding systems: a systematic review

    Authors: Zhengcong Yin, Daniel W. Goldberg, Binbin Lin, Bing Zhou, Diya Li, Andong Ma, Ziqian Ming, Heng Cai, Zhe Zhang, Shaohua Wang, Shanzhen Gao, Joey Ying Lee, Xiao Li, Da Huo

    Abstract: Geocoding systems are widely used in both scientific research for spatial analysis and everyday life through location-based services. The quality of geocoded data significantly impacts subsequent processes and applications, underscoring the need for next-generation systems. In response to this demand, this review first examines the evolving requirements for geocoding inputs and outputs across vari… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  15. arXiv:2503.10701  [pdf, other

    cs.CV cs.RO

    Video Individual Counting for Moving Drones

    Authors: Yaowu Fan, Jia Wan, Tao Han, Antoni B. Chan, Andy J. Ma

    Abstract: Video Individual Counting (VIC) has received increasing attentions recently due to its importance in intelligent video surveillance. Existing works are limited in two aspects, i.e., dataset and method. Previous crowd counting datasets are captured with fixed or rarely moving cameras with relatively sparse individuals, restricting evaluation for a highly varying view and time in crowded scenes. Whi… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  16. arXiv:2503.10127  [pdf, other

    cs.CV

    PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models

    Authors: Runze He, Bo Cheng, Yuhang Ma, Qingxiang Jia, Shanyuan Liu, Ao Ma, Xiaoyu Wu, Liebucha Wu, Dawei Leng, Yuhui Yin

    Abstract: In this paper, we propose a unified layout planning and image generation model, PlanGen, which can pre-plan spatial layout conditions before generating images. Unlike previous diffusion-based models that treat layout planning and layout-to-image as two separate models, PlanGen jointly models the two tasks into one autoregressive transformer using only next-token prediction. PlanGen integrates layo… ▽ More

    Submitted 30 March, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: 15 pages, 12 figures, project page: https://360cvgroup.github.io/PlanGen

  17. arXiv:2503.09242  [pdf, other

    cs.CV

    NAMI: Efficient Image Generation via Progressive Rectified Flow Transformers

    Authors: Yuhang Ma, Bo Cheng, Shanyuan Liu, Ao Ma, Xiaoyu Wu, Liebucha Wu, Dawei Leng, Yuhui Yin

    Abstract: Flow-based transformer models for image generation have achieved state-of-the-art performance with larger model parameters, but their inference deployment cost remains high. To enhance inference performance while maintaining generation quality, we propose progressive rectified flow transformers. We divide the rectified flow into different stages according to resolution, using fewer transformer lay… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  18. arXiv:2503.08157  [pdf, other

    cs.CV

    U-StyDiT: Ultra-high Quality Artistic Style Transfer Using Diffusion Transformers

    Authors: Zhanjie Zhang, Ao Ma, Ke Cao, Jing Wang, Shanyuan Liu, Yuhang Ma, Bo Cheng, Dawei Leng, Yuhui Yin

    Abstract: Ultra-high quality artistic style transfer refers to repainting an ultra-high quality content image using the style information learned from the style image. Existing artistic style transfer methods can be categorized into style reconstruction-based and content-style disentanglement-based style transfer approaches. Although these methods can generate some artistic stylized images, they still exhib… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  19. arXiv:2503.08153  [pdf, other

    cs.CV

    WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation

    Authors: Jing Wang, Ao Ma, Ke Cao, Jun Zheng, Zhanjie Zhang, Jiasong Feng, Shanyuan Liu, Yuhang Ma, Bo Cheng, Dawei Leng, Yuhui Yin, Xiaodan Liang

    Abstract: Recent rapid advancements in text-to-video (T2V) generation, such as SoRA and Kling, have shown great potential for building world simulators. However, current T2V models struggle to grasp abstract physical principles and generate videos that adhere to physical laws. This challenge arises primarily from a lack of clear guidance on physical information due to a significant gap between abstract phys… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  20. arXiv:2503.02112  [pdf, other

    cs.LG astro-ph.IM

    Building Machine Learning Challenges for Anomaly Detection in Science

    Authors: Elizabeth G. Campolongo, Yuan-Tang Chou, Ekaterina Govorkova, Wahid Bhimji, Wei-Lun Chao, Chris Harris, Shih-Chieh Hsu, Hilmar Lapp, Mark S. Neubauer, Josephine Namayanja, Aneesh Subramanian, Philip Harris, Advaith Anand, David E. Carlyn, Subhankar Ghosh, Christopher Lawrence, Eric Moreno, Ryan Raikman, Jiaman Wu, Ziheng Zhang, Bayu Adhi, Mohammad Ahmadi Gharehtoragh, Saúl Alonso Monsalve, Marta Babicz, Furqan Baig , et al. (125 additional authors not shown)

    Abstract: Scientific discoveries are often made by finding a pattern or object that was not predicted by the known rules of science. Oftentimes, these anomalous events or objects that do not conform to the norms are an indication that the rules of science governing the data are incomplete, and something new needs to be present to explain these unexpected outliers. The challenge of finding anomalies can be c… ▽ More

    Submitted 29 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    Comments: 17 pages 6 figures to be submitted to Nature Communications

  21. arXiv:2502.14377  [pdf, other

    cs.CV

    RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

    Authors: Ke Cao, Jing Wang, Ao Ma, Jiasong Feng, Zhanjie Zhang, Xuanhua He, Shanyuan Liu, Bo Cheng, Dawei Leng, Yuhui Yin, Jie Zhang

    Abstract: The Diffusion Transformer plays a pivotal role in advancing text-to-image and text-to-video generation, owing primarily to its inherent scalability. However, existing controlled diffusion transformer methods incur significant parameter and computational overheads and suffer from inefficient resource allocation due to their failure to account for the varying relevance of control information across… ▽ More

    Submitted 23 March, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

    Comments: Homepage: https://360cvgroup.github.io/RelaCtrl/ Github: https://github.com/360CVGroup/RelaCtrl

  22. arXiv:2502.10381  [pdf, ps, other

    cs.LG stat.ML

    Balancing the Scales: A Theoretical and Algorithmic Framework for Learning from Imbalanced Data

    Authors: Corinna Cortes, Anqi Mao, Mehryar Mohri, Yutao Zhong

    Abstract: Class imbalance remains a major challenge in machine learning, especially in multi-class problems with long-tailed distributions. Existing methods, such as data resampling, cost-sensitive techniques, and logistic loss modifications, though popular and often effective, lack solid theoretical foundations. As an example, we demonstrate that cost-sensitive methods are not Bayes-consistent. This paper… ▽ More

    Submitted 25 June, 2025; v1 submitted 14 February, 2025; originally announced February 2025.

    Comments: ICML 2025

  23. arXiv:2502.01925  [pdf, ps, other

    cs.CL cs.CR cs.LG

    PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling

    Authors: Avery Ma, Yangchen Pan, Amir-massoud Farahmand

    Abstract: Many-shot jailbreaking circumvents the safety alignment of LLMs by exploiting their ability to process long input sequences. To achieve this, the malicious target prompt is prefixed with hundreds of fabricated conversational exchanges between the user and the model. These exchanges are randomly sampled from a pool of unsafe question-answer pairs, making it appear as though the model has already co… ▽ More

    Submitted 12 June, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

    Comments: Accepted at ICML 2025 (Spotlight). Code: https://github.com/averyma/pandas

  24. Unraveling quantum phase estimation: exploring the impact of multi-photon interference on the quantum Fisher information

    Authors: Annameng Ma, Agustina G. Magnoni, Miguel A. Larotonda, Laura T. Knoll

    Abstract: Quantum interference is known to become extinct with distinguishing information, as illustrated by the ubiquitous double-slit experiment or the two-photon HOM effect. In the former case single particle interference is destroyed with which-path information while in the latter bunching interference tails-off as photons become distinguishable. It has been observed that when more than two particles ar… ▽ More

    Submitted 25 April, 2025; v1 submitted 22 January, 2025; originally announced January 2025.

    Comments: 13 pages, 6 figures

  25. arXiv:2501.12427  [pdf, other

    cs.LG cs.AI

    SafePowerGraph-HIL: Real-Time HIL Validation of Heterogeneous GNNs for Bridging Sim-to-Real Gap in Power Grids

    Authors: Aoxiang Ma, Salah Ghamizi, Jun Cao, Pedro Rodriguez

    Abstract: As machine learning (ML) techniques gain prominence in power system research, validating these methods' effectiveness under real-world conditions requires real-time hardware-in-the-loop (HIL) simulations. HIL simulation platforms enable the integration of computational models with physical devices, allowing rigorous testing across diverse scenarios critical to system resilience and reliability. In… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: 5 pages, 5 figures

  26. arXiv:2501.11570  [pdf, other

    cs.SD cs.IR cs.LG eess.AS

    Uncertainty Estimation in the Real World: A Study on Music Emotion Recognition

    Authors: Karn N. Watcharasupat, Yiwei Ding, T. Aleksandra Ma, Pavan Seshadri, Alexander Lerch

    Abstract: Any data annotation for subjective tasks shows potential variations between individuals. This is particularly true for annotations of emotional responses to musical stimuli. While older approaches to music emotion recognition systems frequently addressed this uncertainty problem through probabilistic modeling, modern systems based on neural networks tend to ignore the variability and focus only on… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

    Comments: To be presented as a Findings paper at the 2025 European Conference on Information Retrieval (ECIR)

  27. arXiv:2501.07272  [pdf, other

    quant-ph physics.optics

    A Large-Scale Reconfigurable Multiplexed Quantum Photonic Network

    Authors: Natalia Herrera Valencia, Annameng Ma, Suraj Goel, Saroch Leedumrongwatthanakun, Francesco Graffitti, Alessandro Fedrizzi, Will McCutcheon, Mehul Malik

    Abstract: Entanglement distribution in quantum networks will enable next-generation technologies for quantum-secured communications, distributed quantum computing and sensing. Future quantum networks will require dense connectivity, allowing multiple users to share entanglement in a reconfigurable and multiplexed manner, while long-distance connections are established through the teleportation of entangleme… ▽ More

    Submitted 27 January, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

  28. arXiv:2501.02932  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Predicting band gap from chemical composition: A simple learned model for a material property with atypical statistics

    Authors: Andrew Ma, Owen Dugan, Marin Soljačić

    Abstract: In solid-state materials science, substantial efforts have been devoted to the calculation and modeling of the electronic band gap. While a wide range of ab initio methods and machine learning algorithms have been created that can predict this quantity, the development of new computational approaches for studying the band gap remains an active area of research. Here we introduce a simple machine l… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

    Comments: 9 pages, 4 figures

  29. An efficient unsupervised classification model for galaxy morphology: Voting clustering based on coding from ConvNeXt large model

    Authors: Guanwen Fang, Yao Dai, Zesen Lin, Chichun Zhou, Jie Song, Yizhou Gu, Xiaotong Guo, Anqi Mao, Xu Kong

    Abstract: In this work, we update the unsupervised machine learning (UML) step by proposing an algorithm based on ConvNeXt large model coding to improve the efficiency of unlabeled galaxy morphology classifications. The method can be summarized into three key aspects as follows: (1) a convolutional autoencoder is used for image denoising and reconstruction and the rotational invariance of the model is impro… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: Accepted by A&A; 12 pages, 12 figures

  30. arXiv:2412.19552  [pdf, ps, other

    physics.med-ph eess.IV

    Contrast-Optimized Basis Functions for Self-Navigated Motion Correction in Quantitative MRI

    Authors: Elisa Marchetto, Sebastian Flassbeck, Andrew Mao, Jakob Assländer

    Abstract: Purpose: The long scan times of quantitative MRI techniques make motion artifacts more likely. For MR-Fingerprinting-like approaches, this problem can be addressed with self-navigated retrospective motion correction based on reconstructions in a singular value decomposition (SVD) subspace. However, the SVD promotes high signal intensity in all tissues, which limits the contrast between tissue type… ▽ More

    Submitted 17 June, 2025; v1 submitted 27 December, 2024; originally announced December 2024.

  31. arXiv:2412.16434  [pdf, other

    cs.DC

    SYMPHONY: Improving Memory Management for LLM Inference Workloads

    Authors: Saurabh Agarwal, Anyong Mao, Aditya Akella, Shivaram Venkataraman

    Abstract: Large Language Models (LLMs) are increasingly being deployed in applications such as chatbots, code editors, and conversational agents. A key feature of LLMs is their ability to engage in multi-turn interactions with humans or external tools, enabling a wide range of tasks. Each new request in a multi-turn interaction depends on the intermediate state, specifically the key-value (K,V) caches, from… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  32. arXiv:2412.09314  [pdf, other

    astro-ph.GA

    A first glimpse at the MeerKAT DEEP2 field at S-band

    Authors: S. Ranchod, J. D. Wagenveld, H. -R. Klöckner, O. Wucknitz, R. P. Deane, S. S. Sridhar, E. Barr, S. Buchner, F. Camilo, A. Damas-Segovia, C. Kasemann, M. Kramer, L. S. Legodi, S. A. Mao, K. Menten, I. Rammala, M. R. Rugel, G. Wieching

    Abstract: We present the first widefield extragalactic continuum catalogue with the MeerKAT S-band (2.5 GHz), of the radio-selected DEEP2 field. The combined image over the S1 (1.96 - 2.84 GHz) and S4 (2.62 - 3.50 GHz) sub-bands has an angular resolution of 6.8''$\times$3.6'' (4.0''$\times$2.4'') at a robust weighting of $R = 0.3$ ($R=-0.5$) and a sensitivity of 4.7 (7.5) $μ$Jy beam$^{-1}$ with an on-source… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: 16 pages, 12 figures, 7 tables, Accepted for publication in MNRAS

  33. arXiv:2412.04400  [pdf

    physics.chem-ph

    Enhanced Sampling of Protein Conformational Changes via True Reaction Coordinates from Energy Relaxation

    Authors: Huiyu Li, Ao Ma

    Abstract: The bottleneck in enhanced sampling lies in finding collective variables (CVs) that can effectively accelerate protein conformational changes. True reaction coordinates (tRCs) that can predict the committor are considered the optimal CVs, but identifying them requires unbiased natural reactive trajectories, which, paradoxically, depend on effective enhanced sampling. Using the generalized work fun… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

  34. arXiv:2410.16644  [pdf

    cs.AI

    CKSP: Cross-species Knowledge Sharing and Preserving for Universal Animal Activity Recognition

    Authors: Axiu Mao, Meilu Zhu, Zhaojin Guo, Zheng He, Tomas Norton, Kai Liu

    Abstract: Deep learning techniques are dominating automated animal activity recognition (AAR) tasks with wearable sensors due to their high performance on large-scale labelled data. However, current deep learning-based AAR models are trained solely on datasets of individual animal species, constraining their applicability in practice and performing poorly when training data are limited. In this study, we pr… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  35. arXiv:2410.14324  [pdf, other

    cs.CV

    HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation

    Authors: Bo Cheng, Yuhang Ma, Liebucha Wu, Shanyuan Liu, Ao Ma, Xiaoyu Wu, Dawei Leng, Yuhui Yin

    Abstract: The task of layout-to-image generation involves synthesizing images based on the captions of objects and their spatial positions. Existing methods still struggle in complex layout generation, where common bad cases include object missing, inconsistent lighting, conflicting view angles, etc. To effectively address these issues, we propose a \textbf{Hi}erarchical \textbf{Co}ntrollable (HiCo) diffusi… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: NeurIPS2024

  36. arXiv:2410.13395  [pdf, ps, other

    math.NA

    Reverse Quantile-RK and its Application to Quantile-RK

    Authors: Emeric Battaglia, Anna Ma

    Abstract: When solving linear systems $Ax=b$, $A$ and $b$ are given, but the measurements $b$ often contain corruptions. Inspired by recent work on the quantile-randomized Kaczmarz method, we propose an acceleration of the randomized Kaczmarz method using quantile information. We show that the proposed acceleration converges faster than the randomized Kaczmarz algorithm. In addition, we show that our propos… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  37. arXiv:2410.12926  [pdf, other

    cs.CV

    DEeR: Deviation Eliminating and Noise Regulating for Privacy-preserving Federated Low-rank Adaptation

    Authors: Meilu Zhu, Axiu Mao, Jun Liu, Yixuan Yuan

    Abstract: Integrating low-rank adaptation (LoRA) with federated learning (FL) has received widespread attention recently, aiming to adapt pretrained foundation models (FMs) to downstream medical tasks via privacy-preserving decentralized training. However, owing to the direct combination of LoRA and FL, current methods generally undergo two problems, i.e., aggregation deviation, and differential privacy (DP… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  38. arXiv:2410.11747  [pdf, other

    astro-ph.GA

    Cloud properties in simulated galactic winds

    Authors: Orlando Warren, Evan E. Schneider, S. Alwin Mao, Matthew W. Abruzzo

    Abstract: In this work, we investigate the properties of a population of cool clouds in simulated galaxy outflows. Using data from the CGOLS isolated galaxy simulations, we generate catalogues of $\sim 10^5$ clouds. We describe the impact of two different supernova feedback models -- a centrally concentrated starburst and disk-wide distributed star formation -- on the resulting cloud population. In both cas… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 24 pages, 18 figures, submitted to The Astrophysical Journal

  39. arXiv:2410.02081  [pdf, other

    cs.LG

    MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K Parameters

    Authors: Aitian Ma, Dongsheng Luo, Mo Sha

    Abstract: Recently, there has been a growing interest in Long-term Time Series Forecasting (LTSF), which involves predicting long-term future values by analyzing a large amount of historical time-series data to identify patterns and trends. There exist significant challenges in LTSF due to its complex temporal dependencies and high computational demands. Although Transformer-based models offer high forecast… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  40. arXiv:2410.02070  [pdf, other

    cs.LG

    MMFNet: Multi-Scale Frequency Masking Neural Network for Multivariate Time Series Forecasting

    Authors: Aitian Ma, Dongsheng Luo, Mo Sha

    Abstract: Long-term Time Series Forecasting (LTSF) is critical for numerous real-world applications, such as electricity consumption planning, financial forecasting, and disease propagation analysis. LTSF requires capturing long-range dependencies between inputs and outputs, which poses significant challenges due to complex temporal dynamics and high computational demands. While linear models reduce model c… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  41. arXiv:2409.17123  [pdf, other

    math.CO

    On the Bivariate Characteristic Polynomial of the Shuffle Lattice

    Authors: Annabel Ma

    Abstract: The shuffle lattice was introduced by Greene in 1988 as an idealized model for DNA mutation, when he revealed remarkable combinatorial properties of this structure. In this paper, we prove an explicit formula for the $M$-triangle of the shuffle lattice, a bivariate refinement of the characteristic polynomial, as conjectured by McConville and Mühle in 2022, and find a relation between the $M$-trian… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: 21 pages, 3 figures

  42. arXiv:2409.07730  [pdf, other

    eess.AS cs.IR cs.LG cs.SD

    Music auto-tagging in the long tail: A few-shot approach

    Authors: T. Aleksandra Ma, Alexander Lerch

    Abstract: In the realm of digital music, using tags to efficiently organize and retrieve music from extensive databases is crucial for music catalog owners. Human tagging by experts is labor-intensive but mostly accurate, whereas automatic tagging through supervised learning has approached satisfying accuracy but is restricted to a predefined set of training tags. Few-shot learning offers a viable solution… ▽ More

    Submitted 16 September, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

    Comments: Published in Audio Engineering Society NY Show 2024 as a Peer Reviewed (Category 1) paper; typos corrected

    ACM Class: H.3.3

  43. arXiv:2409.04005  [pdf, other

    cs.CV

    Qihoo-T2X: An Efficient Proxy-Tokenized Diffusion Transformer for Text-to-Any-Task

    Authors: Jing Wang, Ao Ma, Jiasong Feng, Dawei Leng, Yuhui Yin, Xiaodan Liang

    Abstract: The global self-attention mechanism in diffusion transformers involves redundant computation due to the sparse and redundant nature of visual information, and the attention map of tokens within a spatial window shows significant similarity. To address this redundancy, we propose the Proxy-Tokenized Diffusion Transformer (PT-DiT), which employs sparse representative token attention (where the numbe… ▽ More

    Submitted 4 October, 2024; v1 submitted 5 September, 2024; originally announced September 2024.

  44. arXiv:2408.17086  [pdf

    physics.bio-ph physics.chem-ph

    Reaction Coordinates are Optimal Channels of Energy Flow

    Authors: Ao Ma, Huiyu Li

    Abstract: Reaction coordinates (RCs) are the few essential coordinates of a protein that control its functional processes, such as allostery, enzymatic reaction, and conformational change. They are critical for understanding protein function and provide optimal enhanced sampling of protein conformational changes and states. Since the pioneering works in the late 1990s, identifying the correct and objectivel… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  45. arXiv:2408.13547  [pdf, other

    math.NA math.OC math.ST

    Frontal Slice Approaches for Tensor Linear Systems

    Authors: Hengrui Luo, Anna Ma

    Abstract: Inspired by the row and column action methods for solving large-scale linear systems, in this work, we explore the use of frontal slices for solving tensor linear systems. In particular, this paper presents a novel approach for using frontal slices of a tensor $\mathcal{A}$ to solve tensor linear systems $\mathcal{A} * \mathcal{X} = \mathcal{B}$ where $*$ denotes the t-product. In addition, we con… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Comments: 41 pages, 10 figures

    MSC Class: 15A69; 15A72; 65F10

  46. arXiv:2408.08189  [pdf, other

    cs.CV

    FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance

    Authors: Jiasong Feng, Ao Ma, Jing Wang, Bo Cheng, Xiaodan Liang, Dawei Leng, Yuhui Yin

    Abstract: Synthesizing motion-rich and temporally consistent videos remains a challenge in artificial intelligence, especially when dealing with extended durations. Existing text-to-video (T2V) models commonly employ spatial cross-attention for text control, equivalently guiding different frame generations without frame-specific textual guidance. Thus, the model's capacity to comprehend the temporal logic c… ▽ More

    Submitted 16 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  47. arXiv:2408.08105  [pdf, other

    cs.CV cs.AI

    Multimodal Causal Reasoning Benchmark: Challenging Vision Large Language Models to Discern Causal Links Across Modalities

    Authors: Zhiyuan Li, Heng Wang, Dongnan Liu, Chaoyi Zhang, Ao Ma, Jieting Long, Weidong Cai

    Abstract: Multimodal Large Language Models (MLLMs) have showcased exceptional Chain-of-Thought (CoT) reasoning ability in complex textual inference tasks including causal reasoning. However, will these causalities remain straightforward when crucial hints hide in visual details? If not, what factors might influence cross-modal generalization? Whether we can effectively enhance their capacity for robust caus… ▽ More

    Submitted 25 May, 2025; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: ACL2025 Findings

  48. arXiv:2407.18496  [pdf, other

    cs.CL cs.LG

    Towards More Accurate Prediction of Human Empathy and Emotion in Text and Multi-turn Conversations by Combining Advanced NLP, Transformers-based Networks, and Linguistic Methodologies

    Authors: Manisha Singh, Divy Sharma, Alonso Ma, Nora Goldfine

    Abstract: Based on the WASSA 2022 Shared Task on Empathy Detection and Emotion Classification, we predict the level of empathic concern and personal distress displayed in essays. For the first stage of this project we implemented a Feed-Forward Neural Network using sentence-level embeddings as features. We experimented with four different embedding models for generating the inputs to the neural network. The… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  49. arXiv:2407.18471  [pdf, other

    cs.CL cs.IR cs.LG

    Constructing the CORD-19 Vaccine Dataset

    Authors: Manisha Singh, Divy Sharma, Alonso Ma, Bridget Tyree, Margaret Mitchell

    Abstract: We introduce new dataset 'CORD-19-Vaccination' to cater to scientists specifically looking into COVID-19 vaccine-related research. This dataset is extracted from CORD-19 dataset [Wang et al., 2020] and augmented with new columns for language detail, author demography, keywords, and topic per paper. Facebook's fastText model is used to identify languages [Joulin et al., 2016]. To establish author d… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  50. The dispersion measure and rotation measure from fast radio burst host galaxies based on the IllustrisTNG50 simulation

    Authors: Timea Orsolya Kovacs, Sui Ann Mao, Aritra Basu, Yik Ki Ma, Laura G. Spitler, Charles R. H. Walker

    Abstract: Fast radio bursts (FRB) will become important cosmological tools, as the number of observed FRBs is increasing rapidly with more surveys being carried out. A large sample of FRBs with dispersion measures (DM) and rotation measures (RM) can be used to study the intergalactic magnetic field. However, the observed DM and RM of FRBs have multiple contributors which must be quantified to obtain the int… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 24 pages, 15 figures Accepted for publication in A&A

    Journal ref: A&A 690, A47 (2024)