Skip to main content

Showing 1–50 of 68 results for author: Sun, E

.
  1. arXiv:2505.18882  [pdf, ps, other

    cs.CY

    Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach

    Authors: Yuchen Wu, Edward Sun, Kaijie Zhu, Jianxun Lian, Jose Hernandez-Orallo, Aylin Caliskan, Jindong Wang

    Abstract: Large language models (LLMs) typically generate identical or similar responses for all users given the same prompt, posing serious safety risks in high-stakes applications where user vulnerabilities differ widely. Existing safety evaluations primarily rely on context-independent metrics - such as factuality, bias, or toxicity - overlooking the fact that the same response may carry divergent risks… ▽ More

    Submitted 29 May, 2025; v1 submitted 24 May, 2025; originally announced May 2025.

  2. arXiv:2505.10151  [pdf, other

    cs.RO

    Training People to Reward Robots

    Authors: Endong Sun, Yuqing Zhu, Matthew Howard

    Abstract: Learning from demonstration (LfD) is a technique that allows expert teachers to teach task-oriented skills to robotic systems. However, the most effective way of guiding novice teachers to approach expert-level demonstrations quantitatively for specific teaching tasks remains an open question. To this end, this paper investigates the use of machine teaching (MT) to guide novice teachers to improve… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 6 pages

  3. Enhancing Product Search Interfaces with Sketch-Guided Diffusion and Language Agents

    Authors: Edward Sun

    Abstract: The rapid progress in diffusion models, transformers, and language agents has unlocked new possibilities, yet their potential in user interfaces and commercial applications remains underexplored. We present Sketch-Search Agent, a novel framework that transforms the image search experience by integrating a multimodal language agent with freehand sketches as control signals for diffusion models. Usi… ▽ More

    Submitted 21 March, 2025; originally announced April 2025.

    Comments: Companion Proceedings of the ACM Web Conference 2025

  4. arXiv:2504.00901  [pdf, other

    cs.CV

    A Decade of Deep Learning for Remote Sensing Spatiotemporal Fusion: Advances, Challenges, and Opportunities

    Authors: Enzhe Sun, Yongchuan Cui, Peng Liu, Jining Yan

    Abstract: Hardware limitations and satellite launch costs make direct acquisition of high temporal-spatial resolution remote sensing imagery challenging. Remote sensing spatiotemporal fusion (STF) technology addresses this problem by merging high temporal but low spatial resolution imagery with high spatial but low temporal resolution imagery to efficiently generate high spatiotemporal resolution satellite… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  5. arXiv:2503.19456  [pdf, ps, other

    cs.DS

    Online Stochastic Matching with Unknown Arrival Order: Beating $0.5$ against the Online Optimum

    Authors: Enze Sun, Zhihao Gavin Tang, Yifan Wang

    Abstract: We study the online stochastic matching problem. Against the offline benchmark, Feldman, Gravin, and Lucier (SODA 2015) designed an optimal $0.5$-competitive algorithm. A recent line of work, initiated by Papadimitriou, Pollner, Saberi, and Wajc (MOR 2024), focuses on designing approximation algorithms against the online optimum. The online benchmark allows positive results surpassing the $0.5$ ra… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: To appear in the 57th Annual ACM Symposium on Theory of Computing (STOC 2025)

  6. arXiv:2503.18684  [pdf, other

    cs.RO cs.AI

    Efficient Continual Adaptation of Pretrained Robotic Policy with Online Meta-Learned Adapters

    Authors: Ruiqi Zhu, Endong Sun, Guanhe Huang, Oya Celiktutan

    Abstract: Continual adaptation is essential for general autonomous agents. For example, a household robot pretrained with a repertoire of skills must still adapt to unseen tasks specific to each household. Motivated by this, building upon parameter-efficient fine-tuning in language models, prior works have explored lightweight adapters to adapt pretrained policies, which can preserve learned features from t… ▽ More

    Submitted 27 March, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

    Comments: Project link: https://ricky-zhu.github.io/OMLA/

  7. arXiv:2412.20138  [pdf, ps, other

    q-fin.TR cs.AI cs.CE cs.LG

    TradingAgents: Multi-Agents LLM Financial Trading Framework

    Authors: Yijia Xiao, Edward Sun, Di Luo, Wei Wang

    Abstract: Significant progress has been made in automated problem-solving using societies of agents powered by large language models (LLMs). In finance, efforts have largely focused on single-agent systems handling specific tasks or multi-agent frameworks independently gathering data. However, the multi-agent systems' potential to replicate real-world trading firms' collaborative dynamics remains underexplo… ▽ More

    Submitted 3 June, 2025; v1 submitted 28 December, 2024; originally announced December 2024.

    Comments: Tauric Research @ https://github.com/TauricResearch; Oral @ Multi-Agent AI in the Real World

  8. arXiv:2412.07386  [pdf, other

    cs.CL

    Algorithmic Phase Transitions in Language Models: A Mechanistic Case Study of Arithmetic

    Authors: Alan Sun, Ethan Sun, Warren Shepard

    Abstract: Zero-shot capabilities of large language models make them powerful tools for solving a range of tasks without explicit training. It remains unclear, however, how these models achieve such performance, or why they can zero-shot some tasks but not others. In this paper, we shed some light on this phenomenon by defining and investigating algorithmic stability in language models -- changes in problem-… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 10 pages, 5 figures

  9. arXiv:2411.08900  [pdf, other

    q-bio.GN cs.AI cs.CE cs.LG q-bio.BM

    RNA-GPT: Multimodal Generative System for RNA Sequence Understanding

    Authors: Yijia Xiao, Edward Sun, Yiqiao Jin, Wei Wang

    Abstract: RNAs are essential molecules that carry genetic information vital for life, with profound implications for drug development and biotechnology. Despite this importance, RNA research is often hindered by the vast literature available on the topic. To streamline this process, we introduce RNA-GPT, a multi-modal RNA chat model designed to simplify RNA discovery by leveraging extensive RNA literature.… ▽ More

    Submitted 29 October, 2024; originally announced November 2024.

    Comments: Machine Learning for Structural Biology Workshop, NeurIPS 2024

  10. arXiv:2411.08349  [pdf

    physics.app-ph

    Flexible Thermoelectric Active Cooling Garment to Combat Extreme Heat

    Authors: Tianshi Feng, Jiedong Wang, Ethan Sun, Antonio Di Buono, Renkun Chen

    Abstract: With the increasing frequency, intensity, and duration of extreme heat events due to climate change, heat-related diseases or even mortality have become more prevalent. An efficient personal cooling strategy can mitigate heat stress by regulating the skin temperature within the thermal comfort zone. However, lightweight, wearable, and sustainable cooling garments are unavailable today. Here, we de… ▽ More

    Submitted 1 December, 2024; v1 submitted 13 November, 2024; originally announced November 2024.

  11. arXiv:2410.21790  [pdf, other

    stat.AP stat.ME

    Reconstructing East Asian Temperatures from 1368 to 1911 Using Historical Documents, Climate Models, and Data Assimilation

    Authors: Eric Sun, Kuan-hui Elaine Lin, Wan-Ling Tseng, Pao K. Wang, Hsin-Cheng Huang

    Abstract: We propose a novel approach for reconstructing annual temperatures in East Asia from 1368 to 1911, leveraging the Reconstructed East Asian Climate Historical Encoded Series (REACHES). The lack of instrumental data during this period poses significant challenges to understanding past climate conditions. REACHES digitizes historical documents from the Ming and Qing dynasties of China, converting qua… ▽ More

    Submitted 18 January, 2025; v1 submitted 29 October, 2024; originally announced October 2024.

    Comments: 28 pages, 16 figures, 1 table

    MSC Class: 62P12

  12. arXiv:2410.10238  [pdf, other

    cs.CV cs.AI

    ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization

    Authors: Jiawei Liu, Fanrui Zhang, Jiaying Zhu, Esther Sun, Qiang Zhang, Zheng-Jun Zha

    Abstract: Multimodal Large Language Models (MLLMs), such as GPT4o, have shown strong capabilities in visual reasoning and explanation generation. However, despite these strengths, they face significant challenges in the increasingly critical task of Image Forgery Detection and Localization (IFDL). Moreover, existing IFDL methods are typically limited to the learning of low-level semantic-agnostic clues and… ▽ More

    Submitted 6 January, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

    Comments: 16 pages, 14 figures

  13. Beyond CCDs: Characterization of sCMOS detectors for optical astronomy

    Authors: Aditya Khandelwal, Sarik Jeram, Ryan Dungee, Albert W. K. Lau, Allison Lau, Ethen Sun, Phil Van-Lane, Shaojie Chen, Aaron Tohuvavohu, Ting S. Li

    Abstract: Modern scientific complementary metal-oxide semiconductor (sCMOS) detectors provide a highly competitive alternative to charge-coupled devices (CCDs), the latter of which have historically been dominant in optical imaging. sCMOS boast comparable performances to CCDs with faster frame rates, lower read noise, and a higher dynamic range. Furthermore, their lower production costs are shifting the ind… ▽ More

    Submitted 6 December, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: SPIE Astronomical Telescopes + Instrumentation, Proceedings Volume 13103, X-Ray, Optical, and Infrared Detectors for Astronomy XI; 131030R (2024)

  14. arXiv:2409.15563  [pdf, other

    cs.RO

    Using Machine Teaching to Boost Novices' Robot Teaching Skill

    Authors: Yuqing Zhu, Endong Sun, Matthew Howard

    Abstract: Recent evidence has shown that, contrary to expectations, it is difficult for users, especially novices, to teach robots tasks through LfD. This paper introduces a framework that leverages MT algorithms to train novices to become better teachers of robots, and verifies whether such teaching ability is retained beyond the period of training and generalises such that novices teach robots more effect… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  15. arXiv:2409.13913  [pdf, other

    cs.CL cs.SD eess.AS

    Target word activity detector: An approach to obtain ASR word boundaries without lexicon

    Authors: Sunit Sivasankaran, Eric Sun, Jinyu Li, Yan Huang, Jing Pan

    Abstract: Obtaining word timestamp information from end-to-end (E2E) ASR models remains challenging due to the lack of explicit time alignment during training. This issue is further complicated in multilingual models. Existing methods, either rely on lexicons or introduce additional tokens, leading to scalability issues and increased computational costs. In this work, we propose a new approach to estimate w… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: Submitted to ICASSP 2025

  16. arXiv:2408.12524  [pdf, ps, other

    cs.DS cs.GT

    Stochastic Online Correlated Selection

    Authors: Ziyun Chen, Zhiyi Huang, Enze Sun

    Abstract: We study Stochastic Online Correlated Selection (SOCS), a family of online rounding algorithms for Non-IID Stochastic Online Submodular Welfare Maximization and special cases such as Online Stochastic Matching, Stochastic AdWords, and Stochastic Display Ads. At each step, the algorithm sees an online item's type and fractional allocation, then immediately allocates it to an agent. We propose a met… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  17. arXiv:2408.11363  [pdf, other

    cs.AI cs.CE cs.LG q-bio.BM

    ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding

    Authors: Yijia Xiao, Edward Sun, Yiqiao Jin, Qifan Wang, Wei Wang

    Abstract: Understanding biological processes, drug development, and biotechnological advancements requires a detailed analysis of protein structures and functions, a task that is inherently complex and time-consuming in traditional protein research. To streamline this process, we introduce ProteinGPT, a state-of-the-art multimodal large language model for proteins that enables users to upload protein sequen… ▽ More

    Submitted 17 April, 2025; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: Spotlight, Machine Learning for Genomics Explorations @ ICLR 2025

  18. arXiv:2407.14212  [pdf, other

    cs.SD cs.CL eess.AS

    Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech2

    Authors: Chun Xu, En-Wei Sun

    Abstract: An increasing number of Chinese people are troubled by different degrees of visual impairment, which has made the modal conversion between a single image or video frame in the visual field and the audio expressing the same information a research hotspot. Deep learning technologies such as OCR+Vocoder and Im2Wav enable English audio synthesis or image-to-sound matching in a self-supervised manner.… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  19. arXiv:2407.04973  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts

    Authors: Yijia Xiao, Edward Sun, Tianyu Liu, Wei Wang

    Abstract: We propose LogicVista, an evaluation benchmark that assesses the integrated logical reasoning capabilities of multimodal large language models (MLLMs) in Visual contexts. Recent advancements in MLLMs have demonstrated various fascinating abilities, from crafting poetry based on an image to performing mathematical reasoning. However, there is still a lack of systematic evaluation of MLLMs' proficie… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: LogicVista benchmarks the logical reasoning of multimodal large language models in visual tasks

  20. arXiv:2405.02243  [pdf, other

    cs.RO

    Towards Improving Learning from Demonstration Algorithms via MCMC Methods

    Authors: Carl Qi, Edward Sun, Harry Zhang

    Abstract: Behavioral cloning, or more broadly, learning from demonstrations (LfD) is a priomising direction for robot policy learning in complex scenarios. Albeit being straightforward to implement and data-efficient, behavioral cloning has its own drawbacks, limiting its efficacy in real robot setups. In this work, we take one step towards improving learning from demonstration algorithms by leveraging impl… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2207.04638, arXiv:2204.03597 by other authors

  21. arXiv:2312.17673  [pdf, other

    cs.CR cs.AI cs.CL

    Jatmo: Prompt Injection Defense by Task-Specific Finetuning

    Authors: Julien Piet, Maha Alrashed, Chawin Sitawarin, Sizhe Chen, Zeming Wei, Elizabeth Sun, Basel Alomair, David Wagner

    Abstract: Large Language Models (LLMs) are attracting significant research attention due to their instruction-following abilities, allowing users and developers to leverage LLMs for a variety of tasks. However, LLMs are vulnerable to prompt-injection attacks: a class of attacks that hijack the model's instruction-following abilities, changing responses to prompts to undesired, possibly malicious ones. In th… ▽ More

    Submitted 8 January, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 24 pages, 6 figures

  22. arXiv:2311.05623  [pdf, other

    astro-ph.IM

    The 4m International Liquid Mirror Telescope: a brief history and some preliminary scientific results

    Authors: Jean Surdej, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Anna Pospieszalska-Surdej, Kumar Pranshu, Ethen Sun

    Abstract: The present article is based upon an invited talk delivered at the occasion of the inauguration of the 4m International Liquid Mirror Telescope (ILMT) which took place in Devasthal (ARIES, Uttarakhand, India) on 21st of March 2023. We present hereafter a short history of the liquid mirror telescopes and in particular of the 4m ILMT which is the first liquid mirror telescope entirely dedicated to a… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 14 pages, 21 figures, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  23. arXiv:2311.05622  [pdf, other

    astro-ph.IM astro-ph.GA

    SunPhot: Preparations for an upcoming quasar variability survey with the International Liquid Mirror Telescope

    Authors: Ethen Sun, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Jean Surdej

    Abstract: Recent research suggests a correlation between the variability and intrinsic brightness of quasars. If calibrated, this could lead to the use of quasars on the cosmic distance ladder, but this work is currently limited by lack of quasar light curve data with high cadence and precision. The Python photometric data pipeline SunPhot is being developed as part of preparations for an upcoming quasar va… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 2 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  24. arXiv:2311.05621  [pdf, other

    astro-ph.GA astro-ph.IM

    Surface Brightness Properties of LSB Galaxies with the International Liquid Mirror Telescope

    Authors: Jiuyang Fu, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: Low surface brightness (LSB) galaxies make up a significant fraction of the luminosity density of the local universe. Their low surface brightness suggests a different formation and evolution process compared to more-typical high-surface-brightness galaxies. This study presents an analysis of LSB galaxies found in images obtained by the International Liquid Mirror Telescope during the observation… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 6 pages, 2 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  25. arXiv:2311.05620  [pdf, other

    astro-ph.IM astro-ph.GA astro-ph.SR

    Survey of Variables with the ILMT

    Authors: Baldeep Grewal, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Paul Hickson, Kuntal Misra, Brajesh Kumar, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: Nestled in the mountains of Northern India, is a 4-metre rotating dish of liquid mercury. Over a 10-year period, the International Liquid Mirror Telescope (ILMT) will survey 117 square degrees of sky, to study the astrometric and photometric variability of all detected objects. One of the scientific programs will be a survey of variable stars. The data gathered will be used to construct a comprehe… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 3 figures, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  26. arXiv:2311.05619  [pdf, other

    astro-ph.IM astro-ph.CO

    Observation of mulitply imaged quasars with the 4-m ILMT

    Authors: Talat Akhunov, Bhavya Ailawadhi, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Anna Pospieszalska-Surdej, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: Gravitationally lensed quasars (GLQs) are known to potentially provide an independent way of determining the value of the Hubble-Lemaître parameter $H_{0}$, to probe the dark matter content of lensing galaxies and to resolve tiny structures in distant active galactic nuclei. That is why multiply imaged quasars are one of the main drivers for a photometric monitoring with the 4-m International Liqu… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 8 pages, 3 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  27. arXiv:2311.05618  [pdf, other

    astro-ph.IM astro-ph.GA

    Follow-up strategy of ILMT discovered supernovae

    Authors: Brajesh Kumar, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The 4m International Liquid Mirror Telescope (ILMT) facility continuously scans the same sky strip ($\sim$22$^\prime$ wide) on each night with a fixed pointing towards the zenith direction. It is possible to detect hundreds of supernovae (SNe) each year by implementing an optimal image subtraction technique on consecutive night images. Prompt monitoring of ILMT-detected SNe is planned under the se… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  28. arXiv:2311.05617  [pdf, other

    astro-ph.IM

    Astrometric and photometric calibrators for the 4-m International Liquid Mirror Telescope

    Authors: Naveen Dukiya, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The International Liquid Mirror Telescope (ILMT) is a 4-meter class survey telescope. It achieved its first light on 29$^{\rm th}$ April 2022 and is now undergoing the commissioning phase. It scans the sky in a fixed \ang{;22;} wide strip centred at the declination of $+$\ang{29;21;41.4} and works in \emph{Time Delay Integration (TDI)} mode. We present a full catalog of sources in the ILMT strip d… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 10 pages, 3 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  29. arXiv:2311.05616  [pdf, other

    astro-ph.IM

    A year-long representation of the ILMT observations in different coordinate systems

    Authors: Monalisa Dubey, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Kuntal Misra, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The 4m International Liquid Mirror Telescope (ILMT) is the first optical survey telescope in India that performs zenithal observations of a 22$'$ wide strip of the sky. To determine the portion of the sky covered by the ILMT during the entire year, we represent the ILMT Field of View (FoV) in three different coordinate systems - galactic, ecliptic, and equatorial. We adopt a constant declination o… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 6 pages, 1 figure, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  30. arXiv:2311.05615  [pdf, other

    astro-ph.IM

    The 4m International Liquid Mirror Telescope project

    Authors: Jean Surdej, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Anna Pospieszalska-Surdej, Kumar Pranshu, Ethen Sun

    Abstract: The International Liquid Mirror Telescope (ILMT) project is a scientific collaboration in observational astrophysics between the Li{è}ge Institute of Astrophysics and Geophysics (Li{è}ge University, Belgium), the Aryabatta Research Institute of observational sciencES (ARIES, Nainital, India) and several Canadian universities (British Columbia, Laval, Montr{é}al, Toronto, Victoria and York). Meanwh… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 2 figures, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  31. arXiv:2311.05614  [pdf, other

    astro-ph.IM

    Serendipitous Detection of Orbital Debris by the International Liquid Mirror Telescope: First Results

    Authors: Paul Hickson, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: Orbital debris presents a growing risk to space operations, and is becoming a significant source of contamination of astronomical images. Much of the debris population is uncatalogued, making the impact more difficult to assess. We present initial results from the first ten nights of commissioning observations with the International Liquid Mirror Telescope, in which images were examined for streak… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 6 pages, 1 figure, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  32. arXiv:2311.04718  [pdf, other

    astro-ph.IM astro-ph.EP

    Detection and Identification of Asteroids with the 4-m ILMT

    Authors: Anna Pospieszalska-Surdej, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: A very unique strength of the Devasthal Observatory is its capability of detecting optical transients with the 4-m International Liquid Mirror Telescope (ILMT) and to rapidly follow them up using the 1.3-m Devasthal Fast Optical Telescope (DFOT) and/or the 3.6-m Devasthal Optical Telescope (DOT), installed right next to it. In this context, we have inspected 20 fields observed during 9 consecutive… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 3 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  33. arXiv:2311.04717  [pdf, other

    astro-ph.IM

    Accessibility of the ILMT survey data

    Authors: Kuntal Misra, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The 4m International Liquid Mirror Telescope (ILMT) continuously scans a 22$'$ wide strip of the zenithal sky and records the images in three broadband filters (g', r' and i') using a 4K$\times$4K CCD camera. In about 10--12 hours of observations during a single night, $\sim$15 GB of data volume is generated. The raw images resulting from the observations in October--November 2022 have been pre-pr… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 2 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  34. arXiv:2311.04716  [pdf, other

    astro-ph.IM

    Automated transient detection in the context of the 4m ILMT

    Authors: Kumar Pranshu, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Ethen Sun, Jean Surdej

    Abstract: In the era of sky surveys like Palomar Transient Factory (PTF), Zwicky Transient Facility (ZTF) and the upcoming Vera Rubin Observatory (VRO) and ILMT, a plethora of image data will be available. ZTF scans the sky with a field of view of 48 deg$^{2}$ and VRO will have a FoV of 9.6 deg$^{2}$ but with a much larger aperture. The 4m ILMT covers a 22$'$ wide strip of the sky. Being a zenith telescope,… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 9 pages, 3 figures, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  35. arXiv:2311.04713  [pdf, other

    astro-ph.IM

    An automated photometric pipeline for the ILMT data

    Authors: Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The International Liquid Mirror Telescope (ILMT) is a 4-meter survey telescope continuously observing towards the zenith in the SDSS g', r', and i' bands. This survey telescope is designed to detect various astrophysical transients (for example, supernovae) and very faint objects like multiply-imaged quasars and low surface brightness galaxies. A single scan of a 22$'$ strip of sky contains a larg… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 2 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  36. arXiv:2311.04712  [pdf, other

    astro-ph.IM

    Necessity of a TDI optical corrector for ILMT observations

    Authors: Vibhore Negi, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The International Liquid Mirror Telescope (ILMT) has recently become operational at the Devasthal Observatory of ARIES, Nainital, India. The ILMT observes in the Time delay integration (TDI) mode where the images are formed by electronically stepping the charges over the pixels of the CCD, along a column. Observations near the zenith impose certain constraints dependent on the latitude such as ima… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  37. arXiv:2308.06533  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Knowledge Distilled Ensemble Model for sEMG-based Silent Speech Interface

    Authors: Wenqiang Lai, Qihan Yang, Ye Mao, Endong Sun, Jiangnan Ye

    Abstract: Voice disorders affect millions of people worldwide. Surface electromyography-based Silent Speech Interfaces (sEMG-based SSIs) have been explored as a potential solution for decades. However, previous works were limited by small vocabularies and manually extracted features from raw data. To address these limitations, we propose a lightweight deep learning knowledge-distilled ensemble model for sEM… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: 6 pages, 5 figures

  38. arXiv:2308.01839  [pdf, other

    q-bio.QM cs.CV q-bio.GN stat.AP stat.ML

    Is your data alignable? Principled and interpretable alignability testing and integration of single-cell data

    Authors: Rong Ma, Eric D. Sun, David Donoho, James Zou

    Abstract: Single-cell data integration can provide a comprehensive molecular view of cells, and many algorithms have been developed to remove unwanted technical or biological variations and integrate heterogeneous single-cell datasets. Despite their wide usage, existing methods suffer from several fundamental limitations. In particular, we lack a rigorous statistical test for whether two high-dimensional si… ▽ More

    Submitted 29 February, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Journal ref: Proceedings of the National Academy of Sciences, 2024, 121(10) e2313719121

  39. arXiv:2307.16332  [pdf

    eess.AS

    Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text

    Authors: Eric Sun, Jinyu Li, Jian Xue, Yifan Gong

    Abstract: In end-to-end automatic speech recognition system, one of the difficulties for language expansion is the limited paired speech and text training data. In this paper, we propose a novel method to generate augmented samples with unpaired speech feature segments and text data for model pre-training, which has the advantage of low cost without using additional speech data. When mixing 20,000 hours aug… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  40. arXiv:2307.09377  [pdf, other

    cs.LG

    Data Cross-Segmentation for Improved Generalization in Reinforcement Learning Based Algorithmic Trading

    Authors: Vikram Duvvur, Aashay Mehta, Edward Sun, Bo Wu, Ken Yew Chan, Jeff Schneider

    Abstract: The use of machine learning in algorithmic trading systems is increasingly common. In a typical set-up, supervised learning is used to predict the future prices of assets, and those predictions drive a simple trading and execution strategy. This is quite effective when the predictions have sufficient signal, markets are liquid, and transaction costs are low. However, those conditions often do not… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  41. arXiv:2306.17241  [pdf, other

    cs.DS

    Improved Algorithms for Online Rent Minimization Problem Under Unit-Size Jobs

    Authors: Enze Sun, Zonghan Yang, Yuhao Zhang

    Abstract: We consider the Online Rent Minimization problem, where online jobs with release times, deadlines, and processing times must be scheduled on machines that can be rented for a fixed length period of $T$. The objective is to minimize the number of machine rents. This problem generalizes the Online Machine Minimization problem where machines can be rented for an infinite period, and both problems hav… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: To appear in the 31st Annual European Symposium on Algorithms (ESA 2023)

  42. arXiv:2306.16656  [pdf

    physics.med-ph

    Motion robust MR fingerprinting scan to image neonates with prenatal opioid exposure

    Authors: Dan Ma, Chaitra Badve, Jessie EP Sun, Siyuan Hu, Xiaofeng Wang, Yong Chen, Ameya Nayate, Michael Wien, Douglas Martin, Lynn T Singer, Jared C. Durieux, Chris Flask, Deanne Wilson Costello

    Abstract: Background: A noninvasive and sensitive imaging tool is needed to assess the fast-evolving baby brain. However, using MRI to study non-sedated babies faces roadblocks, including high scan failure rates due to subjects motion and the lack of quantitative measures for assessing potential developmental delays. This feasibility study explores whether MR Fingerprinting scans can provide motion-robust a… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  43. arXiv:2306.09360  [pdf, other

    nucl-ex hep-ex hep-ph nucl-th

    Strong Interaction Physics at the Luminosity Frontier with 22 GeV Electrons at Jefferson Lab

    Authors: A. Accardi, P. Achenbach, D. Adhikari, A. Afanasev, C. S. Akondi, N. Akopov, M. Albaladejo, H. Albataineh, M. Albrecht, B. Almeida-Zamora, M. Amaryan, D. Androić, W. Armstrong, D. S. Armstrong, M. Arratia, J. Arrington, A. Asaturyan, A. Austregesilo, H. Avagyan, T. Averett, C. Ayerbe Gayoso, A. Bacchetta, A. B. Balantekin, N. Baltzell, L. Barion , et al. (419 additional authors not shown)

    Abstract: This document presents the initial scientific case for upgrading the Continuous Electron Beam Accelerator Facility (CEBAF) at Jefferson Lab (JLab) to 22 GeV. It is the result of a community effort, incorporating insights from a series of workshops conducted between March 2022 and April 2023. With a track record of over 25 years in delivering the world's most intense and precise multi-GeV electron… ▽ More

    Submitted 24 August, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Updates to the list of authors; Preprint number changed from theory to experiment; Updates to sections 4 and 6, including additional figures

    Report number: JLAB-PHY-23-3840

  44. arXiv:2303.00786  [pdf

    cs.CL eess.AS

    Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

    Authors: Eric Sun, Jinyu Li, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu, Edward Lin, Yifan Gong

    Abstract: We propose gated language experts and curriculum training to enhance multilingual transformer transducer models without requiring language identification (LID) input from users during inference. Our method incorporates a gating mechanism and LID loss, enabling transformer experts to learn language-specific information. By combining gated transformer experts with shared transformer layers, we const… ▽ More

    Submitted 7 July, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

  45. arXiv:2211.02809  [pdf, other

    cs.CL cs.SD eess.AS

    LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers

    Authors: Peidong Wang, Eric Sun, Jian Xue, Yu Wu, Long Zhou, Yashesh Gaur, Shujie Liu, Jinyu Li

    Abstract: Automatic speech recognition (ASR) and speech translation (ST) can both use neural transducers as the model structure. It is thus possible to use a single transducer model to perform both tasks. In real-world applications, such joint ASR and ST models may need to be streaming and do not require source language identification (i.e. language-agnostic). In this paper, we propose LAMASSU, a streaming… ▽ More

    Submitted 19 October, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

    Comments: INTERSPEECH 2023

  46. arXiv:2211.02499  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability

    Authors: Jian Xue, Peidong Wang, Jinyu Li, Eric Sun

    Abstract: In this paper, we introduce our work of building a Streaming Multilingual Speech Model (SM2), which can transcribe or translate multiple spoken languages into texts of the target language. The backbone of SM2 is Transformer Transducer, which has high streaming capability. Instead of human labeled speech translation (ST) data, SM2 models are trained using weakly supervised data generated by convert… ▽ More

    Submitted 5 July, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

  47. arXiv:2210.13711  [pdf, other

    stat.ML cs.LG q-bio.QM stat.AP stat.ME

    A Spectral Method for Assessing and Combining Multiple Data Visualizations

    Authors: Rong Ma, Eric D. Sun, James Zou

    Abstract: Dimension reduction and data visualization aim to project a high-dimensional dataset to a low-dimensional space while capturing the intrinsic structures in the data. It is an indispensable part of modern data science, and many dimensional reduction and visualization algorithms have been developed. However, different algorithms have their own strengths and weaknesses, making it critically important… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Under revision of Nature Communications

  48. arXiv:2210.06507  [pdf, ps, other

    cs.GT

    Better Approximation for Interdependent SOS Valuations

    Authors: Pinyan Lu, Enze Sun, Chenghan Zhou

    Abstract: Submodular over signal (SOS) defines a family of interesting functions for which there exist truthful mechanisms with constant approximation to the social welfare for agents with interdependent valuations. The best-known truthful auction is of $4$-approximation and a lower bound of 2 was proved. We propose a new and simple truthful mechanism to achieve an approximation ratio of 3.315.

    Submitted 12 October, 2022; originally announced October 2022.

  49. arXiv:2204.01418  [pdf, ps, other

    cs.DS

    Online Ordinal Problems: Optimality of Comparison-based Algorithms and their Cardinal Complexity

    Authors: Nick Gravin, Enze Sun, Zhihao Gavin Tang

    Abstract: We consider ordinal online problems, i.e., tasks that only require pairwise comparisons between elements of the input. A classic example is the secretary problem and the game of googol, as well as its multiple combinatorial extensions such as $(J,K)$-secretary, $2$-sided game of googol, ordinal-competitive matroid secretary. A natural approach to these tasks is to use ordinal algorithms that at ea… ▽ More

    Submitted 11 October, 2023; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: To appear at FOCS 2023. Abstract shortened to meet arXiv requirements

  50. arXiv:2112.05820  [pdf, other

    cs.CL cs.AI cs.LG eess.AS

    Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition

    Authors: Kenichi Kumatani, Robert Gmyr, Felipe Cruz Salinas, Linquan Liu, Wei Zuo, Devang Patel, Eric Sun, Yu Shi

    Abstract: The sparsely-gated Mixture of Experts (MoE) can magnify a network capacity with a little computational complexity. In this work, we investigate how multi-lingual Automatic Speech Recognition (ASR) networks can be scaled up with a simple routing algorithm in order to achieve better accuracy. More specifically, we apply the sparsely-gated MoE technique to two types of networks: Sequence-to-Sequence… ▽ More

    Submitted 4 January, 2022; v1 submitted 10 December, 2021; originally announced December 2021.