Skip to main content

Showing 201–250 of 1,977 results for author: Lee, T

.
  1. arXiv:2405.10725  [pdf, other

    cs.CL cs.IR

    INDUS: Effective and Efficient Language Models for Scientific Applications

    Authors: Bishwaranjan Bhattacharjee, Aashka Trivedi, Masayasu Muraoka, Muthukumaran Ramasubramanian, Takuma Udagawa, Iksha Gurung, Nishan Pantha, Rong Zhang, Bharath Dandala, Rahul Ramachandran, Manil Maskey, Kaylin Bugbee, Mike Little, Elizabeth Fancher, Irina Gerasimov, Armin Mehrabian, Lauren Sanders, Sylvain Costes, Sergi Blanco-Cuaresma, Kelly Lockhart, Thomas Allen, Felix Grezes, Megan Ansdell, Alberto Accomazzi, Yousef El-Kurdi , et al. (11 additional authors not shown)

    Abstract: Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specialized tasks. Inspired by this insight, we developed INDUS, a comprehensive suite of LLMs tailored for the closely-related domains of Earth science, biology, phys… ▽ More

    Submitted 30 October, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: EMNLP 2024 (Industry Track)

  2. arXiv:2405.09879  [pdf, other

    cs.CV cs.AI

    Generative Unlearning for Any Identity

    Authors: Juwon Seo, Sung-Hoon Lee, Tae-Young Lee, Seungjun Moon, Gyeong-Moon Park

    Abstract: Recent advances in generative models trained on large-scale datasets have made it possible to synthesize high-quality samples across various domains. Moreover, the emergence of strong inversion networks enables not only a reconstruction of real-world images but also the modification of attributes through various editing methods. However, in certain domains related to privacy issues, e.g., human fa… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 15 pages, 17 figures, 10 tables, CVPR 2024 Poster

  3. arXiv:2405.06424  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation

    Authors: JoonHo Lee, Jae Oh Woo, Juree Seok, Parisa Hassanzadeh, Wooseok Jang, JuYoun Son, Sima Didari, Baruch Gutow, Heng Hao, Hankyu Moon, Wenjun Hu, Yeong-Dae Kwon, Taehee Lee, Seungjai Min

    Abstract: Assessing response quality to instructions in language models is vital but challenging due to the complexity of human language across different contexts. This complexity often results in ambiguous or inconsistent interpretations, making accurate assessment difficult. To address this issue, we propose a novel Uncertainty-aware Reward Model (URM) that introduces a robust uncertainty estimation for t… ▽ More

    Submitted 31 January, 2025; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024

  4. arXiv:2405.05550  [pdf, other

    astro-ph.IM astro-ph.CO

    The Simons Observatory: Design, integration, and testing of the small aperture telescopes

    Authors: Nicholas Galitzki, Tran Tsan, Jake Spisak, Michael Randall, Max Silva-Feaver, Joseph Seibert, Jacob Lashner, Shunsuke Adachi, Sean M. Adkins, Thomas Alford, Kam Arnold, Peter C. Ashton, Jason E. Austermann, Carlo Baccigalupi, Andrew Bazarko, James A. Beall, Sanah Bhimani, Bryce Bixler, Gabriele Coppi, Lance Corbett, Kevin D. Crowley, Kevin T. Crowley, Samuel Day-Weiss, Simon Dicker, Peter N. Dow , et al. (55 additional authors not shown)

    Abstract: The Simons Observatory (SO) is a cosmic microwave background (CMB) survey experiment that includes small-aperture telescopes (SATs) observing from an altitude of 5,200 m in the Atacama Desert in Chile. The SO SATs will cover six spectral bands between 27 and 280 GHz to search for primordial B-modes to a sensitivity of $σ(r)=0.002$, with quantified systematic errors well below this value. Each SAT… ▽ More

    Submitted 10 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  5. arXiv:2405.04830  [pdf, other

    astro-ph.IM physics.ins-det

    A Method of Measuring TES Complex ETF Response in Frequency-domain Multiplexed Readout by Single Sideband Power Modulation

    Authors: Yu Zhou, Tijmen de Haan, Hiroki Akamatsu, Daisuke Kaneko, Masashi Hazumi, Masaya Hasegawa, Aritoki Suzuki, Adrian T. Lee

    Abstract: The digital frequency domain multiplexing (DfMux) technique is widely used for astrophysical instruments with large detector arrays. Detailed detector characterization is required for instrument calibration and systematics control. We conduct the TES complex electrothermal-feedback (ETF) response measurement with the DfMux readout system as follows. By injecting a single sideband signal, we induce… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures, accepted to Journal of Low Temperature Physics

  6. arXiv:2405.03141  [pdf, other

    eess.IV cs.AI cs.CV physics.med-ph

    Automatic Ultrasound Curve Angle Measurement via Affinity Clustering for Adolescent Idiopathic Scoliosis Evaluation

    Authors: Yihao Zhou, Timothy Tin-Yan Lee, Kelly Ka-Lee Lai, Chonglin Wu, Hin Ting Lau, De Yang, Chui-Yi Chan, Winnie Chiu-Wing Chu, Jack Chun-Yiu Cheng, Tsz-Ping Lam, Yong-Ping Zheng

    Abstract: The current clinical gold standard for evaluating adolescent idiopathic scoliosis (AIS) is X-ray radiography, using Cobb angle measurement. However, the frequent monitoring of the AIS progression using X-rays poses a challenge due to the cumulative radiation exposure. Although 3D ultrasound has been validated as a reliable and radiation-free alternative for scoliosis assessment, the process of mea… ▽ More

    Submitted 6 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

  7. arXiv:2405.01062  [pdf, ps, other

    math.DG math.AP

    Ancient mean curvature flows with finite total curvature

    Authors: Kyeongsu Choi, Jiuzhou Huang, Taehun Lee

    Abstract: We construct an $I$-family of ancient graphical mean curvature flows over a minimal hypersurface in $\mathbb{R}^{n+1}$ of finite total curvature with the Morse index $I$ by establishing exponentially fast convergence in terms of $|x|^2-t$. As a corollary, we show that these ancient flows have finite total curvature and finite mass drop. Moreover, one family of these flows is mean convex by a point… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: All comments are welcome

  8. arXiv:2404.19125  [pdf, other

    math.AG

    Finite distance problem on the moduli of non-Kähler Calabi--Yau $\partial\bar{\partial}$-threefolds

    Authors: Tsung-Ju Lee

    Abstract: In this article, we study the finite distance problem with respect to the period-map metric on the moduli of non-Kähler Calabi--Yau $\partial\bar{\partial}$-threefolds via Hodge theory. We extended C.-L. Wang's finite distance criterion for one-parameter degenerations to the present setting. As a byproduct, we also obtained a sufficient condition for a non-Kähler Calabi--Yau to support the… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 36 pages. Comments are welcome!

    MSC Class: 32Q25 (Primary) 32G05; 14C30 (Secondary)

  9. arXiv:2404.17709  [pdf, other

    stat.ML cs.LG

    Low-rank Matrix Bandits with Heavy-tailed Rewards

    Authors: Yue Kang, Cho-Jui Hsieh, Thomas C. M. Lee

    Abstract: In stochastic low-rank matrix bandit, the expected reward of an arm is equal to the inner product between its feature matrix and some unknown $d_1$ by $d_2$ low-rank parameter matrix $Θ^*$ with rank $r \ll d_1\wedge d_2$. While all prior studies assume the payoffs are mixed with sub-Gaussian noises, in this work we loosen this strict assumption and consider the new problem of \underline{low}-rank… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: The 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)

  10. arXiv:2404.16702  [pdf, other

    hep-lat hep-ph

    Generalized boost transformations in finite volumes and application to Hamiltonian methods

    Authors: Yan Li, Jia-Jun Wu, T. -S. H. Lee, R. D. Young

    Abstract: The investigation of hadron interactions within lattice QCD has been facilitated by the well-known quantisation condition, linking scattering phase shifts to finite-volume energies. Additionally, the ability to utilise systems at finite total boosts has been pivotal in smoothly charting the energy-dependent behaviour of these phase shifts. The existing implementations of the quantization condition… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 30 pages, 5 figures

    Report number: ADP-24-07/T1246

  11. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Jyoti Aneja, Hany Awadalla, Ahmed Awadallah, Ammar Ahmad Awan, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Qin Cai, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Weizhu Chen, Yen-Chun Chen, Yi-Ling Chen, Hao Cheng, Parul Chopra, Xiyang Dai , et al. (104 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. Our training dataset is a scaled-up version… ▽ More

    Submitted 30 August, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 24 pages

  12. Fidelitous Augmentation of Human Accelerometric Data for Deep Learning

    Authors: Tracey K. M. Lee, H. W. Chan, K. H. Leo, Effie Chew, L. Zhao, Saeid Sanei

    Abstract: Time series (TS) data have consistently been in short supply, yet their demand remains high for training systems in prediction, modeling, classification, and various other applications. Synthesis can serve to expand the sample population, yet it is crucial to maintain the statistical characteristics between the synthesized and the original TS : this ensures consistent sampling of data for both tra… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  13. arXiv:2404.11104  [pdf, other

    cs.CV

    Object Remover Performance Evaluation Methods using Class-wise Object Removal Images

    Authors: Changsuk Oh, Dongseok Shim, Taekbeom Lee, H. Jin Kim

    Abstract: Object removal refers to the process of erasing designated objects from an image while preserving the overall appearance, and it is one area where image inpainting is widely used in real-world applications. The performance of an object remover is quantitatively evaluated by measuring the quality of object removal results, similar to how the performance of an image inpainter is gauged. Current work… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  14. AutoGFI: Streamlined Generalized Fiducial Inference for Modern Inference Problems in Models with Additive Errors

    Authors: Wei Du, Jan Hannig, Thomas C. M. Lee, Yi Su, Chunzhe Zhang

    Abstract: The concept of fiducial inference was introduced by R. A. Fisher in the 1930s to address the perceived limitations of Bayesian inference, particularly the need for subjective prior distributions in cases with limited prior information. However, Fisher's fiducial approach lost favor due to complications, especially in multi-parameter problems. With renewed interest in fiducial inference in the 2000… ▽ More

    Submitted 24 December, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  15. arXiv:2404.06466  [pdf, other

    cs.LG stat.ML

    Hyperparameter Selection in Continual Learning

    Authors: Thomas L. Lee, Sigrid Passano Hellan, Linus Ericsson, Elliot J. Crowley, Amos Storkey

    Abstract: In continual learning (CL) -- where a learner trains on a stream of data -- standard hyperparameter optimisation (HPO) cannot be applied, as a learner does not have access to all of the data at the same time. This has prompted the development of CL-specific HPO frameworks. The most popular way to tune hyperparameters in CL is to repeatedly train over the whole data stream with different hyperparam… ▽ More

    Submitted 14 March, 2025; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Preprint, 16 pages

  16. arXiv:2404.03188  [pdf

    eess.IV cs.CV cs.LG

    Classification of Nasopharyngeal Cases using DenseNet Deep Learning Architecture

    Authors: W. S. H. M. W. Ahmad, M. F. A. Fauzi, M. K. Abdullahi, Jenny T. H. Lee, N. S. A. Basry, A Yahaya, A. M. Ismail, A. Adam, Elaine W. L. Chan, F. S. Abas

    Abstract: Nasopharyngeal carcinoma (NPC) is one of the understudied yet deadliest cancers in South East Asia. In Malaysia, the prevalence is identified mainly in Sarawak, among the ethnic of Bidayuh. NPC is often late-diagnosed because it is asymptomatic at the early stage. There are several tissue representations from the nasopharynx biopsy, such as nasopharyngeal inflammation (NPI), lymphoid hyperplasia (… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: This article has been accepted in the Journal of Engineering Science and Technology (JESTEC) and awaiting publication

  17. arXiv:2404.02153  [pdf, other

    astro-ph.CO

    Mass calibration of DES Year-3 clusters via SPT-3G CMB cluster lensing

    Authors: B. Ansarinejad, S. Raghunathan, T. M. C. Abbott, P. A. R. Ade, M. Aguena, O. Alves, A. J. Anderson, F. Andrade-Oliveira, M. Archipley, L. Balkenhol, K. Benabed, A. N. Bender, B. A. Benson, E. Bertin, F. Bianchini, L. E. Bleem, S. Bocquet, F. R. Bouchet, D. Brooks, L. Bryant, D. L. Burke, E. Camphuis, J. E. Carlstrom, A. Carnero Rosell, J. Carretero , et al. (120 additional authors not shown)

    Abstract: We measure the stacked lensing signal in the direction of galaxy clusters in the Dark Energy Survey Year 3 (DES Y3) redMaPPer sample, using cosmic microwave background (CMB) temperature data from SPT-3G, the third-generation CMB camera on the South Pole Telescope (SPT). We estimate the lensing signal using temperature maps constructed from the initial 2 years of data from the SPT-3G 'Main' survey,… ▽ More

    Submitted 12 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 23 pages, 9 figures, accepted for publication in JCAP. Minor changes and corrections have been made relative to v1

  18. arXiv:2404.01351  [pdf, other

    cs.LG cs.AI cs.CV

    AETTA: Label-Free Accuracy Estimation for Test-Time Adaptation

    Authors: Taeckyung Lee, Sorn Chottananurak, Taesik Gong, Sung-Ju Lee

    Abstract: Test-time adaptation (TTA) has emerged as a viable solution to adapt pre-trained models to domain shifts using unlabeled test data. However, TTA faces challenges of adaptation failures due to its reliance on blind adaptation to unknown test samples in dynamic scenarios. Traditional methods for out-of-distribution performance estimation are limited by unrealistic assumptions in the TTA context, suc… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  19. arXiv:2404.00420  [pdf, other

    cs.SE cs.LG

    Learning Service Selection Decision Making Behaviors During Scientific Workflow Development

    Authors: Xihao Xie, Jia Zhang, Rahul Ramachandran, Tsengdar J. Lee, Seungwon Lee

    Abstract: Increasingly, more software services have been published onto the Internet, making it a big challenge to recommend services in the process of a scientific workflow composition. In this paper, a novel context-aware approach is proposed to recommending next services in a workflow development process, through learning service representation and service selection decision making behaviors from workflo… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 14 pages, 8 figures. arXiv admin note: text overlap with arXiv:2205.11771

  20. arXiv:2404.00376  [pdf, other

    cs.CL

    Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks

    Authors: Hyunjae Kim, Hyeon Hwang, Jiwoo Lee, Sihyeon Park, Dain Kim, Taewhoo Lee, Chanwoong Yoon, Jiwoong Sohn, Donghee Choi, Jaewoo Kang

    Abstract: While recent advancements in commercial large language models (LM) have shown promising results in medical tasks, their closed-source nature poses significant privacy and security concerns, hindering their widespread use in the medical field. Despite efforts to create open-source models, their limited parameters often result in insufficient multi-step reasoning capabilities required for solving co… ▽ More

    Submitted 30 June, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Added new LLaMA-3-based models and experiments on NEJM case challenges

  21. arXiv:2404.00234   

    cs.CV

    Grid Diffusion Models for Text-to-Video Generation

    Authors: Taegyeong Lee, Soyeong Kwon, Taehwan Kim

    Abstract: Recent advances in the diffusion models have significantly improved text-to-image generation. However, generating videos from text is a more challenging task than generating images from text, due to the much larger dataset and higher computational cost required. Most existing video generation methods use either a 3D U-Net architecture that considers the temporal dimension or autoregressive generat… ▽ More

    Submitted 30 December, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: This paper is being withdrawn due to issues of misconduct in the experiments presented in Table 1 and 5. We recognize this as an ethical concern and sincerely apologize to the research community for any inconvenience it may have caused

  22. arXiv:2403.19985  [pdf, other

    cs.CV

    Stable Surface Regularization for Fast Few-Shot NeRF

    Authors: Byeongin Joung, Byeong-Uk Lee, Jaesung Choe, Ukcheol Shin, Minjun Kang, Taeyeop Lee, In So Kweon, Kuk-Jin Yoon

    Abstract: This paper proposes an algorithm for synthesizing novel views under few-shot setup. The main concept is to develop a stable surface regularization technique called Annealing Signed Distance Function (ASDF), which anneals the surface in a coarse-to-fine manner to accelerate convergence speed. We observe that the Eikonal loss - which is a widely known geometric regularization - requires dense traini… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 3DV 2024

  23. arXiv:2403.19456  [pdf, other

    cs.CV cs.GR cs.MM

    Break-for-Make: Modular Low-Rank Adaptations for Composable Content-Style Customization

    Authors: Yu Xu, Fan Tang, Juan Cao, Yuxin Zhang, Oliver Deussen, Weiming Dong, Jintao Li, Tong-Yee Lee

    Abstract: Personalized generation paradigms empower designers to customize visual intellectual properties with the help of textual descriptions by tuning or adapting pre-trained text-to-image models on a few images. Recent works explore approaches for concurrently customizing both content and detailed visual style appearance. However, these existing approaches often generate images where the content and sty… ▽ More

    Submitted 31 March, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  24. arXiv:2403.19146  [pdf, ps, other

    cs.DS cs.DC math.OC

    Improving the Bit Complexity of Communication for Distributed Convex Optimization

    Authors: Mehrdad Ghadiri, Yin Tat Lee, Swati Padmanabhan, William Swartworth, David Woodruff, Guanghao Ye

    Abstract: We consider the communication complexity of some fundamental convex optimization problems in the point-to-point (coordinator) and blackboard communication models. We strengthen known bounds for approximately solving linear regression, $p$-norm regression (for $1\leq p\leq 2$), linear programming, minimizing the sum of finitely many convex nonsmooth functions with varying supports, and low rank app… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: To appear in STOC '24. Abstract shortened to meet the arXiv limits. Comments welcome!

  25. arXiv:2403.18421  [pdf, other

    cs.CL cs.AI

    BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

    Authors: Elliot Bolton, Abhinav Venigalla, Michihiro Yasunaga, David Hall, Betty Xiong, Tony Lee, Roxana Daneshjou, Jonathan Frankle, Percy Liang, Michael Carbin, Christopher D. Manning

    Abstract: Models such as GPT-4 and Med-PaLM 2 have demonstrated impressive performance on a wide variety of biomedical NLP tasks. However, these models have hundreds of billions of parameters, are computationally expensive to run, require users to send their input data over the internet, and are trained on unknown data sources. Can smaller, more targeted models compete? To address this question, we build an… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 23 pages

  26. arXiv:2403.17925  [pdf, other

    astro-ph.CO

    Testing the $\mathbfΛ$CDM Cosmological Model with Forthcoming Measurements of the Cosmic Microwave Background with SPT-3G

    Authors: K. Prabhu, S. Raghunathan, M. Millea, G. Lynch, P. A. R. Ade, E. Anderes, A. J. Anderson, B. Ansarinejad, M. Archipley, L. Balkenhol, K. Benabed, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, F. R. Bouchet, L. Bryant, E. Camphuis, J. E. Carlstrom, T. W. Cecil, C. L. Chang, P. Chaubal, P. M. Chichura, T. -L. Chou, A. Coerver , et al. (76 additional authors not shown)

    Abstract: We forecast constraints on cosmological parameters enabled by three surveys conducted with SPT-3G, the third-generation camera on the South Pole Telescope. The surveys cover separate regions of 1500, 2650, and 6000 ${\rm deg}^{2}$ to different depths, in total observing 25% of the sky. These regions will be measured to white noise levels of roughly 2.5, 9, and 12 $μ{\rm K-arcmin}$, respectively, i… ▽ More

    Submitted 9 September, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 26 pages; 13 figures; Accepted for publication in ApJ; Minor edits have been made

  27. Calibration of detector time constant with a thermal source for the POLARBEAR-2A CMB polarization experiment

    Authors: S. Takatori, M. Hasegawa, M. Hazumi, D. Kaneko, N. Katayama, A. T. Lee, S. Takakura, T. Tomaru, T. Adkins, D. Barron, Y. Chinone, K. T. Crowley, T. de Haan, T. Elleflot, N. Farias, C. Feng, T. Fujino, J. C. Groh, H. Hirose, F. Matsuda, H. Nishino, Y. Segawa, P. Siritanasak, A. Suzuki, K. Yamada

    Abstract: The Simons Array (SA) project is a ground-based Cosmic Microwave Background (CMB) polarization experiment. The SA observes the sky using three telescopes, and POLARBEAR-2A (PB-2A) is the receiver system on the first telescope. For the ground-based experiment, atmospheric fluctuation is the primary noise source that could cause polarization leakage. In the PB-2A receiver system, a continuously rota… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Proceedings of the 15th Asia Pacific Physics Conference (APPC15)

  28. arXiv:2403.16510  [pdf, other

    cs.CV

    Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework

    Authors: Ziyao Huang, Fan Tang, Yong Zhang, Xiaodong Cun, Juan Cao, Jintao Li, Tong-Yee Lee

    Abstract: Despite the remarkable process of talking-head-based avatar-creating solutions, directly generating anchor-style videos with full-body motions remains challenging. In this study, we propose Make-Your-Anchor, a novel system necessitating only a one-minute video clip of an individual for training, subsequently enabling the automatic generation of anchor-style videos with precise torso and hand movem… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: accepted at CVPR2024

  29. arXiv:2403.15341  [pdf, other

    cs.AI cs.MA

    Collaborative AI Teaming in Unknown Environments via Active Goal Deduction

    Authors: Zuyuan Zhang, Hanhan Zhou, Mahdi Imani, Taeyoung Lee, Tian Lan

    Abstract: With the advancements of artificial intelligence (AI), we're seeing more scenarios that require AI to work closely with other agents, whose goals and strategies might not be known beforehand. However, existing approaches for training collaborative agents often require defined and known reward signals and cannot address the problem of teaming with unknown agents that often have latent objectives/re… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  30. Certain functional identities on division rings of characteristic two

    Authors: Münevver Pınar Eroğlu, Tsiu-Kwen Lee, Jheng-Huei Lin

    Abstract: Let $D$ be a noncommutative division ring. In a recent paper, Lee and Lin proved that if $\text{char}\, D\ne 2$, the only solution of additive maps $f, g$ on $D$ satisfying the identity $f(x) = x^n g(x^{-1})$ on $D\setminus \{0\}$ with $n\ne 2$ a positive integer is the trivial case, that is, $f=0$ and $g=0$. Applying Hua's identity and the theory of functional and generalized polynomial identitie… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    MSC Class: 16R60 (Primary) 16R50; 16K40 (Secondary)

    Journal ref: J. Algebra 657 (2024) 363-378

  31. arXiv:2403.11393  [pdf, ps, other

    math.RT math-ph

    Branching algebras for the general linear Lie superalgebra

    Authors: Soo Teck Lee, Ruibin Zhang

    Abstract: We develop an algebraic approach to the branching of representations of the general linear Lie superalgebra $\mathfrak{gl}_{p|q}({\mathbb C})$, by constructing certain super commutative algebras whose structure encodes the branching rules. Using this approach, we derive the branching rules for restricting any irreducible polynomial representation $V$ of $\mathfrak{gl}_{p|q}({\mathbb C})$ to a regu… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 35 pages

    MSC Class: 05E10; 15A75; 20G05; 22E46

  32. arXiv:2403.10041  [pdf, other

    cs.RO cs.AI

    Towards Embedding Dynamic Personas in Interactive Robots: Masquerading Animated Social Kinematics (MASK)

    Authors: Jeongeun Park, Taemoon Jeong, Hyeonseong Kim, Taehyun Byun, Seungyoon Shin, Keunjun Choi, Jaewoon Kwon, Taeyoon Lee, Matthew Pan, Sungjoon Choi

    Abstract: This paper presents the design and development of an innovative interactive robotic system to enhance audience engagement using character-like personas. Built upon the foundations of persona-driven dialog agents, this work extends the agent's application to the physical realm, employing robots to provide a more captivating and interactive experience. The proposed system, named the Masquerading Ani… ▽ More

    Submitted 7 October, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted at Robotics and Automation Letters

  33. arXiv:2403.08272  [pdf, other

    cs.CL

    RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education

    Authors: Jieun Han, Haneul Yoo, Junho Myung, Minsun Kim, Tak Yeon Lee, So-Yeon Ahn, Alice Oh

    Abstract: The integration of generative AI in education is expanding, yet empirical analyses of large-scale and real-world interactions between students and AI systems still remain limited. Addressing this gap, we present RECIPE4U (RECIPE for University), a dataset sourced from a semester-long experiment with 212 college students in English as Foreign Language (EFL) writing courses. During the study, studen… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2309.13243

  34. arXiv:2403.08133  [pdf, other

    eess.SP cs.AI cs.IT

    Physics-Inspired Deep Learning Anti-Aliasing Framework in Efficient Channel State Feedback

    Authors: Yu-Chien Lin, Yan Xin, Ta-Sung Lee, Charlie, Zhang, Zhi Ding

    Abstract: Acquiring downlink channel state information (CSI) at the base station is vital for optimizing performance in massive Multiple input multiple output (MIMO) Frequency-Division Duplexing (FDD) systems. While deep learning architectures have been successful in facilitating UE-side CSI feedback and gNB-side recovery, the undersampling issue prior to CSI feedback is often overlooked. This issue, which… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  35. First Constraints on the Epoch of Reionization Using the non-Gaussianity of the Kinematic Sunyaev-Zel{'}dovich Effect from the South Pole Telescope and {\it Herschel}-SPIRE Observations

    Authors: S. Raghunathan, P. A. R. Ade, A. J. Anderson, B. Ansarinejad, M. Archipley, J. E. Austermann, L. Balkenhol, J. A. Beall, K. Benabed, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, J. Bock, F. R. Bouchet, L. Bryant, E. Camphuis, J. E. Carlstrom, T. W. Cecil, C. L. Chang, P. Chaubal, H. C. Chiang, P. M. Chichura, T. -L. Chou, R. Citron , et al. (99 additional authors not shown)

    Abstract: We report results from an analysis aimed at detecting the trispectrum of the kinematic Sunyaev-Zel{'}dovich (kSZ) effect by combining data from the South Pole Telescope (SPT) and {\it Herschel}-SPIRE experiments over a 100 ${\rm deg}^{2}$ field. The SPT observations combine data from the previous and current surveys, namely SPTpol and SPT-3G, to achieve depths of 4.5, 3, and 16 $μ{\rm K-arcmin}$ i… ▽ More

    Submitted 15 August, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 15 pages, 5 figures (3 in main text and 2 in Appendix); Accepted for publication in PRL; Some texts have been moved to Appendix; Minor change in Fig. 2 to include nomalization; Data products and plotting scripts can be downloaded from https://github.com/sriniraghunathan/kSZ_4pt_SPT_SPIRE

    Journal ref: Phys. Rev. Lett. 133, 121004 (2024)

  36. Exploration of the polarization angle variability of the Crab Nebula with POLARBEAR and its application to the search for axion-like particles

    Authors: Shunsuke Adachi, Tylor Adkins, Carlo Baccigalupi, Yuji Chinone, Kevin T. Crowley, Josquin Errard, Giulio Fabbian, Chang Feng, Takuro Fujino, Masaya Hasegawa, Masashi Hazumi, Oliver Jeong, Daisuke Kaneko, Brian Keating, Akito Kusaka, Adrian T. Lee, Anto I. Lonappan, Yuto Minami, Masaaki Murata, Lucio Piccirillo, Christian L. Reichardt, Praween Siritanasak, Jacob Spisak, Satoru Takakura, Grant P. Teply , et al. (1 additional authors not shown)

    Abstract: The Crab Nebula, also known as Tau A, is a polarized astronomical source at millimeter wavelengths. It has been used as a stable light source for polarization angle calibration in millimeter-wave astronomy. However, it is known that its intensity and polarization vary as a function of time at a variety of wavelengths. Thus, it is of interest to verify the stability of the millimeter-wave polarizat… ▽ More

    Submitted 19 September, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 24 pages, 19 figures, 5 tables

  37. arXiv:2403.01958  [pdf, other

    nucl-th

    Dynamical Model of $J/Ψ$ photo-production on the nucleon

    Authors: S. Sakinah, T. -S. H. Lee, Ho-Meoyng Choi

    Abstract: A dynamical model based on a phenomenological charm quark-nucleon($c$-N) potential $v_{cN}$ and the Pomeron-exchange mechanism is constructed to investigate the $J/Ψ$ photo-production on the nucleon from threshold to invariant mass $W=300$ GeV. The $J/Ψ$-N potential,$V_{J/ΨN}(r)$,is constructed by folding $v_{cN}$ into the wavefunction $Φ_{J/Ψ}(c\bar{c})$ of $J/Ψ$ within a Constituent Quark Model(… ▽ More

    Submitted 10 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 14 pages, 20 figures

  38. arXiv:2403.01749  [pdf, other

    cs.CL

    Differentially Private Synthetic Data via Foundation Model APIs 2: Text

    Authors: Chulin Xie, Zinan Lin, Arturs Backurs, Sivakanth Gopi, Da Yu, Huseyin A Inan, Harsha Nori, Haotian Jiang, Huishuai Zhang, Yin Tat Lee, Bo Li, Sergey Yekhanin

    Abstract: Text data has become extremely valuable due to the emergence of machine learning algorithms that learn from it. A lot of high-quality text data generated in the real world is private and therefore cannot be shared or used freely due to privacy concerns. Generating synthetic replicas of private text data with a formal privacy guarantee, i.e., differential privacy (DP), offers a promising and scalab… ▽ More

    Submitted 23 July, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: ICML'24 Spotlight

  39. arXiv:2403.00160  [pdf, other

    astro-ph.GA

    A far-ultraviolet-driven photoevaporation flow observed in a protoplanetary disk

    Authors: Olivier Berné, Emilie Habart, Els Peeters, Ilane Schroetter, Amélie Canin, Ameek Sidhu, Ryan Chown, Emeric Bron, Thomas J. Haworth, Pamela Klaassen, Boris Trahin, Dries Van De Putte, Felipe Alarcón, Marion Zannese, Alain Abergel, Edwin A. Bergin, Jeronimo Bernard-Salas, Christiaan Boersma, Jan Cami, Sara Cuadrado, Emmanuel Dartois, Daniel Dicken, Meriem Elyajouri, Asunción Fuente, Javier R. Goicoechea , et al. (121 additional authors not shown)

    Abstract: Most low-mass stars form in stellar clusters that also contain massive stars, which are sources of far-ultraviolet (FUV) radiation. Theoretical models predict that this FUV radiation produces photo-dissociation regions (PDRs) on the surfaces of protoplanetary disks around low-mass stars, impacting planet formation within the disks. We report JWST and Atacama Large Millimetere Array observations of… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Journal ref: Science, 383, 6686, 2024

  40. arXiv:2402.19374  [pdf, ps, other

    math.RA

    The $X$-semiprimeness of Rings

    Authors: Grigore Călugăreanu, Tsiu-Kwen Lee, Jerzy Matczuk

    Abstract: For a nonempty subset $X$ of a ring $R$, the ring $R$ is called $X$-semiprime if, given $a\in R$, $aXa=0$ implies $a=0$. This provides a proper class of semiprime rings. First, we clarify the relationship between idempotent semiprime and unit-semiprime rings. Secondly, given a Lie ideal $L$ of a ring $R$, we offer a criterion for $R$ to be $L$-semiprime. For a prime ring $R$, we characterizes Lie… ▽ More

    Submitted 9 April, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Comments welcome

    MSC Class: 16N60; 16U10; 16U40; 16S50

  41. arXiv:2402.12071  [pdf, other

    cs.CL cs.AI

    EmoBench: Evaluating the Emotional Intelligence of Large Language Models

    Authors: Sahand Sabour, Siyang Liu, Zheyuan Zhang, June M. Liu, Jinfeng Zhou, Alvionna S. Sunaryo, Juanzi Li, Tatia M. C. Lee, Rada Mihalcea, Minlie Huang

    Abstract: Recent advances in Large Language Models (LLMs) have highlighted the need for robust, comprehensive, and challenging benchmarks. Yet, research on evaluating their Emotional Intelligence (EI) is considerably limited. Existing benchmarks have two major shortcomings: first, they mainly focus on emotion recognition, neglecting essential EI capabilities such as emotion regulation and thought facilitati… ▽ More

    Submitted 17 July, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: ACL 2024 Main Conference

  42. arXiv:2402.11450  [pdf, other

    cs.RO

    Learning to Learn Faster from Human Feedback with Language Model Predictive Control

    Authors: Jacky Liang, Fei Xia, Wenhao Yu, Andy Zeng, Montserrat Gonzalez Arenas, Maria Attarian, Maria Bauza, Matthew Bennice, Alex Bewley, Adil Dostmohamed, Chuyuan Kelly Fu, Nimrod Gileadi, Marissa Giustina, Keerthana Gopalakrishnan, Leonard Hasenclever, Jan Humplik, Jasmine Hsu, Nikhil Joshi, Ben Jyenis, Chase Kew, Sean Kirmani, Tsang-Wei Edward Lee, Kuang-Huei Lee, Assaf Hurwitz Michaely, Joss Moore , et al. (25 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to exhibit a wide range of capabilities, such as writing robot code from language commands -- enabling non-experts to direct robot behaviors, modify them based on feedback, or compose them to perform new tasks. However, these capabilities (driven by in-context learning) are limited to short-term interactions, where users' feedback remains relevant for o… ▽ More

    Submitted 31 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  43. arXiv:2402.10415  [pdf, other

    astro-ph.GA astro-ph.SR

    GTC Spectroscopic Surveys of Planetary Nebulae in the Milky Way and M31

    Authors: Xuan Fang, Haomiao Huang, Martin A. Guerrero, Letizia Stanghellini, Ruben Garcia-Benito, Ting-Hui Lee, Yong Zhang

    Abstract: We report spectroscopic surveys of planetary nebulae (PNe) in the Milky Way and Andromeda (M31), using the 10.4-m Gran Telescopio Canarias (GTC). The spectra are of high quality and cover the whole optical range, mostly from 3650 Å to beyond 1 micron, enabling detection of nebular emission lines critical for spectral analysis as well as photoionization modeling. We obtained GTC spectra of 24 compa… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures, in production; Proceedings of the IAUS384: "Planetary Nebulae: a Universal Toolbox in the Era of Precision Astrophysics", Krakow, Poland, September 4-8, 2023

  44. arXiv:2402.07872  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

    Authors: Soroush Nasiriany, Fei Xia, Wenhao Yu, Ted Xiao, Jacky Liang, Ishita Dasgupta, Annie Xie, Danny Driess, Ayzaan Wahid, Zhuo Xu, Quan Vuong, Tingnan Zhang, Tsang-Wei Edward Lee, Kuang-Huei Lee, Peng Xu, Sean Kirmani, Yuke Zhu, Andy Zeng, Karol Hausman, Nicolas Heess, Chelsea Finn, Sergey Levine, Brian Ichter

    Abstract: Vision language models (VLMs) have shown impressive capabilities across a variety of tasks, from logical reasoning to visual understanding. This opens the door to richer interaction with the world, for example robotic control. However, VLMs produce only textual outputs, while robotic control and other spatial tasks require outputting continuous coordinates, actions, or trajectories. How can we ena… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  45. Eigenmode Decomposition Method for Full-Wave Modeling of Microring Resonators

    Authors: Yuriy Akimov, Aswin Alexander Eapen, Shiyang Zhu, Doris K. T. Ng, Nanxi Li, Woon Leng Loh, Lennon Y. T. Lee, Alagappan Gandhi, Aravind P. Anthur

    Abstract: We develop a theoretical predictive model for an all-pass ring resonator that enables the most complete description of linear coupling regimes. The model is based on eigenmode decomposition of Maxwell's equations with full account of the confined and leaky modes, as opposed to the existing phenomenological methods restricted to the confined modes only. This model enables quantitative description o… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 8 pages, 11 figures

    Journal ref: Physical Review A 109, 043514 (2024)

  46. arXiv:2401.17800  [pdf, other

    cs.SD cs.MM eess.AS

    Dance-to-Music Generation with Encoder-based Textual Inversion

    Authors: Sifei Li, Weiming Dong, Yuxin Zhang, Fan Tang, Chongyang Ma, Oliver Deussen, Tong-Yee Lee, Changsheng Xu

    Abstract: The seamless integration of music with dance movements is essential for communicating the artistic intent of a dance piece. This alignment also significantly improves the immersive quality of gaming experiences and animation productions. Although there has been remarkable advancement in creating high-fidelity music from textual descriptions, current methodologies mainly focus on modulating overall… ▽ More

    Submitted 12 September, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: 11 pages, 5 figures, SIGGRAPH ASIA 2024

  47. arXiv:2401.17762  [pdf, other

    physics.flu-dyn

    Spreading and engulfment of a viscoelastic film onto a Newtonian droplet

    Authors: Chunheng Zhao, Taehun Lee, Andreas Carlson

    Abstract: We use the conservative phase-field lattice Boltzmann method to investigate the dynamics when a Newtonian droplet comes in contact with an immiscible viscoelastic liquid film. The dynamics of the three liquid phases are explored through numerical simulations, with a focus on illustrating the contact line dynamics and the viscoelastic effects described by the Oldroyd-B model. The droplet dynamics a… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  48. arXiv:2401.16731  [pdf, other

    cs.CL cs.AI

    Towards Generating Informative Textual Description for Neurons in Language Models

    Authors: Shrayani Mondal, Rishabh Garodia, Arbaaz Qureshi, Taesung Lee, Youngja Park

    Abstract: Recent developments in transformer-based language models have allowed them to capture a wide variety of world knowledge that can be adapted to downstream tasks with limited resources. However, what pieces of information are understood in these models is unclear, and neuron-level contributions in identifying them are largely unknown. Conventional approaches in neuron explainability either depend on… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: AAAI 2024

  49. arXiv:2401.15413  [pdf

    physics.bio-ph

    Hyperphosphorylation-Induced Phase Transition in Vesicle Delivery Dynamics of Motor Proteins in Neuronal Cells

    Authors: Eunsang Lee, Donghee Kim, Yo Han Song, Kyujin Shin, Sanggeun Song, Minho Lee, Yeongchang Goh, Mi Hee Lim, Ji-Hyun Kim, Jaeyoung Sung, Kang Taek Lee

    Abstract: Synaptic vesicle transport by motor proteins along microtubules is a crucial active process underlying neuronal communication. It is known that microtubules are destabilized by tau-hyperphosphorylation, which causes tau proteins to detach from microtubules and form neurofibril tangles. However, how tau-phosphorylation affects transport dynamics of motor proteins on the microtubule remains unknown.… ▽ More

    Submitted 23 April, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

  50. arXiv:2401.14200  [pdf, other

    hep-lat physics.app-ph physics.comp-ph

    Speeding up Fermionic Lattice Calculations with Photonic Accelerated Inverters

    Authors: Felipe Attanasio, Marc Bauer, Jelle Dijkstra, Timoteo Lee, Jan M. Pawlowski, Wolfram Pernice

    Abstract: Lattice field theory (LFT) is the standard non-perturbative method to perform numerical calculations of quantum field theory. However, the typical bottleneck of fermionic lattice calculations is the inversion of the Dirac matrix. This inversion is solved by iterative methods, like the conjugate gradient algorithm, where matrix-vector multiplications (MVMs) are the main operation. Photonic integrat… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 10 pages, 8 figures