Skip to main content

Showing 1–50 of 104 results for author: Chiang, T

.
  1. arXiv:2506.01241  [pdf, ps, other

    cs.CL

    ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists

    Authors: Jie Ruan, Inderjeet Nair, Shuyang Cao, Amy Liu, Sheza Munir, Micah Pollens-Dempsey, Tiffany Chiang, Lucy Kates, Nicholas David, Sihan Chen, Ruxin Yang, Yuqian Yang, Jasmine Gump, Tessa Bialek, Vivek Sankaran, Margo Schlanger, Lu Wang

    Abstract: This paper introduces ExpertLongBench, an expert-level benchmark containing 11 tasks from 9 domains that reflect realistic expert workflows and applications. Beyond question answering, the application-driven tasks in ExpertLongBench demand long-form outputs that can exceed 5,000 tokens and strict adherence to domain-specific requirements. Notably, each task in ExpertLongBench includes a rubric, de… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  2. arXiv:2504.03756  [pdf, other

    cs.LG cs.CV

    Semi-Self Representation Learning for Crowdsourced WiFi Trajectories

    Authors: Yu-Lin Kuo, Yu-Chee Tseng, Ting-Hui Chiang, Yan-Ann Chen

    Abstract: WiFi fingerprint-based localization has been studied intensively. Point-based solutions rely on position annotations of WiFi fingerprints. Trajectory-based solutions, however, require end-position annotations of WiFi trajectories, where a WiFi trajectory is a multivariate time series of signal features. A trajectory dataset is much larger than a pointwise dataset as the number of potential traject… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: Accepted by VTC2025-Spring

  3. arXiv:2504.01002  [pdf, other

    cs.CL cs.AI

    Token embeddings violate the manifold hypothesis

    Authors: Michael Robinson, Sourya Dey, Tony Chiang

    Abstract: A full understanding of the behavior of a large language model (LLM) requires our understanding of its input token space. If this space differs from our assumptions, our understanding of and conclusions about the LLM will likely be flawed. We elucidate the structure of the token embeddings both empirically and theoretically. We present a novel statistical test assuming that the neighborhood around… ▽ More

    Submitted 28 May, 2025; v1 submitted 1 April, 2025; originally announced April 2025.

    Comments: 27 pages, 6 figures, 9 tables

    MSC Class: 53Z50; 62H15

  4. arXiv:2503.20903  [pdf, other

    cs.LG cs.AI

    Assessing Generative Models for Structured Data

    Authors: Reilly Cannon, Nicolette M. Laird, Caesar Vazquez, Andy Lin, Amy Wagler, Tony Chiang

    Abstract: Synthetic tabular data generation has emerged as a promising method to address limited data availability and privacy concerns. With the sharp increase in the performance of large language models in recent years, researchers have been interested in applying these models to the generation of tabular data. However, little is known about the quality of the generated tabular data from large language mo… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  5. arXiv:2502.11276  [pdf, other

    cs.CL cs.LG

    The Rotary Position Embedding May Cause Dimension Inefficiency in Attention Heads for Long-Distance Retrieval

    Authors: Ting-Rui Chiang, Dani Yogatama

    Abstract: The Rotary Position Embedding (RoPE) is widely used in the attention heads of many large language models (LLM). It rotates dimensions in the query and the key vectors by different angles according to their positions in the input sequence. For long context modeling, the range of positions may vary a lot, and thus RoPE rotates some dimensions by a great range of angles. We hypothesize that the wide… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  6. arXiv:2502.11191  [pdf, ps, other

    cs.CR cs.AI cs.CL

    Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training

    Authors: Yao-Ching Yu, Tsun-Han Chiang, Cheng-Wei Tsai, Chien-Ming Huang, Wen-Kwang Tsao

    Abstract: Large Language Models (LLMs) have shown remarkable advancements in specialized fields such as finance, law, and medicine. However, in cybersecurity, we have noticed a lack of open-source datasets, with a particular lack of high-quality cybersecurity pretraining corpora, even though much research indicates that LLMs acquire their knowledge during pretraining. To address this, we present a comprehen… ▽ More

    Submitted 1 June, 2025; v1 submitted 16 February, 2025; originally announced February 2025.

  7. arXiv:2502.11153  [pdf, other

    quant-ph cond-mat.stat-mech math-ph

    SVM/SVR Kernels as Quantum Propagators

    Authors: Nan-Hong Kuo, Tsung-Wei Chiang, Renata Wong

    Abstract: In this work, we establish the equivalence between Support Vector Machine (SVM) kernels and quantum Green's functions. Drawing on the analogy between margin maximization in SVMs and action extremization in Lagrangian mechanics, we show that many standard kernels correspond naturally to Green's functions and that this correspondence arises from the inversion of physical operators. We further demons… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  8. arXiv:2501.09968  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    In-plane anisotropy of charge density wave fluctuations in 1$T$-TiSe$_2$

    Authors: Xuefei Guo, Anshul Kogar, Jans Henke, Felix Flicker, Fernando de Juan, Stella X. -L. Sun, Issam Khayr, Yingying Peng, Sangjun Lee, Matthew J. Krogstad, Stephan Rosenkranz, Raymond Osborn, Jacob P. C. Ruff, David B. Lioi, Goran Karapetrov, Daniel J. Campbell, Johnpierre Paglione, Jasper van Wezel, Tai C. Chiang, Peter Abbamonte

    Abstract: We report measurements of anisotropic triple-$q$ charge density wave (CDW) fluctuations in the transition metal dichalcogenide 1$T$-TiSe$_2$ over a large volume of reciprocal space with X-ray diffuse scattering. Above the transition temperature, $T_{\text{CDW}}$, the out-of-plane diffuse scattering is characterized by rod-like structures which indicate that the CDW fluctuations in neighboring laye… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

  9. arXiv:2411.14746  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Measurement of the dynamic charge susceptibility near the charge density wave transition in ErTe$_3$

    Authors: Dipanjan Chaudhuri, Qianni Jiang, Xuefei Guo, Jin Chen, Caitlin S. Kengle, Farzaneh Hoveyda-Marashi, Camille Bernal-Choban, Niels de Vries, Tai-Chang Chiang, Eduardo Fradkin, Ian R. Fisher, Peter Abbamonte

    Abstract: A charge density wave (CDW) is a phase of matter characterized by a periodic modulation of the valence electron density accompanied by a distortion of the lattice structure. The microscopic details of CDW formation are closely tied to the dynamic charge susceptibility, $χ(q,ω)$, which describes the behavior of electronic collective modes. Despite decades of extensive study, the behavior of… ▽ More

    Submitted 18 March, 2025; v1 submitted 22 November, 2024; originally announced November 2024.

  10. arXiv:2411.11164  [pdf, other

    cond-mat.str-el hep-ph

    Conformally invariant charge fluctuations in a strange metal

    Authors: Xuefei Guo, Jin Chen, Farzaneh Hoveyda-Marashi, Simon L. Bettler, Dipanjan Chaudhuri, Caitlin S. Kengle, John A. Schneeloch, Ruidan Zhang, Genda Gu, Tai-Chang Chiang, Alexei M. Tsvelik, Thomas Faulkner, Philip W. Phillips, Peter Abbamonte

    Abstract: The strange metal is a peculiar phase of matter in which the electron scattering rate, $τ^{-1} \sim k_B T/\hbar$, which determines the electrical resistance, is universal across a wide family of materials and determined only by fundamental constants. In 1989, theorists hypothesized that this universality would manifest as scale-invariant behavior in the dynamic charge susceptibility, $χ''(q,ω)$. H… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

    Comments: 14 pages, 4 figures + supplementary data

  11. arXiv:2411.03192  [pdf, other

    astro-ph.GA astro-ph.CO hep-ph

    The tidal evolution of anisotropic subhaloes: A new pathway to creating isotropic and cored satellites

    Authors: Barry T. Chiang, Frank C. van den Bosch, Hsi-Yu Schive

    Abstract: It is common practice, both in dynamical modelling and in idealised numerical simulations, to assume that galaxies and/or dark matter haloes are spherical and have isotropic velocity distributions, such that their distribution functions are ergodic. However, there is no good reason to assume that this assumption is accurate. In this paper we use idealised $N$-body simulations to study the tidal ev… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: 16 pages, 16 figures

  12. arXiv:2410.19808  [pdf, other

    cs.CV cs.AI

    LocateBench: Evaluating the Locating Ability of Vision Language Models

    Authors: Ting-Rui Chiang, Joshua Robinson, Xinyan Velocity Yu, Dani Yogatama

    Abstract: The ability to locate an object in an image according to natural language instructions is crucial for many real-world applications. In this work we propose LocateBench, a high-quality benchmark dedicated to evaluating this ability. We experiment with multiple prompting approaches, and measure the accuracy of several large vision language models. We find that even the accuracy of the strongest mode… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: We release the dataset at https://usc-tamagotchi.github.io/locate-bench/

  13. arXiv:2409.04178  [pdf, other

    cs.CV

    Reprojection Errors as Prompts for Efficient Scene Coordinate Regression

    Authors: Ting-Ru Liu, Hsuan-Kung Yang, Jou-Min Liu, Chun-Wei Huang, Tsung-Chih Chiang, Quan Kong, Norimasa Kobori, Chun-Yi Lee

    Abstract: Scene coordinate regression (SCR) methods have emerged as a promising area of research due to their potential for accurate visual localization. However, many existing SCR approaches train on samples from all image regions, including dynamic objects and texture-less areas. Utilizing these areas for optimization during training can potentially hamper the overall performance and efficiency of the mod… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: ECCV2024

  14. arXiv:2408.10437  [pdf, other

    cs.LG cs.AI

    Understanding Generative AI Content with Embedding Models

    Authors: Max Vargas, Reilly Cannon, Andrew Engel, Anand D. Sarwate, Tony Chiang

    Abstract: Constructing high-quality features is critical to any quantitative data analysis. While feature engineering was historically addressed by carefully hand-crafting data representations based on domain expertise, deep neural networks (DNNs) now offer a radically different approach. DNNs implicitly engineer features by transforming their input data into hidden feature vectors called embeddings. For em… ▽ More

    Submitted 22 February, 2025; v1 submitted 19 August, 2024; originally announced August 2024.

  15. arXiv:2406.17741  [pdf, other

    cs.CV cs.AI

    Point-SAM: Promptable 3D Segmentation Model for Point Clouds

    Authors: Yuchen Zhou, Jiayuan Gu, Tung Yen Chiang, Fanbo Xiang, Hao Su

    Abstract: The development of 2D foundation models for image segmentation has been significantly advanced by the Segment Anything Model (SAM). However, achieving similar success in 3D models remains a challenge due to issues such as non-unified data formats, poor model scalability, and the scarcity of labeled data with diverse masks. To this end, we propose a 3D promptable segmentation model Point-SAM, focus… ▽ More

    Submitted 2 December, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  16. arXiv:2406.16709  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Evidence of directional structural superlubricity and Lévy flights in a van der Waals heterostructure

    Authors: Maxime Le Ster, Paweł Krukowski, Maciej Rogala, Paweł Dabrowski, Iaroslav Lutsyk, Klaudia Toczek, Krzysztof Podlaski, Tefvik O. Mendeş, Francesca Genuzio, Andrea Locatelli, Guan Bian, Tai-Chang Chiang, Simon A. Brown, Paweł J. Kowalczyk

    Abstract: Structural superlubricity is a special frictionless contact in which two crystals are in incommensurate arrangement such that relative in-plane translation is associated with vanishing energy barrier crossing. So far, it has been realized in multilayer graphene and other van der Waals two-dimensional crystals with hexagonal or triangular crystalline symmetries, leading to isotropic frictionless co… ▽ More

    Submitted 16 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 14 pages, 10 figures (including Supplementary Information and Supplementary Figures)

  17. arXiv:2406.08307  [pdf, other

    stat.ML cs.LG

    Measuring training variability from stochastic optimization using robust nonparametric testing

    Authors: Sinjini Banerjee, Tim Marrinan, Reilly Cannon, Tony Chiang, Anand D. Sarwate

    Abstract: Deep neural network training often involves stochastic optimization, meaning each run will produce a different model. This implies that hyperparameters of the training process, such as the random seed itself, can potentially have significant influence on the variability in the trained models. Measuring model quality by summary statistics, such as test accuracy, can obscure this dependence. We prop… ▽ More

    Submitted 15 April, 2025; v1 submitted 12 June, 2024; originally announced June 2024.

  18. arXiv:2403.09845  [pdf, other

    astro-ph.GA astro-ph.CO hep-ph

    Galactic disc heating by density granulation in fuzzy dark matter simulations

    Authors: Hsun-Yeong Yang, Barry T. Chiang, Guan-Ming Su, Hsi-Yu Schive, Tzihong Chiueh, Jeremiah P. Ostriker

    Abstract: Fuzzy dark matter (FDM), an attractive dark matter candidate comprising ultralight bosons (axions) with a particle mass $m_a\sim10^{-22}$ eV, is motivated by the small-scale challenges of cold dark matter and features a kpc-size de Broglie wavelength. Quantum wave interference inside an FDM halo gives rise to stochastically fluctuating density granulation; the resulting gravitational perturbations… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 20 pages, 26 figures; Accepted for publication in MNRAS

  19. arXiv:2402.10424  [pdf, other

    cs.CL cs.AI

    Understanding In-Context Learning with a Pelican Soup Framework

    Authors: Ting-Rui Chiang, Dani Yogatama

    Abstract: Many existing theoretical analyses of in-context learning for natural language processing are based on latent variable models that leaves gaps between theory and practice. We aim to close these gaps by proposing a theoretical framework, the Pelican Soup Framework. In this framework, we introduce (1) the notion of a common sense knowledge base, (2) a general formalism for natural language classific… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  20. arXiv:2311.09615  [pdf, other

    cs.CL

    On Retrieval Augmentation and the Limitations of Language Model Training

    Authors: Ting-Rui Chiang, Xinyan Velocity Yu, Joshua Robinson, Ollie Liu, Isabelle Lee, Dani Yogatama

    Abstract: Augmenting a language model (LM) with $k$-nearest neighbors ($k$NN) retrieval on its training data alone can decrease its perplexity, though the underlying reasons for this remain elusive. In this work, we rule out one previously posited possibility -- the "softmax bottleneck." We then create a new dataset to evaluate LM generalization ability in the setting where training data contains additional… ▽ More

    Submitted 2 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024

  21. arXiv:2310.18612  [pdf, other

    cs.LG

    Efficient kernel surrogates for neural network-based regression

    Authors: Saad Qadeer, Andrew Engel, Amanda Howard, Adam Tsou, Max Vargas, Panos Stinis, Tony Chiang

    Abstract: Despite their immense promise in performing a variety of learning tasks, a theoretical understanding of the limitations of Deep Neural Networks (DNNs) has so far eluded practitioners. This is partly due to the inability to determine the closed forms of the learned functions, making it harder to study their generalization properties on unseen datasets. Recent work has shown that randomly initialize… ▽ More

    Submitted 24 January, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 35 pages. software used to reach results available upon request, approved for release by Pacific Northwest National Laboratory

    Report number: PNNL-SA-191858 MSC Class: 68T07; 65M99

  22. arXiv:2310.16261  [pdf, other

    cs.CL

    The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining

    Authors: Ting-Rui Chiang, Dani Yogatama

    Abstract: We analyze the masked language modeling pretraining objective function from the perspective of the distributional hypothesis. We investigate whether better sample efficiency and the better generalization capability of models pretrained with masked language modeling can be attributed to the semantic similarity encoded in the pretraining data's distributional property. Via a synthetic dataset, our a… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  23. arXiv:2310.13898  [pdf, other

    q-bio.OT

    Computational and Systems Biology Advances to Enable Bioagent-Agnostic Signatures

    Authors: Andy Lin, Cameron Torres, Errett C. Hobbs, Jaydeep Bardhan, Stephen B. Aley, Charles T. Spencer, Karen L. Taylor, Tony Chiang

    Abstract: Enumerated threat agent lists have long driven biodefense priorities. The global SARS-CoV-2 pandemic demonstrated the limitations of searching for known threat agents as compared to a more agnostic approach. Recent technological advances are enabling agent-agnostic biodefense, especially through the integration of multi-modal observations of host-pathogen interactions directed by a human immunolog… ▽ More

    Submitted 28 February, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  24. arXiv:2310.13836  [pdf, other

    cs.LG cs.CL

    Foundation Model's Embedded Representations May Detect Distribution Shift

    Authors: Max Vargas, Adam Tsou, Andrew Engel, Tony Chiang

    Abstract: Sampling biases can cause distribution shifts between train and test datasets for supervised learning tasks, obscuring our ability to understand the generalization capacity of a model. This is especially important considering the wide adoption of pre-trained foundational neural networks -- whose behavior remains poorly understood -- for transfer learning (TL) tasks. We present a case study for TL… ▽ More

    Submitted 2 February, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: 17 pages, 8 figures, 5 tables

  25. arXiv:2310.07724  [pdf, other

    cs.RO cs.AI cs.LG

    Visual Forecasting as a Mid-level Representation for Avoidance

    Authors: Hsuan-Kung Yang, Tsung-Chih Chiang, Ting-Ru Liu, Chun-Wei Huang, Jou-Min Liu, Chun-Yi Lee

    Abstract: The challenge of navigation in environments with dynamic objects continues to be a central issue in the study of autonomous agents. While predictive methods hold promise, their reliance on precise state information makes them less practical for real-world implementation. This study presents visual forecasting as an innovative alternative. By introducing intuitive visual cues, this approach project… ▽ More

    Submitted 17 September, 2023; originally announced October 2023.

    Comments: Tsung-Chih Chiang, Ting-Ru Liu, Chun-Wei Huang, and Jou-Min Liu contributed equally to this work; This work has been submitted to the IEEE for possible publication

  26. arXiv:2310.00541  [pdf, other

    stat.ML cs.LG

    Robust Nonparametric Hypothesis Testing to Understand Variability in Training Neural Networks

    Authors: Sinjini Banerjee, Reilly Cannon, Tim Marrinan, Tony Chiang, Anand D. Sarwate

    Abstract: Training a deep neural network (DNN) often involves stochastic optimization, which means each run will produce a different model. Several works suggest this variability is negligible when models have the same performance, which in the case of classification is test accuracy. However, models with similar test accuracy may not be computing the same function. We propose a new measure of closeness bet… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  27. arXiv:2309.15328  [pdf, other

    cs.LG

    Exploring Learned Representations of Neural Networks with Principal Component Analysis

    Authors: Amit Harlev, Andrew Engel, Panos Stinis, Tony Chiang

    Abstract: Understanding feature representation for deep neural networks (DNNs) remains an open question within the general field of explainable AI. We use principal component analysis (PCA) to study the performance of a k-nearest neighbors classifier (k-NN), nearest class-centers classifier (NCC), and support vector machines on the learned layer-wise representations of a ResNet-18 trained on CIFAR-10. We sh… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 5 pages, 3 figures

  28. arXiv:2307.11684  [pdf, other

    cs.LG

    Minibatching Offers Improved Generalization Performance for Second Order Optimizers

    Authors: Eric Silk, Swarnita Chakraborty, Nairanjana Dasgupta, Anand D. Sarwate, Andrew Lumsdaine, Tony Chiang

    Abstract: Training deep neural networks (DNNs) used in modern machine learning is computationally expensive. Machine learning scientists, therefore, rely on stochastic first-order methods for training, coupled with significant hand-tuning, to obtain good performance. To better understand performance variability of different stochastic algorithms, including second-order methods, we conduct an empirical study… ▽ More

    Submitted 25 May, 2023; originally announced July 2023.

    Comments: 14 pages, 6 figures, 5 tables

  29. arXiv:2306.03681  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Consistency between reflection M-EELS and optical spectroscopy measurements of the long-wavelength density response of Bi$_2$Sr$_2$CaCu$_2$O$_{8+x}$

    Authors: Jin Chen, Xuefei Guo, Christian Boyd, Simon Bettler, Caitlin Kengle, Dipanjan Chaudhuri, Farzaneh Hoveyda, Ali Husain, John Schneeloch, Genda Gu, Philip Phillips, Bruno Uchoa, Tai-Chang Chiang, Peter Abbamonte

    Abstract: The density fluctuation spectrum captures many fundamental properties of strange metals. Using momentum-resolved electron energy-loss spectroscopy (M-EELS), we recently showed that the density response of the strange metal Bi$_2$Sr$_2$CaCu$_2$O$_{8+x}$ (Bi-2212) at large momentum, $q$, exhibits a constant-in-frequency continuum [Mitrano, PNAS $\textbf{115}$, 5392 (2018); Husain, PRX $\textbf{9}$,… ▽ More

    Submitted 13 December, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 9 pages, 7 figures; copy editing, improved figure resolution

  30. arXiv:2305.14585  [pdf, other

    cs.LG

    Faithful and Efficient Explanations for Neural Networks via Neural Tangent Kernel Surrogate Models

    Authors: Andrew Engel, Zhichao Wang, Natalie S. Frank, Ioana Dumitriu, Sutanay Choudhury, Anand Sarwate, Tony Chiang

    Abstract: A recent trend in explainable AI research has focused on surrogate modeling, where neural networks are approximated as simpler ML algorithms such as kernel machines. A second trend has been to utilize kernel functions in various explain-by-example or data attribution tasks. In this work, we combine these two trends to analyze approximate empirical neural tangent kernels (eNTK) for data attribution… ▽ More

    Submitted 11 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 9 pages, 2 figures, 3 tables Updated 3/11/2024 various additions/clarifications after ICLR review. Accepted as a Spotlight paper at ICLR 2024

  31. arXiv:2303.02971  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Observation of 2D Weyl Fermion States in Epitaxial Bismuthene

    Authors: Qiangsheng Lu, P. V. Sreenivasa Reddy, Hoyeon Jeon, Alessandro R. Mazza, Matthew Brahlek, Weikang Wu, Shengyuan A. Yang, Jacob Cook, Clayton Conner, Xiaoqian Zhang, Amarnath Chakraborty, Yueh-Ting Yao, Hung-Ju Tien, Chun-Han Tseng, Po-Yuan Yang, Shang-Wei Lien, Hsin Lin, Tai-Chang Chiang, Giovanni Vignale, An-Ping Li, Tay-Rong Chang, Rob G. Moore, Guang Bian

    Abstract: A two-dimensional (2D) Weyl semimetal featuring a spin-polarized linear band dispersion and a nodal Fermi surface is a new topological phase of matter. It is a solid-state realization of Weyl fermions in an intrinsic 2D system. The nontrivial topology of 2D Weyl cones guarantees the existence of a new form of topologically protected boundary states, Fermi string edge states. In this work, we repor… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 5 figures

  32. arXiv:2303.02731  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality

    Authors: Hsuan-Kung Yang, Tsung-Chih Chiang, Jou-Min Liu, Ting-Ru Liu, Chun-Wei Huang, Tsu-Ching Hsiao, Chun-Yi Lee

    Abstract: In the context of autonomous navigation, effectively conveying abstract navigational cues to agents in dynamic environments presents significant challenges, particularly when navigation information is derived from diverse modalities such as both vision and high-level language descriptions. To address this issue, we introduce a novel technique termed `Virtual Guidance,' which is designed to visuall… ▽ More

    Submitted 14 March, 2025; v1 submitted 5 March, 2023; originally announced March 2023.

    Comments: Tsung-Chih Chiang, Jou-Min Liu, Ting-Ru Liu, and Chun-Wei Huang contributed equally to this work; This work has been submitted to the IEEE for possible publication

  33. arXiv:2303.00256  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Influence of Structural Defects on Charge Density Waves in 1T-TaS2

    Authors: I. Lutsyk, K. Szalowski, P. Krukowski, P. Dabrowski, M. Rogala, W. Kozlowski, M. Le Ster, M. Piskorski, D. A. Kowalczyk, W. Rys, R. Dunal, A. Nadolska, K. Toczek, P. Przybysz, E. Lacinska, J. Binder, A. Wysmolek, N. Olszowska, J. J. Kolodziej, M. Gmitra, T. Hattori, Y. Kuwahara, G. Bian, T. -C. Chiang, P. J. Kowalczyk

    Abstract: The influence of intrinsic defects of 1T-TaS2 on charge density waves (CDW) is studied using scanning tunneling microscopy and spectroscopy (STM, STS), angle-resolved photoelectron spectroscopy (ARPES), and density functional theory (DFT). We identify several types of structural defects and find that most have a local character limited to the single CDW site, with single exception which effectivel… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: 25 pages + 5 pages in SI, 6 figures + 6 figures in SI

  34. Dimensional crossover and symmetry transformation of the charge density waves in VSe2

    Authors: P. Chen, Y. -H. Chan, R. -Y. Liu, H. T. Zhang, Q. Gao, A. -V. Fedorov, M. Y. Chou, T. -C. Chiang

    Abstract: Collective phenomena in solids can be sensitive to the dimensionality of the system; a case of special interest is VSe2, which shows a (r7 x r3) charge density wave (CDW) in the single layer with the three-fold symmetry in the normal phase spontaneously broken, in contrast to the (4 x 4) in-plane CDW in the bulk. Angle-resolved photoemission spectroscopy (ARPES) from VSe2 ranging from a single lay… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Journal ref: Phys. Rev. B 105, L161404 (2022)

  35. arXiv:2302.03876  [pdf

    cond-mat.mes-hall

    Evidence of high-temperature exciton condensation in a two-dimensional semimetal

    Authors: Qiang Gao, Yang-hao Chan, Yuzhe Wang, Haotian Zhang, Jinxu Pu, Shengtao Cui, Yichen Yang, Zhengtai Liu, Dawei Shen, Zhe Sun, Juan Jiang, Tai C. Chiang, Peng Chen

    Abstract: Electrons and holes can spontaneously form excitons and condense in a semimetal or semiconductor, as predicted decades ago. This type of Bose condensation can happen at much higher temperatures in comparison with dilute atomic gases. Two-dimensional (2D) materials with reduced Coulomb screening around the Fermi level are promising for realizing such a system. Here we report a change in the band st… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Journal ref: Nat. Commun. 14, 994 (2023)

  36. arXiv:2301.00402  [pdf

    cond-mat.mes-hall

    Edge States of α-Bismuthene Nanostructures

    Authors: Sara Salehitaleghani, Tobias Maerkl, Pawel J Kowalczyk, Maxime Le Ster, Xiaoxiong Wang, Guang Bian, Tai-Chang Chiang, Simon A Brown

    Abstract: We present a systematic investigation of the edge states of two-dimensional α-bismuthene (α-Bi) structures self-assembled on HOPG substrates, using scanning tunnelling microscopy and scanning tunnelling spectroscopy. The measurements are carried out for 3ML, 5ML and 7ML thick Bi structures. Our spectroscopy studies reveal clear features at the edges of the 5ML and 7ML thick structures, and the pos… ▽ More

    Submitted 1 January, 2023; originally announced January 2023.

    Comments: 17 pages, 6 figures

    Journal ref: 2D Materials 10, 015020 (2023)

  37. arXiv:2212.14801  [pdf, other

    cs.CV

    ExReg: Wide-range Photo Exposure Correction via a Multi-dimensional Regressor with Attention

    Authors: Tzu-Hao Chiang, Hao-Chien Hsueh, Ching-Chun Hsiao, Ching-Chun Huang

    Abstract: Photo exposure correction is widely investigated, but fewer studies focus on correcting under and over-exposed images simultaneously. Three issues remain open to handle and correct under and over-exposed images in a unified way. First, a locally-adaptive exposure adjustment may be more flexible instead of learning a global mapping. Second, it is an ill-posed problem to determine the suitable expos… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: 12 pages, 8 figures

  38. arXiv:2211.07452  [pdf, other

    astro-ph.GA astro-ph.CO hep-ph

    Can ultralight dark matter explain the age-velocity dispersion relation of the Milky Way disc: A revised and improved treatment

    Authors: Barry T. Chiang, Jeremiah P. Ostriker, Hsi-Yu Schive

    Abstract: Ultralight axion-like particles $m_a \sim 10^{-22}$ eV, or Fuzzy Dark Matter (FDM), behave comparably to cold dark matter (CDM) on cosmological scales and exhibit a kpc-size de Broglie wavelength capable of alleviating established (sub-)galactic-scale problems of CDM. Substructures inside an FDM halo incur gravitational potential perturbations, resulting in stellar heating sufficient to account fo… ▽ More

    Submitted 5 December, 2022; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: 19 pages, 9 figures; Accepted for publication in MNRAS. Matches published version

  39. arXiv:2211.06506  [pdf, other

    cs.LG stat.ML

    Spectral Evolution and Invariance in Linear-width Neural Networks

    Authors: Zhichao Wang, Andrew Engel, Anand Sarwate, Ioana Dumitriu, Tony Chiang

    Abstract: We investigate the spectral properties of linear-width feed-forward neural networks, where the sample size is asymptotically proportional to network width. Empirically, we show that the spectra of weight in this high dimensional regime are invariant when trained by gradient descent for small constant learning rates; we provide a theoretical justification for this observation and prove the invarian… ▽ More

    Submitted 7 November, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: Accepted by NeurIPS 2023

  40. arXiv:2210.15026  [pdf, other

    cond-mat.str-el

    Anharmonic multiphonon origin of the valence plasmon in SrTi1-xNbxO3

    Authors: Caitlin S. Kengle, Samantha I. Rubeck, Melinda Rak, Jin Chen, Faren Hoveyda, Simon Bettler, Ali Husain, Matteo Mitrano, Alexander Edelman, Peter Littlewood, Tai-Chang Chiang, Fahad Mahmood, Peter Abbamonte

    Abstract: Doped SrTi1-xNbxO3 exhibits superconductivity and a mid-infrared optical response reminiscent of copper-oxide superconductors. Strangely, its plasma frequency, omega_p, increases by a factor of ~3 when cooling from 300 K to 20 K, without any accepted explanation. Here, we present momentum-resolved electron energy loss spectroscopy (M-EELS) measurements of SrTi1-xNbxO3 at nonzero momentum, q. We fi… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 5 pages, 4 figures

  41. arXiv:2207.12551  [pdf, other

    cs.CL

    DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit

    Authors: Jessica Huynh, Ting-Rui Chiang, Jeffrey Bigham, Maxine Eskenazi

    Abstract: Dialog system developers need high-quality data to train, fine-tune and assess their systems. They often use crowdsourcing for this since it provides large quantities of data from many workers. However, the data may not be of sufficiently good quality. This can be due to the way that the requester presents a task and how they interact with the workers. This paper introduces DialCrowd 2.0 to help r… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Published at LREC 2022

  42. arXiv:2205.12372  [pdf, other

    cs.LG

    TorchNTK: A Library for Calculation of Neural Tangent Kernels of PyTorch Models

    Authors: Andrew Engel, Zhichao Wang, Anand D. Sarwate, Sutanay Choudhury, Tony Chiang

    Abstract: We introduce torchNTK, a python library to calculate the empirical neural tangent kernel (NTK) of neural network models in the PyTorch framework. We provide an efficient method to calculate the NTK of multilayer perceptrons. We compare the explicit differentiation implementation against autodifferentiation implementations, which have the benefit of extending the utility of the library to any archi… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 19 pages, 5 figures

  43. arXiv:2110.08130  [pdf, other

    cs.CL

    Breaking Down Multilingual Machine Translation

    Authors: Ting-Rui Chiang, Yi-Pei Chen, Yi-Ting Yeh, Graham Neubig

    Abstract: While multilingual training is now an essential ingredient in machine translation (MT) systems, recent work has demonstrated that it has different effects in different multilingual settings, such as many-to-one, one-to-many, and many-to-many learning. These training settings expose the encoder and the decoder in a machine translation model with different data distributions. In this paper, we exami… ▽ More

    Submitted 3 April, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: ACL 2022 Findings

  44. arXiv:2110.05665  [pdf, other

    cs.CL

    Are you doing what I say? On modalities alignment in ALFRED

    Authors: Ting-Rui Chiang, Yi-Ting Yeh, Ta-Chung Chi, Yau-Shian Wang

    Abstract: ALFRED is a recently proposed benchmark that requires a model to complete tasks in simulated house environments specified by instructions in natural language. We hypothesize that key to success is accurately aligning the text modality with visual inputs. Motivated by this, we inspect how well existing models can align these modalities using our proposed intrinsic metric, boundary adherence score (… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: Accepted by Novel Ideas in Learning-to-Learn through Interaction at EMNLP 2021

  45. arXiv:2110.05301  [pdf, other

    cs.CL

    On a Benefit of Mask Language Modeling: Robustness to Simplicity Bias

    Authors: Ting-Rui Chiang

    Abstract: Despite the success of pretrained masked language models (MLM), why MLM pretraining is useful is still a qeustion not fully answered. In this work we theoretically and empirically show that MLM pretraining makes models robust to lexicon-level spurious features, partly answer the question. We theoretically show that, when we can model the distribution of a spurious feature $Π$ conditioned on the co… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: Work in progress

  46. arXiv:2110.04907  [pdf, ps, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Observation of Unpinned Two-Dimensional Dirac States in Antimony Single Layers with Phosphorene Structure

    Authors: Qiangsheng Lu, Matthew Snyder, Kyle Y. Chen, Xiaoqian Zhang, Jacob Cook, Duy Tung Nguyen, P. V. Sreenivasa Reddy, Tay-Rong Chang, Pawel J. Kowalczyk, Simon A. Brown, Tai-Chang Chiang, Shengyuan A. Yang, Guang Bian

    Abstract: The discovery of graphene has stimulated enormous interest in two-dimensional (2D) electron gas with linear band structure. 2D Dirac materials possess many intriguing physical properties such as high carrier mobility and zero-energy Landau level thanks to the relativistic dispersion and chiral spin/pseudospin texture. 2D Dirac states discovered so far are exclusively pinned at high-symmetry points… ▽ More

    Submitted 18 October, 2021; v1 submitted 10 October, 2021; originally announced October 2021.

    Comments: 5 figures

    Journal ref: Nature Communications 13:4603 (2022)

  47. arXiv:2109.14144  [pdf, other

    cs.CL cs.LG

    Improving Dialogue State Tracking by Joint Slot Modeling

    Authors: Ting-Rui Chiang, Yi-Ting Yeh

    Abstract: Dialogue state tracking models play an important role in a task-oriented dialogue system. However, most of them model the slot types conditionally independently given the input. We discover that it may cause the model to be confused by slot types that share the same data type. To mitigate this issue, we propose TripPy-MRF and TripPy-LSTM that models the slots jointly. Our results show that they ar… ▽ More

    Submitted 14 November, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: Accepted to the 3rd Workshop on NLP for ConvAI in EMNLP 2021

  48. arXiv:2109.08705  [pdf, other

    cs.CL cs.LG

    Relating Neural Text Degeneration to Exposure Bias

    Authors: Ting-Rui Chiang, Yun-Nung Chen

    Abstract: This work focuses on relating two mysteries in neural-based text generation: exposure bias, and text degeneration. Despite the long time since exposure bias was mentioned and the numerous studies for its remedy, to our knowledge, its impact on text generation has not yet been verified. Text degeneration is a problem that the widely-used pre-trained language model GPT-2 was recently found to suffer… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Accepted by BlackBoxNLP at EMNLP 2021

  49. arXiv:2108.10874  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Kramers-Weyl fermions in the chiral charge density wave material (TaSe$_4$)$_2$I

    Authors: Soyeun Kim, Robert C. McKay, Nina Bielinski, Chengxi Zhao, Meng-Kai Lin, Joseph A. Hlevyack, Xuefei Guo, Sung-Kwan Mo, Peter Abbamonte, Tai-Chang Chiang, André Schleife, Daniel P. Shoemaker, Barry Bradlyn, Fahad Mahmood

    Abstract: The quasi-one-dimensional chiral charge density wave (CDW) material (TaSe$_4$)$_2$I has been recently predicted to host Kramers-Weyl (KW) fermions which should exist in the vicinity of high symmetry points in the Brillouin zone in chiral materials with strong spin-orbit coupling. However, direct spectroscopic evidence of KW fermions is limited. Here we use helicity-dependent laser-based angle reso… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

  50. arXiv:2106.07137  [pdf, other

    cs.CL

    Why Can You Lay Off Heads? Investigating How BERT Heads Transfer

    Authors: Ting-Rui Chiang, Yun-Nung Chen

    Abstract: The huge size of the widely used BERT family models has led to recent efforts about model distillation. The main goal of distillation is to create a task-agnostic pre-trained model that can be fine-tuned on downstream tasks without fine-tuning its full-sized version. Despite the progress of distillation, to what degree and for what reason a task-agnostic model can be created from distillation has… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.