Skip to main content

Showing 1–50 of 363 results for author: Cox, D

.
  1. arXiv:2505.23604  [pdf, ps, other

    cs.CL cs.AI cs.SE

    Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

    Authors: Guangtao Zeng, Maohao Shen, Delin Chen, Zhenting Qi, Subhro Das, Dan Gutfreund, David Cox, Gregory Wornell, Wei Lu, Zhang-Wei Hong, Chuang Gan

    Abstract: Language models (LMs) perform well on standardized coding benchmarks but struggle with real-world software engineering tasks such as resolving GitHub issues in SWE-Bench, especially when model parameters are less than 100B. While smaller models are preferable in practice due to their lower computational cost, improving their performance remains challenging. Existing approaches primarily rely on su… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2505.22482  [pdf, other

    physics.optics

    Inline calibration of spatial light modulators in nonlinear microscopy

    Authors: Daniël W. S. Cox, Harish Sasikumar, Ivo M. Vellekoop

    Abstract: We present a method for calibrating the response of a phase-only spatial light modulator in nonlinear microscopy. Our method uses the microscope image itself as calibration measurement and requires no additional hardware components. Our method is adapted to the nonlinear signals encountered in multi-photon excitation fluorescence microscopes, and works well even under low light conditions and with… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 10 pages, 5 figures

  3. arXiv:2505.10276  [pdf, ps, other

    cond-mat.mes-hall physics.optics

    Chiral near-field control of quantum light generation using magneto-optical graphene

    Authors: Mikkel Have Eriksen, Joel D. Cox

    Abstract: We theoretically explore strategies to actively control photon emission from quantum light sources by leveraging the large magneto-optical response of graphene. The quantum electrodynamic response of graphene -- characterized by the Purcell factor and the Lamb shift of a proximal emitter -- is analyzed for extended two-dimensional sheets, one-dimensional nanoribbons, and zero-dimensional nanodisks… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 16 pages, 4 figures

  4. arXiv:2505.08398  [pdf, other

    cond-mat.mes-hall

    Nonlocal electrodynamics of two-dimensional anisotropic magneto-plasmons

    Authors: A. J. Chaves, Line Jelver, D. R. da Costa, Joel D. Cox, N. Asger Mortensen, Nuno M. R. Peres

    Abstract: We present a hydrodynamic model, grounded in Madelung's formalism, to describe collective electronic motion in anisotropic materials. This model incorporates nonlocal contributions from the Thomas-Fermi quantum pressure and quantum effects arising from the Bohm potential. We derive analytical expressions for the magnetoplasmon dispersion and nonlocal optical conductivity. To demonstrate the applic… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  5. arXiv:2505.04572  [pdf, other

    cs.RO

    Stow: Robotic Packing of Items into Fabric Pods

    Authors: Nicolas Hudson, Josh Hooks, Rahul Warrier, Curt Salisbury, Ross Hartley, Kislay Kumar, Bhavana Chandrashekhar, Paul Birkmeyer, Bosch Tang, Matt Frost, Shantanu Thakar, Tony Piaskowy, Petter Nilsson, Josh Petersen, Neel Doshi, Alan Slatter, Ankit Bhatia, Cassie Meeker, Yuechuan Xue, Dylan Cox, Alex Kyriazis, Bai Lou, Nadeem Hasan, Asif Rana, Nikhil Chacko , et al. (12 additional authors not shown)

    Abstract: This paper presents a compliant manipulation system capable of placing items onto densely packed shelves. The wide diversity of items and strict business requirements for high producing rates and low defect generation have prohibited warehouse robotics from performing this task. Our innovations in hardware, perception, decision-making, motion planning, and control have enabled this system to perfo… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  6. arXiv:2504.12397  [pdf, other

    cs.LG cs.AI

    Activated LoRA: Fine-tuned LLMs for Intrinsics

    Authors: Kristjan Greenewald, Luis Lastras, Thomas Parnell, Vraj Shah, Lucian Popa, Giulio Zizzo, Chulaka Gunasekara, Ambrish Rawat, David Cox

    Abstract: Low-Rank Adaptation (LoRA) has emerged as a highly efficient framework for finetuning the weights of large foundation models, and has become the go-to method for data-driven customization of LLMs. Despite the promise of highly customized behaviors and capabilities, switching between relevant LoRAs in a multiturn setting is inefficient, as the key-value (KV) cache of the entire turn history must be… ▽ More

    Submitted 23 May, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

  7. Roadmap for Photonics with 2D Materials

    Authors: F. Javier García de Abajo, D. N. Basov, Frank H. L. Koppens, Lorenzo Orsini, Matteo Ceccanti, Sebastián Castilla, Lorenzo Cavicchi, Marco Polini, P. A. D. Gonçalves, A. T. Costa, N. M. R. Peres, N. Asger Mortensen, Sathwik Bharadwaj, Zubin Jacob, P. J. Schuck, A. N. Pasupathy, Milan Delor, M. K. Liu, Aitor Mugarza, Pablo Merino, Marc G. Cuxart, Emigdio Chávez-Angel, Martin Svec, Luiz H. G. Tizei, Florian Dirnberger , et al. (123 additional authors not shown)

    Abstract: Triggered by the development of exfoliation and the identification of a wide range of extraordinary physical properties in self-standing films consisting of one or few atomic layers, two-dimensional (2D) materials such as graphene, transition metal dichalcogenides (TMDs), and other van der Waals (vdW) crystals currently constitute a wide research field protruding in multiple directions in combinat… ▽ More

    Submitted 14 April, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

    Comments: 199 pages, 42 figures, 1154 references

  8. arXiv:2503.00519  [pdf, other

    cond-mat.mes-hall physics.optics

    Roadmap on Nonlocality in Photonic Materials and Metamaterials

    Authors: Francesco Monticone, N. Asger Mortensen, Antonio I. Fernández-Domínguez, Yu Luo, Xuezhi Zheng, Christos Tserkezis, Jacob B. Khurgin, Tigran V. Shahbazyan, André J. Chaves, Nuno M. R. Peres, Gino Wegner, Kurt Busch, Huatian Hu, Fabio Della Sala, Pu Zhang, Cristian Ciracì, Javier Aizpurua, Antton Babaze, Andrei G. Borisov, Xue-Wen Chen, Thomas Christensen, Wei Yan, Yi Yang, Ulrich Hohenester, Lorenz Huber , et al. (41 additional authors not shown)

    Abstract: Photonic technologies continue to drive the quest for new optical materials with unprecedented responses. A major frontier in this field is the exploration of nonlocal (spatially dispersive) materials, going beyond the local, wavevector-independent assumption traditionally made in optical material modeling. On one end, the growing interest in plasmonic, polaritonic and quantum materials has reveal… ▽ More

    Submitted 28 March, 2025; v1 submitted 1 March, 2025; originally announced March 2025.

  9. arXiv:2502.20204  [pdf, other

    cs.IR cs.CL

    Granite Embedding Models

    Authors: Parul Awasthy, Aashka Trivedi, Yulong Li, Mihaela Bornea, David Cox, Abraham Daniels, Martin Franz, Gabe Goodhart, Bhavani Iyer, Vishwajeet Kumar, Luis Lastras, Scott McCarley, Rudra Murthy, Vignesh P, Sara Rosenthal, Salim Roukos, Jaydeep Sen, Sukriti Sharma, Avirup Sil, Kate Soule, Arafat Sultan, Radu Florian

    Abstract: We introduce the Granite Embedding models, a family of encoder-based embedding models designed for retrieval tasks, spanning dense-retrieval and sparse retrieval architectures, with both English and Multilingual capabilities. This report provides the technical details of training these highly effective 12 layer embedding models, along with their efficient 6 layer distilled counterparts. Extensive… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  10. arXiv:2502.09927  [pdf, other

    cs.CV cs.AI

    Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence

    Authors: Granite Vision Team, Leonid Karlinsky, Assaf Arbelle, Abraham Daniels, Ahmed Nassar, Amit Alfassi, Bo Wu, Eli Schwartz, Dhiraj Joshi, Jovana Kondic, Nimrod Shabtay, Pengyuan Li, Roei Herzig, Shafiq Abedin, Shaked Perek, Sivan Harary, Udi Barzelay, Adi Raz Goldfarb, Aude Oliva, Ben Wieles, Bishwaranjan Bhattacharjee, Brandon Huang, Christoph Auer, Dan Gutfreund, David Beymer , et al. (38 additional authors not shown)

    Abstract: We introduce Granite Vision, a lightweight large language model with vision capabilities, specifically designed to excel in enterprise use cases, particularly in visual document understanding. Our model is trained on a comprehensive instruction-following dataset, including document-related tasks, such as content extraction from tables, charts, diagrams, sketches, and infographics, as well as gener… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  11. arXiv:2502.02508  [pdf, ps, other

    cs.CL cs.AI

    Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

    Authors: Maohao Shen, Guangtao Zeng, Zhenting Qi, Zhang-Wei Hong, Zhenfang Chen, Wei Lu, Gregory Wornell, Subhro Das, David Cox, Chuang Gan

    Abstract: Large language models (LLMs) have demonstrated remarkable reasoning capabilities across diverse domains. Recent studies have shown that increasing test-time computation enhances LLMs' reasoning capabilities. This typically involves extensive sampling at inference time guided by an external LLM verifier, resulting in a two-player system. Despite external guidance, the effectiveness of this system d… ▽ More

    Submitted 2 June, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

  12. arXiv:2501.04535  [pdf

    quant-ph physics.app-ph

    Roadmap on Atomic-scale Semiconductor Devices

    Authors: Steven R. Schofield, Andrew J. Fisher, Eran Ginossar, Joseph W. Lyding, Richard Silver, Fan Fei, Pradeep Namboodiri, Jonathan Wyrick, M. G. Masteghin, D. C. Cox, B. N. Murdin, S. K Clowes, Joris G. Keizer, Michelle Y. Simmons, Holly G. Stemp, Andrea Morello, Benoit Voisin, Sven Rogge, Robert A. Wolkow, Lucian Livadaru, Jason Pitters, Taylor J. Z. Stock, Neil J. Curson, Robert E. Butera, Tatiana V. Pavlova , et al. (25 additional authors not shown)

    Abstract: Spin states in semiconductors provide exceptionally stable and noise-resistant environments for qubits, positioning them as optimal candidates for reliable quantum computing technologies. The proposal to use nuclear and electronic spins of donor atoms in silicon, introduced by Kane in 1998, sparked a new research field focused on the precise positioning of individual impurity atoms for quantum dev… ▽ More

    Submitted 22 January, 2025; v1 submitted 8 January, 2025; originally announced January 2025.

    Comments: 94 pages

    Journal ref: Nano Futures 9 012001 (2025)

  13. arXiv:2412.12970  [pdf, other

    math.CO

    Graph Burning On Large $p$-Caterpillars

    Authors: Danielle Cox, M. E. Messinger, Kerry Ojakian

    Abstract: Graph burning models the spread of information or contagion in a graph. At each time step, two events occur: neighbours of already burned vertices become burned, and a new vertex is chosen to be burned. The big conjecture is known as the {\it burning number conjecture}: for any connected graph on $n$ vertices, all $n$ vertices can be burned after at most $\lceil \sqrt{n}\ \rceil$ time steps. It is… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    MSC Class: 05C57

  14. arXiv:2411.09377  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Chiral Light-Matter Interactions with Thermal Magnetoplasmons in Graphene Nanodisks

    Authors: Mikkel Have Eriksen, Juan R. Deop-Ruano, Joel D. Cox, Alejandro Manjavacas

    Abstract: We investigate the emergence of self-hybridized thermal magnetoplasmons in doped graphene nanodisks at finite temperatures when subjected to an external magnetic field. Using a semianalytical approach, which fully describes the eigenmodes and polarizability of the graphene nanodisks, we show that the hybridization originates from the coupling of transitions between thermally populated Landau level… ▽ More

    Submitted 30 December, 2024; v1 submitted 14 November, 2024; originally announced November 2024.

    Comments: 22 pages, 11 figures

  15. arXiv:2410.03745  [pdf, ps, other

    math.AG math.NT

    Notes on Three Formulas of Abel

    Authors: David A. Cox

    Abstract: These notes explore three amazing formulas proved by Abel in his 1826 Paris memoir on what we now call Abelian integrals. We discuss the first two formulas from the point of view of symbolic computation and explain their connection to residues and partial fractions. The third formula arises from the first two and is related to the genus and lattice points in the Newton polygon.

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: 49 pages, 4 figures

    MSC Class: Primary 01A55; 14K20; Secondary 14Q05; 68W30

  16. arXiv:2409.04565  [pdf, other

    physics.optics

    Orthonormalization of phase-only basis functions

    Authors: Daniël W. S. Cox, Ivo M. Vellekoop

    Abstract: Orthonormal bases serve as a powerful mathematical tool in theoretical and experimental optics. However, producing arbitrary optical fields in real-world experiments is limited by the hardware, which in many cases involves a phase-only spatial light modulator. Since most basis functions also have a varying amplitude component, they cannot be represented truthfully. We present a general method to c… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: 12 pages, 6 figures

  17. arXiv:2408.14798  [pdf, other

    cond-mat.mes-hall physics.optics

    Nonlinear thermoplasmonics in graphene nanostructures

    Authors: Line Jelver, Joel D. Cox

    Abstract: The linear electronic dispersion relation of graphene endows the atomically thin carbon layer with a large intrinsic optical nonlinearity, with regard to both parametric and photothermal processes. While plasmons in graphene nanostructures can further enhance nonlinear optical phenomena, boosting resonances to the technologically relevant mid- and near-infrared (IR) spectral regime necessitates pa… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 12 pages, 7 figures

  18. arXiv:2408.13359  [pdf, other

    cs.CL cs.AI cs.LG

    Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

    Authors: Yikang Shen, Matthew Stallone, Mayank Mishra, Gaoyuan Zhang, Shawn Tan, Aditya Prasad, Adriana Meza Soria, David D. Cox, Rameswar Panda

    Abstract: Finding the optimal learning rate for language model pretraining is a challenging task. This is not only because there is a complicated correlation between learning rate, batch size, number of training tokens, model size, and other hyperparameters but also because it is prohibitively expensive to perform a hyperparameter search for large language models with Billions or Trillions of parameters. Re… ▽ More

    Submitted 11 September, 2024; v1 submitted 23 August, 2024; originally announced August 2024.

  19. arXiv:2407.13739  [pdf, other

    cs.AI cs.CL cs.SE

    Scaling Granite Code Models to 128K Context

    Authors: Matt Stallone, Vaibhav Saxena, Leonid Karlinsky, Bridget McGinn, Tim Bula, Mayank Mishra, Adriana Meza Soria, Gaoyuan Zhang, Aditya Prasad, Yikang Shen, Saptha Surendran, Shanmukha Guttula, Hima Patel, Parameswaran Selvam, Xuan-Hong Dang, Yan Koyfman, Atin Sood, Rogerio Feris, Nirmit Desai, David D. Cox, Ruchir Puri, Rameswar Panda

    Abstract: This paper introduces long-context Granite code models that support effective context windows of up to 128K tokens. Our solution for scaling context length of Granite 3B/8B code models from 2K/4K to 128K consists of a light-weight continual pretraining by gradually increasing its RoPE base frequency with repository-level file packing and length-upsampled long-context data. Additionally, we also re… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  20. arXiv:2407.05467  [pdf, other

    cs.DC cs.AI

    The infrastructure powering IBM's Gen AI model development

    Authors: Talia Gershon, Seetharami Seelam, Brian Belgodere, Milton Bonilla, Lan Hoang, Danny Barnett, I-Hsin Chung, Apoorve Mohan, Ming-Hung Chen, Lixiang Luo, Robert Walkup, Constantinos Evangelinos, Shweta Salaria, Marc Dombrowa, Yoonho Park, Apo Kayi, Liran Schour, Alim Alim, Ali Sydney, Pavlos Maniotis, Laurent Schares, Bernard Metzler, Bengi Karacali-Akyamac, Sophia Wen, Tatsuhiro Chiba , et al. (122 additional authors not shown)

    Abstract: AI Infrastructure plays a key role in the speed and cost-competitiveness of developing and deploying advanced AI models. The current demand for powerful AI infrastructure for model training is driven by the emergence of generative AI and foundational models, where on occasion thousands of GPUs must cooperate on a single training job for the model to be trained in a reasonable time. Delivering effi… ▽ More

    Submitted 13 January, 2025; v1 submitted 7 July, 2024; originally announced July 2024.

    Comments: Corresponding Authors: Talia Gershon, Seetharami Seelam,Brian Belgodere, Milton Bonilla

  21. arXiv:2407.00121  [pdf, other

    cs.LG cs.AI cs.CL

    Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

    Authors: Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal, Sadhana Kumaravel, Matthew Stallone, Rameswar Panda, Yara Rizk, GP Bhargav, Maxwell Crouse, Chulaka Gunasekara, Shajith Ikbal, Sachin Joshi, Hima Karanam, Vineet Kumar, Asim Munawar, Sumit Neelam, Dinesh Raghu, Udit Sharma, Adriana Meza Soria, Dheeraj Sreedhar, Praveen Venkateswaran, Merve Unuvar, David Cox, Salim Roukos, Luis Lastras , et al. (1 additional authors not shown)

    Abstract: Large language models (LLMs) have recently shown tremendous promise in serving as the backbone to agentic systems, as demonstrated by their performance in multi-faceted, challenging benchmarks like SWE-Bench and Agent-Bench. However, to realize the true potential of LLMs as autonomous agents, they must learn to identify, call, and interact with external tools and application program interfaces (AP… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  22. arXiv:2406.12034  [pdf, other

    cs.CL cs.LG

    Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts

    Authors: Junmo Kang, Leonid Karlinsky, Hongyin Luo, Zhen Wang, Jacob Hansen, James Glass, David Cox, Rameswar Panda, Rogerio Feris, Alan Ritter

    Abstract: We present Self-MoE, an approach that transforms a monolithic LLM into a compositional, modular system of self-specialized experts, named MiXSE (MiXture of Self-specialized Experts). Our approach leverages self-specialization, which constructs expert modules using self-generated synthetic data, each equipping a shared base LLM with distinct domain-specific capabilities, activated via self-optimize… ▽ More

    Submitted 7 October, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  23. Wavefront Threading Enables Effective High-Level Synthesis

    Authors: Blake Pelton, Adam Sapek, Ken Eguro, Daniel Lo, Alessandro Forin, Matt Humphrey, Jinwen Xi, David Cox, Rajas Karandikar, Johannes de Fine Licht, Evgeny Babin, Adrian Caulfield, Doug Burger

    Abstract: Digital systems are growing in importance and computing hardware is growing more heterogeneous. Hardware design, however, remains laborious and expensive, in part due to the limitations of conventional hardware description languages (HDLs) like VHDL and Verilog. A longstanding research goal has been programming hardware like software, with high-level languages that can generate efficient hardware… ▽ More

    Submitted 10 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted to PLDI'24

  24. arXiv:2405.17258  [pdf, other

    cs.LG cs.AI

    $\textit{Trans-LoRA}$: towards data-free Transferable Parameter Efficient Finetuning

    Authors: Runqian Wang, Soumya Ghosh, David Cox, Diego Antognini, Aude Oliva, Rogerio Feris, Leonid Karlinsky

    Abstract: Low-rank adapters (LoRA) and their variants are popular parameter-efficient fine-tuning (PEFT) techniques that closely match full model fine-tune performance while requiring only a small number of additional parameters. These additional LoRA parameters are specific to the base model being adapted. When the base model needs to be deprecated and replaced with a new one, all the associated LoRA modul… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  25. arXiv:2405.04324  [pdf, other

    cs.AI cs.CL cs.SE

    Granite Code Models: A Family of Open Foundation Models for Code Intelligence

    Authors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang, Yikang Shen, Aditya Prasad, Adriana Meza Soria, Michele Merler, Parameswaran Selvam, Saptha Surendran, Shivdeep Singh, Manish Sethi, Xuan-Hong Dang, Pengyuan Li, Kun-Lung Wu, Syed Zawad, Andrew Coleman, Matthew White, Mark Lewis, Raju Pavuluri, Yan Koyfman, Boris Lublinsky, Maximilien de Bayser, Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal , et al. (21 additional authors not shown)

    Abstract: Large Language Models (LLMs) trained on code are revolutionizing the software development process. Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and LLM-based agents are beginning to show promise for handling complex tasks autonomously. Realizing the full potential of code LLMs requires a wide range of capabili… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Corresponding Authors: Rameswar Panda, Ruchir Puri; Equal Contributors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang

  26. Practical considerations for high-fidelity wavefront shaping experiments

    Authors: Bahareh Mastiani, Daniël W. S. Cox, Ivo M. Vellekoop

    Abstract: Wavefront shaping is a technique for directing light through turbid media. The theoretical aspects of wavefront shaping are well understood, and under near-ideal experimental conditions, accurate predictions for the expected signal enhancement can be given. In practice, however, there are many experimental factors that negatively affect the outcome of the experiment. Here, we present a comprehensi… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 23 pages, 11 figures, submitted for publication in Journal of Physics Photonics

  27. arXiv:2403.01081  [pdf, other

    cs.CL cs.LG

    LAB: Large-Scale Alignment for ChatBots

    Authors: Shivchander Sudalairaj, Abhishek Bhandwaldar, Aldo Pareja, Kai Xu, David D. Cox, Akash Srivastava

    Abstract: This work introduces LAB (Large-scale Alignment for chatBots), a novel methodology designed to overcome the scalability challenges in the instruction-tuning phase of large language model (LLM) training. Leveraging a taxonomy-guided synthetic data generation process and a multi-phase tuning framework, LAB significantly reduces reliance on expensive human annotations and proprietary models like GPT-… ▽ More

    Submitted 29 April, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: Corresponding Author: Akash Srivastava. Equal Contribution: Shivchander Sudalairaj, Abhishek Bhandwaldar, Aldo Pareja, Akash Srivastava, Code: https://github.com/instructlab

  28. Generation of entangled waveguided photon pairs by free electrons

    Authors: Theis P. Rasmussen, Álvaro Rodríguez Echarri, Joel D. Cox, F. Javier García de Abajo

    Abstract: Entangled photon pairs are a key resource in future quantum-optical communication and information technologies. While high-power laser light propagating in bulk nonlinear optical crystals is conventionally used to generate entangled photons that are routed into optical configurations, such schemes suffer from low efficiency due to the weak intrinsic nonlinear optical response of known materials an… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Journal ref: Sci. Adv. 10, eadn6312 (2024)

  29. arXiv:2311.13363  [pdf, other

    physics.optics

    Model-based aberration corrected microscopy inside a glass tube

    Authors: D. W. S. Cox, T. Knop, I. M. Vellekoop

    Abstract: Microscope objectives achieve near diffraction-limited performance only when used under the conditions they are designed for. In non-standard geometries, such as thick cover slips or curved surfaces, severe aberrations arise, inevitably impairing high-resolution imaging. Correcting such large aberrations using standard adaptive optics can be challenging: existing solutions are either not suited fo… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 9 pages, 3 figures, 1 table. Submitted to Optics Express

  30. arXiv:2310.07654  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Audio-Visual Neural Syntax Acquisition

    Authors: Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David Cox, David Harwath, Yang Zhang, Karen Livescu, James Glass

    Abstract: We study phrase structure induction from visually-grounded speech. The core idea is to first segment the speech waveform into sequences of word segments, and subsequently induce phrase structure using the inferred segment-level continuous representations. We present the Audio-Visual Neural Syntax Learner (AV-NSL) that learns phrase structure by listening to audio and looking at images, without eve… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  31. arXiv:2310.05910  [pdf, other

    cs.CL cs.AI cs.LG

    SALMON: Self-Alignment with Instructable Reward Models

    Authors: Zhiqing Sun, Yikang Shen, Hongxin Zhang, Qinhong Zhou, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan

    Abstract: Supervised Fine-Tuning (SFT) on response demonstrations combined with Reinforcement Learning from Human Feedback (RLHF) constitutes a powerful paradigm for aligning LLM-based AI agents. However, a significant limitation of such an approach is its dependency on high-quality human annotations, making its application to intricate tasks challenging due to difficulties in obtaining consistent response… ▽ More

    Submitted 9 April, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Previous Title: SALMON: Self-Alignment with Principle-Following Reward Models. Accepted to ICLR 2024. Project page: https://github.com/IBM/SALMON

  32. arXiv:2310.03409  [pdf

    cond-mat.mtrl-sci physics.acc-ph physics.app-ph

    Detection Sensitivity Limit of Hundreds of Atoms with X-Ray Fluorescence Microscopy

    Authors: Mateus G. Masteghin, Toussaint Gervais, Steven K. Clowes, David C. Cox, Veronika Zelyk, Ajith Pattammattel, Yong S. Chu, Nikola Kolev, Taylor Z. Stock, Neil Curson, Paul G. Evans, Michael Stuckelberger, Benedict N. Murdin

    Abstract: We report X-ray fluorescence (XRF) imaging of nanoscale inclusions of impurities for quantum technology. A very bright diffraction-limited focus of the X-ray beam produces very high sensitivity and resolution. We investigated gallium (Ga) dopants in silicon (Si) produced by a focused ion beam (FIB). These dopants might provide 3/2-spin qubits or p-type electrical contacts and quantum dots. We find… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 8 pages, 5 figures

  33. arXiv:2310.00160  [pdf, other

    cs.CL cs.AI

    Self-Specialization: Uncovering Latent Expertise within Large Language Models

    Authors: Junmo Kang, Hongyin Luo, Yada Zhu, Jacob Hansen, James Glass, David Cox, Alan Ritter, Rogerio Feris, Leonid Karlinsky

    Abstract: Recent works have demonstrated the effectiveness of self-alignment in which a large language model is aligned to follow general instructions using instructional data generated from the model itself starting from a handful of human-written seeds. Instead of general alignment, in this work, we focus on self-alignment for expert domain specialization (e.g., biomedicine, finance). As a preliminary, we… ▽ More

    Submitted 5 June, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: ACL 2024 (Findings; Long Paper)

  34. arXiv:2308.09134  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci physics.optics

    Nonlocal effects in atom-plasmon interactions

    Authors: Mikkel Have Eriksen, Christos Tserkezis, N. Asger Mortensen, Joel D. Cox

    Abstract: Nonlocal and quantum mechanical phenomena in noble metal nanostructures become increasingly crucial when the relevant length scales in hybrid nanostructures reach the few-nanometer regime. In practice, such mesoscopic effects at metal-dielectric interfaces can be described using exemplary surface-response functions (SRFs) embodied by the Feibelman $d$-parameters. Here we show that SRFs dramaticall… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 9 pages, 5 figures

    Journal ref: Nanophotonics 13, 2741 (2024)

  35. arXiv:2307.08477  [pdf

    cond-mat.mtrl-sci physics.optics

    Quantum-mechanical effects in photoluminescence from thin crystalline gold films

    Authors: Alan R. Bowman, Álvaro Rodríguez Echarri, Fatemeh Kiani, Fadil Iyikanat, Ted V. Tsoulos, Joel D. Cox, Ravishankar Sundararaman, F. Javier García de Abajo, Giulia Tagliabue

    Abstract: Luminescence constitutes a unique source of insight into hot carrier processes in metals, including those in plasmonic nanostructures used for sensing and energy applications. However, being weak in nature, metal luminescence remains poorly understood, its microscopic origin strongly debated, and its potential for unravelling nanoscale carrier dynamics largely unexploited. Here, we reveal quantum-… ▽ More

    Submitted 25 September, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Main text 21 pages and 4 figures. Supplemental Information 33 pages and 17 figures

    Journal ref: Light. Sci. Appl. 13, 91 (2024)

  36. arXiv:2307.07153  [pdf, other

    nucl-ex astro-ph.SR nucl-th

    Nuclear Level Density and $γ$-ray Strength Function of $^{67}\mathrm{Ni}$ and the impact on the i-process

    Authors: V. W. Ingeberg, S. Siem, M. Wiedeking, A. Choplin, S. Goriely, L. Siess, K. J. Abrahams, K. Arnswald, F. Bello Garrote, D. L. Bleuel, J. Cederkäll, T. L. Christoffersen, D. M. Cox, H. De Witte, L. P. Gaffney, A. Görgen, C. Henrich, A. Illana, P. Jones, B. V. Kheswa, T. Kröll, S. N. T. Majola, K. L. Malatji, J. Ojala, J. Pakarinen , et al. (7 additional authors not shown)

    Abstract: Proton-$γ$ coincidences from $(\mathrm{d},\mathrm{p})$ reactions between a $^{66}\mathrm{Ni}$ beam and a deuterated polyethylene target have been analyzed with the inverse-Oslo method to find the nuclear level density (NLD) and $γ$-ray strength function ($γ$SF) of $^{67}\mathrm{Ni}$. The $^{66}\mathrm{Ni}(n,γ)$ capture cross section has been calculated using the Hauser-Feshbach model in TALYS usin… ▽ More

    Submitted 14 November, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Submitted to Phys. Rev. C

    Journal ref: Phys. Rev. C 111, 015803 (2025)

  37. Simultaneous $γ$-ray and electron spectroscopy of $^{182,184,186}$Hg isotopes

    Authors: M. Stryjczyk, B. Andel, J. G. Cubiss, K. Rezynkina, T. R. Rodríguez, J. E. García-Ramos, A. N. Andreyev, J. Pakarinen, P. Van Duppen, S. Antalic, T. Berry, M. J. G. Borge, C. Clisu, D. M. Cox, H. De Witte, L. M. Fraile, H. O. U. Fynbo, L. P. Gaffney, L. J. Harkness-Brennan, M. Huyse, A. Illana, D. S. Judson, J. Konki, J. Kurcewicz, I. Lazarus , et al. (26 additional authors not shown)

    Abstract: Background: The mercury isotopes around $N=104$ are a well-known example of nuclei exhibiting shape coexistence. Mixing of configurations can be studied by measuring the monopole strength $ρ^2(E0)$, however, currently the experimental information is scarce and lacks precision, especially for the $I^π\rightarrow I^π$ ($I \neq 0$) transitions. Purpose: The goals of this study were to increase the pr… ▽ More

    Submitted 6 June, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 21 pages, 30 figures, accepted for publication in Physical Review C

    Journal ref: Phys. Rev. C 108, 014308 (2023)

  38. arXiv:2305.09532  [pdf, other

    cond-mat.mes-hall physics.optics

    Plasmons in phosphorene nanoribbons

    Authors: Line Jelver, Joel D. Cox

    Abstract: Phosphorene has emerged as an atomically-thin platform for optoelectronics and nanophotonics due to its excellent nonlinear optical properties and the possibility of actively tuning light-matter interactions through electrical doping. While phosphorene is a two-dimensional semiconductor, plasmon resonances characterized by pronounced anisotropy and strong optical confinement are anticipated to eme… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 12 pages, 7 figures

    Journal ref: ACS Nano 17, 20043 (2023)

  39. arXiv:2305.03047  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision

    Authors: Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan

    Abstract: Recent AI-assistant agents, such as ChatGPT, predominantly rely on supervised fine-tuning (SFT) with human annotations and reinforcement learning from human feedback (RLHF) to align the output of large language models (LLMs) with human intentions, ensuring they are helpful, ethical, and reliable. However, this dependence can significantly constrain the true potential of AI-assistant agents due to… ▽ More

    Submitted 2 December, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023 (Spotlight). Project page: https://github.com/IBM/Dromedary

  40. arXiv:2304.03767  [pdf, other

    cs.CV

    Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following

    Authors: Mingyu Ding, Yan Xu, Zhenfang Chen, David Daniel Cox, Ping Luo, Joshua B. Tenenbaum, Chuang Gan

    Abstract: Humans, even at a very early age, can learn visual concepts and understand geometry and layout through active interaction with the environment, and generalize their compositions to complete tasks described by natural languages in novel scenes. To mimic such capability, we propose Embodied Concept Learner (ECL) in an interactive 3D environment. Specifically, a robot agent can ground visual concepts… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: CoRL 2022

  41. arXiv:2303.00980  [pdf, other

    cs.LG

    Learning to Grow Pretrained Models for Efficient Transformer Training

    Authors: Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David Daniel Cox, Zhangyang Wang, Yoon Kim

    Abstract: Scaling transformers has led to significant breakthroughs in many domains, leading to a paradigm in which larger versions of existing models are trained and released on a periodic basis. New instances of such models are typically trained completely from scratch, despite the fact that they are often just scaled-up versions of their smaller counterparts. How can we use the implicit knowledge in the… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: International Conference on Learning Representations (ICLR), 2023

  42. arXiv:2302.05941  [pdf, other

    cs.SE cs.AI

    Rapid Development of Compositional AI

    Authors: Lee Martie, Jessie Rosenberg, Veronique Demers, Gaoyuan Zhang, Onkar Bhardwaj, John Henning, Aditya Prasad, Matt Stallone, Ja Young Lee, Lucy Yip, Damilola Adesina, Elahe Paikari, Oscar Resendiz, Sarah Shaw, David Cox

    Abstract: Compositional AI systems, which combine multiple artificial intelligence components together with other application components to solve a larger problem, have no known pattern of development and are often approached in a bespoke and ad hoc style. This makes development slower and harder to reuse for future applications. To support the full rapid development cycle of compositional AI applications,… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

    Comments: Accepted to ICSE 2023, NIER track

    Journal ref: 2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Technologies Results Track (ICSE-NIER), Melbourne, Australia, 2023, pp. (forthcoming)

  43. Nonlinear photoluminescence in gold thin films

    Authors: A. Rodríguez Echarri, F. Iyikanat, S. Boroviks, N. Asger Mortensen, Joel D. Cox, F. Javier García de Abajo

    Abstract: Promising applications in photonics are driven by the ability to fabricate crystal-quality metal thin films of controlled thickness down to a few nanometers. In particular, these materials exhibit a highly nonlinear response to optical fields owing to the induced ultrafast electron dynamics, which is however poorly understood on such mesoscopic length scales. Here, we reveal a new mechanism that c… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: 20 pages, 6 figures, 64 references

    Journal ref: ACS Photonics 10, 2918 (2023)

  44. arXiv:2211.09790  [pdf, other

    cs.LG cs.AI cs.CV

    ConStruct-VL: Data-Free Continual Structured VL Concepts Learning

    Authors: James Seale Smith, Paola Cascante-Bonilla, Assaf Arbelle, Donghyun Kim, Rameswar Panda, David Cox, Diyi Yang, Zsolt Kira, Rogerio Feris, Leonid Karlinsky

    Abstract: Recently, large-scale pre-trained Vision-and-Language (VL) foundation models have demonstrated remarkable capabilities in many zero-shot downstream tasks, achieving competitive results for recognizing objects defined by as little as short text prompts. However, it has also been shown that VL models are still brittle in Structured VL Concept (SVLC) reasoning, such as the ability to recognize object… ▽ More

    Submitted 30 March, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted by the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023)

  45. arXiv:2207.12134  [pdf, other

    quant-ph cond-mat.mes-hall physics.atom-ph physics.optics

    Optoelectronic control of atomic bistability with graphene

    Authors: Mikkel Have Eriksen, Jakob E. Olsen, Christian Wolff, Joel D. Cox

    Abstract: We explore the emergence and active control of optical bistability in a two-level atom near a graphene sheet. Our theory incorporates self-interaction of the optically-driven atom and its coupling to electromagnetic vacuum modes, both of which are sensitive to the electrically-tunable interband transition threshold in graphene. We show that electro-optical bistability and hysteresis can manifest i… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: 16 pages, 7 figures

    Journal ref: Phys. Rev. Lett. 129, 253602 (2022)

  46. arXiv:2207.05122  [pdf, other

    quant-ph cond-mat.mes-hall physics.optics

    Nonlinear quantum logic with colliding graphene plasmons

    Authors: Giuseppe Calajò, Philipp K. Jenke, Lee A. Rozema, Philip Walther, Darrick E. Chang, Joel D. Cox

    Abstract: Graphene has emerged as a promising platform to bring nonlinear quantum optics to the nanoscale, where a large intrinsic optical nonlinearity enables long-lived and actively tunable plasmon polaritons to strongly interact. Here we theoretically study the collision between two counter-propagating plasmons in a graphene nanoribbon, where transversal subwavelength confinement endows propagating plasm… ▽ More

    Submitted 18 March, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: 13 pages, 5 figures

    Journal ref: Phys. Rev. Research 5, 013188, (2023)

  47. arXiv:2206.12248  [pdf, ps, other

    math.CO

    Existence of Optimally-Greatest Digraphs for Strongly Connected Node Reliability

    Authors: Danielle Cox, Kyle MacKeigan, Emily Wright

    Abstract: In this paper, we introduce a new model to study network reliability with node failures. This model, strongly connected node reliability, is the directed variant of node reliability and measures the probability that the operational vertices induce a subdigraph that is strongly connected. If we are restricted to directed graphs with $n$ vertices and $n+1\leq m\leq 2n-3$ or $m=2n$ arcs, an optimally… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 24 pages, 17 figures

    MSC Class: 05C31 ACM Class: G.2.2; F.2.2

  48. arXiv:2206.10176  [pdf, other

    astro-ph.GA astro-ph.SR

    Hot methanol in the [BHB2007] 11 protobinary system: hot corino versus shock origin? : FAUST V

    Authors: C. Vastel, F. Alves, C. Ceccarelli, M. Bouvier, I. Jimenez-Serra, T. Sakai, P. Caselli, L. Evans, F. Fontani, R. Le Gal, C. J. Chandler, B. Svoboda, L. Maud, C. Codella, N. Sakai, A. Lopez-Sepulcre, G. Moellenbrock, Y. Aikawa, N. Balucani, E. Bianchi, G. Busquet, E. Caux, S. Charnley, N. Cuello, M. De Simone , et al. (41 additional authors not shown)

    Abstract: Methanol is a ubiquitous species commonly found in the molecular interstellar medium. It is also a crucial seed species for the building-up of the chemical complexity in star forming regions. Thus, understanding how its abundance evolves during the star formation process and whether it enriches the emerging planetary system is of paramount importance. We used new data from the ALMA Large Program F… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: 17 pages, accepted in A&A

    Journal ref: A&A 664, A171 (2022)

  49. arXiv:2206.00100  [pdf, other

    cs.CV cs.CL

    VALHALLA: Visual Hallucination for Machine Translation

    Authors: Yi Li, Rameswar Panda, Yoon Kim, Chun-Fu Chen, Rogerio Feris, David Cox, Nuno Vasconcelos

    Abstract: Designing better machine translation systems by considering auxiliary inputs such as images has attracted much attention in recent years. While existing methods show promising performance over the conventional text-only translation systems, they typically require paired text and image as input during inference, which limits their applicability to real-world scenarios. In this paper, we introduce a… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: CVPR 2022

  50. arXiv:2205.12720  [pdf, other

    cond-mat.mes-hall physics.optics

    Giant enhancement of third-harmonic generation in graphene-metal heterostructures

    Authors: Irati Alonso Calafell, Lee A. Rozema, David Alcaraz Iranzo, Alessandro Trenti, Joel D. Cox, Avinash Kumar, Hlib Bieliaiev, Sebastian Nanot, Cheng Peng, Dmitri K. Efetov, Jin Yong Hong, Jing Kong, Dirk R. Englund, F. Javier García de Abajo, Frank H. L. Koppens, Philp Walther

    Abstract: Nonlinear nanophotonics leverages engineered nanostructures to funnel light into small volumes and intensify nonlinear optical processes with spectral and spatial control. Due to its intrinsically large and electrically tunable nonlinear optical response, graphene is an especially promising nanomaterial for nonlinear optoelectronic applications. Here we report on exceptionally strong optical nonli… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Journal ref: Nature Nanotechnology 16, 318-324, (2021)