Skip to main content

Showing 1–25 of 25 results for author: Iuzzolino, M

.
  1. arXiv:2503.22152  [pdf, other

    cs.CV cs.AI

    EgoToM: Benchmarking Theory of Mind Reasoning from Egocentric Videos

    Authors: Yuxuan Li, Vijay Veerabadran, Michael L. Iuzzolino, Brett D. Roads, Asli Celikyilmaz, Karl Ridgeway

    Abstract: We introduce EgoToM, a new video question-answering benchmark that extends Theory-of-Mind (ToM) evaluation to egocentric domains. Using a causal ToM model, we generate multi-choice video QA instances for the Ego4D dataset to benchmark the ability to predict a camera wearer's goals, beliefs, and next actions. We study the performance of both humans and state of the art multimodal large language mod… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  2. arXiv:2502.19410  [pdf, other

    cs.HC cs.AI

    Less or More: Towards Glanceable Explanations for LLM Recommendations Using Ultra-Small Devices

    Authors: Xinru Wang, Mengjie Yu, Hannah Nguyen, Michael Iuzzolino, Tianyi Wang, Peiqi Tang, Natasha Lynova, Co Tran, Ting Zhang, Naveen Sendhilnathan, Hrvoje Benko, Haijun Xia, Tanya Jonker

    Abstract: Large Language Models (LLMs) have shown remarkable potential in recommending everyday actions as personal AI assistants, while Explainable AI (XAI) techniques are being increasingly utilized to help users understand why a recommendation is given. Personal AI assistants today are often located on ultra-small devices such as smartwatches, which have limited screen space. The verbosity of LLM-generat… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  3. arXiv:2307.12854  [pdf, other

    cs.CV

    Multiscale Video Pretraining for Long-Term Activity Forecasting

    Authors: Reuben Tan, Matthias De Lange, Michael Iuzzolino, Bryan A. Plummer, Kate Saenko, Karl Ridgeway, Lorenzo Torresani

    Abstract: Long-term activity forecasting is an especially challenging research problem because it requires understanding the temporal relationships between observed actions, as well as the variability and complexity of human activities. Despite relying on strong supervision via expensive human annotations, state-of-the-art forecasting approaches often generalize poorly to unseen data. To alleviate this issu… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  4. arXiv:2307.05784  [pdf, other

    cs.CV cs.AI

    EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video

    Authors: Matthias De Lange, Hamid Eghbalzadeh, Reuben Tan, Michael Iuzzolino, Franziska Meier, Karl Ridgeway

    Abstract: In egocentric action recognition a single population model is typically trained and subsequently embodied on a head-mounted device, such as an augmented reality headset. While this model remains static for new users and environments, we introduce an adaptive paradigm of two phases, where after pretraining a population model, the model adapts on-device and online to the user's experience. This sett… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: Preprint

  5. arXiv:2304.09179  [pdf, other

    cs.CV cs.AI

    Pretrained Language Models as Visual Planners for Human Assistance

    Authors: Dhruvesh Patel, Hamid Eghbalzadeh, Nitin Kamra, Michael Louis Iuzzolino, Unnat Jain, Ruta Desai

    Abstract: In our pursuit of advancing multi-modal AI assistants capable of guiding users to achieve complex multi-step goals, we propose the task of "Visual Planning for Assistance (VPA)". Given a succinct natural language goal, e.g., "make a shelf", and a video of the user's progress so far, the aim of VPA is to devise a plan, i.e., a sequence of actions such as "sand shelf", "paint shelf", etc. to realize… ▽ More

    Submitted 26 August, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: Accepted at ICCV 2023

  6. arXiv:2302.05330  [pdf, other

    cs.CV cs.AI cs.LG

    Action Dynamics Task Graphs for Learning Plannable Representations of Procedural Tasks

    Authors: Weichao Mao, Ruta Desai, Michael Louis Iuzzolino, Nitin Kamra

    Abstract: Given video demonstrations and paired narrations of an at-home procedural task such as changing a tire, we present an approach to extract the underlying task structure -- relevant actions and their temporal dependencies -- via action-centric task graphs. Learnt structured representations from our method, Action Dynamics Task Graphs (ADTG), can then be used for understanding such tasks in unseen vi… ▽ More

    Submitted 11 January, 2023; originally announced February 2023.

    Comments: AAAI 2023 Workshop on User-Centric Artificial Intelligence for Assistance in At-Home Tasks

  7. arXiv:2112.06857  [pdf, other

    astro-ph.IM

    Software solutions for numerical modeling of wide-field telescopes

    Authors: Salvatore Savarese, Pietro Schipani, Giulio Capasso, Mirko Colapietro, Sergio D'Orsi, Marcella Iuzzolino, Laurent Marty, Francesco Perrotta, Giacomo Basile

    Abstract: This paper presents an integrated modeling software to analyze the PSF of wide-field telescopes affected by misalignments. Even relatively small misalignments in the optical system of a telescope can significantly deteriorate the image quality by introducing large aberrations. In particular, wide-field telescopes are critically affected by these errors, insomuch that usually a closed-loop active o… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 4 pages, 3 figures, ADASS 2021 Conference

  8. arXiv:2109.05675  [pdf, other

    cs.CV cs.LG stat.ML

    Online Unsupervised Learning of Visual Representations and Categories

    Authors: Mengye Ren, Tyler R. Scott, Michael L. Iuzzolino, Michael C. Mozer, Richard Zemel

    Abstract: Real world learning scenarios involve a nonstationary distribution of classes with sequential dependencies among the samples, in contrast to the standard machine learning formulation of drawing samples independently from a fixed, typically uniform distribution. Furthermore, real world interactions demand learning on-the-fly from few or no class labels. In this work, we propose an unsupervised mode… ▽ More

    Submitted 28 May, 2022; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: Technical report, 32 pages

  9. arXiv:2102.09808  [pdf, other

    cs.LG cs.CV

    Improving Anytime Prediction with Parallel Cascaded Networks and a Temporal-Difference Loss

    Authors: Michael L. Iuzzolino, Michael C. Mozer, Samy Bengio

    Abstract: Although deep feedforward neural networks share some characteristics with the primate visual system, a key distinction is their dynamics. Deep nets typically operate in serial stages wherein each layer completes its computation before processing begins in subsequent layers. In contrast, biological systems have cascaded dynamics: information propagates from neurons at all layers in parallel but tra… ▽ More

    Submitted 2 November, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

  10. arXiv:2007.04546  [pdf, other

    cs.LG cs.CV stat.ML

    Wandering Within a World: Online Contextualized Few-Shot Learning

    Authors: Mengye Ren, Michael L. Iuzzolino, Michael C. Mozer, Richard S. Zemel

    Abstract: We aim to bridge the gap between typical human and machine-learning environments by extending the standard framework of few-shot learning to an online, continual setting. In this setting, episodes do not have separate training and testing phases, and instead models are evaluated online while learning novel classes. As in the real world, where the presence of spatiotemporal context helps us retriev… ▽ More

    Submitted 22 April, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: ICLR 2021

  11. arXiv:2004.00762  [pdf, other

    cs.LG cs.HC stat.ML

    In Automation We Trust: Investigating the Role of Uncertainty in Active Learning Systems

    Authors: Michael L. Iuzzolino, Tetsumichi Umada, Nisar R. Ahmed, Danielle A. Szafir

    Abstract: We investigate how different active learning (AL) query policies coupled with classification uncertainty visualizations affect analyst trust in automated classification systems. A current standard policy for AL is to query the oracle (e.g., the analyst) to refine labels for datapoints where the classifier has the highest uncertainty. This is an optimal policy for the automation system as it yields… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

  12. arXiv:2002.10562  [pdf, ps, other

    astro-ph.EP astro-ph.SR

    The GAPS Programme at TNG XXI -- A GIARPS case-study of known young planetary candidates: confirmation of HD 285507 b and refutation of AD Leo b

    Authors: I. Carleo, L. Malavolta, A. F. Lanza, M. Damasso, S. Desidera, F. Borsa, M. Mallonn, M. Pinamonti, R. Gratton, E. Alei, S. Benatti, L. Mancini, J. Maldonado, K. Biazzo, M. Esposito, G. Frustagli, E. González-Álvarez, G. Micela, G. Scandariato, A. Sozzetti, L. Affer, A. Bignamini, A. S. Bonomo, R. Claudi, R. Cosentino , et al. (45 additional authors not shown)

    Abstract: The existence of hot Jupiters is still not well understood. Two main channels are thought to be responsible for their current location: a smooth planet migration through the proto-planetary disk or the circularization of an initial high eccentric orbit by tidal dissipation leading to a strong decrease of the semimajor axis. Different formation scenarios result in different observable effects, such… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Journal ref: A&A 638, A5 (2020)

  13. arXiv:1911.08670  [pdf, other

    cs.CV cs.LG

    MMTM: Multimodal Transfer Module for CNN Fusion

    Authors: Hamid Reza Vaezi Joze, Amirreza Shaban, Michael L. Iuzzolino, Kazuhito Koishida

    Abstract: In late fusion, each modality is processed in a separate unimodal Convolutional Neural Network (CNN) stream and the scores of each modality are fused at the end. Due to its simplicity late fusion is still the predominant approach in many state-of-the-art multimodal applications. In this paper, we present a simple neural network module for leveraging the knowledge from multiple modalities in convol… ▽ More

    Submitted 30 March, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020

  14. Experimental characterization of modal noise in multimode fibers for astronomical spectrometers

    Authors: E. Oliva, M. Rainer, A. Tozzi, N. Sanna, M. Iuzzolino, A. Brucalassi

    Abstract: Starting from our puzzling on-sky experience with the GIANO-TNG spectrometer we set up an infrared high resolution spectrometer in our laboratory and used this instrument to characterize the modal noise generated in fibers of different types (circular and octagonal) and sizes. Our experiment includes two conventional scrambling systems for fibers: a mechanical agitator and an optical double scramb… ▽ More

    Submitted 23 October, 2019; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: 7 pages, 6 figures, accepted by Astronomy and Astrophysics

    Journal ref: A&A 632, A21 (2019)

  15. arXiv:1906.03504  [pdf, other

    cs.LG cs.NE stat.ML

    Convolutional Bipartite Attractor Networks

    Authors: Michael Iuzzolino, Yoram Singer, Michael C. Mozer

    Abstract: In human perception and cognition, a fundamental operation that brains perform is interpretation: constructing coherent neural states from noisy, incomplete, and intrinsically ambiguous evidence. The problem of interpretation is well matched to an early and often overlooked architecture, the attractor network---a recurrent neural net that performs constraint satisfaction, imputation of missing fea… ▽ More

    Submitted 26 September, 2019; v1 submitted 8 June, 2019; originally announced June 2019.

  16. arXiv:1901.05599  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Virtual-to-Real-World Transfer Learning for Robots on Wilderness Trails

    Authors: Michael L. Iuzzolino, Michael E. Walker, Daniel Szafir

    Abstract: Robots hold promise in many scenarios involving outdoor use, such as search-and-rescue, wildlife management, and collecting data to improve environment, climate, and weather forecasting. However, autonomous navigation of outdoor trails remains a challenging problem. Recent work has sought to address this issue using deep learning. Although this approach has achieved state-of-the-art results, the d… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: iROS 2018

    Journal ref: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 576-582)

  17. arXiv:1808.03184  [pdf, other

    astro-ph.IM

    GIARPS: commissioning and first scientific results

    Authors: R. Claudi, S. Benatti, I. Carleo, A. Ghedina, J. Guerra, F. Ghinassi, A. Harutyunyan, G. Micela, E. Molinari, E. Oliva, M. Rainer, A. Tozzi, C. Baffa, A. Baruffolo, V. Biliotti, N. Buchschacher, M. Cecconi, R. Cosentino, G. Falcini, D. Fantinel, L. Fini, E. Giani, E. Gonzalez--Alvarez, M. Gonzalez, C. Gonzalez , et al. (20 additional authors not shown)

    Abstract: GIARPS (GIAno \& haRPS) is a project devoted to have on the same focal station of the Telescopio Nazionale Galileo (TNG) both high resolution spectrographs, HARPS-N (VIS) and GIANO-B (NIR), working simultaneously. This could be considered the first and unique worldwide instrument providing cross-dispersed echelle spectroscopy at a resolution of 50,000 in the NIR range and 115,000 in the VIS and ov… ▽ More

    Submitted 9 August, 2018; originally announced August 2018.

    Comments: 10 pages, 11 figures, Telescopes and Astronomical instrumentation, SPIE Conf. 2018

  18. arXiv:1805.01281  [pdf, ps, other

    astro-ph.EP astro-ph.SR

    Multi-band high resolution spectroscopy rules out the hot Jupiter BD+20 1790b - First data from the GIARPS Commissioning

    Authors: I. Carleo, S. Benatti, A. F. Lanza, R. Gratton, R. Claudi, S. Desidera, G. N. Mace, S. Messina, N. Sanna, E. Sissa, A. Ghedina, F. Ghinassi, J. Guerra, A. Harutyunyan, G. Micela, E. Molinari, E. Oliva, A. Tozzi, C. Baffa, A. Baruffolo, A. Bignamini, N. Buchschacher, M. Cecconi, R. Cosentino, M. Endl , et al. (29 additional authors not shown)

    Abstract: Context. Stellar activity is currently challenging the detection of young planets via the radial velocity (RV) technique. Aims. We attempt to definitively discriminate the nature of the RV variations for the young active K5 star BD+20 1790, for which visible (VIS) RV measurements show divergent results on the existence of a substellar companion. Methods. We compare VIS data with high precision RVs… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

    Comments: 12 pages, 7 figures

    Journal ref: A&A 613, A50 (2018)

  19. GIARPS: the unique VIS-NIR high precision radial velocity facility in this world

    Authors: Riccardo Claudi, Serena Benatti, Ilaria Carleo, Adriano Ghedina, Emilio Molinari, Ernesto Oliva, Andrea Tozzi, Andrea Baruffolo, Massimo Cecconi, Rosario Cosentino, Daniela Fantinel, Luca Fini, Francesca Ghinassi, Manuel Gonzalez, Raffaele Gratton, Jose Guerra, Avet Harutyunyan, Nauzet Hernandez, Marcella Iuzzolino, Marcello Lodi, Luca Malavolta, Jesus Maldonado, Giusi Micela, Nicoletta Sanna, Jose Sanjuan , et al. (8 additional authors not shown)

    Abstract: GIARPS (GIAno & haRPS) is a project devoted to have on the same focal station of the Telescopio Nazionale Galileo (TNG) both the high resolution spectrographs HARPS-N (VIS) and GIANO (NIR) working simultaneously. This could be considered the first and unique worldwide instrument providing cross-dispersed echelle spectroscopy at a high resolution (R=115,000 in the visual and R=50,000 in the IR) and… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: 8 pages, 5 figures, SPIE Conference Proceedings

  20. The new SOXS instrument for the ESO NTT

    Authors: P. Schipani, R. Claudi, S. Campana, A. Baruffolo, S. Basa, S. Basso, E. Cappellaro, E. Cascone, R. Cosentino, F. DAlessio, V. De Caprio, M. Della Valle, A. de Ugarte Postigo, S. DOrsi, R. Franzen, J. Fynbo, A. Gal-Yam, D. Gardiol, E. Giro, M. Hamuy, M. Iuzzolino, D. Loreggia, S. Mattila, M. Munari, G. Pignata , et al. (6 additional authors not shown)

    Abstract: SOXS (Son Of X-Shooter) will be a unique spectroscopic facility for the ESO-NTT 3.5-m telescope in La Silla (Chile), able to cover the optical/NIR band (350-1750 nm). The design foresees a high-efficiency spectrograph with a resolution-slit product of ~4,500, capable of simultaneously observing the complete spectral range 350 - 1750 nm with a good sensitivity, with light imaging capabilities in th… ▽ More

    Submitted 13 July, 2016; originally announced July 2016.

    Comments: 10 pages, submitted to SPIE Astronomical Telescopes & Instrumentation 2016, paper 9908-152

  21. GIANO-TNG spectroscopy of red supergiants in the young star cluster RSGC3

    Authors: L. Origlia, E. Oliva, N. Sanna, A. Mucciarelli, E. Dalessandro, S. Scuderi, C. Baffa, V. Biliotti, L. Carbonaro, G. Falcini, E. Giani, M. Iuzzolino, F. Massi, M. Sozzi, A. Tozzi, A. Ghedina, F. Ghinassi, M. Lodi, A. Harutyunyan, M. Pedani

    Abstract: The Scutum complex in the inner disk of the Galaxy has a number of young star clusters dominated by red supergiants that are heavily obscured by dust extinction and observable only at infrared wavelengths. These clusters are important tracers of the recent star formation and chemical enrichment history in the inner Galaxy. During the technical commissioning and as a first science verification of t… ▽ More

    Submitted 23 October, 2015; originally announced October 2015.

  22. Lines and continuum sky emission in the near infrared: observational constraints from deep high spectral resolution spectra with GIANO-TNG

    Authors: E. Oliva, L. Origlia, S. Scuderi, S. Benatti, I. Carleo, E. Lapenna, A. Mucciarelli, C. Baffa, V. Biliotti, L. Carbonaro, G. Falcini, E. Giani, M. Iuzzolino, F. Massi, N. Sanna, M. Sozzi, A Tozzi, A. Ghedina, F. Ghinassi, M. Lodi, A. Harutyunyan, M. Pedani

    Abstract: Aims Determining the intensity of lines and continuum airglow emission in the H-band is important for the design of faint-object infrared spectrographs. Existing spectra at low/medium resolution cannot disentangle the true sky-continuum from instrumental effects (e.g. diffuse light in the wings of strong lines). We aim to obtain, for the first time, a high resolution infrared spectrum deep enough… ▽ More

    Submitted 30 June, 2015; originally announced June 2015.

    Comments: 7 pages, 4 figures, to be published in Astronomy & Astrophysics

    Journal ref: A&A 581, A47 (2015)

  23. arXiv:1407.3126  [pdf

    astro-ph.IM

    The fiber-fed preslit of GIANO at T.N.G

    Authors: A. Tozzi, E. Oliva, L. Origlia, C. Baffa, V. Biliotti, G. Falcini, E. Giani, M. Iuzzolino, F. Massi, N. Sanna, S. Scuderi, M. Sozzi

    Abstract: Giano is a Cryogenic Spectrograph located in T.N.G. (Spain) and commisioned in 2013. It works in the range 950-2500 nm with a resolving power of 50000. This instrument was designed and built for direct feeding from the telescope [2]. However, due to constraints imposed on the telescope interfacing during the pre-commissioning phase, it had to be positioned on the rotating building, far from the te… ▽ More

    Submitted 11 July, 2014; originally announced July 2014.

    Comments: 21 pages, 24 figures, 3 tables. Presented at SPIE Astronomical Telescope + Instrumentation 2014 (Ground-based and Airbone Instrumentation for Astronomy 5, 9147-360). To be published in Proceeding of SPIE Volume 9147

  24. arXiv:1407.3054  [pdf

    astro-ph.IM

    Updated optical design and trade-off study for MOONS, the Multi-Object Optical and Near Infrared spectrometer for the VLT

    Authors: E. Oliva, S. Todd, M. Cirasuolo, H. Schnetler, D. Lunney, P. Rees, A. Bianco, E. Diolaiti, D. Ferruzzi, M. Fisher, I. Guinouard, M. Iuzzolino, I. Parry, X. Sun, A. Tozzi, F. Vitali

    Abstract: This paper presents the latest optical design for the MOONS triple-arm spectrographs. MOONS will be a Multi-Object Optical and Near-infrared Spectrograph and will be installed on one of the European Southern Observatory (ESO) Very Large Telescopes (VLT). Included in this paper is a trade-off analysis of different types of collimators, cameras, dichroics and filters.

    Submitted 11 July, 2014; originally announced July 2014.

    Comments: 10 pages, 8 figures, 5 tables. Presented at SPIE Astronomical Telescope + Instrumentation 2014 (Ground-based and Airbone Instrumentation for Astronomy 5, 9147-84). To be published in Proceeding of SPIE Volume 9147

  25. arXiv:1407.3052  [pdf

    astro-ph.IM

    Preliminary results on the characterization and performances of ZBLAN fiber for infrared spectrographs

    Authors: M. Iuzzolino, A. Tozzi, N. Sanna, L. Zangrilli, E. Oliva

    Abstract: Present telescopes and future extremely large telescopes make use of fiber-fed spectrographs to observe at optical and infrared wavelengths. The use of fibers largely simplifies the interfacing of the spectrograph to the telescope. At a high spectral resolution (R>50,000) the fibers can be used to achieve very high spectral accuracy. GIANO is an infrared (0.95-2.5μm) high resolution (R=50,000) spe… ▽ More

    Submitted 11 July, 2014; originally announced July 2014.

    Comments: 11 pages, 5 figures, 1 table. Presented at SPIE Astronomical Telescope + Instrumentation 2014 (Ground-based and Airbone Instrumentation for Astronomy 5, 9147-231). To be published in Proceeding of SPIE Volume 9147