-
Direct Frequency-Mode-Stable Laser Amplification at Terahertz Burst Rates
Authors:
Vinzenz Stummer,
Tobias Flöry,
Matthias Schneller,
Markus Zeiler,
Audrius Pugžlys,
Andrius Baltuška
Abstract:
Generation of high-fidelity amplified pulse bursts with a regular interpulse interval yields, in the spectral domain, an equidistant pattern of narrowband spectral modes, similar to frequency combs produced by cw mode-locked lasers, but with greatly increased pulse energy. Despite their great potential for nonlinear spectroscopy, material processing, etc., such long frequency-stable bursts are dif…
▽ More
Generation of high-fidelity amplified pulse bursts with a regular interpulse interval yields, in the spectral domain, an equidistant pattern of narrowband spectral modes, similar to frequency combs produced by cw mode-locked lasers, but with greatly increased pulse energy. Despite their great potential for nonlinear spectroscopy, material processing, etc., such long frequency-stable bursts are difficult to generate and amplify because of prominent temporal intensity modulation even after strong dispersive pulse stretching. This study presents a burst generation method based on a master-oscillator regenerative-amplifier system that allows for chirped-pulse amplification (CPA) with high scalability in pulse number. A gradual smoothing of temporal intensity profiles at an increasing number of pulses is discovered, demonstrating an unexpected recovery of the CPA performance at terahertz (THz) intraburst repetition rates. In consequence, a self-referenced stable burst spectral peak structure with megahertz (MHz) peak width is generated, without risk of amplifier damage caused by interference of chirped pulses. This result eliminates limitations in burst amplification and paves the way for advancements in ultrashort-pulse burst technology, particularly for its use in nonlinear optical applications.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Spectral Peak Recovery in Parametrically Amplified THz-Repetition-Rate Bursts
Authors:
Vinzenz Stummer,
Tobias Flöry,
Matthias Schneller,
Edgar Kaksis,
Markus Zeiler,
Audrius Pugžlys,
Andrius Baltuška
Abstract:
Multi-photon resonant spectroscopies require tunable narrowband excitation to deliver spectral selectivity and, simultaneously, high temporal intensity to drive a nonlinear-optical process. These contradictory requirements are achievable with bursts of ultrashort pulses, which provides both high intensity and tunable narrowband peaks in the frequency domain arising from spectral interference. Howe…
▽ More
Multi-photon resonant spectroscopies require tunable narrowband excitation to deliver spectral selectivity and, simultaneously, high temporal intensity to drive a nonlinear-optical process. These contradictory requirements are achievable with bursts of ultrashort pulses, which provides both high intensity and tunable narrowband peaks in the frequency domain arising from spectral interference. However, femtosecond pulse bursts need special attention during their amplification [Optica 7, 1758 (2020)], which requires spectral peak suppression to increase the energy safely extractable from a chirped-pulse amplifier (CPA). Here, we present a method combining safe laser CPA, relying on spectral scrambling, with a parametric frequency converter that automatically restores the desired spectral peak structure and delivers narrow linewidths for bursts of ultrashort pulses at microjoule energies. The shown results pave the way to new high-energy ultrafast laser sources with controllable spectral selectivity.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
Finding Task-Relevant Features for Few-Shot Learning by Category Traversal
Authors:
Hongyang Li,
David Eigen,
Samuel Dodge,
Matthew Zeiler,
Xiaogang Wang
Abstract:
Few-shot learning is an important area of research. Conceptually, humans are readily able to understand new concepts given just a few examples, while in more pragmatic terms, limited-example training situations are common in practice. Recent effective approaches to few-shot learning employ a metric-learning framework to learn a feature similarity comparison between a query (test) example, and the…
▽ More
Few-shot learning is an important area of research. Conceptually, humans are readily able to understand new concepts given just a few examples, while in more pragmatic terms, limited-example training situations are common in practice. Recent effective approaches to few-shot learning employ a metric-learning framework to learn a feature similarity comparison between a query (test) example, and the few support (training) examples. However, these approaches treat each support class independently from one another, never looking at the entire task as a whole. Because of this, they are constrained to use a single set of features for all possible test-time tasks, which hinders the ability to distinguish the most relevant dimensions for the task at hand. In this work, we introduce a Category Traversal Module that can be inserted as a plug-and-play module into most metric-learning based few-shot learners. This component traverses across the entire support set at once, identifying task-relevant features based on both intra-class commonality and inter-class uniqueness in the feature space. Incorporating our module improves performance considerably (5%-10% relative) over baseline systems on both mini-ImageNet and tieredImageNet benchmarks, with overall performance competitive with recent state-of-the-art systems.
△ Less
Submitted 27 May, 2019;
originally announced May 2019.
-
Visualizing and Understanding Convolutional Networks
Authors:
Matthew D Zeiler,
Rob Fergus
Abstract:
Large Convolutional Network models have recently demonstrated impressive classification performance on the ImageNet benchmark. However there is no clear understanding of why they perform so well, or how they might be improved. In this paper we address both issues. We introduce a novel visualization technique that gives insight into the function of intermediate feature layers and the operation of t…
▽ More
Large Convolutional Network models have recently demonstrated impressive classification performance on the ImageNet benchmark. However there is no clear understanding of why they perform so well, or how they might be improved. In this paper we address both issues. We introduce a novel visualization technique that gives insight into the function of intermediate feature layers and the operation of the classifier. We also perform an ablation study to discover the performance contribution from different model layers. This enables us to find model architectures that outperform Krizhevsky \etal on the ImageNet classification benchmark. We show our ImageNet model generalizes well to other datasets: when the softmax classifier is retrained, it convincingly beats the current state-of-the-art results on Caltech-101 and Caltech-256 datasets.
△ Less
Submitted 28 November, 2013; v1 submitted 12 November, 2013;
originally announced November 2013.
-
Stochastic Pooling for Regularization of Deep Convolutional Neural Networks
Authors:
Matthew D. Zeiler,
Rob Fergus
Abstract:
We introduce a simple and effective method for regularizing large convolutional neural networks. We replace the conventional deterministic pooling operations with a stochastic procedure, randomly picking the activation within each pooling region according to a multinomial distribution, given by the activities within the pooling region. The approach is hyper-parameter free and can be combined with…
▽ More
We introduce a simple and effective method for regularizing large convolutional neural networks. We replace the conventional deterministic pooling operations with a stochastic procedure, randomly picking the activation within each pooling region according to a multinomial distribution, given by the activities within the pooling region. The approach is hyper-parameter free and can be combined with other regularization approaches, such as dropout and data augmentation. We achieve state-of-the-art performance on four image datasets, relative to other approaches that do not utilize data augmentation.
△ Less
Submitted 15 January, 2013;
originally announced January 2013.
-
ADADELTA: An Adaptive Learning Rate Method
Authors:
Matthew D. Zeiler
Abstract:
We present a novel per-dimension learning rate method for gradient descent called ADADELTA. The method dynamically adapts over time using only first order information and has minimal computational overhead beyond vanilla stochastic gradient descent. The method requires no manual tuning of a learning rate and appears robust to noisy gradient information, different model architecture choices, variou…
▽ More
We present a novel per-dimension learning rate method for gradient descent called ADADELTA. The method dynamically adapts over time using only first order information and has minimal computational overhead beyond vanilla stochastic gradient descent. The method requires no manual tuning of a learning rate and appears robust to noisy gradient information, different model architecture choices, various data modalities and selection of hyperparameters. We show promising results compared to other methods on the MNIST digit classification task using a single machine and on a large scale voice dataset in a distributed cluster environment.
△ Less
Submitted 22 December, 2012;
originally announced December 2012.
-
Differentiable Pooling for Hierarchical Feature Learning
Authors:
Matthew D. Zeiler,
Rob Fergus
Abstract:
We introduce a parametric form of pooling, based on a Gaussian, which can be optimized alongside the features in a single global objective function. By contrast, existing pooling schemes are based on heuristics (e.g. local maximum) and have no clear link to the cost function of the model. Furthermore, the variables of the Gaussian explicitly store location information, distinct from the appearance…
▽ More
We introduce a parametric form of pooling, based on a Gaussian, which can be optimized alongside the features in a single global objective function. By contrast, existing pooling schemes are based on heuristics (e.g. local maximum) and have no clear link to the cost function of the model. Furthermore, the variables of the Gaussian explicitly store location information, distinct from the appearance captured by the features, thus providing a what/where decomposition of the input signal. Although the differentiable pooling scheme can be incorporated in a wide range of hierarchical models, we demonstrate it in the context of a Deconvolutional Network model (Zeiler et al. ICCV 2011). We also explore a number of secondary issues within this model and present detailed experiments on MNIST digits.
△ Less
Submitted 30 June, 2012;
originally announced July 2012.