-
Magnified Image Spatial Spectrum (MISS) microscopy for nanometer and millisecond scale label-free imaging
Authors:
Hassaan Majeed,
Lihong Ma,
Young Jae Lee,
Mikhail Kandel,
Eunjung Min,
Woonggyu Jung,
Catherine Best-Popescu,
Gabriel Popescu
Abstract:
Label-free imaging of rapidly moving, sub-diffraction sized structures has important applications in both biology and material science, as it removes the limitations associated with fluorescence tagging. However, unlabeled nanoscale particles in suspension are difficult to image due to their transparency and fast Brownian motion. Here we describe a novel interferometric imaging technique referred…
▽ More
Label-free imaging of rapidly moving, sub-diffraction sized structures has important applications in both biology and material science, as it removes the limitations associated with fluorescence tagging. However, unlabeled nanoscale particles in suspension are difficult to image due to their transparency and fast Brownian motion. Here we describe a novel interferometric imaging technique referred to as Magnified Image Spatial Spectrum (MISS) microscopy, which overcomes these challenges. The MISS microscope provides quantitative phase information and enables dynamic light scattering investigations with an overall optical path length sensitivity of 0.95 nm at 833 frames per second acquisition rate. Using spatiotemporal filtering, we find that the sensitivity can be further pushed down to 0.001-0.01 nm. We demonstrate the instrument's capability through colloidal nanoparticle sizing down to 20 nm diameter and measurements of live neuron membrane dynamics. MISS microscopy is implemented as an upgrade module to an existing microscope, which converts it into a powerful light scattering instrument. Thus, we anticipate that MISS will be adopted broadly for both material and life sciences applications.
△ Less
Submitted 21 January, 2018;
originally announced January 2018.
-
Video Object Detection with an Aligned Spatial-Temporal Memory
Authors:
Fanyi Xiao,
Yong Jae Lee
Abstract:
We introduce Spatial-Temporal Memory Networks for video object detection. At its core, a novel Spatial-Temporal Memory module (STMM) serves as the recurrent computation unit to model long-term temporal appearance and motion dynamics. The STMM's design enables full integration of pretrained backbone CNN weights, which we find to be critical for accurate detection. Furthermore, in order to tackle ob…
▽ More
We introduce Spatial-Temporal Memory Networks for video object detection. At its core, a novel Spatial-Temporal Memory module (STMM) serves as the recurrent computation unit to model long-term temporal appearance and motion dynamics. The STMM's design enables full integration of pretrained backbone CNN weights, which we find to be critical for accurate detection. Furthermore, in order to tackle object motion in videos, we propose a novel MatchTrans module to align the spatial-temporal memory from frame to frame. Our method produces state-of-the-art results on the benchmark ImageNet VID dataset, and our ablative studies clearly demonstrate the contribution of our different design choices. We release our code and models at http://fanyix.cs.ucdavis.edu/project/stmn/project.html.
△ Less
Submitted 26 July, 2018; v1 submitted 18 December, 2017;
originally announced December 2017.
-
Cross-Domain Self-supervised Multi-task Feature Learning using Synthetic Imagery
Authors:
Zhongzheng Ren,
Yong Jae Lee
Abstract:
In human learning, it is common to use multiple sources of information jointly. However, most existing feature learning approaches learn from only a single task. In this paper, we propose a novel multi-task deep network to learn generalizable high-level visual representations. Since multi-task learning requires annotations for multiple properties of the same training instance, we look to synthetic…
▽ More
In human learning, it is common to use multiple sources of information jointly. However, most existing feature learning approaches learn from only a single task. In this paper, we propose a novel multi-task deep network to learn generalizable high-level visual representations. Since multi-task learning requires annotations for multiple properties of the same training instance, we look to synthetic images to train our network. To overcome the domain difference between real and synthetic data, we employ an unsupervised feature space domain adaptation method based on adversarial learning. Given an input synthetic RGB image, our network simultaneously predicts its surface normal, depth, and instance contour, while also minimizing the feature space domain differences between real and synthetic data. Through extensive experiments, we demonstrate that our network learns more transferable representations compared to single-task baselines. Our learned representation produces state-of-the-art transfer learning results on PASCAL VOC 2007 classification and 2012 detection.
△ Less
Submitted 24 November, 2017;
originally announced November 2017.
-
On-Chip Laser Power Delivery System for Dielectric Laser Accelerators
Authors:
Tyler W. Hughes,
Si Tan,
Zhexin Zhao,
Neil V. Sapra,
Yun Jo Lee,
Kenneth J. Leedle,
Huiyang Deng,
Yu Miao,
Dylan S. Black,
Minghao Qi,
Olav Solgaard,
James S. Harris,
Jelena Vuckovic,
Robert L. Byer,
Shanhui Fan
Abstract:
We propose an on-chip optical power delivery system for dielectric laser accelerators based on a fractal 'tree-branch' dielectric waveguide network. This system replaces experimentally demanding free-space manipulations of the driving laser beam with chip-integrated techniques based on precise nano-fabrication, enabling access to orders of magnitude increases in the interaction length and total en…
▽ More
We propose an on-chip optical power delivery system for dielectric laser accelerators based on a fractal 'tree-branch' dielectric waveguide network. This system replaces experimentally demanding free-space manipulations of the driving laser beam with chip-integrated techniques based on precise nano-fabrication, enabling access to orders of magnitude increases in the interaction length and total energy gain for these miniature accelerators. Based on computational modeling, in the relativistic regime, our laser delivery system is estimated to provide 21 keV of energy gain over an acceleration length of 192 um with a single laser input, corresponding to a 108 MV/m acceleration gradient. The system may achieve 1 MeV of energy gain over a distance less than 1 cm by sequentially illuminating 49 identical structures. These findings are verified by detailed numerical simulation and modeling of the subcomponents and we provide a discussion of the main constraints, challenges, and relevant parameters in regards to on-chip laser coupling for dielectric laser accelerators.
△ Less
Submitted 13 September, 2017;
originally announced September 2017.
-
Spontaneous emission enhancement in strain-induced WSe2 monolayer based quantum light sources on metallic surfaces
Authors:
Laxmi Narayan Tripathi,
Oliver Iff,
Simon Betzold,
Monika Emmerling,
Kihwan Moon,
Young Jin Lee,
Soon-Hong Kwon,
Sven Höfling,
Christian Schneider
Abstract:
Atomic monolayers of transition metal dichalcogenides represent an emerging material platform for the implementation of ultra compact quantum light emitters via strain engineering. In this framework, we discuss experimental results on creation of strain induced single photon sources using a WSe2 monolayer on a silver substrate, coated with a very thin dielectric layer. We identify quantum emitters…
▽ More
Atomic monolayers of transition metal dichalcogenides represent an emerging material platform for the implementation of ultra compact quantum light emitters via strain engineering. In this framework, we discuss experimental results on creation of strain induced single photon sources using a WSe2 monolayer on a silver substrate, coated with a very thin dielectric layer. We identify quantum emitters which are formed at various locations in the sample. The emission is highly linearly polarized, stable in linewidth and decay times down to 300 ps are observed. We provide numerical calculations of our monolayer-metal device platform to assess the strength of the radiative decay rate enhancement by the presence of the plasmonic structure. We believe, that our results represent a crucial step towards the ultra-compact integration of high performance single photon sources in nanoplasmonic devices and circuits.
△ Less
Submitted 22 March, 2018; v1 submitted 2 September, 2017;
originally announced September 2017.
-
Stationary waves and slowly moving features in the night upper clouds of Venus
Authors:
J. Peralta,
R. Hueso,
A. Sánchez-Lavega,
Y. J. Lee,
A. García-Muñoz,
T. Kouyama,
H. Sagawa,
T. M. Sato,
G. Piccioni,
S. Tellmann,
T. Imamura,
T. Satoh
Abstract:
At the cloud top level of Venus (65-70 km altitude) the atmosphere rotates 60 times faster than the underlying surface, a phenomenon known as superrotation. Whereas on Venus's dayside the cloud top motions are well determined and Venus general circulation models predict a mean zonal flow at the upper clouds similar on both day and nightside, the nightside circulation remains poorly studied except…
▽ More
At the cloud top level of Venus (65-70 km altitude) the atmosphere rotates 60 times faster than the underlying surface, a phenomenon known as superrotation. Whereas on Venus's dayside the cloud top motions are well determined and Venus general circulation models predict a mean zonal flow at the upper clouds similar on both day and nightside, the nightside circulation remains poorly studied except for the polar region. Here we report global measurements of the nightside circulation at the upper cloud level. We tracked individual features in thermal emission images at 3.8 and 5.0 $\mathrm{μm}$ obtained between 2006 and 2008 by the Visible and Infrared Thermal Imaging Spectrometer (VIRTIS-M) onboard Venus Express and in 2015 by ground-based measurements with the Medium-Resolution 0.8-5.5 Micron Spectrograph and Imager (SpeX) at the National Aeronautics and Space Administration Infrared Telescope Facility (NASA/IRTF). The zonal motions range from -110 to -60 m s$^{-1}$, consistent with those found for the dayside but with larger dispersion. Slow motions (-50 to -20 m s$^{-1}$) were also found and remain unexplained. In addition, abundant stationary wave patterns with zonal speeds from -10 to +10 m s$^{-1}$ dominate the night upper clouds and concentrate over the regions of higher surface elevation.
△ Less
Submitted 12 February, 2018; v1 submitted 24 July, 2017;
originally announced July 2017.
-
Who Will Share My Image? Predicting the Content Diffusion Path in Online Social Networks
Authors:
Wenjian Hu,
Krishna Kumar Singh,
Fanyi Xiao,
Jinyoung Han,
Chen-Nee Chuah,
Yong Jae Lee
Abstract:
Content popularity prediction has been extensively studied due to its importance and interest for both users and hosts of social media sites like Facebook, Instagram, Twitter, and Pinterest. However, existing work mainly focuses on modeling popularity using a single metric such as the total number of likes or shares. In this work, we propose Diffusion-LSTM, a memory-based deep recurrent network th…
▽ More
Content popularity prediction has been extensively studied due to its importance and interest for both users and hosts of social media sites like Facebook, Instagram, Twitter, and Pinterest. However, existing work mainly focuses on modeling popularity using a single metric such as the total number of likes or shares. In this work, we propose Diffusion-LSTM, a memory-based deep recurrent network that learns to recursively predict the entire diffusion path of an image through a social network. By combining user social features and image features, and encoding the diffusion path taken thus far with an explicit memory cell, our model predicts the diffusion path of an image more accurately compared to alternate baselines that either encode only image or social features, or lack memory. By mapping individual users to user prototypes, our model can generalize to new users not seen during training. Finally, we demonstrate our model's capability of generating diffusion trees, and show that the generated trees closely resemble ground-truth trees.
△ Less
Submitted 29 November, 2017; v1 submitted 25 May, 2017;
originally announced May 2017.
-
Weakly-supervised Visual Grounding of Phrases with Linguistic Structures
Authors:
Fanyi Xiao,
Leonid Sigal,
Yong Jae Lee
Abstract:
We propose a weakly-supervised approach that takes image-sentence pairs as input and learns to visually ground (i.e., localize) arbitrary linguistic phrases, in the form of spatial attention masks. Specifically, the model is trained with images and their associated image-level captions, without any explicit region-to-phrase correspondence annotations. To this end, we introduce an end-to-end model…
▽ More
We propose a weakly-supervised approach that takes image-sentence pairs as input and learns to visually ground (i.e., localize) arbitrary linguistic phrases, in the form of spatial attention masks. Specifically, the model is trained with images and their associated image-level captions, without any explicit region-to-phrase correspondence annotations. To this end, we introduce an end-to-end model which learns visual groundings of phrases with two types of carefully designed loss functions. In addition to the standard discriminative loss, which enforces that attended image regions and phrases are consistently encoded, we propose a novel structural loss which makes use of the parse tree structures induced by the sentences. In particular, we ensure complementarity among the attention masks that correspond to sibling noun phrases, and compositionality of attention masks among the children and parent phrases, as defined by the sentence parse tree. We validate the effectiveness of our approach on the Microsoft COCO and Visual Genome datasets.
△ Less
Submitted 3 May, 2017;
originally announced May 2017.
-
Identifying First-person Camera Wearers in Third-person Videos
Authors:
Chenyou Fan,
Jangwon Lee,
Mingze Xu,
Krishna Kumar Singh,
Yong Jae Lee,
David J. Crandall,
Michael S. Ryoo
Abstract:
We consider scenarios in which we wish to perform joint scene understanding, object tracking, activity recognition, and other tasks in environments in which multiple people are wearing body-worn cameras while a third-person static camera also captures the scene. To do this, we need to establish person-level correspondences across first- and third-person videos, which is challenging because the cam…
▽ More
We consider scenarios in which we wish to perform joint scene understanding, object tracking, activity recognition, and other tasks in environments in which multiple people are wearing body-worn cameras while a third-person static camera also captures the scene. To do this, we need to establish person-level correspondences across first- and third-person videos, which is challenging because the camera wearer is not visible from his/her own egocentric video, preventing the use of direct feature matching. In this paper, we propose a new semi-Siamese Convolutional Neural Network architecture to address this novel challenge. We formulate the problem as learning a joint embedding space for first- and third-person videos that considers both spatial- and motion-domain cues. A new triplet loss function is designed to minimize the distance between correct first- and third-person matches while maximizing the distance between incorrect ones. This end-to-end approach performs significantly better than several baselines, in part by learning the first- and third-person features optimized for matching jointly with the distance measure itself.
△ Less
Submitted 20 April, 2017;
originally announced April 2017.
-
Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization
Authors:
Krishna Kumar Singh,
Yong Jae Lee
Abstract:
We propose `Hide-and-Seek', a weakly-supervised framework that aims to improve object localization in images and action localization in videos. Most existing weakly-supervised methods localize only the most discriminative parts of an object rather than all relevant parts, which leads to suboptimal performance. Our key idea is to hide patches in a training image randomly, forcing the network to see…
▽ More
We propose `Hide-and-Seek', a weakly-supervised framework that aims to improve object localization in images and action localization in videos. Most existing weakly-supervised methods localize only the most discriminative parts of an object rather than all relevant parts, which leads to suboptimal performance. Our key idea is to hide patches in a training image randomly, forcing the network to seek other relevant parts when the most discriminative part is hidden. Our approach only needs to modify the input image and can work with any network designed for object localization. During testing, we do not need to hide any patches. Our Hide-and-Seek approach obtains superior performance compared to previous methods for weakly-supervised object localization on the ILSVRC dataset. We also demonstrate that our framework can be easily extended to weakly-supervised action localization.
△ Less
Submitted 23 December, 2017; v1 submitted 13 April, 2017;
originally announced April 2017.
-
Interspecies Knowledge Transfer for Facial Keypoint Detection
Authors:
Maheen Rashid,
Xiuye Gu,
Yong Jae Lee
Abstract:
We present a method for localizing facial keypoints on animals by transferring knowledge gained from human faces. Instead of directly finetuning a network trained to detect keypoints on human faces to animal faces (which is sub-optimal since human and animal faces can look quite different), we propose to first adapt the animal images to the pre-trained human detection network by correcting for the…
▽ More
We present a method for localizing facial keypoints on animals by transferring knowledge gained from human faces. Instead of directly finetuning a network trained to detect keypoints on human faces to animal faces (which is sub-optimal since human and animal faces can look quite different), we propose to first adapt the animal images to the pre-trained human detection network by correcting for the differences in animal and human face shape. We first find the nearest human neighbors for each animal image using an unsupervised shape matching method. We use these matches to train a thin plate spline warping network to warp each animal face to look more human-like. The warping network is then jointly finetuned with a pre-trained human facial keypoint detection network using an animal dataset. We demonstrate state-of-the-art results on both horse and sheep facial keypoint detection, and significant improvement over simple finetuning, especially when training data is scarce. Additionally, we present a new dataset with 3717 images with horse face and facial keypoint annotations.
△ Less
Submitted 13 April, 2017;
originally announced April 2017.
-
Venus cloud morphology and motions from ground-based images at the time of the Akatsuki orbit insertion
Authors:
A. Sánchez-Lavega,
J. Peralta,
J. M. Gómez-Forrellad,
R. Hueso,
S. Pérez-Hoyos,
I. Mendikoa,
J. F. Rojas,
T. Horinouchi,
Y. J. Lee,
S. Watanabe
Abstract:
We report Venus image observations around the two maximum elongations of the planet at June and October 2015. From these images we describe the global atmospheric dynamics and cloud morphology in the planet before the arrival of JAXA Akatsuki mission on December the 7th. The majority of the images were acquired at ultraviolet wavelengths (380-410 nm) using small telescopes. The Venus dayside was a…
▽ More
We report Venus image observations around the two maximum elongations of the planet at June and October 2015. From these images we describe the global atmospheric dynamics and cloud morphology in the planet before the arrival of JAXA Akatsuki mission on December the 7th. The majority of the images were acquired at ultraviolet wavelengths (380-410 nm) using small telescopes. The Venus dayside was also observed with narrow band filters at other wavelengths (890 nm, 725-950 nm, 1.435 μm CO2 band) using the instrument PlanetCam-UPV/EHU at the 2.2m telescope in Calar Alto Observatory. In all cases, the lucky imaging methodology was used to improve the spatial resolution of the images over the atmospheric seeing. During the April-June period, the morphology of the upper cloud showed an irregular and chaotic texture with a well developed equatorial dark belt (afternoon hemisphere), whereas during October-December the dynamical regime was dominated by planetary-scale waves (Yhorizontal, C-reversed and ψ-horizontal features) formed by long streaks, and banding suggesting more stable conditions. Measurements of the zonal wind velocity with cloud tracking in the latitude range from 50$^{\circ}$N to 50$^{\circ}$S shows agreement with retrievals from previous works.
△ Less
Submitted 14 November, 2016;
originally announced November 2016.
-
End-to-End Localization and Ranking for Relative Attributes
Authors:
Krishna Kumar Singh,
Yong Jae Lee
Abstract:
We propose an end-to-end deep convolutional network to simultaneously localize and rank relative visual attributes, given only weakly-supervised pairwise image comparisons. Unlike previous methods, our network jointly learns the attribute's features, localization, and ranker. The localization module of our network discovers the most informative image region for the attribute, which is then used by…
▽ More
We propose an end-to-end deep convolutional network to simultaneously localize and rank relative visual attributes, given only weakly-supervised pairwise image comparisons. Unlike previous methods, our network jointly learns the attribute's features, localization, and ranker. The localization module of our network discovers the most informative image region for the attribute, which is then used by the ranking module to learn a ranking model of the attribute. Our end-to-end framework also significantly speeds up processing and is much faster than previous methods. We show state-of-the-art ranking results on various relative attribute datasets, and our qualitative localization results clearly demonstrate our network's ability to learn meaningful image patches.
△ Less
Submitted 8 August, 2016;
originally announced August 2016.
-
Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection
Authors:
Krishna Kumar Singh,
Fanyi Xiao,
Yong Jae Lee
Abstract:
The status quo approach to training object detectors requires expensive bounding box annotations. Our framework takes a markedly different direction: we transfer tracked object boxes from weakly-labeled videos to weakly-labeled images to automatically generate pseudo ground-truth boxes, which replace manually annotated bounding boxes. We first mine discriminative regions in the weakly-labeled imag…
▽ More
The status quo approach to training object detectors requires expensive bounding box annotations. Our framework takes a markedly different direction: we transfer tracked object boxes from weakly-labeled videos to weakly-labeled images to automatically generate pseudo ground-truth boxes, which replace manually annotated bounding boxes. We first mine discriminative regions in the weakly-labeled image collection that frequently/rarely appear in the positive/negative images. We then match those regions to videos and retrieve the corresponding tracked object boxes. Finally, we design a hough transform algorithm to vote for the best box to serve as the pseudo GT for each image, and use them to train an object detector. Together, these lead to state-of-the-art weakly-supervised detection results on the PASCAL 2007 and 2010 datasets.
△ Less
Submitted 19 April, 2016;
originally announced April 2016.
-
Joint Defogging and Demosaicking
Authors:
Y. J. Lee,
K. Hirakawa,
T. Q. Nguyen
Abstract:
Image defogging is a technique used extensively for enhancing visual quality of images in bad weather condition. Even though defogging algorithms have been well studied, defogging performance is degraded by demosaicking artifacts and sensor noise amplification in distant scenes. In order to improve visual quality of restored images, we propose a novel approach to perform defogging and demosaicking…
▽ More
Image defogging is a technique used extensively for enhancing visual quality of images in bad weather condition. Even though defogging algorithms have been well studied, defogging performance is degraded by demosaicking artifacts and sensor noise amplification in distant scenes. In order to improve visual quality of restored images, we propose a novel approach to perform defogging and demosaicking simultaneously. We conclude that better defogging performance with fewer artifacts can be achieved when a defogging algorithm is combined with a demosaicking algorithm simultaneously. We also demonstrate that the proposed joint algorithm has the benefit of suppressing noise amplification in distant scene. In addition, we validate our theoretical analysis and observations for both synthesized datasets with ground truth fog-free images and natural scene datasets captured in a raw format.
△ Less
Submitted 9 February, 2016;
originally announced February 2016.
-
Quantitative, Comparable Coherent Anti-Stokes Raman Scattering (CARS) Spectroscopy: Correcting Errors in Phase Retrieval
Authors:
Charles H. Camp Jr.,
Young Jong Lee,
Marcus T. Cicerone
Abstract:
Coherent anti-Stokes Raman scattering (CARS) microspectroscopy has demonstrated significant potential for biological and materials imaging. To date, however, the primary mechanism of disseminating CARS spectroscopic information is through pseudocolor imagery, which explicitly neglects a vast majority of the hyperspectral data. Furthermore, current paradigms in CARS spectral processing do not lend…
▽ More
Coherent anti-Stokes Raman scattering (CARS) microspectroscopy has demonstrated significant potential for biological and materials imaging. To date, however, the primary mechanism of disseminating CARS spectroscopic information is through pseudocolor imagery, which explicitly neglects a vast majority of the hyperspectral data. Furthermore, current paradigms in CARS spectral processing do not lend themselves to quantitative sample-to-sample comparability. The primary limitation stems from the need to accurately measure the so-called nonresonant background (NRB) that is used to extract the chemically-sensitive Raman information from the raw spectra. Measurement of the NRB on a pixel-by-pixel basis is a nontrivial task; thus, reference NRB from glass or water are typically utilized, resulting in error between the actual and estimated amplitude and phase. In this manuscript, we present a new methodology for extracting the Raman spectral features that significantly suppresses these errors through phase detrending and scaling. Classic methods of error-correction, such as baseline detrending, are demonstrated to be inaccurate and to simply mask the underlying errors. The theoretical justification is presented by re-developing the theory of phase retrieval via the Kramers-Kronig relation, and we demonstrate that these results are also applicable to maximum entropy method-based phase retrieval. This new error-correction approach is experimentally applied to glycerol spectra and tissue images, demonstrating marked consistency between spectra obtained using different NRB estimates, and between spectra obtained on different instruments. Additionally, in order to facilitate implementation of these approaches, we have made many of the tools described herein available free for download.
△ Less
Submitted 23 July, 2015;
originally announced July 2015.
-
Fracture Toughness of Silicate Glasses: Insights from Molecular Dynamics Simulations
Authors:
Yingtian Yu,
Bu Wang,
Young Jea Lee,
Mathieu Bauchy
Abstract:
Understanding, predicting and eventually improving the resistance to fracture of silicate materials is of primary importance to design new glasses that would be tougher, while retaining their transparency. However, the atomic mechanism of the fracture in amorphous silicate materials is still a topic of debate. In particular, there is some controversy about the existence of ductility at the nano-sc…
▽ More
Understanding, predicting and eventually improving the resistance to fracture of silicate materials is of primary importance to design new glasses that would be tougher, while retaining their transparency. However, the atomic mechanism of the fracture in amorphous silicate materials is still a topic of debate. In particular, there is some controversy about the existence of ductility at the nano-scale during the crack propagation. Here, we present simulations of the fracture of three archetypical silicate glasses using molecular dynamics. We show that the methodology that is used provide realistic values of fracture energy and toughness. In addition, the simulations clearly suggest that silicate glasses can show different degrees of ductility, depending on their composition.
△ Less
Submitted 21 June, 2015;
originally announced June 2015.
-
Predicting Important Objects for Egocentric Video Summarization
Authors:
Yong Jae Lee,
Kristen Grauman
Abstract:
We present a video summarization approach for egocentric or "wearable" camera data. Given hours of video, the proposed method produces a compact storyboard summary of the camera wearer's day. In contrast to traditional keyframe selection techniques, the resulting summary focuses on the most important objects and people with which the camera wearer interacts. To accomplish this, we develop region c…
▽ More
We present a video summarization approach for egocentric or "wearable" camera data. Given hours of video, the proposed method produces a compact storyboard summary of the camera wearer's day. In contrast to traditional keyframe selection techniques, the resulting summary focuses on the most important objects and people with which the camera wearer interacts. To accomplish this, we develop region cues indicative of high-level saliency in egocentric video---such as the nearness to hands, gaze, and frequency of occurrence---and learn a regressor to predict the relative importance of any new region based on these cues. Using these predictions and a simple form of temporal event detection, our method selects frames for the storyboard that reflect the key object-driven happenings. We adjust the compactness of the final summary given either an importance selection criterion or a length budget; for the latter, we design an efficient dynamic programming solution that accounts for importance, visual uniqueness, and temporal displacement. Critically, the approach is neither camera-wearer-specific nor object-specific; that means the learned importance metric need not be trained for a given user or context, and it can predict the importance of objects and people that have never been seen previously. Our results on two egocentric video datasets show the method's promise relative to existing techniques for saliency and summarization.
△ Less
Submitted 18 May, 2015;
originally announced May 2015.
-
Effects of thermal inflation on small scale density perturbations
Authors:
Sungwook E. Hong,
Hyung-Joo Lee,
Young Jae Lee,
Ewan D. Stewart,
Heeseung Zoe
Abstract:
In cosmological scenarios with thermal inflation, extra eras of moduli matter domination, thermal inflation and flaton matter domination exist between primordial inflation and the radiation domination of Big Bang nucleosynthesis. During these eras, cosmological perturbations on small scales can enter and re-exit the horizon, modifying the power spectrum on those scales. The largest modified scale,…
▽ More
In cosmological scenarios with thermal inflation, extra eras of moduli matter domination, thermal inflation and flaton matter domination exist between primordial inflation and the radiation domination of Big Bang nucleosynthesis. During these eras, cosmological perturbations on small scales can enter and re-exit the horizon, modifying the power spectrum on those scales. The largest modified scale, $k_\mathrm{b}$, touches the horizon size when the expansion changes from deflation to inflation at the transition from moduli domination to thermal inflation. We analytically calculate the evolution of perturbations from moduli domination through thermal inflation and evaluate the curvature perturbation on the constant radiation density hypersurface at the end of thermal inflation to determine the late time curvature perturbation. Our resulting transfer function suppresses the power spectrum by a factor $\sim 50$ at $k \gg k_\mathrm{b}$, with $k_\mathrm{b}$ corresponding to anywhere from megaparsec to subparsec scales depending on the parameters of thermal inflation. Thus, thermal inflation might be constrained or detected by small scale observations such as CMB distortions or 21cm hydrogen line observations.
△ Less
Submitted 3 February, 2017; v1 submitted 31 March, 2015;
originally announced March 2015.
-
Weakly-supervised Discovery of Visual Pattern Configurations
Authors:
Hyun Oh Song,
Yong Jae Lee,
Stefanie Jegelka,
Trevor Darrell
Abstract:
The increasing prominence of weakly labeled data nurtures a growing demand for object detection methods that can cope with minimal supervision. We propose an approach that automatically identifies discriminative configurations of visual patterns that are characteristic of a given object class. We formulate the problem as a constrained submodular optimization problem and demonstrate the benefits of…
▽ More
The increasing prominence of weakly labeled data nurtures a growing demand for object detection methods that can cope with minimal supervision. We propose an approach that automatically identifies discriminative configurations of visual patterns that are characteristic of a given object class. We formulate the problem as a constrained submodular optimization problem and demonstrate the benefits of the discovered configurations in remedying mislocalizations and finding informative positive and negative training examples. Together, these lead to state-of-the-art weakly-supervised detection results on the challenging PASCAL VOC dataset.
△ Less
Submitted 25 June, 2014;
originally announced June 2014.
-
The Physics of the B Factories
Authors:
A. J. Bevan,
B. Golob,
Th. Mannel,
S. Prell,
B. D. Yabsley,
K. Abe,
H. Aihara,
F. Anulli,
N. Arnaud,
T. Aushev,
M. Beneke,
J. Beringer,
F. Bianchi,
I. I. Bigi,
M. Bona,
N. Brambilla,
J. B rodzicka,
P. Chang,
M. J. Charles,
C. H. Cheng,
H. -Y. Cheng,
R. Chistov,
P. Colangelo,
J. P. Coleman,
A. Drutskoy
, et al. (2009 additional authors not shown)
Abstract:
This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C.
Please note that version 3 on the archive is the auxiliary…
▽ More
This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C.
Please note that version 3 on the archive is the auxiliary version of the Physics of the B Factories book. This uses the notation alpha, beta, gamma for the angles of the Unitarity Triangle. The nominal version uses the notation phi_1, phi_2 and phi_3. Please cite this work as Eur. Phys. J. C74 (2014) 3026.
△ Less
Submitted 31 October, 2015; v1 submitted 24 June, 2014;
originally announced June 2014.
-
High-Speed Coherent Raman Fingerprint Imaging of Biological Tissues
Authors:
Charles H. Camp Jr.,
Young Jong Lee,
John M. Heddleston,
Christopher M. Hartshorn,
Angela R. Hight Walker,
Jeremy N. Rich,
Justin D. Lathia,
Marcus T. Cicerone
Abstract:
We have developed a coherent Raman imaging platform using broadband coherent anti-Stokes Raman scattering (BCARS) that provides an unprecedented combination of speed, sensitivity, and spectral breadth. The system utilizes a unique configuration of laser sources that probes the Raman spectrum over 3,000 cm$^{-1}$ and generates an especially strong response in the typically weak Raman "fingerprint"…
▽ More
We have developed a coherent Raman imaging platform using broadband coherent anti-Stokes Raman scattering (BCARS) that provides an unprecedented combination of speed, sensitivity, and spectral breadth. The system utilizes a unique configuration of laser sources that probes the Raman spectrum over 3,000 cm$^{-1}$ and generates an especially strong response in the typically weak Raman "fingerprint" region through heterodyne amplification of the anti-Stokes photons with a large nonresonant background (NRB) while maintaining high spectral resolution of $<$ 13 cm$^{-1}$. For histology and pathology, this system shows promise in highlighting major tissue components in a non-destructive, label-free manner. We demonstrate high-speed chemical imaging in two- and three-dimensional views of healthy murine liver and pancreas tissues and interfaces between xenograft brain tumors and the surrounding healthy brain matter.
△ Less
Submitted 15 February, 2014; v1 submitted 13 February, 2014;
originally announced February 2014.
-
Axisymmetric Stokes equations in polygonal domains: regularity and finite element approximations
Authors:
Young Ju Lee,
Hengguang Li
Abstract:
We study the regularity and finite element approximation of the axisymmetric Stokes problem on a polygonal domain $Ω$. In particular, taking into account the singular coefficients in the equation and non-smoothness of the domain, we establish the well-posedness and full regularity of the solution in new weighted Sobolev spaces $\maK^m_{μ, 1}(Ω)$. Using our a priori results, we give a specific cons…
▽ More
We study the regularity and finite element approximation of the axisymmetric Stokes problem on a polygonal domain $Ω$. In particular, taking into account the singular coefficients in the equation and non-smoothness of the domain, we establish the well-posedness and full regularity of the solution in new weighted Sobolev spaces $\maK^m_{μ, 1}(Ω)$. Using our a priori results, we give a specific construction of graded meshes on which the Taylor-Hood mixed method approximates singular solutions at the optimal convergence rate. Numerical tests are presented to confirm the theoretical results in the paper.
△ Less
Submitted 19 June, 2012;
originally announced June 2012.
-
The Possibility of Inflation in Asymptotically Safe Gravity
Authors:
Sungwook E. Hong,
Young Jae Lee,
Heeseung Zoe
Abstract:
We examine the inflationary modes in the cubic curvature theories in the context of asymptotically safe gravity. On the phase space of the Hubble parameter, there exists a critical point which corresponds to the slow-roll inflation in Einstein frame. Most of the e-foldings are attained around the critical point for each inflationary trajectories. If the coupling constants $g_i$ have the parametric…
▽ More
We examine the inflationary modes in the cubic curvature theories in the context of asymptotically safe gravity. On the phase space of the Hubble parameter, there exists a critical point which corresponds to the slow-roll inflation in Einstein frame. Most of the e-foldings are attained around the critical point for each inflationary trajectories. If the coupling constants $g_i$ have the parametric relations generated as the power of the relative energy scale of inflation $H_0$ to the ultraviolet cutoff $Λ$, a successful inflation with more than 60 e-foldings occurs near the critical point.
△ Less
Submitted 12 June, 2012; v1 submitted 30 August, 2011;
originally announced August 2011.
-
Nucleation of vacuum bubbles in Brans-Dicke type theory
Authors:
Hongsu Kim,
Bum-Hoon Lee,
Wonwoo Lee,
Young Jae Lee,
Dong-han Yeom
Abstract:
In this paper, we explore the nucleation of vacuum bubbles in the Brans-Dicke type theory of gravity. In the Euclidean signature, we evaluate the fields at the vacuum bubbles as solutions of the Euler-Lagrange equations of motion as well as the bubble nucleation probabilities by integrating the Euclidean action. We illustrate three possible ways to obtain vacuum bubbles: true vacuum bubbles for ω>…
▽ More
In this paper, we explore the nucleation of vacuum bubbles in the Brans-Dicke type theory of gravity. In the Euclidean signature, we evaluate the fields at the vacuum bubbles as solutions of the Euler-Lagrange equations of motion as well as the bubble nucleation probabilities by integrating the Euclidean action. We illustrate three possible ways to obtain vacuum bubbles: true vacuum bubbles for ω>-3/2, false vacuum bubbles for ω<-3/2, and false vacuum bubbles for ω>-3/2 when the vacuum energy of the false vacuum in the potential of the Einstein frame is less than that of the true vacuum. After the bubble is nucleated at the t=0 surface, we can smoothly interpolate the field combinations to some solutions in the Lorentzian signature and consistently continue their subsequent evolutions. Therefore, we conclude that, in general scalar-tensor theories like this Brans-Dicke type theories, which may include and represent certain features of string theory, vacuum bubbles come in false vacuum bubbles as well as in true vacuum bubbles, as long as a special condition is assumed on the potential.
△ Less
Submitted 18 July, 2011; v1 submitted 27 November, 2010;
originally announced November 2010.
-
Multidimensional Divide-and-Conquer and Weighted Digital Sums
Authors:
Y. K. Cheung,
Philippe Flajolet,
Mordecai Golin,
C. Y. James Lee
Abstract:
This paper studies three types of functions arising separately in the analysis of algorithms that we analyze exactly using similar Mellin transform techniques. The first is the solution to a Multidimensional Divide-and-Conquer (MDC) recurrence that arises when solving problems on points in $d$-dimensional space. The second involves weighted digital sums. Write $n$ in its binary representation…
▽ More
This paper studies three types of functions arising separately in the analysis of algorithms that we analyze exactly using similar Mellin transform techniques. The first is the solution to a Multidimensional Divide-and-Conquer (MDC) recurrence that arises when solving problems on points in $d$-dimensional space. The second involves weighted digital sums. Write $n$ in its binary representation $n=(b_i b_{i-1}... b_1 b_0)_2$ and set $S_M(n) = \sum_{t=0}^i t^{\bar{M}} b_t 2^t$. We analyze the average $TS_M(n) = \frac{1}{n}\sum_{j<n} S_M(j)$. The third is a different variant of weighted digital sums. Write $n$ as $n=2^{i_1} + 2^{i_2} + ... + 2^{i_k}$ with $i_1 > i_2 > ... > i_k\geq 0$ and set $W_M(n) = \sum_{t=1}^k t^M 2^{i_t}$. We analyze the average $TW_M(n) = \frac{1}{n}\sum_{j<n} W_M(j)$.
We show that both the MDC functions and $TS_M(n)$ (with $d=M+1$) have solutions of the form $λ_d n \lg^{d-1}n + \sum_{m=0}^{d-2}(n\lg^m n)A_{d,m}(\lg n) + c_d,$ where $λ_d,c_d$ are constants and $A_{d,m}(u)$'s are periodic functions with period one (given by absolutely convergent Fourier series). We also show that $TW_M(n)$ has a solution of the form $n G_M(\lg n) + d_M \lg^M n + \sum_{d=0}^{M-1}(\lg^d n)G_{M,d}(\lg n),$ where $d_M$ is a constant, $G_M(u)$ and $G_{M,d}(u)$'s are again periodic functions with period one (given by absolutely convergent Fourier series).
△ Less
Submitted 28 February, 2010;
originally announced March 2010.
-
Analysis of Stability, Response and LQR Controller Design of a Small Scale Helicopter Dynamics
Authors:
Hardian Reza Dharmayanda,
Taesam Kang,
Young Jae Lee,
Sangkyung Sung
Abstract:
This paper presents how to use feedback controller with helicopter dynamics state space model. A simplified analysis is presented for controller design using LQR of small scale helicopters for axial and forward flights. Our approach is simple and gives the basic understanding about how to develop controller for solving the stability of linear helicopter flight dynamics.
This paper presents how to use feedback controller with helicopter dynamics state space model. A simplified analysis is presented for controller design using LQR of small scale helicopters for axial and forward flights. Our approach is simple and gives the basic understanding about how to develop controller for solving the stability of linear helicopter flight dynamics.
△ Less
Submitted 30 April, 2008;
originally announced April 2008.
-
Determinable Solutions for One-dimensional Quantum Potentials: Scattering, Quasi-bound and Bound State Problems
Authors:
Hwasung Lee,
Y. J. Lee
Abstract:
We derive analytic expressions of the recursive solutions to the Schrödinger's equation by means of a cutoff potential technique for one-dimensional piecewise constant potentials. These solutions provide a method for accurately determining the transmission probabilities as well as the wave function in both classically accessible region and inaccessible region for any barrier potentials. It is al…
▽ More
We derive analytic expressions of the recursive solutions to the Schrödinger's equation by means of a cutoff potential technique for one-dimensional piecewise constant potentials. These solutions provide a method for accurately determining the transmission probabilities as well as the wave function in both classically accessible region and inaccessible region for any barrier potentials. It is also shown that the energy eigenvalues and the wave functions of bound states can be obtained for potential-well structures by exploiting this method. Calculational results of illustrative examples are shown in order to verify this method for treating barrier and potential-well problems.
△ Less
Submitted 6 January, 2007; v1 submitted 19 November, 2006;
originally announced November 2006.
-
Observation of B+ -> p Lambdabar gamma
Authors:
Y. J. Lee,
M. -Z. Wang
Abstract:
We report the first observation of the radiative hyperonic B decay B+ -> p Lambdabar gamma, using a 140 fb^{-1} data sample recorded on the Upsilon(4S) resonance with the Belle detector at the KEKB asymmetric energy e+e- collider. The measured branching fraction is B(B+ -> p Lambdabar gamma) = (2.16 ^{+0.58}_{-0.53} \pm 0.20) \times 10^{-6}. We examine its M_pLambdabar distribution and observe a…
▽ More
We report the first observation of the radiative hyperonic B decay B+ -> p Lambdabar gamma, using a 140 fb^{-1} data sample recorded on the Upsilon(4S) resonance with the Belle detector at the KEKB asymmetric energy e+e- collider. The measured branching fraction is B(B+ -> p Lambdabar gamma) = (2.16 ^{+0.58}_{-0.53} \pm 0.20) \times 10^{-6}. We examine its M_pLambdabar distribution and observe a peak near threshold. This feature is expected by the short-distance b -> s gamma transition. A search for B+ -> p Sigmabar gamma yields no significant signal and we set a 90% confidence-level upper limit on the branching fraction of B(B+ -> p Sigmabar gamma) < 4.6 \times 10^{-6}.
△ Less
Submitted 30 March, 2005; v1 submitted 25 March, 2005;
originally announced March 2005.
-
New analytic results for electroweak baryon number violation
Authors:
F. R. Klinkhamer,
Y. J. Lee
Abstract:
Real-time anomalous fermion number violation has been investigated for massless chiral fermions in spherically symmetric SU(2) Yang-Mills gauge field backgrounds which can be weakly dissipative or even nondissipative. Restricting consideration to spherically symmetric fermion fields, a relation has been found between the spectral flow of the Dirac Hamiltonian and two characteristics of the backg…
▽ More
Real-time anomalous fermion number violation has been investigated for massless chiral fermions in spherically symmetric SU(2) Yang-Mills gauge field backgrounds which can be weakly dissipative or even nondissipative. Restricting consideration to spherically symmetric fermion fields, a relation has been found between the spectral flow of the Dirac Hamiltonian and two characteristics of the background gauge field. This new result may be relevant to electroweak baryon number violation in the early universe.
△ Less
Submitted 25 October, 2001;
originally announced October 2001.
-
Spectral flow of chiral fermions in nondissipative Yang-Mills gauge field backgrounds
Authors:
F. R. Klinkhamer,
Y. J. Lee
Abstract:
Real-time anomalous fermion number violation is investigated for massless chiral fermions in spherically symmetric SU(2) Yang-Mills gauge field backgrounds which can be weakly dissipative or even nondissipative. Restricting consideration to spherically symmetric fermion fields, the zero-eigenvalue equation of the time-dependent effective Dirac Hamiltonian is studied in detail. For generic spheri…
▽ More
Real-time anomalous fermion number violation is investigated for massless chiral fermions in spherically symmetric SU(2) Yang-Mills gauge field backgrounds which can be weakly dissipative or even nondissipative. Restricting consideration to spherically symmetric fermion fields, the zero-eigenvalue equation of the time-dependent effective Dirac Hamiltonian is studied in detail. For generic spherically symmetric SU(2) gauge fields in Minkowski spacetime, a relation is presented between the spectral flow and two characteristics of the background gauge field. These characteristics are the well-known ``winding factor,'' which is defined to be the change of the Chern-Simons number of the associated vacuum sector of the background gauge field, and a new ``twist factor,'' which can be obtained from the zero-eigenvalue equation of the effective Dirac Hamiltonian but is entirely determined by the background gauge field. For a particular class of (weakly dissipative) Luscher-Schechter gauge field solutions, the level crossings are calculated directly and nontrivial contributions to the spectral flow from both the winding factor and the twist factor are observed. The general result for the spectral flow may be relevant to electroweak baryon number violation in the early universe.
△ Less
Submitted 6 September, 2001; v1 submitted 10 April, 2001;
originally announced April 2001.
-
Nuclear Density-Dependent Effective Coupling Constants in the Mean-Field Theory
Authors:
Jae Hwang Lee,
Young Jae Lee,
Suk-Joon Lee
Abstract:
It is shown that the equation of state of nuclear matter can be determined within the mean-field theory of $σω$ model provided only that the nucleon effective mass curve is given. We use a family of the possible nucleon effective mass curves that reproduce the empirical saturation point in the calculation of the nuclear binding energy curves in order to obtain density-dependent effective couplin…
▽ More
It is shown that the equation of state of nuclear matter can be determined within the mean-field theory of $σω$ model provided only that the nucleon effective mass curve is given. We use a family of the possible nucleon effective mass curves that reproduce the empirical saturation point in the calculation of the nuclear binding energy curves in order to obtain density-dependent effective coupling constants. The resulting density-dependent coupling constants may be used to study a possible equation of state of nuclear system at high density or neutron matter. Within the constraints used in this paper to $M^*$ of nuclear matter at saturation point and zero density, neutron matter of large incompressibility is strongly bound at high density while soft neutron matter is weakly bound at low density. The study also exhibits the importance of surface vibration modes in the study of nuclear equation of state.
△ Less
Submitted 8 October, 1996;
originally announced October 1996.