Skip to main content

Showing 1–49 of 49 results for author: Learned-Miller, E

.
  1. arXiv:2504.17626  [pdf, other

    cs.CV

    Improving Open-World Object Localization by Discovering Background

    Authors: Ashish Singh, Michael J. Jones, Kuan-Chuan Peng, Anoop Cherian, Moitreya Chatterjee, Erik Learned-Miller

    Abstract: Our work addresses the problem of learning to localize objects in an open-world setting, i.e., given the bounding box information of a limited number of object classes during training, the goal is to localize all objects, belonging to both the training and unseen classes in an image, during inference. Towards this end, recent work in this area has focused on improving the characterization of objec… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  2. arXiv:2502.17223  [pdf, other

    math.ST

    On the admissibility of bounds on the mean of discrete, scalar probability distributions from an iid sample

    Authors: Erik Learned-Miller

    Abstract: We address the problem of producing a lower bound for the mean of a discrete probability distribution, with known support over a finite set of real numbers, from an iid sample of that distribution. Up to a constant, this is equivalent to bounding the mean of a multinomial distribution (with known support) from a sample of that distribution. Our main contribution is to characterize the complete set… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: 24 pages

  3. arXiv:2411.15931  [pdf, other

    cs.LG cs.CV cs.IT stat.AP stat.ML

    Improving Pre-trained Self-Supervised Embeddings Through Effective Entropy Maximization

    Authors: Deep Chakraborty, Yann LeCun, Tim G. J. Rudner, Erik Learned-Miller

    Abstract: A number of different architectures and loss functions have been applied to the problem of self-supervised learning (SSL), with the goal of developing embeddings that provide the best possible pre-training for as-yet-unknown, lightly supervised downstream tasks. One of these SSL criteria is to maximize the entropy of a set of embeddings in some compact space. But the goal of maximizing the embeddi… ▽ More

    Submitted 13 March, 2025; v1 submitted 24 November, 2024; originally announced November 2024.

    Comments: Published in Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS 2025). A preliminary version of this work also appeared in the NeurIPS 2024 Workshop on Self-Supervised Learning: Theory and Practice

  4. arXiv:2311.02763  [pdf, ps, other

    math.ST cs.LG stat.ML

    Log-Concavity of Multinomial Likelihood Functions Under Interval Censoring Constraints on Frequencies or Their Partial Sums

    Authors: Bruce Levin, Erik Learned-Miller

    Abstract: We show that the likelihood function for a multinomial vector observed under arbitrary interval censoring constraints on the frequencies or their partial sums is completely log-concave by proving that the constrained sample spaces comprise M-convex subsets of the discrete simplex.

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: 7 pages

  5. Machine Learning for Automated Mitral Regurgitation Detection from Cardiac Imaging

    Authors: Ke Xiao, Erik Learned-Miller, Evangelos Kalogerakis, James Priest, Madalina Fiterau

    Abstract: Mitral regurgitation (MR) is a heart valve disease with potentially fatal consequences that can only be forestalled through timely diagnosis and treatment. Traditional diagnosis methods are expensive, labor-intensive and require clinical expertise, posing a barrier to screening for MR. To overcome this impediment, we propose a new semi-supervised model for MR classification called CUSSP. CUSSP ope… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: 12 pages including references and the appendix. 9 Figures, 2 tables. Accepted at MICCAI (Machine Learning for Automated Mitral Regurgitation Detection from Cardiac Imaging) 2023, Link to Springer at https://link.springer.com/chapter/10.1007/978-3-031-43990-2_23

    ACM Class: I.4.0; I.2.10

    Journal ref: In: Medical Image Computing and Computer Assisted Intervention - MICCAI 2023. pp. 236-246 (2023)

  6. arXiv:2309.08588  [pdf, other

    cs.CV cs.RO

    Robust Frame-to-Frame Camera Rotation Estimation in Crowded Scenes

    Authors: Fabien Delattre, David Dirnfeld, Phat Nguyen, Stephen Scarano, Michael J. Jones, Pedro Miraldo, Erik Learned-Miller

    Abstract: We present an approach to estimating camera rotation in crowded, real-world scenes from handheld monocular video. While camera rotation estimation is a well-studied problem, no previous methods exhibit both high accuracy and acceptable speed in this setting. Because the setting is not addressed well by other datasets, we provide a new dataset and benchmark, with high-accuracy, rigorously verified… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Published at ICCV 2023

  7. arXiv:2306.17165  [pdf, other

    cs.CV cs.AI cs.LG

    An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training

    Authors: Zitian Chen, Mingyu Ding, Yikang Shen, Wei Zhan, Masayoshi Tomizuka, Erik Learned-Miller, Chuang Gan

    Abstract: We present a model that can perform multiple vision tasks and can be adapted to other downstream tasks efficiently. Despite considerable progress in multi-task learning, most efforts focus on learning from multi-label data: a single image set with multiple task labels. Such multi-label data sets are rare, small, and expensive. We say heterogeneous to refer to image sets with different task labels,… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  8. arXiv:2305.08962  [pdf, other

    cs.RO cs.CV

    Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface

    Authors: Shifan Zhu, Zhipeng Tang, Michael Yang, Erik Learned-Miller, Donghyun Kim

    Abstract: Our paper proposes a direct sparse visual odometry method that combines event and RGB-D data to estimate the pose of agile-legged robots during dynamic locomotion and acrobatic behaviors. Event cameras offer high temporal resolution and dynamic range, which can eliminate the issue of blurred RGB images during fast movements. This unique strength holds a potential for accurate pose estimation of ag… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 8 pages, 8 figures

  9. A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

    Authors: Megan M. Baker, Alexander New, Mario Aguilar-Simon, Ziad Al-Halah, Sébastien M. R. Arnold, Ese Ben-Iwhiwhu, Andrew P. Brna, Ethan Brooks, Ryan C. Brown, Zachary Daniels, Anurag Daram, Fabien Delattre, Ryan Dellana, Eric Eaton, Haotian Fu, Kristen Grauman, Jesse Hostetler, Shariq Iqbal, Cassandra Kent, Nicholas Ketz, Soheil Kolouri, George Konidaris, Dhireesha Kudithipudi, Erik Learned-Miller, Seungwon Lee , et al. (22 additional authors not shown)

    Abstract: Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to "real world" events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed. This critical gap may be addressed through th… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: To appear in Neural Networks

  10. arXiv:2212.08066  [pdf, other

    cs.CV cs.AI cs.LG

    Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners

    Authors: Zitian Chen, Yikang Shen, Mingyu Ding, Zhenfang Chen, Hengshuang Zhao, Erik Learned-Miller, Chuang Gan

    Abstract: Optimization in multi-task learning (MTL) is more challenging than single-task learning (STL), as the gradient from different tasks can be contradictory. When tasks are related, it can be beneficial to share some parameters among them (cooperation). However, some tasks require additional parameters with expertise in a specific type of data or discrimination (specialization). To address the MTL cha… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  11. arXiv:2212.07900  [pdf, other

    cs.CV

    EVAL: Explainable Video Anomaly Localization

    Authors: Ashish Singh, Michael J. Jones, Erik Learned-Miller

    Abstract: We develop a novel framework for single-scene video anomaly localization that allows for human-understandable reasons for the decisions the system makes. We first learn general representations of objects and their motions (using deep networks) and then use these representations to build a high-level, location-dependent model of any particular scene. This model can be used to detect anomalies in ne… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  12. arXiv:2211.06780  [pdf, other

    cs.LG cs.CV

    Inv-SENnet: Invariant Self Expression Network for clustering under biased data

    Authors: Ashutosh Singh, Ashish Singh, Aria Masoomi, Tales Imbiriba, Erik Learned-Miller, Deniz Erdogmus

    Abstract: Subspace clustering algorithms are used for understanding the cluster structure that explains the dataset well. These methods are extensively used for data-exploration tasks in various areas of Natural Sciences. However, most of these methods fail to handle unwanted biases in datasets. For datasets where a data sample represents multiple attributes, naively applying any clustering approach can res… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

  13. arXiv:2204.09854  [pdf, other

    cs.CV

    Self-Supervised Learning to Guide Scientifically Relevant Categorization of Martian Terrain Images

    Authors: Tejas Panambur, Deep Chakraborty, Melissa Meyer, Ralph Milliken, Erik Learned-Miller, Mario Parente

    Abstract: Automatic terrain recognition in Mars rover images is an important problem not just for navigation, but for scientists interested in studying rock types, and by extension, conditions of the ancient Martian paleoclimate and habitability. Existing approaches to label Martian terrain either involve the use of non-expert annotators producing taxonomies of limited granularity (e.g. soil, sand, bedrock,… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: Earthvision at CVPR Workshops 2022, Code and datasets are available at https://github.com/TejasPanambur/mastcam

    MSC Class: 68T07 ACM Class: I.2.10; I.4.8; I.5.1; I.5.3; J.2

  14. arXiv:2203.00115  [pdf, other

    cs.CV

    The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields

    Authors: Pia Bideau, Erik Learned-Miller, Cordelia Schmid, Karteek Alahari

    Abstract: Both a good understanding of geometrical concepts and a broad familiarity with objects lead to our excellent perception of moving objects. The human ability to detect and segment moving objects works in the presence of multiple objects, complex background geometry, motion of the observer and even camouflage. How humans perceive moving objects so reliably is a longstanding research question in comp… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

  15. arXiv:2112.13942  [pdf, other

    cs.CV

    PriFit: Learning to Fit Primitives Improves Few Shot Point Cloud Segmentation

    Authors: Gopal Sharma, Bidya Dash, Aruni RoyChowdhury, Matheus Gadelha, Marios Loizou, Liangliang Cao, Rui Wang, Erik Learned-Miller, Subhransu Maji, Evangelos Kalogerakis

    Abstract: We present PriFit, a semi-supervised approach for label-efficient learning of 3D point cloud segmentation networks. PriFit combines geometric primitive fitting with point-based representation learning. Its key idea is to learn point representations whose clustering reveals shape regions that can be approximated well by basic geometric primitives, such as cuboids and ellipsoids. The learned point r… ▽ More

    Submitted 23 June, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

  16. The Spatio-Temporal Poisson Point Process: A Simple Model for the Alignment of Event Camera Data

    Authors: Cheng Gu, Erik Learned-Miller, Daniel Sheldon, Guillermo Gallego, Pia Bideau

    Abstract: Event cameras, inspired by biological vision systems, provide a natural and data efficient representation of visual information. Visual information is acquired in the form of events that are triggered by local brightness changes. Each pixel location of the camera's sensor records events asynchronously and independently with very high temporal resolution. However, because most brightness changes ar… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Journal ref: IEEE International Conference on Computer Vision 2021

  17. arXiv:2106.03163  [pdf, other

    math.ST

    Towards Practical Mean Bounds for Small Samples

    Authors: My Phan, Philip S. Thomas, Erik Learned-Miller

    Abstract: Historically, to bound the mean for small sample sizes, practitioners have had to choose between using methods with unrealistic assumptions about the unknown distribution (e.g., Gaussianity) and methods like Hoeffding's inequality that use weaker assumptions but produce much looser (wider) intervals. In 1969, Anderson (1969) proposed a mean confidence interval strictly better than or equal to Hoef… ▽ More

    Submitted 25 October, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: This is an extended work of our ICML 2021 paper "Towards Practical Mean Bounds for Small Samples"

  18. Passage Retrieval for Outside-Knowledge Visual Question Answering

    Authors: Chen Qu, Hamed Zamani, Liu Yang, W. Bruce Croft, Erik Learned-Miller

    Abstract: In this work, we address multi-modal information needs that contain text questions and images by focusing on passage retrieval for outside-knowledge visual question answering. This task requires access to outside knowledge, which in our case we define to be a large unstructured passage collection. We first conduct sparse retrieval with BM25 and study expanding the question with object names and im… ▽ More

    Submitted 9 May, 2021; originally announced May 2021.

    Comments: Accepted to SIGIR'21 as a short paper

  19. arXiv:2104.12820  [pdf, other

    cs.LG

    Universal Off-Policy Evaluation

    Authors: Yash Chandak, Scott Niekum, Bruno Castro da Silva, Erik Learned-Miller, Emma Brunskill, Philip S. Thomas

    Abstract: When faced with sequential decision-making problems, it is often useful to be able to predict what would happen if decisions were made using a new policy. Those predictions must often be based on data collected under some previously used decision-making rule. Many previous methods enable such off-policy (or counterfactual) estimation of the expected value of a performance measure called the return… ▽ More

    Submitted 2 November, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: Accepted at Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021)

  20. arXiv:2103.17271  [pdf, other

    cs.CV

    DCVNet: Dilated Cost Volume Networks for Fast Optical Flow

    Authors: Huaizu Jiang, Erik Learned-Miller

    Abstract: The cost volume, capturing the similarity of possible correspondences across two input images, is a key ingredient in state-of-the-art optical flow approaches. When sampling correspondences to build the cost volume, a large neighborhood radius is required to deal with large displacements, introducing a significant computational burden. To address this, coarse-to-fine or recurrent processing of the… ▽ More

    Submitted 18 March, 2024; v1 submitted 31 March, 2021; originally announced March 2021.

  21. arXiv:2012.08707  [pdf, other

    cs.CV

    SID-NISM: A Self-supervised Low-light Image Enhancement Framework

    Authors: Lijun Zhang, Xiao Liu, Erik Learned-Miller, Hui Guan

    Abstract: When capturing images in low-light conditions, the images often suffer from low visibility, which not only degrades the visual aesthetics of images, but also significantly degenerates the performance of many computer vision algorithms. In this paper, we propose a self-supervised low-light image enhancement framework (SID-NISM), which consists of two components, a Self-supervised Image Decompositio… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: 11 pages, 9 figures

  22. arXiv:2010.02430  [pdf, other

    cs.CV

    Shot in the Dark: Few-Shot Learning with No Base-Class Labels

    Authors: Zitian Chen, Subhransu Maji, Erik Learned-Miller

    Abstract: Few-shot learning aims to build classifiers for new classes from a small number of labeled examples and is commonly facilitated by access to examples from a distinct set of 'base classes'. The difference in data distribution between the test set (novel classes) and the base classes used to learn an inductive bias often results in poor generalization on the novel classes. To alleviate problems caus… ▽ More

    Submitted 22 April, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Learning from Limited or Imperfect Data (L2ID) Workshop @ CVPR 2021

  23. arXiv:2007.06995  [pdf, other

    cs.CV

    Improving Face Recognition by Clustering Unlabeled Faces in the Wild

    Authors: Aruni RoyChowdhury, Xiang Yu, Kihyuk Sohn, Erik Learned-Miller, Manmohan Chandraker

    Abstract: While deep face recognition has benefited significantly from large-scale labeled data, current research is focused on leveraging unlabeled data to further boost performance, reducing the cost of human annotation. Prior work has mostly been in controlled settings, where the labeled and unlabeled data sets have no overlapping identities by construction. This is not realistic in large-scale face reco… ▽ More

    Submitted 15 July, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: ECCV 2020

  24. arXiv:2006.15056  [pdf, other

    cs.CV

    Cross-Supervised Object Detection

    Authors: Zitian Chen, Zhiqiang Shen, Jiahui Yu, Erik Learned-Miller

    Abstract: After learning a new object category from image-level annotations (with no object bounding boxes), humans are remarkably good at precisely localizing those objects. However, building good object localizers (i.e., detectors) currently requires expensive instance-level annotations. While some work has been done on learning detectors from weakly labeled samples (with only class labels), these detecto… ▽ More

    Submitted 28 June, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

  25. arXiv:2003.13834  [pdf, other

    cs.CV cs.GR cs.LG

    Label-Efficient Learning on Point Clouds using Approximate Convex Decompositions

    Authors: Matheus Gadelha, Aruni RoyChowdhury, Gopal Sharma, Evangelos Kalogerakis, Liangliang Cao, Erik Learned-Miller, Rui Wang, Subhransu Maji

    Abstract: The problems of shape classification and part segmentation from 3D point clouds have garnered increasing attention in the last few years. Both of these problems, however, suffer from relatively small training sets, creating the need for statistically efficient methods to learn 3D shape representations. In this paper, we investigate the use of Approximate Convex Decompositions (ACD) as a self-super… ▽ More

    Submitted 4 August, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: First two authors had equal contribution. ECCV'20 version. 19 pages, 5 figures

    Journal ref: 16th European Conference on Computer Vision (ECCV 2020)

  26. arXiv:2001.03615  [pdf, other

    cs.CV

    In Defense of Grid Features for Visual Question Answering

    Authors: Huaizu Jiang, Ishan Misra, Marcus Rohrbach, Erik Learned-Miller, Xinlei Chen

    Abstract: Popularized as 'bottom-up' attention, bounding box (or region) based visual features have recently surpassed vanilla grid-based convolutional features as the de facto standard for vision and language tasks like visual question answering (VQA). However, it is not clear whether the advantages of regions (e.g. better localization) are the key reasons for the success of bottom-up attention. In this pa… ▽ More

    Submitted 2 April, 2020; v1 submitted 10 January, 2020; originally announced January 2020.

    Journal ref: CVPR, 2020

  27. arXiv:1910.12361  [pdf, other

    cs.CV

    SENSE: a Shared Encoder Network for Scene-flow Estimation

    Authors: Huaizu Jiang, Deqing Sun, Varun Jampani, Zhaoyang Lv, Erik Learned-Miller, Jan Kautz

    Abstract: We introduce a compact network for holistic scene flow estimation, called SENSE, which shares common encoder features among four closely-related tasks: optical flow estimation, disparity estimation from stereo, occlusion estimation, and semantic segmentation. Our key insight is that sharing features makes the network more compact, induces better feature representations, and can better exploit inte… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

    Comments: ICCV 2019 Oral

    Journal ref: International Conference on Computer Vision 2019

  28. arXiv:1905.06208  [pdf, other

    math.ST cs.LG math.PR

    A New Confidence Interval for the Mean of a Bounded Random Variable

    Authors: Erik Learned-Miller, Philip S. Thomas

    Abstract: We present a new method for constructing a confidence interval for the mean of a bounded random variable from samples of the random variable. We conjecture that the confidence interval has guaranteed coverage, i.e., that it contains the mean with high probability for all distributions on a bounded interval, for all samples sizes, and for all confidence levels. This new method provides confidence i… ▽ More

    Submitted 4 November, 2020; v1 submitted 15 May, 2019; originally announced May 2019.

  29. arXiv:1904.07305  [pdf, other

    cs.CV cs.LG

    Automatic adaptation of object detectors to new domains using self-training

    Authors: Aruni RoyChowdhury, Prithvijit Chakrabarty, Ashish Singh, SouYoung Jin, Huaizu Jiang, Liangliang Cao, Erik Learned-Miller

    Abstract: This work addresses the unsupervised adaptation of an existing object detector to a new target domain. We assume that a large number of unlabeled videos from this domain are readily available. We automatically obtain labels on the target data by using high-confidence detections from the existing detector, augmented with hard (misclassified) examples acquired by exploiting temporal cues using a tra… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: Accepted at CVPR 2019

  30. arXiv:1904.05373  [pdf, other

    cs.CV cs.GR cs.LG

    Pixel-Adaptive Convolutional Neural Networks

    Authors: Hang Su, Varun Jampani, Deqing Sun, Orazio Gallo, Erik Learned-Miller, Jan Kautz

    Abstract: Convolutions are the fundamental building block of CNNs. The fact that their weights are spatially shared is one of the main reasons for their widespread use, but it also is a major limitation, as it makes convolutions content agnostic. We propose a pixel-adaptive convolution (PAC) operation, a simple yet effective modification of standard convolutions, in which the filter weights are multiplied w… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: CVPR 2019. Video introduction: https://youtu.be/gsQZbHuR64o

  31. arXiv:1902.05492  [pdf, other

    cs.CV

    Integrating Propositional and Relational Label Side Information for Hierarchical Zero-Shot Image Classification

    Authors: Colin Samplawski, Heesung Kwon, Erik Learned-Miller, Benjamin M. Marlin

    Abstract: Zero-shot learning (ZSL) is one of the most extreme forms of learning from scarce labeled data. It enables predicting that images belong to classes for which no labeled training instances are available. In this paper, we present a new ZSL framework that leverages both label attribute side information and a semantic label hierarchy. We present two methods, lifted zero-shot prediction and a custom c… ▽ More

    Submitted 14 February, 2019; originally announced February 2019.

  32. arXiv:1902.00626  [pdf

    cs.LG cs.CV stat.ML

    Nonparametric Curve Alignment

    Authors: Marwan Mattar, Michael Ross, Erik Learned-Miller

    Abstract: Congealing is a flexible nonparametric data-driven framework for the joint alignment of data. It has been successfully applied to the joint alignment of binary images of digits, binary images of object silhouettes, grayscale MRI images, color images of cars and faces, and 3D brain volumes. This research enhances congealing to practically and effectively apply it to curve data. We develop a paramet… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

    Comments: 4 pages, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009

  33. arXiv:1808.04285  [pdf, other

    cs.CV

    Unsupervised Hard Example Mining from Videos for Improved Object Detection

    Authors: SouYoung Jin, Aruni RoyChowdhury, Huaizu Jiang, Ashish Singh, Aditya Prasad, Deep Chakraborty, Erik Learned-Miller

    Abstract: Important gains have recently been obtained in object detection by using training objectives that focus on {\em hard negative} examples, i.e., negative examples that are currently rated as positive or ambiguous by the detector. These examples can strongly influence parameters when the network is trained to correct them. Unfortunately, they are often sparse in the training data, and are expensive t… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

    Comments: 14 pages, 7 figures, accepted at ECCV 2018

  34. arXiv:1712.04850  [pdf, other

    cs.CV

    Self-Supervised Relative Depth Learning for Urban Scene Understanding

    Authors: Huaizu Jiang, Erik Learned-Miller, Gustav Larsson, Michael Maire, Greg Shakhnarovich

    Abstract: As an agent moves through the world, the apparent motion of scene elements is (usually) inversely proportional to their depth. It is natural for a learning agent to associate image patterns with the magnitude of their displacement over time: as the agent moves, faraway mountains don't move much; nearby trees move a lot. This natural relationship between the appearance of objects and their motion i… ▽ More

    Submitted 2 April, 2018; v1 submitted 13 December, 2017; originally announced December 2017.

  35. arXiv:1712.00080  [pdf, other

    cs.CV

    Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation

    Authors: Huaizu Jiang, Deqing Sun, Varun Jampani, Ming-Hsuan Yang, Erik Learned-Miller, Jan Kautz

    Abstract: Given two consecutive frames, video interpolation aims at generating intermediate frame(s) to form both spatially and temporally coherent video sequences. While most existing methods focus on single-frame interpolation, we propose an end-to-end convolutional neural network for variable-length multi-frame video interpolation, where the motion interpretation and occlusion reasoning are jointly model… ▽ More

    Submitted 13 July, 2018; v1 submitted 30 November, 2017; originally announced December 2017.

    Comments: CVPR 2018 version with minor revisions and supplementary material included. Project page http://jianghz.me/projects/superslomo

    Journal ref: CVPR 2018

  36. arXiv:1709.02458  [pdf, other

    cs.CV

    End-to-end Face Detection and Cast Grouping in Movies Using Erdős-Rényi Clustering

    Authors: SouYoung Jin, Hang Su, Chris Stauffer, Erik Learned-Miller

    Abstract: We present an end-to-end system for detecting and clustering faces by identity in full-length movies. Unlike works that start with a predefined set of detected faces, we consider the end-to-end problem of detection and clustering together. We make three separate contributions. First, we combine a state-of-the-art face detector with a generic tracker to extract high quality face tracklets. We then… ▽ More

    Submitted 7 September, 2017; originally announced September 2017.

    Comments: to appear in ICCV 2017 (spotlight)

  37. arXiv:1704.07433  [pdf, other

    stat.ML cs.LG

    Active Bias: Training More Accurate Neural Networks by Emphasizing High Variance Samples

    Authors: Haw-Shiuan Chang, Erik Learned-Miller, Andrew McCallum

    Abstract: Self-paced learning and hard example mining re-weight training instances to improve learning accuracy. This paper presents two improved alternatives based on lightweight estimates of sample uncertainty in stochastic gradient descent (SGD): the variance in predicted probability of the correct class across iterations of mini-batch SGD, and the proximity of the correct class probability to the decisi… ▽ More

    Submitted 6 January, 2018; v1 submitted 24 April, 2017; originally announced April 2017.

    Comments: camera-ready version for NIPS 2017

  38. arXiv:1610.10033  [pdf, other

    cs.CV

    A Detailed Rubric for Motion Segmentation

    Authors: Pia Bideau, Erik Learned-Miller

    Abstract: Motion segmentation is currently an active area of research in computer Vision. The task of comparing different methods of motion segmentation is complicated by the fact that researchers may use subtly different definitions of the problem. Questions such as "Which objects are moving?", "What is background?", and "How can we use motion of the camera to segment objects, whether they are static or mo… ▽ More

    Submitted 31 October, 2016; originally announced October 2016.

  39. arXiv:1609.03947  [pdf, other

    cs.CV cs.RO

    Associating Grasp Configurations with Hierarchical Features in Convolutional Neural Networks

    Authors: Li Yang Ku, Erik Learned-Miller, Rod Grupen

    Abstract: In this work, we provide a solution for posturing the anthropomorphic Robonaut-2 hand and arm for grasping based on visual information. A mapping from visual features extracted from a convolutional neural network (CNN) to grasp points is learned. We demonstrate that a CNN pre-trained for image classification can be applied to a grasping task based on a small set of grasping examples. Our approach… ▽ More

    Submitted 25 July, 2017; v1 submitted 13 September, 2016; originally announced September 2016.

    Comments: 8 pages, 9 figures, IROS 2017

  40. arXiv:1606.03473  [pdf, other

    cs.CV

    Face Detection with the Faster R-CNN

    Authors: Huaizu Jiang, Erik Learned-Miller

    Abstract: The Faster R-CNN has recently demonstrated impressive results on various object detection benchmarks. By training a Faster R-CNN model on the large scale WIDER face dataset, we report state-of-the-art results on two widely used face detection benchmarks, FDDB and the recently released IJB-A.

    Submitted 10 June, 2016; originally announced June 2016.

    Comments: technical report

  41. arXiv:1604.00136  [pdf, other

    cs.CV

    It's Moving! A Probabilistic Model for Causal Motion Segmentation in Moving Camera Videos

    Authors: Pia Bideau, Erik Learned-Miller

    Abstract: The human ability to detect and segment moving objects works in the presence of multiple objects, complex background geometry, motion of the observer, and even camouflage. In addition to all of this, the ability to detect motion is nearly instantaneous. While there has been much recent progress in motion segmentation, it still appears we are far from human capabilities. In this work, we derive fro… ▽ More

    Submitted 1 April, 2016; originally announced April 2016.

  42. Background Modeling Using Adaptive Pixelwise Kernel Variances in a Hybrid Feature Space

    Authors: Manjunath Narayana, Allen Hanson, Erik Learned-Miller

    Abstract: Recent work on background subtraction has shown developments on two major fronts. In one, there has been increasing sophistication of probabilistic models, from mixtures of Gaussians at each pixel [7], to kernel density estimates at each pixel [1], and more recently to joint domainrange density estimates that incorporate spatial information [6]. Another line of work has shown the benefits of incre… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

    Comments: 8 pages, 4 figures, CVPR 2012 conference paper in CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  43. arXiv:1511.01627  [pdf, other

    cs.CV

    Background subtraction - separating the modeling and the inference

    Authors: Manjunath Narayana, Allen Hanson, Erik Learned-Miller

    Abstract: In its early implementations, background modeling was a process of building a model for the background of a video with a stationary camera, and identifying pixels that did not conform well to this model. The pixels that were not well-described by the background model were assumed to be moving objects. Many systems today maintain models for the foreground as well as the background, and these models… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

    Comments: 19 pages, 6 figures, Machine Vision and Applications journal

    Journal ref: Machine Vision and Applications July 2014, Volume 25, Issue 5, pp 1163-1174

  44. Coherent Motion Segmentation in Moving Camera Videos using Optical Flow Orientations

    Authors: Manjunath Narayana, Allen Hanson, Erik Learned-Miller

    Abstract: In moving camera videos, motion segmentation is commonly performed using the image plane motion of pixels, or optical flow. However, objects that are at different depths from the camera can exhibit different optical flows even if they share the same real-world motion. This can cause a depth-dependent segmentation of the scene. Our goal is to develop a segmentation algorithm that clusters pixels th… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

    Comments: 8 pages, 5 figures, in 2013 IEEE International Conference on Computer Vision (ICCV)

  45. arXiv:1506.01342  [pdf, other

    cs.CV

    One-to-many face recognition with bilinear CNNs

    Authors: Aruni RoyChowdhury, Tsung-Yu Lin, Subhransu Maji, Erik Learned-Miller

    Abstract: The recent explosive growth in convolutional neural network (CNN) research has produced a variety of new architectures for deep learning. One intriguing new architecture is the bilinear CNN (B-CNN), which has shown dramatic performance gains on certain fine-grained recognition problems [15]. We apply this new CNN to the challenging new face recognition benchmark, the IARPA Janus Benchmark A (IJB-A… ▽ More

    Submitted 28 March, 2016; v1 submitted 3 June, 2015; originally announced June 2015.

    Comments: Published version at WACV 2016

  46. arXiv:1505.00880  [pdf, other

    cs.CV cs.GR

    Multi-view Convolutional Neural Networks for 3D Shape Recognition

    Authors: Hang Su, Subhransu Maji, Evangelos Kalogerakis, Erik Learned-Miller

    Abstract: A longstanding question in computer vision concerns the representation of 3D shapes for recognition: should 3D shapes be represented with descriptors operating on their native 3D formats, such as voxel grid or polygon mesh, or can they be effectively represented with view-based descriptors? We address this question in the context of learning to recognize 3D shapes from a collection of their render… ▽ More

    Submitted 27 September, 2015; v1 submitted 5 May, 2015; originally announced May 2015.

    Comments: v1: Initial version. v2: An updated ModelNet40 training/test split is used; results with low-rank Mahalanobis metric learning are added. v3 (ICCV 2015): A second camera setup without the upright orientation assumption is added; some accuracy and mAP numbers are changed slightly because a small issue in mesh rendering related to specularities is fixed

  47. arXiv:1210.4892  [pdf

    cs.LG stat.ML

    Unsupervised Joint Alignment and Clustering using Bayesian Nonparametrics

    Authors: Marwan A. Mattar, Allen R. Hanson, Erik G. Learned-Miller

    Abstract: Joint alignment of a collection of functions is the process of independently transforming the functions so that they appear more similar to each other. Typically, such unsupervised alignment algorithms fail when presented with complex data sets arising from multiple modalities or make restrictive assumptions about the form of the functions or transformations, limiting their generality. We present… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-584-593

  48. arXiv:0907.0418  [pdf, ps, other

    cs.CV

    Bounding the Probability of Error for High Precision Recognition

    Authors: Andrew Kae, Gary B. Huang, Erik Learned-Miller

    Abstract: We consider models for which it is important, early in processing, to estimate some variables with high precision, but perhaps at relatively low rates of recall. If some variables can be identified with near certainty, then they can be conditioned upon, allowing further inference to be done efficiently. Specifically, we consider optical character recognition (OCR) systems that can be bootstrappe… ▽ More

    Submitted 2 July, 2009; originally announced July 2009.

    Report number: UM-CS-2009-031

  49. arXiv:cs/0504091  [pdf

    cs.IT

    A Probabilistic Upper Bound on Differential Entropy

    Authors: Joseph DeStefano, Erik Learned-Miller

    Abstract: A novel, non-trivial, probabilistic upper bound on the entropy of an unknown one-dimensional distribution, given the support of the distribution and a sample from that distribution, is presented. No knowledge beyond the support of the unknown distribution is required, nor is the distribution required to have a density. Previous distribution-free bounds on the cumulative distribution function of… ▽ More

    Submitted 21 April, 2005; originally announced April 2005.