Skip to main content

Showing 1–46 of 46 results for author: Krishnan, D

.
  1. arXiv:2504.19916  [pdf, ps, other

    cs.IT

    An Achievability Bound for Type-Based Unsourced Multiple Access

    Authors: Deekshith Pathayappilly Krishnan, Kaan Okumus, Khac-Hoang Ngo, Giuseppe Durisi

    Abstract: We derive an achievability bound to quantify the performance of a type-based unsourced multiple access system -- an information-theoretic model for grant-free multiple access with correlated messages. The bound extends available achievability results for the per-user error probability in the unsourced multiple access framework, where, different from our setup, message collisions are treated as err… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 8 pages, 1 figure. Extended version of a paper accepted for presentation at ISIT 2025

  2. arXiv:2411.14464  [pdf, ps, other

    q-bio.QM cs.AI cs.LG q-bio.BM

    JESTR: Joint Embedding Space Technique for Ranking Candidate Molecules for the Annotation of Untargeted Metabolomics Data

    Authors: Apurva Kalia, Yan Zhou Chen, Dilip Krishnan, Soha Hassoun

    Abstract: Motivation: A major challenge in metabolomics is annotation: assigning molecular structures to mass spectral fragmentation patterns. Despite recent advances in molecule-to-spectra and in spectra-to-molecular fingerprint prediction (FP), annotation rates remain low. Results: We introduce in this paper a novel paradigm (JESTR) for annotation. Unlike prior approaches that explicitly construct molecul… ▽ More

    Submitted 7 June, 2025; v1 submitted 17 November, 2024; originally announced November 2024.

    Comments: 10 pages, 10 figures, 4 tables

  3. arXiv:2409.19233  [pdf, ps, other

    nlin.SI nlin.PS

    Collisional Dynamics of Solitons and Pattern Formation in an Integrable Cross Coupled Nonlinear Schrodinger equation with constant background

    Authors: P. S. Vinayagam, D. Aravindha Krishnan, R. V. Kamaleshwaran, R. Radha

    Abstract: We investigate the dynamics arising out of the propagation of light pulses with different polarizations through a condensate (referred to as a constant background field) with cross coupling described by a coupled nonlinear Schrodinger equation(NLSE) type equation. We then employ Gauge and Darboux transformation approach to bring out the rich dynamics arising out of the background field and cross c… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

    Comments: 14 pages, 6 figures, Accepted for Publication in Romanian Reports in Physics (2024)

    MSC Class: 37K40; 35Q51; 35Q55

  4. arXiv:2404.19552  [pdf, ps, other

    cs.IT

    Type-Based Unsourced Multiple Access

    Authors: Khac-Hoang Ngo, Deekshith Pathayappilly Krishnan, Kaan Okumus, Giuseppe Durisi, Erik G. Ström

    Abstract: We generalize the type-based multiple access framework proposed by Mergen and Tong (2006) to the case of unsourced multiple access. In the proposed framework, each device tracks the state of a physical/digital process, quantizes this state, and communicates it to a common receiver through a shared channel in an uncoordinated manner. The receiver aims to estimate the type of the states, i.e., the s… ▽ More

    Submitted 15 July, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: accepted to the 25th IEEE International Workshop on Signal Processing Advances in Wireless Communications (SPAWC); simulation code available at: https://github.com/khachoang1412/TUMA

  5. arXiv:2401.02957  [pdf, other

    cs.CV

    Denoising Vision Transformers

    Authors: Jiawei Yang, Katie Z Luo, Jiefeng Li, Congyue Deng, Leonidas Guibas, Dilip Krishnan, Kilian Q Weinberger, Yonglong Tian, Yue Wang

    Abstract: We study a crucial yet often overlooked issue inherent to Vision Transformers (ViTs): feature maps of these models exhibit grid-like artifacts, which hurt the performance of ViTs in downstream dense prediction tasks such as semantic segmentation, depth prediction, and object discovery. We trace this issue down to the positional embeddings at the input stage. To mitigate this, we propose a two-stag… ▽ More

    Submitted 22 July, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted to ECCV2024. Project website: https://jiawei-yang.github.io/DenoisingViT/

  6. arXiv:2312.17742  [pdf, other

    cs.CV

    Learning Vision from Models Rivals Learning Vision from Data

    Authors: Yonglong Tian, Lijie Fan, Kaifeng Chen, Dina Katabi, Dilip Krishnan, Phillip Isola

    Abstract: We introduce SynCLR, a novel approach for learning visual representations exclusively from synthetic images and synthetic captions, without any real data. We synthesize a large dataset of image captions using LLMs, then use an off-the-shelf text-to-image model to generate multiple images corresponding to each synthetic caption. We perform visual representation learning on these synthetic images vi… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Code is available at https://github.com/google-research/syn-rep-learn

  7. arXiv:2312.04567  [pdf, other

    cs.CV

    Scaling Laws of Synthetic Images for Model Training ... for Now

    Authors: Lijie Fan, Kaifeng Chen, Dilip Krishnan, Dina Katabi, Phillip Isola, Yonglong Tian

    Abstract: Recent significant advances in text-to-image models unlock the possibility of training vision systems using synthetic images, potentially overcoming the difficulty of collecting curated data at scale. It is unclear, however, how these models behave at scale, as more synthetic data is added to the training set. In this paper we study the scaling laws of synthetic images generated by state of the ar… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  8. arXiv:2312.00950  [pdf, other

    cs.CV

    Improve Supervised Representation Learning with Masked Image Modeling

    Authors: Kaifeng Chen, Daniel Salz, Huiwen Chang, Kihyuk Sohn, Dilip Krishnan, Mojtaba Seyedhosseini

    Abstract: Training visual embeddings with labeled data supervision has been the de facto setup for representation learning in computer vision. Inspired by recent success of adopting masked image modeling (MIM) in self-supervised representation learning, we propose a simple yet effective setup that can easily integrate MIM into existing supervised training paradigms. In our design, in addition to the origina… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  9. arXiv:2310.03734  [pdf, other

    cs.CV

    Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency

    Authors: Tianhong Li, Sangnie Bhardwaj, Yonglong Tian, Han Zhang, Jarred Barber, Dina Katabi, Guillaume Lajoie, Huiwen Chang, Dilip Krishnan

    Abstract: Current vision-language generative models rely on expansive corpora of paired image-text data to attain optimal performance and generalization capabilities. However, automatically collecting such data (e.g. via large-scale web scraping) leads to low quality and poor image-text correlation, while human annotation is more accurate but requires significant manual effort and expense. We introduce… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  10. arXiv:2307.05610  [pdf, other

    cs.LG cs.AI cs.CV

    Substance or Style: What Does Your Image Embedding Know?

    Authors: Cyrus Rashtchian, Charles Herrmann, Chun-Sung Ferng, Ayan Chakrabarti, Dilip Krishnan, Deqing Sun, Da-Cheng Juan, Andrew Tomkins

    Abstract: Probes are small networks that predict properties of underlying data from embeddings, and they provide a targeted, effective way to illuminate the information contained in embeddings. While analysis through the use of probes has become standard in NLP, there has been much less exploration in vision. Image foundation models have primarily been evaluated for semantic content. Better understanding th… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 27 pages, 9 figures

  11. arXiv:2306.00984  [pdf, other

    cs.CV

    StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners

    Authors: Yonglong Tian, Lijie Fan, Phillip Isola, Huiwen Chang, Dilip Krishnan

    Abstract: We investigate the potential of learning visual representations using synthetic images generated by text-to-image models. This is a natural question in the light of the excellent performance of such models in generating high-quality images. We consider specifically the Stable Diffusion, one of the leading open source text-to-image models. We show that (1) when the generative model is configured wi… ▽ More

    Submitted 26 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: code is available at: https://github.com/google-research/syn-rep-learn

  12. arXiv:2306.00983  [pdf, other

    cs.CV cs.AI

    StyleDrop: Text-to-Image Generation in Any Style

    Authors: Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan

    Abstract: Pre-trained large text-to-image models synthesize impressive images with an appropriate use of text prompts. However, ambiguities inherent in natural language and out-of-distribution effects make it hard to synthesize image styles, that leverage a specific design pattern, texture or material. In this paper, we introduce StyleDrop, a method that enables the synthesis of images that faithfully follo… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Preprint. Project page at https://styledrop.github.io

  13. arXiv:2305.20088  [pdf, other

    cs.CV cs.CL cs.LG

    Improving CLIP Training with Language Rewrites

    Authors: Lijie Fan, Dilip Krishnan, Phillip Isola, Dina Katabi, Yonglong Tian

    Abstract: Contrastive Language-Image Pre-training (CLIP) stands as one of the most effective and scalable methods for training transferable vision models using paired image and text data. CLIP models are trained using contrastive loss, which typically relies on data augmentations to prevent overfitting and shortcuts. However, in the CLIP training paradigm, data augmentations are exclusively applied to image… ▽ More

    Submitted 28 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  14. arXiv:2302.11349  [pdf, other

    cs.CV

    Steerable Equivariant Representation Learning

    Authors: Sangnie Bhardwaj, Willie McClinton, Tongzhou Wang, Guillaume Lajoie, Chen Sun, Phillip Isola, Dilip Krishnan

    Abstract: Pre-trained deep image representations are useful for post-training tasks such as classification through transfer learning, image retrieval, and object detection. Data augmentations are a crucial aspect of pre-training robust representations in both supervised and self-supervised settings. Data augmentations explicitly or implicitly promote invariance in the embedding space to the input image tran… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  15. arXiv:2301.00704  [pdf, other

    cs.CV cs.AI cs.LG

    Muse: Text-To-Image Generation via Masked Generative Transformers

    Authors: Huiwen Chang, Han Zhang, Jarred Barber, AJ Maschinot, Jose Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Murphy, William T. Freeman, Michael Rubinstein, Yuanzhen Li, Dilip Krishnan

    Abstract: We present Muse, a text-to-image Transformer model that achieves state-of-the-art image generation performance while being significantly more efficient than diffusion or autoregressive models. Muse is trained on a masked modeling task in discrete token space: given the text embedding extracted from a pre-trained large language model (LLM), Muse is trained to predict randomly masked image tokens. C… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

  16. arXiv:2211.09117  [pdf, other

    cs.CV

    MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis

    Authors: Tianhong Li, Huiwen Chang, Shlok Kumar Mishra, Han Zhang, Dina Katabi, Dilip Krishnan

    Abstract: Generative modeling and representation learning are two key tasks in computer vision. However, these models are typically trained independently, which ignores the potential for each task to help the other, and leads to training and model maintenance overheads. In this work, we propose MAsked Generative Encoder (MAGE), the first framework to unify SOTA image generation and self-supervised represent… ▽ More

    Submitted 29 June, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Update sponsor info

  17. arXiv:2210.16870  [pdf, other

    cs.CV cs.LG

    A simple, efficient and scalable contrastive masked autoencoder for learning visual representations

    Authors: Shlok Mishra, Joshua Robinson, Huiwen Chang, David Jacobs, Aaron Sarna, Aaron Maschinot, Dilip Krishnan

    Abstract: We introduce CAN, a simple, efficient and scalable method for self-supervised learning of visual representations. Our framework is a minimal and conceptually clean synthesis of (C) contrastive learning, (A) masked autoencoders, and (N) the noise prediction approach used in diffusion models. The learning mechanisms are complementary to one another: contrastive learning shapes the embedding space ac… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: Mishra and Robinson contributed equally

  18. The Lick Observatory Supernova Search follow-up program: photometry data release of 70 stripped-envelope supernovae

    Authors: WeiKang Zheng, Benjamin E. Stahl, Thomas de Jaeger, Alexei V. Filippenko, Shan-Qin Wang, Wen-Pei Gan, Thomas G. Brink, Ivan Altunin, Raphael Baer-Way, Andrew Bigley, Kyle Blanchard, Peter K. Blanchard, James Bradley, Samantha K. Cargill, Chadwick Casper, Teagan Chapman, Vidhi Chander, Sanyum Channa, Byung Yun Choi, Nick Choksi, Matthew Chu, Kelsey I. Clubb, Daniel P. Cohen, Paul A. Dalba, Asia deGraw , et al. (63 additional authors not shown)

    Abstract: We present BVRI and unfiltered Clear light curves of 70 stripped-envelope supernovae (SESNe), observed between 2003 and 2020, from the Lick Observatory Supernova Search (LOSS) follow-up program. Our SESN sample consists of 19 spectroscopically normal SNe~Ib, two peculiar SNe Ib, six SN Ibn, 14 normal SNe Ic, one peculiar SN Ic, ten SNe Ic-BL, 15 SNe IIb, one ambiguous SN IIb/Ib/c, and two superlum… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Accepted by MNRAS

  19. arXiv:2112.00319  [pdf, other

    cs.CV cs.LG

    Object-Aware Cropping for Self-Supervised Learning

    Authors: Shlok Mishra, Anshul Shah, Ankan Bansal, Abhyuday Jagannatha, Janit Anjaria, Abhishek Sharma, David Jacobs, Dilip Krishnan

    Abstract: A core component of the recent success of self-supervised learning is cropping data augmentation, which selects sub-regions of an image to be used as positive views in the self-supervised loss. The underlying assumption is that randomly cropped and resized regions of a given image share information about the objects of interest, which the learned representation will capture. This assumption is mos… ▽ More

    Submitted 6 April, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

    Journal ref: Transactions on Machine Learning Research 2022

  20. arXiv:2111.15121  [pdf, other

    cs.CV

    Pyramid Adversarial Training Improves ViT Performance

    Authors: Charles Herrmann, Kyle Sargent, Lu Jiang, Ramin Zabih, Huiwen Chang, Ce Liu, Dilip Krishnan, Deqing Sun

    Abstract: Aggressive data augmentation is a key component of the strong generalization capabilities of Vision Transformer (ViT). One such data augmentation technique is adversarial training (AT); however, many prior works have shown that this often results in poor clean accuracy. In this work, we present pyramid adversarial training (PyramidAT), a simple and effective technique to improve ViT's overall perf… ▽ More

    Submitted 2 September, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: Accepted to CVPR22 (oral, best paper finalist). 33 pages, including references & supplementary material

  21. arXiv:2111.09467  [pdf, other

    cs.LG q-bio.MN q-bio.QM

    CSI: Contrastive Data Stratification for Interaction Prediction and its Application to Compound-Protein Interaction Prediction

    Authors: Apurva Kalia, Dilip Krishnan, Soha Hassoun

    Abstract: Accurately predicting the likelihood of interaction between two objects (compound-protein sequence, user-item, author-paper, etc.) is a fundamental problem in Computer Science. Current deep-learning models rely on learning accurate representations of the interacting objects. Importantly, relationships between the interacting objects, or features of the interaction, offer an opportunity to partitio… ▽ More

    Submitted 21 December, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: 11 pages, submitted to BioInformatics

  22. arXiv:2108.06613  [pdf, other

    cs.CV cs.LG

    Unsupervised Disentanglement without Autoencoding: Pitfalls and Future Directions

    Authors: Andrea Burns, Aaron Sarna, Dilip Krishnan, Aaron Maschinot

    Abstract: Disentangled visual representations have largely been studied with generative models such as Variational AutoEncoders (VAEs). While prior work has focused on generative methods for disentangled representation learning, these approaches do not scale to large datasets due to current limitations of generative models. Instead, we explore regularization methods with contrastive learning, which could re… ▽ More

    Submitted 14 August, 2021; originally announced August 2021.

    Comments: Accepted at the ICML 2021 Self-Supervised Learning for Reasoning and Perception Workshop

  23. arXiv:2107.13047  [pdf, other

    cs.DB cs.CR cs.DC

    RingBFT: Resilient Consensus over Sharded Ring Topology

    Authors: Sajjad Rahnama, Suyash Gupta, Rohan Sogani, Dhruv Krishnan, Mohammad Sadoghi

    Abstract: The recent surge in federated data management applications has brought forth concerns about the security of underlying data and the consistency of replicas in the presence of malicious attacks. A prominent solution in this direction is to employ a permissioned blockchain framework that is modeled around traditional Byzantine Fault-Tolerant (BFT) consensus protocols. Any federated application expec… ▽ More

    Submitted 23 March, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: In proceedings of EDBT 2022

  24. arXiv:2103.07470  [pdf, other

    cs.LG

    Understanding Invariance via Feedforward Inversion of Discriminatively Trained Classifiers

    Authors: Piotr Teterwak, Chiyuan Zhang, Dilip Krishnan, Michael C. Mozer

    Abstract: A discriminatively trained neural net classifier can fit the training data perfectly if all information about its input other than class membership has been discarded prior to the output layer. Surprisingly, past research has discovered that some extraneous visual detail remains in the logit vector. This finding is based on inversion techniques that map deep embeddings back to images. We explore t… ▽ More

    Submitted 21 July, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: Camera Ready ICML 2021

  25. arXiv:2005.10243  [pdf, other

    cs.CV cs.LG

    What Makes for Good Views for Contrastive Learning?

    Authors: Yonglong Tian, Chen Sun, Ben Poole, Dilip Krishnan, Cordelia Schmid, Phillip Isola

    Abstract: Contrastive learning between multiple views of the data has recently achieved state of the art performance in the field of self-supervised representation learning. Despite its success, the influence of different view choices has been less studied. In this paper, we use theoretical and empirical analysis to better understand the importance of view selection, and argue that we should reduce the mutu… ▽ More

    Submitted 18 December, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: NeurIPS 2020. Project page: https://hobbitlong.github.io/InfoMin/

  26. arXiv:2004.11362  [pdf, other

    cs.LG cs.CV stat.ML

    Supervised Contrastive Learning

    Authors: Prannay Khosla, Piotr Teterwak, Chen Wang, Aaron Sarna, Yonglong Tian, Phillip Isola, Aaron Maschinot, Ce Liu, Dilip Krishnan

    Abstract: Contrastive learning applied to self-supervised representation learning has seen a resurgence in recent years, leading to state of the art performance in the unsupervised training of deep image models. Modern batch contrastive approaches subsume or significantly outperform traditional contrastive losses such as triplet, max-margin and the N-pairs loss. In this work, we extend the self-supervised b… ▽ More

    Submitted 10 March, 2021; v1 submitted 23 April, 2020; originally announced April 2020.

  27. arXiv:2003.11539  [pdf, other

    cs.CV cs.LG

    Rethinking Few-Shot Image Classification: a Good Embedding Is All You Need?

    Authors: Yonglong Tian, Yue Wang, Dilip Krishnan, Joshua B. Tenenbaum, Phillip Isola

    Abstract: The focus of recent meta-learning research has been on the development of learning algorithms that can quickly adapt to test time tasks with limited data and low computational cost. Few-shot learning is widely used as one of the standard benchmarks in meta-learning. In this work, we show that a simple baseline: learning a supervised or self-supervised representation on the meta-training set, follo… ▽ More

    Submitted 17 June, 2020; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: First two authors contributed equally. Project Page: https://people.csail.mit.edu/yuewang/projects/rfs/ Code: http://github.com/WangYueFt/rfs/

  28. arXiv:1912.02178  [pdf, other

    cs.LG stat.ML

    Fantastic Generalization Measures and Where to Find Them

    Authors: Yiding Jiang, Behnam Neyshabur, Hossein Mobahi, Dilip Krishnan, Samy Bengio

    Abstract: Generalization of deep networks has been of great interest in recent years, resulting in a number of theoretically and empirically motivated complexity measures. However, most papers proposing such measures study only a small set of models, leaving open the question of whether the conclusion drawn from those experiments would remain valid in other settings. We present the first large scale study o… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

  29. arXiv:1910.10699  [pdf, other

    cs.LG cs.CV stat.ML

    Contrastive Representation Distillation

    Authors: Yonglong Tian, Dilip Krishnan, Phillip Isola

    Abstract: Often we wish to transfer representational knowledge from one neural network to another. Examples include distilling a large network into a smaller one, transferring knowledge from one sensory modality to a second, or ensembling a collection of models into a single estimator. Knowledge distillation, the standard approach to these problems, minimizes the KL divergence between the probabilistic outp… ▽ More

    Submitted 24 January, 2022; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: ICLR 2020. Project Page: http://hobbitlong.github.io/CRD/, Code: http://github.com/HobbitLong/RepDistiller. Typo fixed in the newest version

  30. arXiv:1909.11140  [pdf, other

    astro-ph.SR astro-ph.CO astro-ph.HE

    Lick Observatory Supernova Search Follow-Up Program: Photometry Data Release of 93 Type Ia Supernovae

    Authors: Benjamin E. Stahl, WeiKang Zheng, Thomas de Jaeger, Alexei V. Filippenko, Andrew Bigley, Kyle Blanchard, Peter K. Blanchard, Thomas G. Brink, Samantha K. Cargill, Chadwick Casper, Sanyum Channa, Byung Yun Choi, Nick Choksi, Jason Chu, Kelsey I. Clubb, Daniel P. Cohen, Michael Ellison, Edward Falcon, Pegah Fazeli, Kiera Fuller, Mohan Ganeshalingam, Elinor L. Gates, Carolina Gould, Goni Halevi, Kevin T. Hayakawa , et al. (30 additional authors not shown)

    Abstract: We present BVRI and unfiltered light curves of 93 Type Ia supernovae (SNe Ia) from the Lick Observatory Supernova Search (LOSS) follow-up program conducted between 2005 and 2018. Our sample consists of 78 spectroscopically normal SNe Ia, with the remainder divided between distinct subclasses (three SN 1991bg-like, three SN 1991T-like, four SNe Iax, two peculiar, and three super-Chandrasekhar event… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

    Comments: 29 pages, 13 figures, accepted for publication in MNRAS

  31. Full control of Co valence in isopolar LaCoO3 / LaTiO3 perovskite heterostructures via interfacial engineering

    Authors: Georgios Araizi-Kanoutas, Jaap Geessinck, Nicolas Gauquelin, Steef Smit, Xanthe Verbeek, Shrawan K. Mishra, Peter Bencok, Christoph Schlueter, Tien-Lin Lee, Dileep Krishnan, Jo Verbeeck, Guus Rijnders, Gertjan Koster, Mark S. Golden

    Abstract: We report charge-transfer up to a single electron per interfacial unit cell across non-polar heterointerfaces from the Mott insulator LaTiO3 to the charge transfer insulator LaCoO3. In high-quality bi- and tri-layer systems grown using pulsed laser deposition, soft X-ray absorption, dichroism and STEM-EELS are used to probe the cobalt 3d-electron count and provide an element-specific investigation… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Journal ref: Phys. Rev. Materials 4, 026001 (2020)

  32. arXiv:1908.07007  [pdf, other

    cs.CV

    Boundless: Generative Adversarial Networks for Image Extension

    Authors: Piotr Teterwak, Aaron Sarna, Dilip Krishnan, Aaron Maschinot, David Belanger, Ce Liu, William T. Freeman

    Abstract: Image extension models have broad applications in image editing, computational photography and computer graphics. While image inpainting has been extensively studied in the literature, it is challenging to directly apply the state-of-the-art inpainting methods to image extension as they tend to generate blurry or repetitive pixels with inconsistent semantics. We introduce semantic conditioning to… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  33. arXiv:1907.02610  [pdf, other

    stat.ML cs.LG

    Adversarial Robustness through Local Linearization

    Authors: Chongli Qin, James Martens, Sven Gowal, Dilip Krishnan, Krishnamurthy Dvijotham, Alhussein Fawzi, Soham De, Robert Stanforth, Pushmeet Kohli

    Abstract: Adversarial training is an effective methodology for training deep neural networks that are robust against adversarial, norm-bounded perturbations. However, the computational cost of adversarial training grows prohibitively as the size of the model and number of input dimensions increase. Further, training against less expensive and therefore weaker adversaries produces models that are robust agai… ▽ More

    Submitted 10 October, 2019; v1 submitted 4 July, 2019; originally announced July 2019.

  34. arXiv:1906.05849  [pdf, other

    cs.CV cs.LG

    Contrastive Multiview Coding

    Authors: Yonglong Tian, Dilip Krishnan, Phillip Isola

    Abstract: Humans view the world through many sensory channels, e.g., the long-wavelength light channel, viewed by the left eye, or the high-frequency vibrations channel, heard by the right ear. Each view is noisy and incomplete, but important factors, such as physics, geometry, and semantics, tend to be shared between all views (e.g., a "dog" can be seen, heard, and felt). We investigate the classic hypothe… ▽ More

    Submitted 18 December, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: Code: http://github.com/HobbitLong/CMC/

  35. arXiv:1906.03808  [pdf, other

    cs.LG stat.ML

    A Closed-Form Learned Pooling for Deep Classification Networks

    Authors: Vighnesh Birodkar, Hossein Mobahi, Dilip Krishnan, Samy Bengio

    Abstract: In modern computer vision tasks, convolutional neural networks (CNNs) are indispensable for image classification tasks due to their efficiency and effectiveness. Part of their superiority compared to other architectures, comes from the fact that a single, local filter is shared across the entire image. However, there are scenarios where we may need to treat spatial locations in non-uniform manner.… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

  36. arXiv:1811.00953  [pdf, other

    cond-mat.mtrl-sci

    Influence of stoichiometry on interfacial conductance in LaAlO$_3$/SrTiO$_3$ grown by 90$^o$ off-axis sputtering

    Authors: Chunhai Yin, Dileep Krishnan, Nicolas Gauquelin, Jo Verbeeck, Jan Aarts

    Abstract: We report on the fabrication of conducting interfaces between LaAlO$_3$ and SrTiO$_3$ by 90$^o$ off-axis sputtering in an Ar atmosphere. At a growth pressure of 0.04 mbar the interface is metallic, with a carrier density of the order of $10^{13}$ cm$^{-2}$ at 3 K. By increasing the growth pressure, we observe an increase of the out-of-plane lattice constants of the LaAlO$_3$ films while the in-pla… ▽ More

    Submitted 2 November, 2018; originally announced November 2018.

    Comments: 4 pages, 4 figures; manuscript under review

  37. arXiv:1810.00113  [pdf, other

    stat.ML cs.LG

    Predicting the Generalization Gap in Deep Networks with Margin Distributions

    Authors: Yiding Jiang, Dilip Krishnan, Hossein Mobahi, Samy Bengio

    Abstract: As shown in recent research, deep neural networks can perfectly fit randomly labeled data, but with very poor accuracy on held out data. This phenomenon indicates that loss functions such as cross-entropy are not a reliable indicator of generalization. This leads to the crucial question of how generalization gap should be predicted from the training data and network parameters. In this paper, we p… ▽ More

    Submitted 12 June, 2019; v1 submitted 28 September, 2018; originally announced October 2018.

    Comments: Published in ICLR 2019

  38. arXiv:1803.05598  [pdf, other

    stat.ML cs.LG

    Large Margin Deep Networks for Classification

    Authors: Gamaleldin F. Elsayed, Dilip Krishnan, Hossein Mobahi, Kevin Regan, Samy Bengio

    Abstract: We present a formulation of deep learning that aims at producing a large margin classifier. The notion of margin, minimum distance to a decision boundary, has served as the foundation of several theoretically profound and empirically successful results for both classification and regression tasks. However, most large margin algorithms are applicable only to shallow models with a preset feature rep… ▽ More

    Submitted 3 December, 2018; v1 submitted 15 March, 2018; originally announced March 2018.

  39. arXiv:1712.08232  [pdf, other

    cs.CV

    Smart, Sparse Contours to Represent and Edit Images

    Authors: Tali Dekel, Chuang Gan, Dilip Krishnan, Ce Liu, William T. Freeman

    Abstract: We study the problem of reconstructing an image from information stored at contour locations. We show that high-quality reconstructions with high fidelity to the source image can be obtained from sparse input, e.g., comprising less than $6\%$ of image pixels. This is a significant improvement over existing contour-based reconstruction methods that require much denser input to capture subtle textur… ▽ More

    Submitted 9 April, 2018; v1 submitted 21 December, 2017; originally announced December 2017.

    Comments: Accepted to CVPR'18; Project page: contour2im.github.io

  40. arXiv:1701.04851  [pdf, other

    cs.CV stat.ML

    Synthesizing Normalized Faces from Facial Identity Features

    Authors: Forrester Cole, David Belanger, Dilip Krishnan, Aaron Sarna, Inbar Mosseri, William T. Freeman

    Abstract: We present a method for synthesizing a frontal, neutral-expression image of a person's face given an input face photograph. This is achieved by learning to generate facial landmarks and textures from features extracted from a facial-recognition network. Unlike previous approaches, our encoding feature vector is largely invariant to lighting, pose, and facial expression. Exploiting this invariance,… ▽ More

    Submitted 17 October, 2017; v1 submitted 17 January, 2017; originally announced January 2017.

  41. arXiv:1612.05424  [pdf, other

    cs.CV

    Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

    Authors: Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, Dilip Krishnan

    Abstract: Collecting well-annotated image datasets to train modern machine learning algorithms is prohibitively expensive for many tasks. One appealing alternative is rendering synthetic data where ground-truth annotations are generated automatically. Unfortunately, models trained purely on rendered images often fail to generalize to real images. To address this shortcoming, prior work introduced unsupervis… ▽ More

    Submitted 23 August, 2017; v1 submitted 16 December, 2016; originally announced December 2016.

    Comments: Final CVPR 2017 paper and supplementary material

  42. arXiv:1611.08573  [pdf

    cs.DC

    The Marriage of Incremental and Approximate Computing

    Authors: Dhanya R Krishnan

    Abstract: Most data analytics systems that require low-latency execution and efficient utilization of computing resources, increasingly adopt two computational paradigms, namely, incremental and approximate computing. Incremental computation updates the output incrementally instead of re-computing everything from scratch for successive runs of a job with input changes. Approximate computation returns an app… ▽ More

    Submitted 25 November, 2016; originally announced November 2016.

    Comments: http://dl.acm.org/citation.cfm?id=2883026

  43. arXiv:1608.06019  [pdf, other

    cs.CV

    Domain Separation Networks

    Authors: Konstantinos Bousmalis, George Trigeorgis, Nathan Silberman, Dilip Krishnan, Dumitru Erhan

    Abstract: The cost of large scale data collection and annotation often makes the application of machine learning algorithms to new tasks or datasets prohibitively expensive. One approach circumventing this cost is training models on synthetic data where annotations are provided automatically. Despite their appeal, such models often fail to generalize from synthetic to real images, necessitating domain adapt… ▽ More

    Submitted 21 August, 2016; originally announced August 2016.

    Comments: This work will be presented at NIPS 2016

  44. arXiv:1511.06811  [pdf, other

    cs.LG cs.CV

    Learning visual groups from co-occurrences in space and time

    Authors: Phillip Isola, Daniel Zoran, Dilip Krishnan, Edward H. Adelson

    Abstract: We propose a self-supervised framework that learns to group visual entities based on their rate of co-occurrence in space and time. To model statistical dependencies between the entities, we set up a simple binary classification problem in which the goal is to predict if two visual primitives occur in the same spatial or temporal context. We apply this framework to three domains: learning patch af… ▽ More

    Submitted 20 November, 2015; originally announced November 2015.

  45. arXiv:1311.4029  [pdf, other

    cs.CV

    Blind Deconvolution with Non-local Sparsity Reweighting

    Authors: Dilip Krishnan, Joan Bruna, Rob Fergus

    Abstract: Blind deconvolution has made significant progress in the past decade. Most successful algorithms are classified either as Variational or Maximum a-Posteriori ($MAP$). In spite of the superior theoretical justification of variational techniques, carefully constructed $MAP$ algorithms have proven equally effective in practice. In this paper, we show that all successful $MAP$ and variational algorith… ▽ More

    Submitted 16 June, 2014; v1 submitted 16 November, 2013; originally announced November 2013.

    Comments: 19 pages

  46. arXiv:0912.0982  [pdf

    cs.SE

    Ethics Understanding of Software Professional In Risk Reducing Reusability Coding Using Inclusion Set Theory

    Authors: G. Singaravel, Dr. V. Palanisamy, Dr. A. Krishnan

    Abstract: The technical skill or ability of an individual is different to person in software developments of projects. So, it is necessary to identify the talent and attitude of an individual contribution can be uniformly distributed to the different phases of software development cycle. The line of code analysis metrics to understanding the various skills of the programmers in code development. By using… ▽ More

    Submitted 5 December, 2009; originally announced December 2009.

    Comments: 5 pages IEEE format, International Journal of Computer Science and Information Security, IJCSIS November 2009, ISSN 1947 5500, http://sites.google.com/site/ijcsis/

    Report number: ISSN 1947 5500

    Journal ref: International Journal of Computer Science and Information Security, IJCSIS, Vol. 6, No. 2, pp. 189-193, November 2009, USA