Skip to main content

Showing 1–50 of 50 results for author: Acuna, D

.
  1. arXiv:2506.15237  [pdf, ps, other

    cs.DL physics.data-an physics.soc-ph

    Dissecting the gender divide: Authorship and acknowledgment in scientific publications

    Authors: Keigo Kusumegi, Yukie Sano, Daniel E. Acuña

    Abstract: The issue of gender bias in scientific publications has been the subject of ongoing debate. One aspect of this debate concerns whether women receive equal credit for their contributions compared to men. Conventional wisdom suggests that women are more likely to be acknowledged than listed as co-authors, a role that carries greater prestige. Here, we analyze data from hundreds of thousands of scien… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 29 pages, 8 figures

  2. arXiv:2506.08927  [pdf, ps, other

    cs.CV cs.AI cs.CL

    Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions

    Authors: David Acuna, Ximing Lu, Jaehun Jung, Hyunwoo Kim, Amlan Kar, Sanja Fidler, Yejin Choi

    Abstract: Recent research in vision-language models (VLMs) has centered around the possibility of equipping them with implicit long-form chain-of-thought reasoning -- akin to the success observed in language models -- via distillation and reinforcement learning. But what about the non-reasoning models already trained and deployed across the internet? Should we simply abandon them, or is there hope for a sea… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  3. arXiv:2505.20161  [pdf, other

    cs.LG cs.AI cs.CL

    Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning

    Authors: Jaehun Jung, Seungju Han, Ximing Lu, Skyler Hallinan, David Acuna, Shrimai Prabhumoye, Mostafa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Yejin Choi

    Abstract: Effective generalization in language models depends critically on the diversity of their training data. Yet existing diversity metrics often fall short of this goal, relying on surface-level heuristics that are decoupled from model behavior. This motivates us to ask: What kind of diversity in training data actually drives generalization in language models -- and how can we measure and amplify it?… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  4. arXiv:2505.11718  [pdf, ps, other

    cs.AI

    REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning

    Authors: Pawin Taechoyotin, Daniel Acuna

    Abstract: AI-based peer review systems tend to produce shallow and overpraising suggestions compared to human feedback. Here, we evaluate how well a reasoning LLM trained with multi-objective reinforcement learning (REMOR) can overcome these limitations. We start by designing a multi-aspect reward function that aligns with human evaluation of reviews. The aspects are related to the review itself (e.g., crit… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: 18 pages, 6 figures

  5. arXiv:2504.15362  [pdf, other

    cs.CV cs.CL cs.LG

    LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception

    Authors: Yuan-Hong Liao, Sven Elflein, Liu He, Laura Leal-Taixé, Yejin Choi, Sanja Fidler, David Acuna

    Abstract: Recent reasoning models through test-time scaling have demonstrated that long chain-of-thoughts can unlock substantial performance boosts in hard reasoning tasks such as math and code. However, the benefit of such long thoughts for system-2 reasoning is relatively less explored in other domains such as perceptual tasks where shallower, system-1 reasoning seems sufficient. In this paper, we introdu… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: 24 pages, 10 figures, in submission. Project page: https://andrewliao11.github.io/LongPerceptualThoughts

  6. arXiv:2504.04383  [pdf, other

    cs.AI cs.CL cs.LG

    Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning

    Authors: Ximing Lu, Seungju Han, David Acuna, Hyunwoo Kim, Jaehun Jung, Shrimai Prabhumoye, Niklas Muennighoff, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Yejin Choi

    Abstract: Large reasoning models exhibit remarkable reasoning capabilities via long, elaborate reasoning trajectories. Supervised fine-tuning on such reasoning traces, also known as distillation, can be a cost-effective way to boost reasoning capabilities of student models. However, empirical observations reveal that these reasoning trajectories are often suboptimal, switching excessively between different… ▽ More

    Submitted 15 April, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

    Comments: Code and data will be publicly released upon internal approval

  7. arXiv:2409.09788  [pdf, other

    cs.CV cs.CL

    Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models

    Authors: Yuan-Hong Liao, Rafid Mahmood, Sanja Fidler, David Acuna

    Abstract: Despite recent advances demonstrating vision-language models' (VLMs) abilities to describe complex relationships in images using natural language, their capability to quantitatively reason about object sizes and distances remains underexplored. In this work, we introduce a manually annotated benchmark, Q-Spatial Bench, with 271 questions across five categories designed for quantitative spatial rea… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

    Comments: 20 pages, 13 figures

  8. arXiv:2408.09702  [pdf, other

    cs.CV cs.AI cs.GR

    Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

    Authors: Ruofan Liang, Zan Gojcic, Merlin Nimier-David, David Acuna, Nandita Vijaykumar, Sanja Fidler, Zian Wang

    Abstract: The correct insertion of virtual objects in images of real-world scenes requires a deep understanding of the scene's lighting, geometry and materials, as well as the image formation process. While recent large-scale diffusion models have shown strong generative and inpainting capabilities, we find that current models do not sufficiently "understand" the scene shown in a single picture to generate… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: ECCV 2024, Project page: https://research.nvidia.com/labs/toronto-ai/DiPIR/

  9. Dataset Mention Extraction in Scientific Articles Using Bi-LSTM-CRF Model

    Authors: Tong Zeng, Daniel Acuna

    Abstract: Datasets are critical for scientific research, playing an important role in replication, reproducibility, and efficiency. Researchers have recently shown that datasets are becoming more important for science to function properly, even serving as artifacts of study themselves. However, citing datasets is not a common or standard practice in spite of recent efforts by data repositories and funding a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Journal ref: Rich Search and Discovery for Research Datasets, 2020, 158-165

  10. arXiv:2405.12840  [pdf, other

    cs.IR cs.DL cs.LG

    GotFunding: A grant recommendation system based on scientific articles

    Authors: Tong Zeng, Daniel E. Acuna

    Abstract: Obtaining funding is an important part of becoming a successful scientist. Junior faculty spend a great deal of time finding the right agencies and programs that best match their research profile. But what are the factors that influence the best publication--grant matching? Some universities might employ pre-award personnel to understand these factors, but not all institutions can afford to hire t… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Journal ref: Proceedings of the Association for Information Science and Technology (2020), Volume 57, Issue 1, e323

  11. Modeling citation worthiness by using attention-based bidirectional long short-term memory networks and interpretable models

    Authors: Tong Zeng, Daniel E. Acuna

    Abstract: Scientist learn early on how to cite scientific sources to support their claims. Sometimes, however, scientists have challenges determining where a citation should be situated -- or, even worse, fail to cite a source altogether. Automatically detecting sentences that need a citation (i.e., citation worthiness) could solve both of these issues, leading to more robust and well-constructed scientific… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Journal ref: Scientometrics 124, 399-428 (2020)

  12. arXiv:2404.10765  [pdf, other

    cs.CV

    RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting

    Authors: Ashkan Mirzaei, Riccardo De Lutio, Seung Wook Kim, David Acuna, Jonathan Kelly, Sanja Fidler, Igor Gilitschenski, Zan Gojcic

    Abstract: Neural reconstruction approaches are rapidly emerging as the preferred representation for 3D scenes, but their limited editability is still posing a challenge. In this work, we propose an approach for 3D scene inpainting -- the task of coherently replacing parts of the reconstructed scene with desired content. Scene inpainting is an inherently ill-posed task as there exist many solutions that plau… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Project page: https://reffusion.github.io

  13. arXiv:2404.06510  [pdf, other

    cs.CV

    Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves?

    Authors: Yuan-Hong Liao, Rafid Mahmood, Sanja Fidler, David Acuna

    Abstract: Enhancing semantic grounding abilities in Vision-Language Models (VLMs) often involves collecting domain-specific training data, refining the network architectures, or modifying the training recipes. In this work, we venture into an orthogonal direction and explore whether VLMs can improve their semantic grounding by "receiving" feedback, without requiring in-domain data, fine-tuning, or modificat… ▽ More

    Submitted 26 May, 2025; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2025. 22 pages, 16 figures

  14. arXiv:2401.10268  [pdf

    cs.CY cs.AI cs.SI

    The complementary contributions of academia and industry to AI research

    Authors: Lizhen Liang, Han Zhuang, James Zou, Daniel E. Acuna

    Abstract: Artificial intelligence (AI) has seen fast paced development in industry and academia. However, striking recent advances by industry have stunned the field, inviting a fresh perspective on the role of academic research on this progress. Here, we characterize the impact and type of AI produced by both environments over the last 25 years and establish several patterns. We find that articles publishe… ▽ More

    Submitted 18 September, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: 35 pages, 11 figures

  15. arXiv:2310.09453   

    cs.SI

    Effects of Same-Race Mentorship Preferences on Academic Performance and Survival

    Authors: Meijun Liu, Yi Bu, Daifeng Li, Ying Ding, Daniel E. Acuna

    Abstract: Same-race mentorship preference refers to mentors or mentees forming connections significantly influenced by a shared race. Although racial diversity in science has been well-studied and linked to favorable outcomes, the extent and effects of same-race mentorship preferences remain largely underexplored. Here, we analyze 465,355 mentor-mentee pairs from more than 60 research areas over the last 70… ▽ More

    Submitted 4 May, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 1. After further evaluating the race prediction method, we observed unsatisfactory accuracy and F1 scores. The study's findings could be impacted by these subpar predictions. 2. Our study incorporates both US and non-US samples, revealing that non-US samples may introduce outliers and distort the results. We recognize that the study's findings and conclusions might be affected by data quality

  16. arXiv:2310.03547  [pdf, other

    cond-mat.soft cond-mat.mtrl-sci physics.app-ph

    Auxetic Granular Metamaterials

    Authors: Daan Haver, Daniel Acuña, Shahram Janbaz, Edan Lerner, Gustavo Düring, Corentin Coulais

    Abstract: The flowing, jamming and avalanche behavior of granular materials is satisfyingly universal and vexingly hard to tune: a granular flow is typically intermittent and will irremediably jam if too confined. Here, we show that granular metamaterials made from particles with a negative Poisson's ratio yield more easily and flow more smoothly than ordinary granular materials. We first create a collectio… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  17. arXiv:2307.07487  [pdf, other

    cs.CV cs.LG

    DreamTeacher: Pretraining Image Backbones with Deep Generative Models

    Authors: Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim, Karsten Kreis, Antonio Torralba, Sanja Fidler

    Abstract: In this work, we introduce a self-supervised feature representation learning framework DreamTeacher that utilizes generative networks for pre-training downstream image backbones. We propose to distill knowledge from a trained generative model into standard image backbones that have been well engineered for specific perception tasks. We investigate two types of knowledge distillation: 1) distilling… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: Project page: https://research.nvidia.com/labs/toronto-ai/DreamTeacher/

  18. arXiv:2306.15804  [pdf

    physics.soc-ph cs.CY

    The Impact of Heterogeneous Shared Leadership in Scientific Teams

    Authors: Huimin Xu, Meijun Liu, Yi Bu, Shujing Sun, Yi Zhang, Chenwei Zhang, Daniel E. Acuna, Steven Gray, Eric Meyer, Ying Ding

    Abstract: Leadership is evolving dynamically from an individual endeavor to shared efforts. This paper aims to advance our understanding of shared leadership in scientific teams. We define three kinds of leaders, junior (10-15), mid (15-20), and senior (20+) based on career age. By considering the combinations of any two leaders, we distinguish shared leadership as heterogeneous when leaders are in differen… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  19. arXiv:2302.04832  [pdf, other

    cs.CV

    Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting

    Authors: Viraj Prabhu, David Acuna, Andrew Liao, Rafid Mahmood, Marc T. Law, Judy Hoffman, Sanja Fidler, James Lucas

    Abstract: Sim2Real domain adaptation (DA) research focuses on the constrained setting of adapting from a labeled synthetic source domain to an unlabeled or sparsely labeled real target domain. However, for high-stakes applications (e.g. autonomous driving), it is common to have a modest amount of human-labeled real data in addition to plentiful auto-labeled source data (e.g. from a driving simulator). We st… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  20. arXiv:2212.06933  [pdf, other

    cs.CL cs.AI cs.IR

    Paraphrase Identification with Deep Learning: A Review of Datasets and Methods

    Authors: Chao Zhou, Cheng Qiu, Lizhen Liang, Daniel E. Acuna

    Abstract: The rapid progress of Natural Language Processing (NLP) technologies has led to the widespread availability and effectiveness of text generation tools such as ChatGPT and Claude. While highly useful, these technologies also pose significant risks to the credibility of various media forms if they are employed for paraphrased plagiarism -- one of the most subtle forms of content misuse in scientific… ▽ More

    Submitted 7 October, 2024; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: 45 pages, 6 figures, 7 tables, 143 references

  21. arXiv:2208.09480  [pdf, other

    cs.CV

    Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion

    Authors: Zian Wang, Wenzheng Chen, David Acuna, Jan Kautz, Sanja Fidler

    Abstract: We consider the challenging problem of outdoor lighting estimation for the goal of photorealistic virtual object insertion into photographs. Existing works on outdoor lighting estimation typically simplify the scene lighting into an environment map which cannot capture the spatially-varying lighting effects in outdoor scenes. In this work, we propose a neural approach that estimates the 5D HDR lig… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Comments: Webpage: https://nv-tlabs.github.io/outdoor-ar/

    Journal ref: ECCV 2022

  22. arXiv:2207.01725  [pdf, other

    cs.CV cs.LG

    How Much More Data Do I Need? Estimating Requirements for Downstream Tasks

    Authors: Rafid Mahmood, James Lucas, David Acuna, Daiqing Li, Jonah Philion, Jose M. Alvarez, Zhiding Yu, Sanja Fidler, Marc T. Law

    Abstract: Given a small training data set and a learning algorithm, how much more data is necessary to reach a target validation or test performance? This question is of critical importance in applications such as autonomous driving or medical imaging where collecting data is expensive and time-consuming. Overestimating or underestimating data requirements incurs substantial costs that could be avoided with… ▽ More

    Submitted 13 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted to CVPR 2022

  23. arXiv:2206.09386  [pdf, other

    cs.LG cs.CV

    Scalable Neural Data Server: A Data Recommender for Transfer Learning

    Authors: Tianshi Cao, Sasha Doubov, David Acuna, Sanja Fidler

    Abstract: Absence of large-scale labeled data in the practitioner's target domain can be a bottleneck to applying machine learning algorithms in practice. Transfer learning is a popular strategy for leveraging additional data to improve the downstream performance, but finding the most relevant data to transfer from can be challenging. Neural Data Server (NDS), a search engine that recommends relevant data f… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: Neurips 2021

    Journal ref: Advances in Neural Information Processing Systems, Volume 34, pages 8984-8997, year 2021

  24. arXiv:2205.08756  [pdf

    cs.DL

    Team formation and team performance: The balance between team freshness and repeat collaboration

    Authors: Meijun Liu, Ajay Jaiswal, Yi Bu, Chao Min, Sijie Yang, Zhibo Liu, Daniel Daniel Acuña, Ying Ding

    Abstract: Incorporating fresh members in teams is considered a pathway to team creativity. However, whether freshness improves team performance or not remains unclear, as well as the optimal involvement of fresh members for team performance. This study uses a group of authors on the byline of a publication as a proxy for a scientific team. We extend an indicator, i.e., team freshness, to measure the extent… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  25. arXiv:2203.12800  [pdf

    cs.DL cs.IR

    Predicting the longevity of resources shared in scientific publications

    Authors: Daniel E. Acuna, Jian Jian, Tong Zeng, Lizhen Liang, Han Zhuang

    Abstract: Research has shown that most resources shared in articles (e.g., URLs to code or data) are not kept up to date and mostly disappear from the web after some years (Zeng et al., 2019). Little is known about the factors that differentiate and predict the longevity of these resources. This article explores a range of explanatory features related to the publication venue, authors, references, and where… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

  26. arXiv:2202.05352  [pdf, other

    cs.LG cs.CV cs.GT

    Domain Adversarial Training: A Game Perspective

    Authors: David Acuna, Marc T Law, Guojun Zhang, Sanja Fidler

    Abstract: The dominant line of work in domain adaptation has focused on learning invariant representations using domain-adversarial training. In this paper, we interpret this approach from a game theoretical perspective. Defining optimal solutions in domain-adversarial training as a local Nash equilibrium, we show that gradient descent in domain-adversarial training can violate the asymptotic convergence gu… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: ICLR 2022

  27. arXiv:2201.08459  [pdf, other

    cs.LG cs.AI

    Federated Learning with Heterogeneous Architectures using Graph HyperNetworks

    Authors: Or Litany, Haggai Maron, David Acuna, Jan Kautz, Gal Chechik, Sanja Fidler

    Abstract: Standard Federated Learning (FL) techniques are limited to clients with identical network architectures. This restricts potential use-cases like cross-platform training or inter-organizational collaboration when both data privacy and architectural proprietary are required. We propose a new FL framework that accommodates heterogeneous client architecture by adopting a graph hypernetwork for paramet… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  28. arXiv:2111.07971  [pdf, other

    cs.CV cs.AI cs.LG

    Towards Optimal Strategies for Training Self-Driving Perception Models in Simulation

    Authors: David Acuna, Jonah Philion, Sanja Fidler

    Abstract: Autonomous driving relies on a huge volume of real-world data to be labeled to high precision. Alternative solutions seek to exploit driving simulators that can generate large amounts of labeled data with a plethora of content variations. However, the domain gap between the synthetic and real data remains, raising the following important question: What are the best ways to utilize a self-driving s… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021; Project website: https://nv-tlabs.github.io/simulation-strategies/

  29. arXiv:2110.09663  [pdf

    cs.IR cs.DL

    EILEEN: A recommendation system for scientific publications and grants

    Authors: Daniel E. Acuna, Kartik Nagre, Priya Matnani

    Abstract: Finding relevant scientific articles is crucial for advancing knowledge. Recommendation systems are helpful for such purpose, although they have only been applied to science recently. This article describes EILEEN (Exploratory Innovator of LitEraturE Networks), a recommendation system for scientific publications and grants with open source code and datasets. We describe EILEEN's architecture for i… ▽ More

    Submitted 23 March, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: 16 pages, 3 figures, 2 tables

  30. arXiv:2106.11344  [pdf, other

    cs.LG cs.AI cs.CV

    f-Domain-Adversarial Learning: Theory and Algorithms

    Authors: David Acuna, Guojun Zhang, Marc T. Law, Sanja Fidler

    Abstract: Unsupervised domain adaptation is used in many machine learning applications where, during training, a model has access to unlabeled data in the target domain, and a related labeled dataset. In this paper, we introduce a novel and general domain-adversarial framework. Specifically, we derive a novel generalization bound for domain adaptation that exploits a new measure of discrepancy between distr… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  31. arXiv:2106.06487  [pdf, other

    cs.DL cs.CY

    A dataset of mentorship in science with semantic and demographic estimations

    Authors: Qing Ke, Lizhen Liang, Ying Ding, Stephen V. David, Daniel E. Acuna

    Abstract: Mentorship in science is crucial for topic choice, career decisions, and the success of mentees and mentors. Typically, researchers who study mentorship use article co-authorship and doctoral dissertation datasets. However, available datasets of this type focus on narrow selections of fields and miss out on early career and non-publication-related interactions. Here, we describe MENTORSHIP, a crow… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: Data can be found at https://doi.org/10.5281/zenodo.4917086

  32. arXiv:2102.08431  [pdf, other

    cs.LG cs.GT

    Complex Momentum for Optimization in Games

    Authors: Jonathan Lorraine, David Acuna, Paul Vicol, David Duvenaud

    Abstract: We generalize gradient descent with momentum for optimization in differentiable games to have complex-valued momentum. We give theoretical motivation for our method by proving convergence on bilinear zero-sum games for simultaneous and alternating updates. Our method gives real-valued parameter updates, making it a drop-in replacement for standard optimizers. We empirically demonstrate that comple… ▽ More

    Submitted 1 June, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  33. arXiv:2101.12352  [pdf, other

    cond-mat.soft cond-mat.mtrl-sci

    Auxetic behavior on demand: a three steps recipe for new designs

    Authors: Daniel Acuna, Francisco Gutiérrez, Rodrigo Silva, Humberto Palza, Alvaro S. Nunez, Gustavo Düring

    Abstract: Despite their outstanding mechanical properties, with many industrial applications, a rational and systematic design of new and controlled auxetic materials remains poorly developed. Here a unified framework is established to describe bidimensional perfect auxetics with potential use in the design of new materials. Perfect auxetics are characterized by a Poisson's ratio $ν=-1$ over a finite strain… ▽ More

    Submitted 24 June, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: Supporting Video 1 at https://youtu.be/ErAPafo2GkA , Supporting Video 2 at https://youtu.be/KhGkYwG2Btw , Supporting Video 3 at https://youtu.be/SLD0-6K2o0g , Supporting Video 4 at https://youtu.be/gNYfVsA4KGk

  34. arXiv:2003.00878  [pdf, other

    cs.CV cs.LG stat.ML

    Estimating a Null Model of Scientific Image Reuse to Support Research Integrity Investigations

    Authors: Daniel E. Acuna, Ziyue Xiang

    Abstract: When there is a suspicious figure reuse case in science, research integrity investigators often find it difficult to rebut authors claiming that "it happened by chance". In other words, when there is a "collision" of image features, it is difficult to justify whether it appears rarely or not. In this article, we provide a method to predict the rarity of an image feature by statistically estimating… ▽ More

    Submitted 21 February, 2020; originally announced March 2020.

  35. arXiv:2001.07799  [pdf, other

    cs.CV eess.IV

    Scientific Image Tampering Detection Based On Noise Inconsistencies: A Method And Datasets

    Authors: Ziyue Xiang, Daniel E. Acuna

    Abstract: Scientific image tampering is a problem that affects not only authors but also the general perception of the research community. Although previous researchers have developed methods to identify tampering in natural images, these methods may not thrive under the scientific setting as scientific images have different statistics, format, quality, and intentions. Therefore, we propose a scientific-ima… ▽ More

    Submitted 4 March, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

  36. arXiv:2001.05917  [pdf, other

    cs.IR

    Assigning credit to scientific datasets using article citation networks

    Authors: Tong Zeng, Longfeng Wu, Sarah Bratt, Daniel E. Acuna

    Abstract: A citation is a well-established mechanism for connecting scientific artifacts. Citation networks are used by citation analysis for a variety of reasons, prominently to give credit to scientists' work. However, because of current citation practices, scientists tend to cite only publications, leaving out other types of artifacts such as datasets. Datasets then do not get appropriate credit even tho… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Comments: PII: S1751-1577(19)30184-1

  37. arXiv:2001.02799  [pdf, other

    cs.CV cs.LG

    Neural Data Server: A Large-Scale Search Engine for Transfer Learning Data

    Authors: Xi Yan, David Acuna, Sanja Fidler

    Abstract: Transfer learning has proven to be a successful technique to train deep learning models in the domains where little training data is available. The dominant approach is to pretrain a model on a large generic dataset such as ImageNet and finetune its weights on the target domain. However, in the new era of an ever-increasing number of massive datasets, selecting the relevant data for pretraining is… ▽ More

    Submitted 31 March, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

  38. arXiv:1912.10818  [pdf, other

    cs.CL cs.AI cs.CY cs.NE

    Artificial mental phenomena: Psychophysics as a framework to detect perception biases in AI models

    Authors: Lizhen Liang, Daniel E. Acuna

    Abstract: Detecting biases in artificial intelligence has become difficult because of the impenetrable nature of deep learning. The central difficulty is in relating unobservable phenomena deep inside models with observable, outside quantities that we can measure from inputs and outputs. For example, can we detect gendered perceptions of occupations (e.g., female librarian, male electrician) using questions… ▽ More

    Submitted 15 December, 2019; originally announced December 2019.

    Comments: FAT Conference 2020

  39. arXiv:1911.02712  [pdf

    cs.DL

    The effect of novelty on the future impact of scientific grants

    Authors: Han Zhuang, Daniel E. Acuna

    Abstract: Government funding agencies and foundations tend to perceive novelty as necessary for scientific impact and hence prefer to fund novel instead of incremental projects. Evidence linking novelty and the eventual impact of a grant is surprisingly scarce, however. Here, we examine this link by analyzing 920,000 publications funded by 170,000 grants from the National Science Foundation (NSF) and the Na… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

  40. arXiv:1910.02055  [pdf, other

    cs.CV cs.GR cs.LG

    Neural Turtle Graphics for Modeling City Road Layouts

    Authors: Hang Chu, Daiqing Li, David Acuna, Amlan Kar, Maria Shugrina, Xinkai Wei, Ming-Yu Liu, Antonio Torralba, Sanja Fidler

    Abstract: We propose Neural Turtle Graphics (NTG), a novel generative model for spatial graphs, and demonstrate its applications in modeling city road layouts. Specifically, we represent the road layout using a graph where nodes in the graph represent control points and edges in the graph represent road segments. NTG is a sequential generative model parameterized by a neural network. It iteratively generate… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: ICCV-2019 Oral

  41. arXiv:1907.05740  [pdf, other

    cs.CV cs.LG

    Gated-SCNN: Gated Shape CNNs for Semantic Segmentation

    Authors: Towaki Takikawa, David Acuna, Varun Jampani, Sanja Fidler

    Abstract: Current state-of-the-art methods for image segmentation form a dense image representation where the color, shape and texture information are all processed together inside a deep CNN. This however may not be ideal as they contain very different type of information relevant for recognition. Here, we propose a new two-stream CNN architecture for semantic segmentation that explicitly wires shape infor… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

    Comments: Project Website: https://nv-tlabs.github.io/GSCNN/

  42. arXiv:1907.00962  [pdf, other

    cs.CL

    Claim Extraction in Biomedical Publications using Deep Discourse Model and Transfer Learning

    Authors: Titipat Achakulvisut, Chandra Bhagavatula, Daniel Acuna, Konrad Kording

    Abstract: Claims are a fundamental unit of scientific discourse. The exponential growth in the number of scientific publications makes automatic claim extraction an important problem for researchers who are overwhelmed by this information overload. Such an automated claim extraction system is useful for both manual and programmatic exploration of scientific knowledge. In this paper, we introduce a new datas… ▽ More

    Submitted 16 January, 2020; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: 11 pages, 6 figures

  43. arXiv:1904.11621  [pdf, other

    cs.CV cs.AI cs.GR

    Meta-Sim: Learning to Generate Synthetic Datasets

    Authors: Amlan Kar, Aayush Prakash, Ming-Yu Liu, Eric Cameracci, Justin Yuan, Matt Rusiniak, David Acuna, Antonio Torralba, Sanja Fidler

    Abstract: Training models to high-end performance requires availability of large labeled datasets, which are expensive to get. The goal of our work is to automatically synthesize labeled datasets that are relevant for a downstream task. We propose Meta-Sim, which learns a generative model of synthetic scenes, and obtain images as well as its corresponding ground-truth via a graphics engine. We parametrize o… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.

    Comments: Webpage: https://nv-tlabs.github.io/meta-sim/

  44. arXiv:1904.07934  [pdf, other

    cs.CV cs.AI

    Devil is in the Edges: Learning Semantic Boundaries from Noisy Annotations

    Authors: David Acuna, Amlan Kar, Sanja Fidler

    Abstract: We tackle the problem of semantic boundary prediction, which aims to identify pixels that belong to object(class) boundaries. We notice that relevant datasets consist of a significant level of label noise, reflecting the fact that precise annotations are laborious to get and thus annotators trade-off quality with efficiency. We aim to learn sharp and precise semantic boundaries by explicitly reaso… ▽ More

    Submitted 9 June, 2019; v1 submitted 16 April, 2019; originally announced April 2019.

    Comments: Accepted as a CVPR 2019 oral paper (Project Page: https://nv-tlabs.github.io/STEAL/)

    Journal ref: CVPR 2019

  45. arXiv:1810.10093  [pdf, other

    cs.CV

    Structured Domain Randomization: Bridging the Reality Gap by Context-Aware Synthetic Data

    Authors: Aayush Prakash, Shaad Boochoon, Mark Brophy, David Acuna, Eric Cameracci, Gavriel State, Omer Shapira, Stan Birchfield

    Abstract: We present structured domain randomization (SDR), a variant of domain randomization (DR) that takes into account the structure and context of the scene. In contrast to DR, which places objects and distractors randomly according to a uniform probability distribution, SDR places objects and distractors randomly according to probability distributions that arise from the specific problem at hand. In t… ▽ More

    Submitted 18 August, 2020; v1 submitted 23 October, 2018; originally announced October 2018.

    Comments: ICRA 2019; for video, see https://youtu.be/1WdjWJYx9AY

  46. arXiv:1804.06516  [pdf, other

    cs.CV

    Training Deep Networks with Synthetic Data: Bridging the Reality Gap by Domain Randomization

    Authors: Jonathan Tremblay, Aayush Prakash, David Acuna, Mark Brophy, Varun Jampani, Cem Anil, Thang To, Eric Cameracci, Shaad Boochoon, Stan Birchfield

    Abstract: We present a system for training deep neural networks for object detection using synthetic images. To handle the variability in real-world data, the system relies upon the technique of domain randomization, in which the parameters of the simulator$-$such as lighting, pose, object textures, etc.$-$are randomized in non-realistic ways to force the neural network to learn the essential features of th… ▽ More

    Submitted 23 April, 2018; v1 submitted 17 April, 2018; originally announced April 2018.

    Comments: CVPR 2018 Workshop on Autonomous Driving

  47. arXiv:1803.09693  [pdf, other

    cs.CV

    Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN++

    Authors: David Acuna, Huan Ling, Amlan Kar, Sanja Fidler

    Abstract: Manually labeling datasets with object masks is extremely time consuming. In this work, we follow the idea of Polygon-RNN to produce polygonal annotations of objects interactively using humans-in-the-loop. We introduce several important improvements to the model: 1) we design a new CNN encoder architecture, 2) show how to effectively train the model with Reinforcement Learning, and 3) significantl… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

    Comments: Accepted to CVPR 2018 (http://www.cs.toronto.edu/polyrnn/)

  48. The Social Structure of Consensus in Scientific Review

    Authors: Misha Teplitskiy, Daniel Acuna, Aida Elamrani-Raoult, Konrad Kording, James Evans

    Abstract: Personal connections between creators and evaluators of scientific works are ubiquitous, and the possibility of bias ever-present. Although connections have been shown to bias prospective judgments of (uncertain) future performance, it is unknown whether such biases occur in the much more concrete task of assessing the scientific validity of already completed work, and if so, why. This study prese… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    Journal ref: Research Policy. 2018

  49. Science Concierge: A fast content-based recommendation system for scientific publications

    Authors: Titipat Achakulvisut, Daniel E. Acuna, Tulakan Ruangrong, Konrad Kording

    Abstract: Finding relevant publications is important for scientists who have to cope with exponentially increasing numbers of scholarly material. Algorithms can help with this task as they help for music, movie, and product recommendations. However, we know little about the performance of these algorithms with scholarly material. Here, we develop an algorithm, and an accompanying Python library, that implem… ▽ More

    Submitted 11 May, 2016; v1 submitted 4 April, 2016; originally announced April 2016.

    Comments: 12 pages, 5 figures

  50. arXiv:1402.0422  [pdf, other

    stat.ML cs.IR cs.LG physics.soc-ph

    A high-reproducibility and high-accuracy method for automated topic classification

    Authors: Andrea Lancichinetti, M. Irmak Sirer, Jane X. Wang, Daniel Acuna, Konrad Körding, Luís A. Nunes Amaral

    Abstract: Much of human knowledge sits in large databases of unstructured text. Leveraging this knowledge requires algorithms that extract and record metadata on unstructured text documents. Assigning topics to documents will enable intelligent search, statistical characterization, and meaningful classification. Latent Dirichlet allocation (LDA) is the state-of-the-art in topic classification. Here, we perf… ▽ More

    Submitted 3 February, 2014; originally announced February 2014.

    Comments: 23 pages, 24 figures