Skip to main content

Showing 1–50 of 55 results for author: Scott, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.01239  [pdf, ps, other

    cs.NI

    A Full-Stack Platform Architecture for Self-Organised Social Coordination

    Authors: Matthew Scott, Jeremy Pitt

    Abstract: To mitigate the restrictive centralising and monopolistic tendencies of platformisation, we aim to empower local communities by democratising platforms for self-organised social coordination. Our approach is to develop an open-source, full-stack architecture for platform development that supports ease of distribution and cloning, generativity, and a variety of hosting options. The architecture con… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: 10 pages, 10 figures, 2 tables

  2. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  3. arXiv:2504.01046  [pdf, other

    stat.ML cs.IT cs.LG eess.SP math.PR

    Denoising guarantees for optimized sampling schemes in compressed sensing

    Authors: Yaniv Plan, Matthew S. Scott, Xia Sheng, Ozgur Yilmaz

    Abstract: Compressed sensing with subsampled unitary matrices benefits from \emph{optimized} sampling schemes, which feature improved theoretical guarantees and empirical performance relative to uniform subsampling. We provide, in a first of its kind in compressed sensing, theoretical guarantees showing that the error caused by the measurement noise vanishes with an increasing number of measurements for opt… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

    Comments: 29 pages, 4 figures. Submitted for review to the SIAM Journal on Mathematics of Data Science (SIMODS). Author roles: Authors listed in alphabetic order. MS was primarily responsible for developing the theory, the sparsity-based numerics, and writing the paper. XS was primarily responsible for training the generative model and creating and presenting the related numerics

    MSC Class: 94A20; 94A12; 68T07

  4. arXiv:2411.16478  [pdf, other

    cs.LG cs.DB

    Distributed, communication-efficient, and differentially private estimation of KL divergence

    Authors: Mary Scott, Sayan Biswas, Graham Cormode, Carsten Maple

    Abstract: A key task in managing distributed, sensitive data is to measure the extent to which a distribution changes. Understanding this drift can effectively support a variety of federated learning and analytics tasks. However, in many practical settings sharing such information can be undesirable (e.g., for privacy concerns) or infeasible (e.g., for high communication costs). In this work, we describe no… ▽ More

    Submitted 28 November, 2024; v1 submitted 25 November, 2024; originally announced November 2024.

    Comments: 28 pages, 5 figures

  5. arXiv:2411.04579  [pdf, other

    cs.LG cs.DB

    Towards Robust Federated Analytics via Differentially Private Measurements of Statistical Heterogeneity

    Authors: Mary Scott, Graham Cormode, Carsten Maple

    Abstract: Statistical heterogeneity is a measure of how skewed the samples of a dataset are. It is a common problem in the study of differential privacy that the usage of a statistically heterogeneous dataset results in a significant loss of accuracy. In federated scenarios, statistical heterogeneity is more likely to happen, and so the above problem is even more pressing. We explore the three most promisin… ▽ More

    Submitted 28 November, 2024; v1 submitted 7 November, 2024; originally announced November 2024.

    Comments: 26 pages, 6 tables, 1 figure

  6. Bridging Research and Practice Through Conversation: Reflecting on Our Experience

    Authors: Mayra Russo, Mackenzie Jorgensen, Kristen M. Scott, Wendy Xu, Di H. Nguyen, Jessie Finocchiaro, Matthew Olckers

    Abstract: While some research fields have a long history of collaborating with domain experts outside academia, many quantitative researchers do not have natural avenues to meet experts in areas where the research is later deployed. We explain how conversations -- interviews without a specific research objective -- can bridge research and practice. Using collaborative autoethnography, we reflect on our expe… ▽ More

    Submitted 17 September, 2024; v1 submitted 25 August, 2024; originally announced September 2024.

    Comments: Accepted for publication at the fourth ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO'24)

  7. arXiv:2407.16496  [pdf, other

    cs.CY cs.AI cs.LG

    Articulation Work and Tinkering for Fairness in Machine Learning

    Authors: Miriam Fahimi, Mayra Russo, Kristen M. Scott, Maria-Esther Vidal, Bettina Berendt, Katharina Kinder-Kurlanda

    Abstract: The field of fair AI aims to counter biased algorithms through computational modelling. However, it faces increasing criticism for perpetuating the use of overly technical and reductionist methods. As a result, novel approaches appear in the field to address more socially-oriented and interdisciplinary (SOI) perspectives on fair AI. In this paper, we take this dynamic as the starting point to stud… ▽ More

    Submitted 28 August, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    ACM Class: K.4.3; I.2.0

  8. arXiv:2403.16329  [pdf, other

    cs.MA

    Social Deliberation vs. Social Contracts in Self-Governing Voluntary Organisations

    Authors: Matthew Scott, Asimina Mertzani, Ciske Smit, Stefan Sarkadi, Jeremy Pitt

    Abstract: Self-organising multi-agent systems regulate their components' behaviour voluntarily, according to a set of socially-constructed, mutually-agreed, and mutable social arrangements. In some systems, these arrangements may be applied with a frequency, at a scale and within implicit cost constraints such that performance becomes a pressing issue. This paper introduces the \textit{Megabike Scenario}, w… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: COINE@AAMAS2024

    ACM Class: I.2.11

  9. arXiv:2403.03897  [pdf, other

    cs.SE cs.CR

    Fuzzing BusyBox: Leveraging LLM and Crash Reuse for Embedded Bug Unearthing

    Authors: Asmita, Yaroslav Oliinyk, Michael Scott, Ryan Tsang, Chongzhou Fang, Houman Homayoun

    Abstract: BusyBox, an open-source software bundling over 300 essential Linux commands into a single executable, is ubiquitous in Linux-based embedded devices. Vulnerabilities in BusyBox can have far-reaching consequences, affecting a wide array of devices. This research, driven by the extensive use of BusyBox, delved into its analysis. The study revealed the prevalence of older BusyBox versions in real-worl… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  10. arXiv:2310.04984  [pdf, other

    cs.IT cs.LG eess.SP math.PR stat.ML

    Model-adapted Fourier sampling for generative compressed sensing

    Authors: Aaron Berk, Simone Brugiapaglia, Yaniv Plan, Matthew Scott, Xia Sheng, Ozgur Yilmaz

    Abstract: We study generative compressed sensing when the measurement matrix is randomly subsampled from a unitary matrix (with the DFT as an important special case). It was recently shown that $\textit{O}(kdn\| \boldsymbolα\|_{\infty}^{2})$ uniformly random Fourier measurements are sufficient to recover signals in the range of a neural network $G:\mathbb{R}^k \to \mathbb{R}^n$ of depth $d$, where each comp… ▽ More

    Submitted 17 November, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: 12 pages, 4 figures. Submitted to the NeurIPS 2023 Workshop on Deep Learning and Inverse Problems. This revision features additional attribution of work, aknowledgmenents, and a correction in definition 1.1

  11. arXiv:2309.06122  [pdf, other

    cond-mat.mtrl-sci cs.LG eess.IV

    A robust synthetic data generation framework for machine learning in High-Resolution Transmission Electron Microscopy (HRTEM)

    Authors: Luis Rangel DaCosta, Katherine Sytwu, Catherine Groschner, Mary Scott

    Abstract: Machine learning techniques are attractive options for developing highly-accurate automated analysis tools for nanomaterials characterization, including high-resolution transmission electron microscopy (HRTEM). However, successfully implementing such machine learning tools can be difficult due to the challenges in procuring sufficiently large, high-quality training datasets from experiments. In th… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  12. arXiv:2306.11853  [pdf, other

    cond-mat.mtrl-sci cs.LG eess.IV

    Generalization Across Experimental Parameters in Machine Learning Analysis of High Resolution Transmission Electron Microscopy Datasets

    Authors: Katherine Sytwu, Luis Rangel DaCosta, Mary C. Scott

    Abstract: Neural networks are promising tools for high-throughput and accurate transmission electron microscopy (TEM) analysis of nanomaterials, but are known to generalize poorly on data that is "out-of-distribution" from their training data. Given the limited set of image features typically seen in high-resolution TEM imaging, it is unclear which images are considered out-of-distribution from others. Here… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 11 pages, 5 figures

  13. Domain Adaptive Decision Trees: Implications for Accuracy and Fairness

    Authors: Jose M. Alvarez, Kristen M. Scott, Salvatore Ruggieri, Bettina Berendt

    Abstract: In uses of pre-trained machine learning models, it is a known issue that the target population in which the model is being deployed may not have been reflected in the source population with which the model was trained. This can result in a biased model when deployed, leading to a reduction in model performance. One risk is that, as the population changes, certain demographic groups will be under-s… ▽ More

    Submitted 31 May, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: *Both authors contributed equally to this work. Accepted at FAccT '23

    Journal ref: FAccT '23: the 2023 ACM Conference on Fairness, Accountability, and Transparency Chicago IL USA June 12 - 15, 2023

  14. arXiv:2301.00996  [pdf, other

    cs.DC cs.DS cs.PF

    Transactional Composition of Nonblocking Data Structures

    Authors: Wentao Cai, Haosen Wen, Michael L. Scott

    Abstract: This paper introduces nonblocking transaction composition (NBTC), a new methodology for atomic composition of nonblocking operations on concurrent data structures. Unlike previous software transactional memory (STM) approaches, NBTC leverages the linearizability of existing nonblocking structures, reducing the number of memory accesses that must be executed together, atomically, to only one per op… ▽ More

    Submitted 7 January, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

  15. arXiv:2207.09340  [pdf, other

    cs.IT cs.LG eess.SP math.PR stat.ML

    A coherence parameter characterizing generative compressed sensing with Fourier measurements

    Authors: Aaron Berk, Simone Brugiapaglia, Babhru Joshi, Yaniv Plan, Matthew Scott, Özgür Yilmaz

    Abstract: In Bora et al. (2017), a mathematical framework was developed for compressed sensing guarantees in the setting where the measurement matrix is Gaussian and the signal structure is the range of a generative neural network (GNN). The problem of compressed sensing with GNNs has since been extensively analyzed when the measurement matrix and/or network weights follow a subgaussian distribution. We mov… ▽ More

    Submitted 9 November, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

    MSC Class: 68T07; 60F10; 68P30; 94A08; 94A16

  16. Fairness in Agreement With European Values: An Interdisciplinary Perspective on AI Regulation

    Authors: Alejandra Bringas Colmenarejo, Luca Nannini, Alisa Rieger, Kristen M. Scott, Xuan Zhao, Gourab K. Patro, Gjergji Kasneci, Katharina Kinder-Kurlanda

    Abstract: With increasing digitalization, Artificial Intelligence (AI) is becoming ubiquitous. AI-based systems to identify, optimize, automate, and scale solutions to complex economic and societal problems are being proposed and implemented. This has motivated regulation efforts, including the Proposal of an EU AI Act. This interdisciplinary position paper considers various concerns surrounding fairness an… ▽ More

    Submitted 8 June, 2022; originally announced July 2022.

    Comments: In proceedings of AAAI/ACM Conference AIES 2022 (https://doi.org/10.1145/3514094.3534158)

  17. arXiv:2204.04250  [pdf, other

    cond-mat.mtrl-sci cs.CV eess.IV

    Understanding the Influence of Receptive Field and Network Complexity in Neural-Network-Guided TEM Image Analysis

    Authors: Katherine Sytwu, Catherine Groschner, Mary C. Scott

    Abstract: Trained neural networks are promising tools to analyze the ever-increasing amount of scientific image data, but it is unclear how to best customize these networks for the unique features in transmission electron micrographs. Here, we systematically examine how neural network architecture choices affect how neural networks segment, or pixel-wise separate, crystalline nanoparticles from amorphous ba… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: 11 pages, 8 figures

  18. Aggregation and Transformation of Vector-Valued Messages in the Shuffle Model of Differential Privacy

    Authors: Mary Scott, Graham Cormode, Carsten Maple

    Abstract: Advances in communications, storage and computational technology allow significant quantities of data to be collected and processed by distributed devices. Combining the information from these endpoints can realize significant societal benefit but presents challenges in protecting the privacy of individuals, especially important in an increasingly regulated world. Differential privacy (DP) is a te… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: 16 pages, 5 figures, in: IEEE Transactions on Information Forensics and Security (TIFS), 2022. arXiv admin note: substantial text overlap with arXiv:2112.05464

  19. arXiv:2112.05464  [pdf, other

    cs.CR

    Applying the Shuffle Model of Differential Privacy to Vector Aggregation

    Authors: Mary Scott, Graham Cormode, Carsten Maple

    Abstract: In this work we introduce a new protocol for vector aggregation in the context of the Shuffle Model, a recent model within Differential Privacy (DP). It sits between the Centralized Model, which prioritizes the level of accuracy over the secrecy of the data, and the Local Model, for which an improvement in trust is counteracted by a much higher noise requirement. The Shuffle Model was developed to… ▽ More

    Submitted 31 January, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: 17 pages, 3 figures, in: British International Conference on Databases (BICOD21), London, UK, 28 Mar 2022

  20. arXiv:2108.07755  [pdf, other

    cs.CV

    TOOD: Task-aligned One-stage Object Detection

    Authors: Chengjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott, Weilin Huang

    Abstract: One-stage object detection is commonly implemented by optimizing two sub-tasks: object classification and localization, using heads with two parallel branches, which might lead to a certain level of spatial misalignment in predictions between the two tasks. In this work, we propose a Task-aligned One-stage Object Detection (TOOD) that explicitly aligns the two tasks in a learning-based manner. Fir… ▽ More

    Submitted 28 August, 2021; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: ICCV2021 Oral

  21. arXiv:2107.04324  [pdf, other

    cs.CV

    Mutually-aware Sub-Graphs Differentiable Architecture Search

    Authors: Haoxian Tan, Sheng Guo, Yujie Zhong, Matthew R. Scott, Weilin Huang

    Abstract: Differentiable architecture search is prevalent in the field of NAS because of its simplicity and efficiency, where two paradigms, multi-path algorithms and single-path methods, are dominated. Multi-path framework (e.g. DARTS) is intuitive but suffers from memory usage and training collapse. Single-path methods (e.g.GDAS and ProxylessNAS) mitigate the memory issue and shrink the gap between search… ▽ More

    Submitted 5 November, 2021; v1 submitted 9 July, 2021; originally announced July 2021.

  22. Fast Nonblocking Persistence for Concurrent Data Structures

    Authors: Wentao Cai, Haosen Wen, Vladimir Maksimovski, Mingzhe Du, Rafaello Sanna, Shreif Abdallah, Michael L. Scott

    Abstract: We present a fully lock-free variant of the recent Montage system for persistent data structures. Our variant, nbMontage, adds persistence to almost any nonblocking concurrent structure without introducing significant overhead or blocking of any kind. Like its predecessor, nbMontage is buffered durably linearizable: it guarantees that the state recovered in the wake of a crash will represent a con… ▽ More

    Submitted 8 August, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

    MSC Class: 68W10; 68Q85; 68M15 ACM Class: D.1.3; B.2.3

  23. arXiv:2103.14003  [pdf, other

    cs.CV

    Rethinking Deep Contrastive Learning with Embedding Memory

    Authors: Haozhi Zhang, Xun Wang, Weilin Huang, Matthew R. Scott

    Abstract: Pair-wise loss functions have been extensively studied and shown to continuously improve the performance of deep metric learning (DML). However, they are primarily designed with intuition based on simple toy examples, and experimentally identifying the truly effective design is difficult in complicated, real-world cases. In this paper, we provide a new methodology for systematically studying weigh… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: Under review

  24. arXiv:2103.11587  [pdf, other

    cs.CV eess.IV

    Brain Image Synthesis with Unsupervised Multivariate Canonical CSC$\ell_4$Net

    Authors: Yawen Huang, Feng Zheng, Danyang Wang, Weilin Huang, Matthew R. Scott, Ling Shao

    Abstract: Recent advances in neuroscience have highlighted the effectiveness of multi-modal medical data for investigating certain pathologies and understanding human cognition. However, obtaining full sets of different modalities is limited by various factors, such as long acquisition times, high examination costs and artifact suppression. In addition, the complexity, high dimensionality and heterogeneity… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: 10 pages, 5 figures CVPR2021 oral

  25. arXiv:2101.04028  [pdf, other

    cs.CV

    Unchain the Search Space with Hierarchical Differentiable Architecture Search

    Authors: Guanting Liu, Yujie Zhong, Sheng Guo, Matthew R. Scott, Weilin Huang

    Abstract: Differentiable architecture search (DAS) has made great progress in searching for high-performance architectures with reduced computational cost. However, DAS-based methods mainly focus on searching for a repeatable cell structure, which is then stacked sequentially in multiple stages to form the networks. This configuration significantly reduces the search space, and ignores the importance of con… ▽ More

    Submitted 11 January, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: To appear in AAAI2021. Code is available

  26. arXiv:2009.13701  [pdf, other

    cs.DC cs.DS cs.PF

    Montage: A General System for Buffered Durably Linearizable Data Structures

    Authors: Haosen Wen, Wentao Cai, Mingzhe Du, Louis Jenkins, Benjamin Valpey, Michael L. Scott

    Abstract: The recent emergence of fast, dense, nonvolatile main memory suggests that certain long-lived data might remain in its natural pointer-rich format across program runs and hardware reboots. Operations on such data must be instrumented with explicit write-back and fence instructions to ensure consistency in the wake of a crash. Techniques to minimize the cost of this instrumentation are an active to… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    MSC Class: 68W10; 68Q85; 68M15 ACM Class: D.1.3; B.2.3

  27. arXiv:2007.12075  [pdf, other

    cs.CV

    Representation Sharing for Fast Object Detector Search and Beyond

    Authors: Yujie Zhong, Zelu Deng, Sheng Guo, Matthew R. Scott, Weilin Huang

    Abstract: Region Proposal Network (RPN) provides strong support for handling the scale variation of objects in two-stage object detection. For one-stage detectors which do not have RPN, it is more demanding to have powerful sub-networks capable of directly capturing objects of unknown sizes. To enhance such capability, we propose an extremely efficient neural architecture search method, named Fast And Diver… ▽ More

    Submitted 23 October, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: ECCV 2020 accepted

  28. arXiv:2004.06711  [pdf, other

    cs.CV

    Deformable Siamese Attention Networks for Visual Object Tracking

    Authors: Yuechen Yu, Yilei Xiong, Weilin Huang, Matthew R. Scott

    Abstract: Siamese-based trackers have achieved excellent performance on visual object tracking. However, the target template is not updated online, and the features of the target template and search image are computed independently in a Siamese architecture. In this paper, we propose Deformable Siamese Attention Networks, referred to as SiamAttn, by introducing a new Siamese attention mechanism that compute… ▽ More

    Submitted 24 March, 2021; v1 submitted 14 April, 2020; originally announced April 2020.

    Comments: CVPR 2020, with code available at: https://github.com/msight-tech/research-siamattn

  29. arXiv:2003.06718  [pdf, other

    cs.DC

    Understanding and Optimizing Persistent Memory Allocation

    Authors: Wentao Cai, Haosen Wen, H. Alan Beadle, Chris Kjellqvist, Mohammad Hedayati, Michael L. Scott

    Abstract: The proliferation of fast, dense, byte-addressable nonvolatile memory suggests that data might be kept in pointer-rich "in-memory" format across program runs and even process and system crashes. For full generality, such data requires dynamic memory allocation, and while the allocator could in principle "rolled into" each data structure, it is desirable to make it a separate abstraction. Toward… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

  30. arXiv:2003.05235  [pdf, other

    cs.CV

    Channel Interaction Networks for Fine-Grained Image Categorization

    Authors: Yu Gao, Xintong Han, Xun Wang, Weilin Huang, Matthew R. Scott

    Abstract: Fine-grained image categorization is challenging due to the subtle inter-class differences.We posit that exploiting the rich relationships between channels can help capture such differences since different channels correspond to different semantics. In this paper, we propose a channel interaction network (CIN), which models the channel-wise interplay both within an image and across images. For a s… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

    Comments: AAAI 2020

  31. arXiv:2003.04132  [pdf, other

    cs.CV

    iFAN: Image-Instance Full Alignment Networks for Adaptive Object Detection

    Authors: Chenfan Zhuang, Xintong Han, Weilin Huang, Matthew R. Scott

    Abstract: Training an object detector on a data-rich domain and applying it to a data-poor one with limited performance drop is highly attractive in industry, because it saves huge annotation cost. Recent research on unsupervised domain adaptive object detection has verified that aligning data distributions between source and target images through adversarial learning is very useful. The key is when, where… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

    Comments: AAAI 2020

  32. arXiv:2002.07471  [pdf, other

    cs.CV

    Knowledge Integration Networks for Action Recognition

    Authors: Shiwen Zhang, Sheng Guo, Limin Wang, Weilin Huang, Matthew R. Scott

    Abstract: In this work, we propose Knowledge Integration Networks (referred as KINet) for video action recognition. KINet is capable of aggregating meaningful context features which are of great importance to identifying an action, such as human information and scene context. We design a three-branch architecture consisting of a main branch for action recognition, and two auxiliary branches for human parsin… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

    Comments: To appear in AAAI 2020

  33. arXiv:2002.07442  [pdf, other

    cs.CV

    V4D:4D Convolutional Neural Networks for Video-level Representation Learning

    Authors: Shiwen Zhang, Sheng Guo, Weilin Huang, Matthew R. Scott, Limin Wang

    Abstract: Most existing 3D CNNs for video representation learning are clip-based methods, and thus do not consider video-level temporal evolution of spatio-temporal features. In this paper, we propose Video-level 4D Convolutional Neural Networks, referred as V4D, to model the evolution of long-range spatio-temporal representation with 4D convolutions, and at the same time, to preserve strong 3D spatio-tempo… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

    Comments: To appear in ICLR2020

  34. Machine Learning Pipeline for Segmentation and Defect Identification from High Resolution Transmission Electron Microscopy Data

    Authors: C. K. Groschner, Christina Choi, M. C. Scott

    Abstract: In the field of transmission electron microscopy, data interpretation often lags behind acquisition methods, as image processing methods often have to be manually tailored to individual datasets. Machine learning offers a promising approach for fast, accurate analysis of electron microscopy data. Here, we demonstrate a flexible two step pipeline for analysis of high resolution transmission electro… ▽ More

    Submitted 23 February, 2021; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: 10 pages, 5 figures

  35. arXiv:1912.06798  [pdf, other

    cs.LG cs.CV

    Cross-Batch Memory for Embedding Learning

    Authors: Xun Wang, Haozhi Zhang, Weilin Huang, Matthew R. Scott

    Abstract: Mining informative negative instances are of central importance to deep metric learning (DML), however this task is intrinsically limited by mini-batch training, where only a mini-batch of instances is accessible at each iteration. In this paper, we identify a "slow drift" phenomena by observing that the embedding features drift exceptionally slow even as the model parameters are updating througho… ▽ More

    Submitted 20 April, 2020; v1 submitted 14 December, 2019; originally announced December 2019.

    Comments: CVPR 2020 Oral

  36. arXiv:1910.07954  [pdf, other

    cs.CV

    Convolutional Character Networks

    Authors: Linjie Xing, Zhi Tian, Weilin Huang, Matthew R. Scott

    Abstract: Recent progress has been made on developing a unified framework for joint text detection and recognition in natural images, but existing joint models were mostly built on two-stage framework by involving ROI pooling, which can degrade the performance on recognition task. In this work, we propose convolutional character networks, referred as CharNet, which is an one-stage model that can process two… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: To appear in ICCV 2019

  37. arXiv:1910.02624  [pdf, other

    cs.CV

    Label-PEnet: Sequential Label Propagation and Enhancement Networks for Weakly Supervised Instance Segmentation

    Authors: Weifeng Ge, Sheng Guo, Weilin Huang, Matthew R. Scott

    Abstract: Weakly-supervised instance segmentation aims to detect and segment object instances precisely, given imagelevel labels only. Unlike previous methods which are composed of multiple offline stages, we propose Sequential Label Propagation and Enhancement Networks (referred as Label-PEnet) that progressively transform image-level labels to pixel-wise labels in a coarse-to-fine manner. We design four c… ▽ More

    Submitted 24 April, 2020; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Rectifiy some typos in Arxiv title

  38. arXiv:1909.11966  [pdf, other

    cs.CV

    Dual-Stream Pyramid Registration Network

    Authors: Miao Kang, Xiaojun Hu, Weilin Huang, Matthew R. Scott, Mauricio Reyes

    Abstract: We propose a Dual-Stream Pyramid Registration Network (referred as Dual-PRNet) for unsupervised 3D medical image registration. Unlike recent CNN-based registration approaches, such as VoxelMorph, which explores a single-stream encoder-decoder network to compute a registration fields from a pair of 3D volumes, we design a two-stream architecture able to compute multi-scale registration fields from… ▽ More

    Submitted 1 April, 2023; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: Published in Medical Image Analysis, 2022

  39. arXiv:1906.05750  [pdf, other

    cs.CV

    The iMaterialist Fashion Attribute Dataset

    Authors: Sheng Guo, Weilin Huang, Xiao Zhang, Prasanna Srikhanta, Yin Cui, Yuan Li, Matthew R. Scott, Hartwig Adam, Serge Belongie

    Abstract: Large-scale image databases such as ImageNet have significantly advanced image classification and other visual recognition tasks. However much of these datasets are constructed only for single-label and coarse object-level classification. For real-world applications, multiple labels and fine-grained categories are often needed, yet very few such datasets exist publicly, especially those of large-s… ▽ More

    Submitted 14 June, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

  40. arXiv:1906.01811  [pdf, other

    cs.AI stat.AP

    The Stanford Acuity Test: A Precise Vision Test Using Bayesian Techniques and a Discovery in Human Visual Response

    Authors: Chris Piech, Ali Malik, Laura M Scott, Robert T Chang, Charles Lin

    Abstract: Chart-based visual acuity measurements are used by billions of people to diagnose and guide treatment of vision impairment. However, the ubiquitous eye exam has no mechanism for reasoning about uncertainty and as such, suffers from a well-documented reproducibility problem. In this paper we make two core contributions. First, we uncover a new parametric probabilistic model of visual acuity respons… ▽ More

    Submitted 21 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 34th AAAI Conference on Artificial Intelligence, New York, USA. 2020

  41. arXiv:1904.06627  [pdf, other

    cs.CV

    Multi-Similarity Loss with General Pair Weighting for Deep Metric Learning

    Authors: Xun Wang, Xintong Han, Weilin Huang, Dengke Dong, Matthew R. Scott

    Abstract: A family of loss functions built on pair-based computation have been proposed in the literature which provide a myriad of solutions for deep metric learning. In this paper, we provide a general weighting framework for understanding recent pair-based loss functions. Our contributions are three-fold: (1) we establish a General Pair Weighting (GPW) framework, which casts the sampling problem of deep… ▽ More

    Submitted 22 March, 2020; v1 submitted 14 April, 2019; originally announced April 2019.

    Comments: Accepted CVPR 2019, rewrite main method to be more clear

    Report number: 13 pages, 4 figures, 7 tables, including supplementary materials

  42. Fast Intra-kernel Isolation and Security with IskiOS

    Authors: Spyridoula Gravani, Mohammad Hedayati, John Criswell, Michael L. Scott

    Abstract: The kernels of operating systems such as Windows, Linux, and MacOS are vulnerable to control-flow hijacking. Defenses exist, but many require efficient intra-address-space isolation. Execute-only memory, for example, requires read protection on code segments, and shadow stacks require protection from buffer overwrites. Intel's Protection Keys for Userspace (PKU) could, in principle, provide the in… ▽ More

    Submitted 2 August, 2021; v1 submitted 11 March, 2019; originally announced March 2019.

  43. arXiv:1902.01096  [pdf, other

    cs.CV

    Compatible and Diverse Fashion Image Inpainting

    Authors: Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R. Scott, Larry S. Davis

    Abstract: Visual compatibility is critical for fashion analysis, yet is missing in existing fashion image synthesis systems. In this paper, we propose to explicitly model visual compatibility through fashion image inpainting. To this end, we present Fashion Inpainting Networks (FiNet), a two-stage image-to-image generation framework that is able to perform compatible and diverse inpainting. Disentangling th… ▽ More

    Submitted 24 April, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

  44. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  45. arXiv:1810.06951  [pdf, other

    cs.CV

    Deep Metric Learning with Hierarchical Triplet Loss

    Authors: Weifeng Ge, Weilin Huang, Dengke Dong, Matthew R. Scott

    Abstract: We present a novel hierarchical triplet loss (HTL) capable of automatically collecting informative training samples (triplets) via a defined hierarchical tree that encodes global context information. This allows us to cope with the main limitation of random sampling in training a conventional triplet loss, which is a central issue for deep metric learning. Our main contributions are two-fold. (i)… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: Published in ECCV 2018

  46. arXiv:1808.01097  [pdf, other

    cs.CV

    CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

    Authors: Sheng Guo, Weilin Huang, Haozhi Zhang, Chenfan Zhuang, Dengke Dong, Matthew R. Scott, Dinglong Huang

    Abstract: We present a simple yet efficient approach capable of training deep neural networks on large-scale weakly-supervised web images, which are crawled raw from the Internet by using text queries, without any human annotation. We develop a principled learning strategy by leveraging curriculum learning, with the goal of handling a massive amount of noisy labels and data imbalance effectively. We design… ▽ More

    Submitted 18 October, 2018; v1 submitted 3 August, 2018; originally announced August 2018.

    Comments: Accepted to ECCV 2018. 16 pages, 5 figures, 5 tables

  47. arXiv:1610.08809  [pdf, other

    cs.DS q-bio.GN q-bio.QM

    Aligning coding sequences with frameshift extension penalties

    Authors: Safa Jammali, Esaie Kuitche, Ayoub Rachati, François Bélanger, Michelle Scott, Aïda Ouangraoua

    Abstract: Frameshift translation is an important phenomenon that contributes to the appearance of novel Coding DNA Sequences (CDS) and functions in gene evolution, by allowing alternative amino acid translations of genes coding regions. Frameshift translations can be identified by aligning two CDS, from a same gene or from homologous genes, while accounting for their codon structure. Two main classes of alg… ▽ More

    Submitted 13 April, 2017; v1 submitted 27 October, 2016; originally announced October 2016.

    Comments: 24 pages, 4 figures

    Journal ref: Algorithms for Molecular Biology, 2017, vol. 12, no 1, p. 10

  48. arXiv:1606.06873  [pdf, other

    cs.MM cs.HC

    Personality, Culture, and System Factors - Impact on Affective Response to Multimedia

    Authors: Sharath Chandra Guntuku, Michael James Scott, Gheorghita Ghinea, Weisi Lin

    Abstract: Whilst affective responses to various forms and genres of multimedia content have been well researched, precious few studies have investigated the combined impact that multimedia system parameters and human factors have on affect. Consequently, in this paper we explore the role that two primordial dimensions of human factors - personality and culture - in conjunction with system factors - frame ra… ▽ More

    Submitted 18 July, 2016; v1 submitted 22 June, 2016; originally announced June 2016.

  49. arXiv:1512.09022  [pdf

    cs.CY

    Cross-Cultural Differences in Students' Intention to Use RSS Feeds between Lebanon and the United Kingdom: A Multi-Group Invariance Analysis Based on the Technology Acceptance Model

    Authors: Ali Tarhini, Michael James Scott

    Abstract: Really Simple Syndication (RSS) offers a means for university students to receive timely updates from virtual learning environments. However, despite its utility, only 21% of students surveyed at a Lebanese university claim to have ever used the technology. To investigate whether a cultural influence is affecting intention to use RSS, the survey was extended to the British context to conduct a cro… ▽ More

    Submitted 30 December, 2015; originally announced December 2015.

    Comments: 15 pages, 2 figures, 10 tables

    ACM Class: K.3.1; J.4

    Journal ref: Electronic Journal of e-Learning, 13(1): 14-29, 2015

  50. Promoting Inclusive Design Practice at the Global Game Jam: A Pilot Evaluation

    Authors: Michael James Scott, Gheorghita Ghinea, Ian Hamilton

    Abstract: Games are a popular form of entertainment. However, many computer games present unnecessary barriers to players with sensory, motor and cognitive impairments. In order to overcome such pitfalls, an awareness of their impact and a willingness to apply inclusive design practice is often necessary. The Global Game Jam offers a potential avenue to promote inclusive design practices to students of game… ▽ More

    Submitted 18 September, 2014; originally announced September 2014.

    Comments: Presented at the 2014 IEEE Frontiers in Education Conference, 12 Pages, 1 Figure

    Journal ref: Frontiers in Education Conference (FIE), 2014 IEEE, 1-4