Skip to main content

Showing 1–50 of 76 results for author: Qiao, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.03460  [pdf, ps, other

    cs.AI

    Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis

    Authors: Weitong Zhang, Mengyun Qiao, Chengqi Zang, Steven Niederer, Paul M Matthews, Wenjia Bai, Bernhard Kainz

    Abstract: Identifying the associations between imaging phenotypes and disease risk factors and outcomes is essential for understanding disease mechanisms and improving diagnosis and prognosis models. However, traditional approaches rely on human-driven hypothesis testing and selection of association factors, often overlooking complex, non-linear dependencies among imaging phenotypes and other multi-modal da… ▽ More

    Submitted 4 July, 2025; originally announced July 2025.

  2. arXiv:2506.21171  [pdf, ps, other

    eess.IV cs.CV

    Uncover Treasures in DCT: Advancing JPEG Quality Enhancement by Exploiting Latent Correlations

    Authors: Jing Yang, Qunliang Xing, Mai Xu, Minglang Qiao

    Abstract: Joint Photographic Experts Group (JPEG) achieves data compression by quantizing Discrete Cosine Transform (DCT) coefficients, which inevitably introduces compression artifacts. Most existing JPEG quality enhancement methods operate in the pixel domain, suffering from the high computational costs of decoding. Consequently, direct enhancement of JPEG images in the DCT domain has gained increasing at… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  3. arXiv:2505.24759  [pdf, ps, other

    q-bio.QM cs.AI cs.LG

    Unsupervised Evolutionary Cell Type Matching via Entropy-Minimized Optimal Transport

    Authors: Mu Qiao

    Abstract: Identifying evolutionary correspondences between cell types across species is a fundamental challenge in comparative genomics and evolutionary biology. Existing approaches often rely on either reference-based matching, which imposes asymmetry by designating one species as the reference, or projection-based matching, which may increase computational complexity and obscure biological interpretabilit… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  4. arXiv:2505.14190  [pdf, ps, other

    cs.LG cs.AI

    $α$-GAN by Rényi Cross Entropy

    Authors: Ni Ding, Miao Qiao, Jiaxing Xu, Yiping Ke, Xiaoyu Zhang

    Abstract: This paper proposes $α$-GAN, a generative adversarial network using Rényi measures. The value function is formulated, by Rényi cross entropy, as an expected certainty measure incurred by the discriminator's soft decision as to where the sample is from, true population or the generator. The discriminator tries to maximize the Rényi certainty about sample source, while the generator wants to reduce… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  5. arXiv:2504.17271  [pdf, other

    cs.CR

    Contrastive Learning for Continuous Touch-Based Authentication

    Authors: Mengyu Qiao, Yunpeng Zhai, Yang Wang

    Abstract: Smart mobile devices have become indispensable in modern daily life, where sensitive information is frequently processed, stored, and transmitted-posing critical demands for robust security controls. Given that touchscreens are the primary medium for human-device interaction, continuous user authentication based on touch behavior presents a natural and seamless security solution. While existing me… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  6. arXiv:2504.17223  [pdf, other

    cs.CV

    Towards Generalizable Deepfake Detection with Spatial-Frequency Collaborative Learning and Hierarchical Cross-Modal Fusion

    Authors: Mengyu Qiao, Runze Tian, Yang Wang

    Abstract: The rapid evolution of deep generative models poses a critical challenge to deepfake detection, as detectors trained on forgery-specific artifacts often suffer significant performance degradation when encountering unseen forgeries. While existing methods predominantly rely on spatial domain analysis, frequency domain operations are primarily limited to feature-level augmentation, leaving frequency… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  7. arXiv:2504.13914  [pdf, other

    cs.CL

    Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

    Authors: ByteDance Seed, :, Jiaze Chen, Tiantian Fan, Xin Liu, Lingjun Liu, Zhiqi Lin, Mingxuan Wang, Chengyi Wang, Xiangpeng Wei, Wenyuan Xu, Yufeng Yuan, Yu Yue, Lin Yan, Qiying Yu, Xiaochen Zuo, Chi Zhang, Ruofei Zhu, Zhecheng An, Zhihao Bai, Yu Bao, Xingyan Bin, Jiangjie Chen, Feng Chen, Hongmin Chen , et al. (249 additional authors not shown)

    Abstract: We introduce Seed1.5-Thinking, capable of reasoning through thinking before responding, resulting in improved performance on a wide range of benchmarks. Seed1.5-Thinking achieves 86.7 on AIME 2024, 55.0 on Codeforces and 77.3 on GPQA, demonstrating excellent reasoning abilities in STEM and coding. Beyond reasoning tasks, the method demonstrates notable generalization across diverse domains. For in… ▽ More

    Submitted 29 April, 2025; v1 submitted 10 April, 2025; originally announced April 2025.

  8. arXiv:2503.14476  [pdf, other

    cs.LG cs.CL

    DAPO: An Open-Source LLM Reinforcement Learning System at Scale

    Authors: Qiying Yu, Zheng Zhang, Ruofei Zhu, Yufeng Yuan, Xiaochen Zuo, Yu Yue, Weinan Dai, Tiantian Fan, Gaohong Liu, Lingjun Liu, Xin Liu, Haibin Lin, Zhiqi Lin, Bole Ma, Guangming Sheng, Yuxuan Tong, Chi Zhang, Mofan Zhang, Wang Zhang, Hang Zhu, Jinhua Zhu, Jiaze Chen, Jiangjie Chen, Chengyi Wang, Hongli Yu , et al. (10 additional authors not shown)

    Abstract: Inference scaling empowers LLMs with unprecedented reasoning ability, with reinforcement learning as the core technique to elicit complex reasoning. However, key technical details of state-of-the-art reasoning LLMs are concealed (such as in OpenAI o1 blog and DeepSeek R1 technical report), thus the community still struggles to reproduce their RL training results. We propose the $\textbf{D}$ecouple… ▽ More

    Submitted 19 May, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

    Comments: Project Page: https://dapo-sia.github.io/

  9. arXiv:2503.02384  [pdf, other

    cs.LG

    Truthfulness of Decision-Theoretic Calibration Measures

    Authors: Mingda Qiao, Eric Zhao

    Abstract: Calibration measures quantify how much a forecaster's predictions violates calibration, which requires that forecasts are unbiased conditioning on the forecasted probabilities. Two important desiderata for a calibration measure are its decision-theoretic implications (i.e., downstream decision-makers that best-respond to the forecasts are always no-regret) and its truthfulness (i.e., a forecaster… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  10. arXiv:2502.09834  [pdf, other

    cs.DS

    Optimal $k$-Secretary with Logarithmic Memory

    Authors: Mingda Qiao, Wei Zhang

    Abstract: We study memory-bounded algorithms for the $k$-secretary problem. The algorithm of Kleinberg (2005) achieves an optimal competitive ratio of $1 - O(1/\sqrt{k})$, yet a straightforward implementation requires $Ω(k)$ memory. Our main result is a $k$-secretary algorithm that matches the optimal competitive ratio using $O(\log k)$ words of memory. We prove this result by establishing a general reduc… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  11. arXiv:2501.15415  [pdf, other

    cs.CV

    OCSU: Optical Chemical Structure Understanding for Molecule-centric Scientific Discovery

    Authors: Siqi Fan, Yuguang Xie, Bowen Cai, Ailin Xie, Gaochao Liu, Mu Qiao, Jie Xing, Zaiqing Nie

    Abstract: Understanding the chemical structure from a graphical representation of a molecule is a challenging image caption task that would greatly benefit molecule-centric scientific discovery. Variations in molecular images and caption subtasks pose a significant challenge in both image representation learning and task modeling. Yet, existing methods only focus on a specific caption task that translates a… ▽ More

    Submitted 22 May, 2025; v1 submitted 26 January, 2025; originally announced January 2025.

  12. arXiv:2501.04882  [pdf, other

    cs.GT cs.AI cs.LG stat.AP stat.ML

    Reach Measurement, Optimization and Frequency Capping In Targeted Online Advertising Under k-Anonymity

    Authors: Yuan Gao, Mu Qiao

    Abstract: The growth in the use of online advertising to foster brand awareness over recent years is largely attributable to the ubiquity of social media. One pivotal technology contributing to the success of online brand advertising is frequency capping, a mechanism that enables marketers to control the number of times an ad is shown to a specific user. However, the very foundation of this technology is be… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  13. arXiv:2411.16624  [pdf, ps, other

    cs.GT cs.DS cs.MA

    Leakage-Robust Bayesian Persuasion

    Authors: Nika Haghtalab, Mingda Qiao, Kunhe Yang

    Abstract: We introduce the concept of leakage-robust Bayesian persuasion. Situated between public persuasion [KG11, CCG23, Xu20] and private persuasion [AB19], leakage-robust persuasion considers a setting where one or more signals privately sent by a sender to the receivers may be leaked. We study the design of leakage-robust persuasion schemes and quantify the price of robustness using two formalisms: -… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  14. arXiv:2410.19245  [pdf, other

    cs.SE cs.CV cs.MA

    MaCTG: Multi-Agent Collaborative Thought Graph for Automatic Programming

    Authors: Zixiao Zhao, Jing Sun, Zhe Hou, Zhiyuan Wei, Cheng-Hao Cai, Miao Qiao, Jin Song Dong

    Abstract: With the rapid advancement of Large Language Models (LLMs), LLM-based approaches have demonstrated strong problem-solving capabilities across various domains. However, in automatic programming, a single LLM is typically limited to function-level code generation, while multi-agent systems composed of multiple LLMs often suffer from inefficient task planning. This lack of structured coordination can… ▽ More

    Submitted 21 April, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

  15. A personalized time-resolved 3D mesh generative model for unveiling normal heart dynamics

    Authors: Mengyun Qiao, Kathryn A McGurk, Shuo Wang, Paul M. Matthews, Declan P O Regan, Wenjia Bai

    Abstract: Understanding the structure and motion of the heart is crucial for diagnosing and managing cardiovascular diseases, the leading cause of global death. There is wide variation in cardiac shape and motion patterns, influenced by demographic, anthropometric and disease factors. Unravelling normal patterns of shape and motion, and understanding how each individual deviates from the norm, would facilit… ▽ More

    Submitted 2 June, 2025; v1 submitted 20 September, 2024; originally announced September 2024.

    Comments: Accepted by Nature Machine Intelligence

    Journal ref: Nature Machine Intelligence 7 (2025) 800-811

  16. arXiv:2409.10944  [pdf, other

    cs.LG cs.AI q-bio.NC

    Contrasformer: A Brain Network Contrastive Transformer for Neurodegenerative Condition Identification

    Authors: Jiaxing Xu, Kai He, Mengcheng Lan, Qingtian Bian, Wei Li, Tieying Li, Yiping Ke, Miao Qiao

    Abstract: Understanding neurological disorder is a fundamental problem in neuroscience, which often requires the analysis of brain networks derived from functional magnetic resonance imaging (fMRI) data. Despite the prevalence of Graph Neural Networks (GNNs) and Graph Transformers in various domains, applying them to brain networks faces challenges. Specifically, the datasets are severely impacted by the no… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

  17. arXiv:2407.13979  [pdf, other

    cs.LG cs.DS stat.ML

    Truthfulness of Calibration Measures

    Authors: Nika Haghtalab, Mingda Qiao, Kunhe Yang, Eric Zhao

    Abstract: We initiate the study of the truthfulness of calibration measures in sequential prediction. A calibration measure is said to be truthful if the forecaster (approximately) minimizes the expected penalty by predicting the conditional expectation of the next outcome, given the prior distribution of outcomes. Truthfulness is an important property of calibration measures, ensuring that the forecaster i… ▽ More

    Submitted 20 November, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: To appear at NeurIPS 2024

  18. arXiv:2406.07006  [pdf, other

    cs.CV

    MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results

    Authors: Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan , et al. (17 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Few-shot RAWImage Denoising Challenge Report. Website: https://mipi-challenge.org/MIPI2024/

  19. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  20. arXiv:2405.10246  [pdf, other

    eess.IV cs.CV

    A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts

    Authors: Xinru Zhang, Ni Ou, Berke Doga Basaran, Marco Visentin, Mengyun Qiao, Renyang Gu, Cheng Ouyang, Yaou Liu, Paul M. Matthew, Chuyang Ye, Wenjia Bai

    Abstract: Brain lesion segmentation plays an essential role in neurological research and diagnosis. As brain lesions can be caused by various pathological alterations, different types of brain lesions tend to manifest with different characteristics on different imaging modalities. Due to this complexity, brain lesion segmentation methods are often developed in a task-specific manner. A specific segmentation… ▽ More

    Submitted 16 July, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: The work has been early accepted by MICCAI 2024

  21. Synthesizing Realistic Data for Table Recognition

    Authors: Qiyu Hou, Jun Wang, Meixuan Qiao, Lujun Tian

    Abstract: To overcome the limitations and challenges of current automatic table data annotation methods and random table data synthesis approaches, we propose a novel method for synthesizing annotation data specifically designed for table recognition. This method utilizes the structure and content of existing complex tables, facilitating the efficient creation of tables that closely replicate the authentic… ▽ More

    Submitted 9 July, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: ICDAR 2024

  22. arXiv:2403.05775  [pdf, other

    cs.DS

    Scalable $k$-clique Densest Subgraph Search

    Authors: Xiaowei Ye, Miao Qiao, Rong-Hua Li, Qi Zhang, Guoren Wang

    Abstract: In this paper, we present a collection of novel and scalable algorithms designed to tackle the challenges inherent in the $k$-clique densest subgraph problem (\kcdsp) within network analysis. We propose \psctl, a novel algorithm based on the Frank-Wolfe approach for addressing \kcdsp, effectively solving a distinct convex programming problem. \textcolor{black}{\psctl is able to approximate \kcdsp… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  23. arXiv:2402.15169  [pdf, ps, other

    cs.GT cs.DS cs.MA

    Platforms for Efficient and Incentive-Aware Collaboration

    Authors: Nika Haghtalab, Mingda Qiao, Kunhe Yang

    Abstract: Collaboration is crucial for reaching collective goals. However, its effectiveness is often undermined by the strategic behavior of individual agents -- a fact that is captured by a high Price of Stability (PoS) in recent literature [Blum et al., 2021]. Implicit in the traditional PoS analysis is the assumption that agents have full knowledge of how their tasks relate to one another. We offer a ne… ▽ More

    Submitted 20 November, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  24. arXiv:2402.10445  [pdf, other

    cs.LG cs.DS stat.ML

    Collaborative Learning with Different Labeling Functions

    Authors: Yuyang Deng, Mingda Qiao

    Abstract: We study a variant of Collaborative PAC Learning, in which we aim to learn an accurate classifier for each of the $n$ data distributions, while minimizing the number of samples drawn from them in total. Unlike in the usual collaborative learning setup, it is not assumed that there exists a single classifier that is simultaneously accurate for all distributions. We show that, when the data distri… ▽ More

    Submitted 22 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: To appear at ICML 2024; v2 and v3 included additional discussion on related work

  25. arXiv:2402.07458  [pdf, other

    cs.LG cs.DS stat.ML

    On the Distance from Calibration in Sequential Prediction

    Authors: Mingda Qiao, Letian Zheng

    Abstract: We study a sequential binary prediction setting where the forecaster is evaluated in terms of the calibration distance, which is defined as the $L_1$ distance between the predicted values and the set of predictions that are perfectly calibrated in hindsight. This is analogous to a calibration measure recently proposed by Błasiok, Gopalan, Hu and Nakkiran (STOC 2023) for the offline setting. The ca… ▽ More

    Submitted 27 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: To appear at COLT 2024; v2 fixed minor typos

  26. arXiv:2311.16416  [pdf, other

    cs.DS cs.LG stat.ML

    A Combinatorial Approach to Robust PCA

    Authors: Weihao Kong, Mingda Qiao, Rajat Sen

    Abstract: We study the problem of recovering Gaussian data under adversarial corruptions when the noises are low-rank and the corruptions are on the coordinate level. Concretely, we assume that the Gaussian noises lie in an unknown $k$-dimensional subspace $U \subseteq \mathbb{R}^d$, and $s$ randomly chosen coordinates of each data point fall into the control of an adversary. This setting models the scenari… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: To appear at ITCS 2024

  27. arXiv:2308.09442  [pdf, other

    cs.CE

    BioMedGPT: Open Multimodal Generative Pre-trained Transformer for BioMedicine

    Authors: Yizhen Luo, Jiahuan Zhang, Siqi Fan, Kai Yang, Yushuai Wu, Mu Qiao, Zaiqing Nie

    Abstract: Foundation models (FMs) have exhibited remarkable performance across a wide range of downstream tasks in many domains. Nevertheless, general-purpose FMs often face challenges when confronted with domain-specific problems, due to their limited access to the proprietary training data in a particular domain. In biomedicine, there are various biological modalities, such as molecules, proteins, and cel… ▽ More

    Submitted 21 August, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: 12 pages, 4 figures

  28. arXiv:2308.09026  [pdf, ps, other

    eess.IV cs.CV cs.LG

    LesionMix: A Lesion-Level Data Augmentation Method for Medical Image Segmentation

    Authors: Berke Doga Basaran, Weitong Zhang, Mengyun Qiao, Bernhard Kainz, Paul M. Matthews, Wenjia Bai

    Abstract: Data augmentation has become a de facto component of deep learning-based medical image segmentation methods. Most data augmentation techniques used in medical imaging focus on spatial and intensity transformations to improve the diversity of training images. They are often designed at the image level, augmenting the full image, and do not pay attention to specific abnormalities within the image. H… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 13 pages, 5 figures, 4 tables, MICCAI DALI Workshop 2023

  29. arXiv:2307.11133  [pdf, other

    q-bio.NC cs.AI cs.LG

    Contrastive Graph Pooling for Explainable Classification of Brain Networks

    Authors: Jiaxing Xu, Qingtian Bian, Xinhang Li, Aihu Zhang, Yiping Ke, Miao Qiao, Wei Zhang, Wei Khang Jeremy Sim, Balázs Gulyás

    Abstract: Functional magnetic resonance imaging (fMRI) is a commonly used technique to measure neural activation. Its application has been particularly important in identifying underlying neurodegenerative conditions such as Parkinson's, Alzheimer's, and Autism. Recent analysis of fMRI data models the brain as a graph and extracts features by graph neural networks (GNNs). However, the unique characteristics… ▽ More

    Submitted 6 September, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Journal ref: IEEE Transactions on Medical Imaging, vol. 43, no. 9, pp. 3292-3305, Sept. 2024

  30. arXiv:2307.08347  [pdf, other

    cs.CV cs.AI cs.LG

    M-FLAG: Medical Vision-Language Pre-training with Frozen Language Models and Latent Space Geometry Optimization

    Authors: Che Liu, Sibo Cheng, Chen Chen, Mengyun Qiao, Weitong Zhang, Anand Shah, Wenjia Bai, Rossella Arcucci

    Abstract: Medical vision-language models enable co-learning and integrating features from medical imaging and clinical text. However, these models are not easy to train and the latent representation space can be complex. Here we propose a novel way for pre-training and regularising medical vision-language models. The proposed method, named Medical vision-language pre-training with Frozen language models and… ▽ More

    Submitted 19 July, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI 2023

  31. Structure Diagram Recognition in Financial Announcements

    Authors: Meixuan Qiao, Jun Wang, Junfu Xiang, Qiyu Hou, Ruixuan Li

    Abstract: Accurately extracting structured data from structure diagrams in financial announcements is of great practical importance for building financial knowledge graphs and further improving the efficiency of various financial applications. First, we proposed a new method for recognizing structure diagrams in financial announcements, which can better detect and extract different types of connecting lines… ▽ More

    Submitted 1 May, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: ICDAR2023

  32. Feature-Conditioned Cascaded Video Diffusion Models for Precise Echocardiogram Synthesis

    Authors: Hadrien Reynaud, Mengyun Qiao, Mischa Dombrowski, Thomas Day, Reza Razavi, Alberto Gomez, Paul Leeson, Bernhard Kainz

    Abstract: Image synthesis is expected to provide value for the translation of machine learning methods into clinical practice. Fundamental problems like model robustness, domain transfer, causal modelling, and operator training become approachable through synthetic data. Especially, heavily operator-dependant modalities like Ultrasound imaging require robust frameworks for image and video generation. So far… ▽ More

    Submitted 21 February, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Published in MICCAI 2023 proceedings. https://link.springer.com/chapter/10.1007/978-3-031-43999-5_14

  33. arXiv:2303.11376  [pdf, other

    cs.LG cs.AI

    GNN-Ensemble: Towards Random Decision Graph Neural Networks

    Authors: Wenqi Wei, Mu Qiao, Divyesh Jadav

    Abstract: Graph Neural Networks (GNNs) have enjoyed wide spread applications in graph-structured data. However, existing graph based applications commonly lack annotated data. GNNs are required to learn latent patterns from a limited amount of training data to perform inferences on a vast amount of test data. The increased complexity of GNNs, as well as a single point of model parameter initialization, usua… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  34. arXiv:2301.13098  [pdf, other

    eess.IV cs.CV cs.LG

    CHeart: A Conditional Spatio-Temporal Generative Model for Cardiac Anatomy

    Authors: Mengyun Qiao, Shuo Wang, Huaqi Qiu, Antonio de Marvao, Declan P. O'Regan, Daniel Rueckert, Wenjia Bai

    Abstract: Two key questions in cardiac image analysis are to assess the anatomy and motion of the heart from images; and to understand how they are associated with non-imaging clinical factors such as gender, age and diseases. While the first question can often be addressed by image segmentation and motion tracking algorithms, our capability to model and to answer the second question is still limited. In th… ▽ More

    Submitted 30 November, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted by IEEE Transactions on Medical Imaging

  35. arXiv:2212.02055  [pdf, other

    cs.LG

    Graph Convolutional Neural Networks with Diverse Negative Samples via Decomposed Determinant Point Processes

    Authors: Wei Duan, Junyu Xuan, Maoying Qiao, Jie Lu

    Abstract: Graph convolutional networks (GCNs) have achieved great success in graph representation learning by extracting high-level features from nodes and their topology. Since GCNs generally follow a message-passing mechanism, each node aggregates information from its first-order neighbour to update its representation. As a result, the representations of nodes with edges between them should be positively… ▽ More

    Submitted 6 September, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted by IEEE TNNLS on 30-Aug-2023. arXiv admin note: text overlap with arXiv:2210.00728

  36. arXiv:2211.12421  [pdf, other

    q-bio.NC cs.LG eess.IV

    Data-Driven Network Neuroscience: On Data Collection and Benchmark

    Authors: Jiaxing Xu, Yunhan Yang, David Tse Jung Huang, Sophi Shilpa Gururajapathy, Yiping Ke, Miao Qiao, Alan Wang, Haribalan Kumar, Josh McGeown, Eryn Kwon

    Abstract: This paper presents a comprehensive and quality collection of functional human brain network data for potential research in the intersection of neuroscience, machine learning, and graph analytics. Anatomical and functional MRI images have been used to understand the functional connectivity of the human brain and are particularly important in identifying underlying neurodegenerative conditions such… ▽ More

    Submitted 29 October, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Journal ref: Advances in Neural Information Processing Systems, 2023

  37. arXiv:2210.16041  [pdf, other

    cs.SI

    Centralization Problem for Opinion Convergence in Decentralized Networks

    Authors: Yiping Liu, Jiamou Liu, Bakhadyr Khoussaino, Miao Qiao, Bo Yan

    Abstract: This paper aims to provide a new perspective on the interplay between decentralization -- a prevalent character of multi-agent systems -- and centralization, i.e., the task of imposing central control to meet system-level goals. In particular, in the context of networked opinion dynamic model, the paper proposes and discusses a framework for centralization. More precisely, a decentralized network… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

  38. arXiv:2210.02415  [pdf, other

    cs.LG cs.DS stat.ML

    A Fourier Approach to Mixture Learning

    Authors: Mingda Qiao, Guru Guruganesh, Ankit Singh Rawat, Avinava Dubey, Manzil Zaheer

    Abstract: We revisit the problem of learning mixtures of spherical Gaussians. Given samples from mixture $\frac{1}{k}\sum_{j=1}^{k}\mathcal{N}(μ_j, I_d)$, the goal is to estimate the means $μ_1, μ_2, \ldots, μ_k \in \mathbb{R}^d$ up to a small error. The hardness of this learning problem can be measured by the separation $Δ$ defined as the minimum distance between all pairs of means. Regev and Vijayaraghava… ▽ More

    Submitted 5 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: To appear at NeurIPS 2022; v2 corrected author information

  39. Learning from the Dark: Boosting Graph Convolutional Neural Networks with Diverse Negative Samples

    Authors: Wei Duan, Junyu Xuan, Maoying Qiao, Jie Lu

    Abstract: Graph Convolutional Neural Networks (GCNs) has been generally accepted to be an effective tool for node representations learning. An interesting way to understand GCNs is to think of them as a message passing mechanism where each node updates its representation by accepting information from its neighbours (also known as positive samples). However, beyond these neighbouring nodes, graphs have a lar… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

  40. arXiv:2210.00655  [pdf, other

    cs.DS cs.GT

    Online Pen Testing

    Authors: Mingda Qiao, Gregory Valiant

    Abstract: We study a "pen testing" problem, in which we are given $n$ pens with unknown amounts of ink $X_1, X_2, \ldots, X_n$, and we want to choose a pen with the maximum amount of remaining ink in it. The challenge is that we cannot access each $X_i$ directly; we only get to write with the $i$-th pen until either a certain amount of ink is used, or the pen runs out of ink. In both cases, this testing red… ▽ More

    Submitted 21 November, 2022; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: To appear at ITCS 2023; v2 added discussion on a closely related work of Awerbuch, Azar, Fiat, and Leighton (1996)

  41. arXiv:2208.13146  [pdf, other

    eess.IV cs.CV cs.LG

    Generative Modelling of the Ageing Heart with Cross-Sectional Imaging and Clinical Data

    Authors: Mengyun Qiao, Berke Doga Basaran, Huaqi Qiu, Shuo Wang, Yi Guo, Yuanyuan Wang, Paul M. Matthews, Daniel Rueckert, Wenjia Bai

    Abstract: Cardiovascular disease, the leading cause of death globally, is an age-related disease. Understanding the morphological and functional changes of the heart during ageing is a key scientific question, the answer to which will help us define important risk factors of cardiovascular disease and monitor disease progression. In this work, we propose a novel conditional generative model to describe the… ▽ More

    Submitted 10 October, 2022; v1 submitted 28 August, 2022; originally announced August 2022.

  42. arXiv:2208.02135  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Subject-Specific Lesion Generation and Pseudo-Healthy Synthesis for Multiple Sclerosis Brain Images

    Authors: Berke Doga Basaran, Mengyun Qiao, Paul M. Matthews, Wenjia Bai

    Abstract: Understanding the intensity characteristics of brain lesions is key for defining image-based biomarkers in neurological studies and for predicting disease burden and outcome. In this work, we present a novel foreground-based generative method for modelling the local lesion characteristics that can both generate synthetic lesions on healthy images and synthesize subject-specific pseudo-healthy imag… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: 13 pages, 6 figures, 2022 MICCAI SASHIMI (Simulation and Synthesis in Medical Imaging) Workshop paper

  43. arXiv:2206.14431  [pdf, other

    cs.DS cs.LG stat.ML

    Open Problem: Properly learning decision trees in polynomial time?

    Authors: Guy Blanc, Jane Lange, Mingda Qiao, Li-Yang Tan

    Abstract: The authors recently gave an $n^{O(\log\log n)}$ time membership query algorithm for properly learning decision trees under the uniform distribution (Blanc et al., 2021). The previous fastest algorithm for this problem ran in $n^{O(\log n)}$ time, a consequence of Ehrenfeucht and Haussler (1989)'s classic algorithm for the distribution-free setting. In this article we highlight the natural open pr… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 5 pages, to appear at the Open Problem sessions at COLT 2022

  44. arXiv:2206.00311  [pdf, other

    cs.CV

    MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining

    Authors: Pengyuan Lyu, Chengquan Zhang, Shanshan Liu, Meina Qiao, Yangliu Xu, Liang Wu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

    Abstract: Text images contain both visual and linguistic information. However, existing pre-training techniques for text recognition mainly focus on either visual representation learning or linguistic knowledge learning. In this paper, we propose a novel approach MaskOCR to unify vision and language pre-training in the classical encoder-decoder recognition framework. We adopt the masked image modeling appro… ▽ More

    Submitted 9 October, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

  45. arXiv:2204.09924  [pdf, other

    cs.CV cs.MM

    Progressive Training of A Two-Stage Framework for Video Restoration

    Authors: Meisong Zheng, Qunliang Xing, Minglang Qiao, Mai Xu, Lai Jiang, Huaida Liu, Ying Chen

    Abstract: As a widely studied task, video restoration aims to enhance the quality of the videos with multiple potential degradations, such as noises, blurs and compression artifacts. Among video restorations, compressed video quality enhancement and video super-resolution are two of the main tacks with significant values in practical scenarios. Recently, recurrent neural networks and transformers attract in… ▽ More

    Submitted 4 February, 2023; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: Winning two championships and one runner-up in the NTIRE 2022 challenge on super-resolution and quality enhancement of compressed video; Accepted to CVPRW 2022

  46. arXiv:2204.09314  [pdf, other

    cs.CV

    NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results

    Authors: Ren Yang, Radu Timofte, Meisong Zheng, Qunliang Xing, Minglang Qiao, Mai Xu, Lai Jiang, Huaida Liu, Ying Chen, Youcheng Ben, Xiao Zhou, Chen Fu, Pei Cheng, Gang Yu, Junyi Li, Renlong Wu, Zhilu Zhang, Wei Shang, Zhengyao Lv, Yunjin Chen, Mingcai Zhou, Dongwei Ren, Kai Zhang, Wangmeng Zuo, Pavel Ostyakov , et al. (54 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video. In this challenge, we proposed the LDV 2.0 dataset, which includes the LDV dataset (240 videos) and 95 additional videos. This challenge includes three tracks. Track 1 aims at enhancing the videos compressed by HEVC at a fixed QP. Track 2 and Track 3 target both the super-resolution and qua… ▽ More

    Submitted 25 April, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  47. arXiv:2204.01489  [pdf, other

    cs.CY cs.AI cs.SI

    Towards a New Science of Disinformation

    Authors: Claudio S. Pinhanez, German H. Flores, Marisa A. Vasconcelos, Mu Qiao, Nick Linck, Rogério de Paula, Yuya J. Ong

    Abstract: How can we best address the dangerous impact that deep learning-generated fake audios, photographs, and videos (a.k.a. deepfakes) may have in personal and societal life? We foresee that the availability of cheap deepfake technology will create a second wave of disinformation where people will receive specific, personalized disinformation through different channels, making the current approaches to… ▽ More

    Submitted 17 March, 2022; originally announced April 2022.

  48. arXiv:2111.08567  [pdf, other

    cs.CV

    Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos

    Authors: Minglang Qiao, Yufan Liu, Mai Xu, Xin Deng, Bing Li, Weiming Hu, Ali Borji

    Abstract: Visual and audio events simultaneously occur and both attract attention. However, most existing saliency prediction works ignore the influence of audio and only consider vision modality. In this paper, we propose a multitask learning method for visual-audio saliency prediction and sound source localization on multi-face video by leveraging visual, audio and face information. Specifically, we first… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: 21 pages, 15 figures

  49. arXiv:2109.05287  [pdf, other

    eess.IV cs.CV

    Dual-view Snapshot Compressive Imaging via Optical Flow Aided Recurrent Neural Network

    Authors: Ruiying Lu, Bo Chen, Guanliang Liu, Ziheng Cheng, Mu Qiao, Xin Yuan

    Abstract: Dual-view snapshot compressive imaging (SCI) aims to capture videos from two field-of-views (FoVs) using a 2D sensor (detector) in a single snapshot, achieving joint FoV and temporal compressive sensing, and thus enjoying the advantages of low-bandwidth, low-power, and low-cost. However, it is challenging for existing model-based decoding algorithms to reconstruct each individual scene, which usua… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

  50. arXiv:2109.00637  [pdf, ps, other

    cs.DS cs.CC cs.LG

    Properly learning decision trees in almost polynomial time

    Authors: Guy Blanc, Jane Lange, Mingda Qiao, Li-Yang Tan

    Abstract: We give an $n^{O(\log\log n)}$-time membership query algorithm for properly and agnostically learning decision trees under the uniform distribution over $\{\pm 1\}^n$. Even in the realizable setting, the previous fastest runtime was $n^{O(\log n)}$, a consequence of a classic algorithm of Ehrenfeucht and Haussler. Our algorithm shares similarities with practical heuristics for learning decision tr… ▽ More

    Submitted 1 November, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: 21 pages, to appear in FOCS 2021