Skip to main content

Showing 1–31 of 31 results for author: Dan, J

.
  1. arXiv:2505.18191  [pdf, ps, other

    eess.SP cs.AI cs.LG cs.PF

    SzCORE as a benchmark: report from the seizure detection challenge at the 2025 AI in Epilepsy and Neurological Disorders Conference

    Authors: Jonathan Dan, Amirhossein Shahbazinia, Christodoulos Kechris, David Atienza

    Abstract: Reliable automatic seizure detection from long-term EEG remains a challenge, as current machine learning models often fail to generalize across patients or clinical settings. Manual EEG review remains the clinical standard, underscoring the need for robust models and standardized evaluation. To rigorously assess algorithm performance, we organized a challenge using a private dataset of continuous… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  2. arXiv:2505.13100  [pdf, ps, other

    cs.LG

    Time series saliency maps: explaining models across multiple domains

    Authors: Christodoulos Kechris, Jonathan Dan, David Atienza

    Abstract: Traditional saliency map methods, popularized in computer vision, highlight individual points (pixels) of the input that contribute the most to the model's output. However, in time-series they offer limited insights as semantically meaningful features are often found in other domains. We introduce Cross-domain Integrated Gradients, a generalization of Integrated Gradients. Our method enables featu… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  3. arXiv:2504.18768  [pdf, other

    cs.GR cs.CV

    TransparentGS: Fast Inverse Rendering of Transparent Objects with Gaussians

    Authors: Letian Huang, Dongwei Ye, Jialin Dan, Chengzhi Tao, Huiwen Liu, Kun Zhou, Bo Ren, Yuanqi Li, Yanwen Guo, Jie Guo

    Abstract: The emergence of neural and Gaussian-based radiance field methods has led to considerable advancements in novel view synthesis and 3D object reconstruction. Nonetheless, specular reflection and refraction continue to pose significant challenges due to the instability and incorrect overfitting of radiance fields to high-frequency light variations. Currently, even 3D Gaussian Splatting (3D-GS), as a… ▽ More

    Submitted 1 May, 2025; v1 submitted 25 April, 2025; originally announced April 2025.

    Comments: accepted by SIGGRAPH 2025; https://letianhuang.github.io/transparentgs/

  4. arXiv:2502.07295  [pdf, other

    cs.LG

    Treatment Effect Estimation for Exponential Family Outcomes using Neural Networks with Targeted Regularization

    Authors: Jiahong Li, Zeqin Yang, Jiayi Dan, Jixing Xu, Zhichao Zou, Peng Zhen, Jiecheng Guo

    Abstract: Neural Networks (NNs) have became a natural choice for treatment effect estimation due to their strong approximation capabilities. Nevertheless, how to design NN-based estimators with desirable properties, such as low bias and doubly robustness, still remains a significant challenge. A common approach to address this is targeted regularization, which modifies the objective function of NNs. However… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  5. arXiv:2412.11399  [pdf

    cs.LG eess.SP

    Quantifying Climate Change Impacts on Renewable Energy Generation: A Super-Resolution Recurrent Diffusion Model

    Authors: Xiaochong Dong, Jun Dan, Yingyun Sun, Yang Liu, Xuemin Zhang, Shengwei Mei

    Abstract: Driven by global climate change and the ongoing energy transition, the coupling between power supply capabilities and meteorological factors has become increasingly significant. Over the long term, accurately quantifying the power generation of renewable energy under the influence of climate change is essential for the development of sustainable power systems. However, due to interdisciplinary dif… ▽ More

    Submitted 24 March, 2025; v1 submitted 15 December, 2024; originally announced December 2024.

  6. arXiv:2410.24066  [pdf, other

    eess.AS eess.SP

    Cough-E: A multimodal, privacy-preserving cough detection algorithm for the edge

    Authors: Stefano Albini, Lara Orlandic, Jonathan Dan, Jérôme Thevenot, Tomas Teijeiro, Denisa Andreea Constantinescu, David Atienza

    Abstract: Continuous cough monitors can greatly aid doctors in home monitoring and treatment of respiratory diseases. Although many algorithms have been proposed, they still face limitations in data privacy and short-term monitoring. Edge-AI offers a promising solution by processing privacy-sensitive data near the source, but challenges arise in deploying resource-intensive algorithms on constrained devices… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

    Comments: 14 pages, 10 figures

  7. arXiv:2410.12312  [pdf, other

    cs.CV cs.AI

    FaceChain-FACT: Face Adapter with Decoupled Training for Identity-preserved Personalization

    Authors: Cheng Yu, Haoyu Xie, Lei Shang, Yang Liu, Jun Dan, Liefeng Bo, Baigui Sun

    Abstract: In the field of human-centric personalized image generation, the adapter-based method obtains the ability to customize and generate portraits by text-to-image training on facial data. This allows for identity-preserved personalization without additional fine-tuning in inference. Although there are improvements in efficiency and fidelity, there is often a significant performance decrease in test fo… ▽ More

    Submitted 25 October, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: 12 pages, 8 figures

  8. arXiv:2410.10587  [pdf, other

    cs.CV cs.LG

    TopoFR: A Closer Look at Topology Alignment on Face Recognition

    Authors: Jun Dan, Yang Liu, Jiankang Deng, Haoyu Xie, Siyuan Li, Baigui Sun, Shan Luo

    Abstract: The field of face recognition (FR) has undergone significant advancements with the rise of deep learning. Recently, the success of unsupervised learning and graph neural networks has demonstrated the effectiveness of data structure information. Considering that the FR task can leverage large-scale training data, which intrinsically contains significant structure information, we aim to investigate… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS 2024

  9. arXiv:2409.12771  [pdf, other

    cs.CV cs.GR

    Spectral-GS: Taming 3D Gaussian Splatting with Spectral Entropy

    Authors: Letian Huang, Jie Guo, Jialin Dan, Ruoyu Fu, Shujie Wang, Yuanqi Li, Yanwen Guo

    Abstract: Recently, 3D Gaussian Splatting (3D-GS) has achieved impressive results in novel view synthesis, demonstrating high fidelity and efficiency. However, it easily exhibits needle-like artifacts, especially when increasing the sampling rate. Mip-Splatting tries to remove these artifacts with a 3D smoothing filter for frequency constraints and a 2D Mip filter for approximated supersampling. Unfortunate… ▽ More

    Submitted 15 October, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

  10. arXiv:2408.03223  [pdf, other

    cs.LG

    Don't Think It Twice: Exploit Shift Invariance for Efficient Online Streaming Inference of CNNs

    Authors: Christodoulos Kechris, Jonathan Dan, Jose Miranda, David Atienza

    Abstract: Deep learning time-series processing often relies on convolutional neural networks with overlapping windows. This overlap allows the network to produce an output faster than the window length. However, it introduces additional computations. This work explores the potential to optimize computational efficiency during inference by exploiting convolution's shift-invariance properties to skip the calc… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  11. arXiv:2407.16556  [pdf, other

    cs.LG eess.SP

    DC is all you need: describing ReLU from a signal processing standpoint

    Authors: Christodoulos Kechris, Jonathan Dan, Jose Miranda, David Atienza

    Abstract: Non-linear activation functions are crucial in Convolutional Neural Networks. However, until now they have not been well described in the frequency domain. In this work, we study the spectral behavior of ReLU, a popular activation function. We use the ReLU's Taylor expansion to derive its frequency domain behavior. We demonstrate that ReLU introduces higher frequency oscillations in the signal and… ▽ More

    Submitted 11 May, 2025; v1 submitted 23 July, 2024; originally announced July 2024.

  12. arXiv:2407.00737  [pdf, other

    cs.CV

    LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation

    Authors: Mushui Liu, Yuhang Ma, Yang Zhen, Jun Dan, Yunlong Yu, Zeng Zhao, Zhipeng Hu, Bai Liu, Changjie Fan

    Abstract: Diffusion models have exhibited substantial success in text-to-image generation. However, they often encounter challenges when dealing with complex and dense prompts involving multiple objects, attribute binding, and long descriptions. In this paper, we propose a novel framework called \textbf{LLM4GEN}, which enhances the semantic understanding of text-to-image diffusion models by leveraging the r… ▽ More

    Submitted 27 August, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: 11 pages, 13 figures

  13. How to Count Coughs: An Event-Based Framework for Evaluating Automatic Cough Detection Algorithm Performance

    Authors: Lara Orlandic, Jonathan Dan, Jerome Thevenot, Tomas Teijeiro, Alain Sauty, David Atienza

    Abstract: Chronic cough disorders are widespread and challenging to assess because they rely on subjective patient questionnaires about cough frequency. Wearable devices running Machine Learning (ML) algorithms are promising for quantifying daily coughs, providing clinicians with objective metrics to track symptoms and evaluate treatments. However, there is a mismatch between state-of-the-art metrics for co… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2405.17519  [pdf, other

    cond-mat.mtrl-sci

    Symmetry quantification and segmentation in STEM imaging through Zernike moments

    Authors: Jiadong Dan, Cheng Zhang, Xiaoxu Zhao, N. Duane Loh

    Abstract: We present a method using Zernike moments for quantifying rotational and reflectional symmetries in scanning transmission electron microscopy (STEM) images, aimed at improving structural analysis of materials at the atomic scale. This technique is effective against common imaging noises and is potentially suited for low-dose imaging and identifying quantum defects. We showcase its utility in the u… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 16 pages, 6 figures

  15. arXiv:2405.10530  [pdf, other

    cs.CV

    CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation

    Authors: Mushui Liu, Jun Dan, Ziqian Lu, Yunlong Yu, Yingming Li, Xi Li

    Abstract: Due to the large-scale image size and object variations, current CNN-based and Transformer-based approaches for remote sensing image semantic segmentation are suboptimal for capturing the long-range dependency or limited to the complex computational complexity. In this paper, we propose CM-UNet, comprising a CNN-based encoder for extracting local image features and a Mamba-based decoder for aggreg… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 5 pages, 6 figures

  16. KID-PPG: Knowledge Informed Deep Learning for Extracting Heart Rate from a Smartwatch

    Authors: Christodoulos Kechris, Jonathan Dan, Jose Miranda, David Atienza

    Abstract: Accurate extraction of heart rate from photoplethysmography (PPG) signals remains challenging due to motion artifacts and signal degradation. Although deep learning methods trained as a data-driven inference problem offer promising solutions, they often underutilize existing knowledge from the medical and signal processing community. In this paper, we address three shortcomings of deep learning mo… ▽ More

    Submitted 9 October, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  17. arXiv:2403.01901  [pdf, other

    cs.CV

    FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio

    Authors: Chao Xu, Yang Liu, Jiazheng Xing, Weida Wang, Mingze Sun, Jun Dan, Tianxin Huang, Siyuan Li, Zhi-Qi Cheng, Ying Tai, Baigui Sun

    Abstract: In this paper, we abstract the process of people hearing speech, extracting meaningful cues, and creating various dynamically audio-consistent talking faces, termed Listening and Imagining, into the task of high-fidelity diverse talking faces generation from a single audio. Specifically, it involves two critical challenges: one is to effectively decouple identity, content, and emotion from entangl… ▽ More

    Submitted 31 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  18. arXiv:2402.18117  [pdf, other

    cs.CV cs.LG

    PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation

    Authors: Haoyu Xie, Changqi Wang, Jian Zhao, Yang Liu, Jun Dan, Chong Fu, Baigui Sun

    Abstract: Tremendous breakthroughs have been developed in Semi-Supervised Semantic Segmentation (S4) through contrastive learning. However, due to limited annotations, the guidance on unlabeled images is generated by the model itself, which inevitably exists noise and disturbs the unsupervised training process. To address this issue, we propose a robust contrastive-based S4 framework, termed the Probabilist… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 19 pages, 11 figures

  19. arXiv:2402.13005  [pdf, other

    eess.SP cs.LG

    SzCORE: A Seizure Community Open-source Research Evaluation framework for the validation of EEG-based automated seizure detection algorithms

    Authors: Jonathan Dan, Una Pale, Alireza Amirshahi, William Cappelletti, Thorir Mar Ingolfsson, Xiaying Wang, Andrea Cossettini, Adriano Bernini, Luca Benini, Sándor Beniczky, David Atienza, Philippe Ryvlin

    Abstract: The need for high-quality automated seizure detection algorithms based on electroencephalography (EEG) becomes ever more pressing with the increasing use of ambulatory and long-term EEG monitoring. Heterogeneity in validation methods of these algorithms influences the reported results and makes comprehensive evaluation and comparison challenging. This heterogeneity concerns in particular the choic… ▽ More

    Submitted 8 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  20. arXiv:2311.16605  [pdf, other

    cs.LG cs.AI

    LasTGL: An Industrial Framework for Large-Scale Temporal Graph Learning

    Authors: Jintang Li, Jiawang Dan, Ruofan Wu, Jing Zhou, Sheng Tian, Yunfei Liu, Baokun Wang, Changhua Meng, Weiqiang Wang, Yuchang Zhu, Liang Chen, Zibin Zheng

    Abstract: Over the past few years, graph neural networks (GNNs) have become powerful and practical tools for learning on (static) graph-structure data. However, many real-world applications, such as social networks and e-commerce, involve temporal graphs where nodes and edges are dynamically evolving. Temporal graph neural networks (TGNNs) have progressively emerged as an extension of GNNs to address time-e… ▽ More

    Submitted 30 November, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Preprint; Work in progress

  21. arXiv:2310.11664  [pdf, other

    cs.LG cs.AI

    Hetero$^2$Net: Heterophily-aware Representation Learning on Heterogenerous Graphs

    Authors: Jintang Li, Zheng Wei, Jiawang Dan, Jing Zhou, Yuchang Zhu, Ruofan Wu, Baokun Wang, Zhang Zhen, Changhua Meng, Hong Jin, Zibin Zheng, Liang Chen

    Abstract: Real-world graphs are typically complex, exhibiting heterogeneity in the global structure, as well as strong heterophily within local neighborhoods. While a growing body of literature has revealed the limitations of common graph neural networks (GNNs) in handling homogeneous graphs with heterophily, little work has been conducted on investigating the heterophily properties in the context of hetero… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Preprint

  22. arXiv:2310.11281  [pdf, other

    cs.LG

    Self-supervision meets kernel graph neural models: From architecture to augmentations

    Authors: Jiawang Dan, Ruofan Wu, Yunpeng Liu, Baokun Wang, Changhua Meng, Tengfei Liu, Tianyi Zhang, Ningtao Wang, Xing Fu, Qi Li, Weiqiang Wang

    Abstract: Graph representation learning has now become the de facto standard when handling graph-structured data, with the framework of message-passing graph neural networks (MPNN) being the most prevailing algorithmic tool. Despite its popularity, the family of MPNNs suffers from several drawbacks such as transparency and expressivity. Recently, the idea of designing neural models on graphs using the theor… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  23. arXiv:2308.10133  [pdf, other

    cs.CV cs.AI

    TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective

    Authors: Jun Dan, Yang Liu, Haoyu Xie, Jiankang Deng, Haoran Xie, Xuansong Xie, Baigui Sun

    Abstract: Vision Transformers (ViTs) have demonstrated powerful representation ability in various visual tasks thanks to their intrinsic data-hungry nature. However, we unexpectedly find that ViTs perform vulnerably when applied to face recognition (FR) scenarios with extremely large datasets. We investigate the reasons for this phenomenon and discover that the existing data augmentation approach and hard s… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  24. arXiv:2305.18325  [pdf

    cond-mat.mtrl-sci

    A multiscale generative model to understand disorder in domain boundaries

    Authors: Jiadong Dan, Moaz Waqar, Ivan Erofeev, Kui Yao, John Wang, Stephen J. Pennycook, N. Duane Loh

    Abstract: A continuing challenge in atomic resolution microscopy is to identify significant structural motifs and their assembly rules in synthesized materials with limited observations. Here we propose and validate a simple and effective hybrid generative model capable of predicting unseen domain boundaries in a potassium sodium niobate thin film from only a small number of observations, without expensive… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  25. arXiv:2204.09803  [pdf, other

    cs.LG cs.AI cs.CR

    GUARD: Graph Universal Adversarial Defense

    Authors: Jintang Li, Jie Liao, Ruofan Wu, Liang Chen, Zibin Zheng, Jiawang Dan, Changhua Meng, Weiqiang Wang

    Abstract: Graph convolutional networks (GCNs) have been shown to be vulnerable to small adversarial perturbations, which becomes a severe threat and largely limits their applications in security-critical scenarios. To mitigate such a threat, considerable research efforts have been devoted to increasing the robustness of GCNs against adversarial attacks. However, current defense approaches are typically desi… ▽ More

    Submitted 12 August, 2023; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: Accepted by CIKM 2023. Code is publicly available at https://github.com/EdisonLeeeee/GUARD

  26. arXiv:2105.13667  [pdf, other

    eess.SP

    Grouped Variable Selection for Generalized Eigenvalue Problems

    Authors: Jonathan Dan, Simon Geirnaert, Alexander Bertrand

    Abstract: Many problems require the selection of a subset of variables from a full set of optimization variables. The computational complexity of an exhaustive search over all possible subsets of variables is, however, prohibitively expensive, necessitating more efficient but potentially suboptimal search strategies. We focus on sparse variable selection for generalized Rayleigh quotient optimization and ge… ▽ More

    Submitted 26 January, 2022; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: Jonathan Dan and Simon Geirnaert contributed equally to this work

  27. arXiv:2005.11488  [pdf

    cond-mat.mtrl-sci physics.data-an

    Learning Motifs and their Hierarchies in Atomic Resolution Microscopy

    Authors: Jiadong Dan, Xiaoxu Zhao, Shoucong Ning, Jiong Lu, Kian Ping Loh, N. Duane Loh, Stephen J. Pennycook

    Abstract: Progress in functional materials discovery has been accelerated by advances in high throughput materials synthesis and by the development of high-throughput computation. However, a complementary robust and high throughput structural characterization framework is still lacking. New methods and tools in the field of machine learning suggest that a highly automated high-throughput structural characte… ▽ More

    Submitted 29 November, 2021; v1 submitted 23 May, 2020; originally announced May 2020.

  28. arXiv:1503.00800  [pdf

    cs.IT

    IMAC: Impulsive-mitigation adaptive sparse channel estimation based on Gaussian-mixture model

    Authors: Tingping Zhang, Jingpei Dan, Guan Gui

    Abstract: Broadband frequency-selective fading channels usually have the inherent sparse nature. By exploiting the sparsity, adaptive sparse channel estimation (ASCE) methods, e.g., reweighted L1-norm least mean square (RL1-LMS), could bring a performance gain if additive noise satisfying Gaussian assumption. In real communication environments, however, channel estimation performance is often deteriorated b… ▽ More

    Submitted 2 March, 2015; originally announced March 2015.

    Comments: 12 pages, 10 figures, submitted for journal

  29. arXiv:1410.4035  [pdf, ps, other

    physics.plasm-ph astro-ph.GA

    Conditions for supersonic bent Marshak waves

    Authors: Qiang Xu, Xiao-dong Ren, Jing Li, Jia-kun Dan, Kun-lun Wang, Shao-tong Zhou

    Abstract: Supersonic radiation diffusion approximation is a useful way to study the radiation transportation. Considering the bent Marshak wave theory in 2-dimensions, and an invariable source temperature, we get the supersonic radiation diffusion conditions which are about the Mach number $M>8(1+\sqrt{\ep})/3$, and the optical depth $τ>1$. A large Mach number requires a high temperature, while a large opti… ▽ More

    Submitted 15 October, 2014; originally announced October 2014.

    Comments: 9 pages, 5 figures

  30. arXiv:1410.2567  [pdf, ps, other

    physics.plasm-ph

    Significance of self magnetic field in long-distance collimation of laser-generated electron beams

    Authors: Shi Chen, Jiaofeng Huang, Yifei Niu, Jiakun Dan, Ziyu Chen, Jianfeng Li

    Abstract: Long-distance collimation of fast electron beams generated by laser-metallic-wire targets has been observed in recent experiments, while the mechanism behind this phenomenon remains unclear. In this work, we investigate in detail the laser-wire interaction processes with a simplified model and Classical Trajectory Monte Carlo simulations, and demonstrate the significance of the self magnetic field… ▽ More

    Submitted 9 October, 2014; originally announced October 2014.

    Comments: 5 pages, 4 figures

  31. arXiv:1311.0074  [pdf, ps, other

    physics.plasm-ph

    Magnetic Generation due to Mass Difference between Charge Carriers

    Authors: Shi Chen, JiaKun Dan, ZiYu Chen, JianFeng Li

    Abstract: The possibility of spontaneous magnetization due to the "asymmetry in mass" of charge carriers in a system is investigated. Analysis shows that when the masses of positive and negative charge carriers are identical, no magnetization is predicted. However, if the masses of two species are different, spontaneous magnetic field would appear, either due to the equipartition of magnetic energy or due t… ▽ More

    Submitted 31 October, 2013; originally announced November 2013.