Skip to main content

Showing 51–100 of 141 results for author: Kwok, T

.
  1. arXiv:2305.10716  [pdf, other

    cs.LG cs.AI

    A Survey on Time-Series Pre-Trained Models

    Authors: Qianli Ma, Zhen Liu, Zhenjing Zheng, Ziyang Huang, Siying Zhu, Zhongzhong Yu, James T. Kwok

    Abstract: Time-Series Mining (TSM) is an important research area since it shows great potential in practical applications. Deep learning models that rely on massive labeled data have been utilized for TSM successfully. However, constructing a large-scale well-labeled dataset is difficult due to data annotation costs. Recently, pre-trained models have gradually attracted attention in the time series domain d… ▽ More

    Submitted 4 October, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted in the IEEE Transactions on Knowledge and Data Engineering (TKDE)

  2. arXiv:2303.18049  [pdf, other

    cs.CL

    No Place to Hide: Dual Deep Interaction Channel Network for Fake News Detection based on Data Augmentation

    Authors: Biwei Cao, Lulu Hua, Jiuxin Cao, Jie Gui, Bo Liu, James Tin-Yau Kwok

    Abstract: Online Social Network (OSN) has become a hotbed of fake news due to the low cost of information dissemination. Although the existing methods have made many attempts in news content and propagation structure, the detection of fake news is still facing two challenges: one is how to mine the unique key features and evolution patterns, and the other is how to tackle the problem of small samples to bui… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

  3. arXiv:2303.17255  [pdf, other

    cs.CV cs.CR eess.IV

    Fooling the Image Dehazing Models by First Order Gradient

    Authors: Jie Gui, Xiaofeng Cong, Chengwei Peng, Yuan Yan Tang, James Tin-Yau Kwok

    Abstract: The research on the single image dehazing task has been widely explored. However, as far as we know, no comprehensive study has been conducted on the robustness of the well-trained dehazing models. Therefore, there is no evidence that the dehazing networks can resist malicious attacks. In this paper, we focus on designing a group of attack methods based on first order gradient to verify the robust… ▽ More

    Submitted 15 February, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: This paper is accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  4. arXiv:2303.02405  [pdf, other

    cs.LG cs.AI

    Decision Support System for Chronic Diseases Based on Drug-Drug Interactions

    Authors: Tian Bian, Yuli Jiang, Jia Li, Tingyang Xu, Yu Rong, Yi Su, Timothy Kwok, Helen Meng, Hong Cheng

    Abstract: Many patients with chronic diseases resort to multiple medications to relieve various symptoms, which raises concerns about the safety of multiple medication use, as severe drug-drug antagonism can lead to serious adverse effects or even death. This paper presents a Decision Support System, called DSSDDI, based on drug-drug interactions to support doctors prescribing decisions. DSSDDI contains thr… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

    Journal ref: ICDE2023

  5. arXiv:2301.05177  [pdf, other

    hep-ph hep-ex

    Searching for Heavy Neutral Leptons at A Future Muon Collider

    Authors: Tsz Hong Kwok, Lingfeng Li, Tao Liu, Ariel Rock

    Abstract: As the planning stages for a high energy muon collider enter a more concrete era, an important question arises as to what new physics could be uncovered. A TeV-scale muon collider is also a vector boson fusion (VBF) factory with a very clean background, and as such it is a promising environment to look for new physics that couples to the electroweak (EW) sector. In this paper, we explore the abili… ▽ More

    Submitted 19 January, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: 26 pages, 13 figures, 2 tables. v2: references updated, typos fixed

  6. Learning the Relation between Similarity Loss and Clustering Loss in Self-Supervised Learning

    Authors: Jidong Ge, Yuxiang Liu, Jie Gui, Lanting Fang, Ming Lin, James Tin-Yau Kwok, LiGuo Huang, Bin Luo

    Abstract: Self-supervised learning enables networks to learn discriminative features from massive data itself. Most state-of-the-art methods maximize the similarity between two augmentations of one image based on contrastive learning. By utilizing the consistency of two augmentations, the burden of manual annotations can be freed. Contrastive learning exploits instance-level information to learn robust feat… ▽ More

    Submitted 5 June, 2023; v1 submitted 8 January, 2023; originally announced January 2023.

    Comments: This paper is accepted by IEEE Transactions on Image Processing

  7. arXiv:2301.01190  [pdf, other

    cond-mat.mtrl-sci

    Carbon in solution and the Charpy impact performance of medium Mn steels

    Authors: TWK Kwok, FF Worsnop, JO Douglas, D Dye

    Abstract: Carbon is a well known austenite stabiliser and can be used to alter the stacking fault energy and stability against martensitic transformation in medium Mn steels, producing a range of deformation mechanisms such as the Transformation Induced Plasticity (TRIP) or combined Twinning and Transformation Induced Plasticity (TWIP $+$ TRIP) effects. However, the effect of C beyond quasi-static tensile b… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

  8. arXiv:2212.02433  [pdf, other

    hep-ph hep-ex

    Testing Lepton Flavor Universality at Future $Z$ Factories

    Authors: Tin Seng Manfred Ho, Xu-Hui Jiang, Tsz Hong Kwok, Lingfeng Li, Tao Liu

    Abstract: As one of the hypothetical principles in the Standard Model (SM), lepton flavor universality (LFU) should be tested with a precision as high as possible such that the physics violating this principle can be fully examined. The run of $Z$ factory at a future $e^+e^-$ collider such as CEPC or FCC-$ee$ provides a great opportunity to perform this task because of the large statistics and high reconstr… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: 63 pages, 27 figures

  9. arXiv:2211.15362  [pdf, other

    cs.CV cs.LG

    Exploring the Coordination of Frequency and Attention in Masked Image Modeling

    Authors: Jie Gui, Tuo Chen, Minjing Dong, Zhengqi Liu, Hao Luo, James Tin-Yau Kwok, Yuan Yan Tang

    Abstract: Recently, masked image modeling (MIM), which learns visual representations by reconstructing the masked patches of an image, has dominated self-supervised learning in computer vision. However, the pre-training of MIM always takes massive time due to the large-scale data and large-size backbones. We mainly attribute it to the random patch masking in previous MIM works, which fails to leverage the c… ▽ More

    Submitted 28 September, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

  10. AlignVE: Visual Entailment Recognition Based on Alignment Relations

    Authors: Biwei Cao, Jiuxin Cao, Jie Gui, Jiayun Shen, Bo Liu, Lei He, Yuan Yan Tang, James Tin-Yau Kwok

    Abstract: Visual entailment (VE) is to recognize whether the semantics of a hypothesis text can be inferred from the given premise image, which is one special task among recent emerged vision and language understanding tasks. Currently, most of the existing VE approaches are derived from the methods of visual question answering. They recognize visual entailment by quantifying the similarity between the hypo… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: This paper is accepted for publication as a REGULAR paper in the IEEE Transactions on Multimedia

  11. Automated Dominative Subspace Mining for Efficient Neural Architecture Search

    Authors: Yaofo Chen, Yong Guo, Daihai Liao, Fanbing Lv, Hengjie Song, James Tin-Yau Kwok, Mingkui Tan

    Abstract: Neural Architecture Search (NAS) aims to automatically find effective architectures within a predefined search space. However, the search space is often extremely large. As a result, directly searching in such a large search space is non-trivial and also very time-consuming. To address the above issues, in each search step, we seek to limit the search space to a small but effective subspace to boo… ▽ More

    Submitted 6 June, 2024; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: Published in IEEE TCSVT

  12. arXiv:2209.15522  [pdf, other

    cond-mat.mtrl-sci

    The mechanism of twin thickening and the elastic strain state of TWIP steel nanotwins

    Authors: T W J Kwok, T P McAuliffe, A K Ackerman, B H Savitzky, M Danaie, C Ophus, D Dye

    Abstract: A Twinning Induced Plasticity (TWIP) steel with a nominal composition of Fe-16.4Mn-0.9C-0.5Si-0.05Nb-0.05V was deformed to an engineering strain of 6\%. The strain around the deformation twins were mapped using the 4D-STEM technique. Strain mapping showed a large average elastic strain of approximately 6\% in the directions parallel and perpendicular to the twinning direction. However, the large a… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  13. arXiv:2209.13139  [pdf, other

    cs.CV cs.AI

    Searching a High-Performance Feature Extractor for Text Recognition Network

    Authors: Hui Zhang, Quanming Yao, James T. Kwok, Xiang Bai

    Abstract: Feature extractor plays a critical role in text recognition (TR), but customizing its architecture is relatively less explored due to expensive manual tweaking. In this work, inspired by the success of neural architecture search (NAS), we propose to search for suitable feature extractors. We design a domain-specific search space by exploring principles for having good feature extractors. The space… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  14. arXiv:2207.14443  [pdf, other

    cs.LG

    A Survey of Learning on Small Data: Generalization, Optimization, and Challenge

    Authors: Xiaofeng Cao, Weixin Bu, Shengjun Huang, Minling Zhang, Ivor W. Tsang, Yew Soon Ong, James T. Kwok

    Abstract: Learning on big data brings success for artificial intelligence (AI), but the annotation and training costs are expensive. In future, learning on small data that approximates the generalization ability of big data is one of the ultimate purposes of AI, which requires machines to recognize objectives and scenarios relying on small data as humans. A series of learning topics is going on this way suc… ▽ More

    Submitted 6 June, 2023; v1 submitted 28 July, 2022; originally announced July 2022.

  15. arXiv:2206.15205  [pdf, other

    cs.LG

    Black-box Generalization of Machine Teaching

    Authors: Xiaofeng Cao, Yaming Guo, Ivor W. Tsang, James T. Kwok

    Abstract: Hypothesis-pruning maximizes the hypothesis updates for active learning to find those desired unlabeled data. An inherent assumption is that this learning manner can derive those updates into the optimal hypothesis. However, its convergence may not be guaranteed well if those incremental updates are negative and disordered. In this paper, we introduce a black-box teaching hypothesis… ▽ More

    Submitted 20 September, 2023; v1 submitted 30 June, 2022; originally announced June 2022.

  16. arXiv:2204.05388  [pdf, other

    cond-mat.mtrl-sci

    The relative contributions of TWIP and TRIP to strength in fine grained medium-Mn steels

    Authors: T W J Kwok, P Gong, R Rose, D Dye

    Abstract: A medium Mn steel of composition Fe-4.8Mn-2.8Al-1.5Si-0.51C (wt.\%) was processed to obtain two different microstructures representing two different approaches in the hot rolling mill, resulting in equiaxed vs. a mixed equiaxed and lamellar microstructures. Both were found to exhibit a simultaneous TWIP$+$TRIP plasticity enhancing mechanism where deformation twins and $α'$-martensite formed indepe… ▽ More

    Submitted 19 August, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Edited after review, second round

  17. arXiv:2203.06168  [pdf, other

    cs.DS cs.DM math.CO math.PR

    Cheeger Inequalities for Vertex Expansion and Reweighted Eigenvalues

    Authors: Tsz Chiu Kwok, Lap Chi Lau, Kam Chuen Tung

    Abstract: The classical Cheeger's inequality relates the edge conductance $φ$ of a graph and the second smallest eigenvalue $λ_2$ of the Laplacian matrix. Recently, Olesker-Taylor and Zanetti discovered a Cheeger-type inequality $ψ^2 / \log |V| \lesssim λ_2^* \lesssim ψ$ connecting the vertex expansion $ψ$ of a graph $G=(V,E)$ and the maximum reweighted second smallest eigenvalue $λ_2^*$ of the Laplacian ma… ▽ More

    Submitted 19 September, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    Comments: 65 pages, 1 figure. Minor changes

  18. arXiv:2202.08625  [pdf, other

    cs.LG

    Revisiting Over-smoothing in BERT from the Perspective of Graph

    Authors: Han Shi, Jiahui Gao, Hang Xu, Xiaodan Liang, Zhenguo Li, Lingpeng Kong, Stephen M. S. Lee, James T. Kwok

    Abstract: Recently over-smoothing phenomenon of Transformer-based models is observed in both vision and language fields. However, no existing work has delved deeper to further investigate the main cause of this phenomenon. In this work, we make the attempt to analyze the over-smoothing problem from the perspective of graph, where such problem was first discovered and explored. Intuitively, the self-attentio… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: Accepted by ICLR 2022 (Spotlight)

  19. arXiv:2112.12115  [pdf, other

    cond-mat.mtrl-sci

    Tailoring the Deformation Behaviour of a Medium Mn Steel through Isothermal Intercritical Annealing

    Authors: X. Xu, T. W. J. Kwok, P. Gong, D. Dye

    Abstract: A novel concept of varying the strain hardening rate of a medium Mn steel with 8 wt\% Mn by varying the duration of the intercritical anneal after hot rolling was explored. It was found that the stability of the austenite phase showed an inverse square root relationship with intercritical annealing duration and that the maximum strain hardening rate showed a linear relationship with austenite stab… ▽ More

    Submitted 4 April, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

    Comments: Updated in response to reviewer comments

  20. arXiv:2112.01172  [pdf, other

    cond-mat.mtrl-sci

    Strengthening $κ$-carbide steels using residual dislocation content

    Authors: T. W. J. Kwok, K. M. Rahman, V. A. Vorontsov, D. Dye

    Abstract: A steel with nominal composition Fe-28Mn-8Al-1.0C in mass percent was hot rolled at two temperatures, 1100 \degree C and 850 \degree C and subsequently aged at 550 \degree C for 24 h. The lower temperature rolling resulted in a yield strength increment of 299 MPa while still retaining an elongation to failure of over 30\%. The large improvement in strength was attributed to an increase in residual… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

  21. A scale up study on chemical segregation and the effects on tensile properties in two medium Mn steel castings

    Authors: T. W. J. Kwok, C. Slater, X. Xu, C. Davis, D. Dye

    Abstract: Two ingots weighing 400 g and 5 kg with nominal compositions of Fe-8Mn-4Al-2Si-0.5C-0.07V-0.05Sn were produced to investigate the effect of processing variables on microstructure development. The larger casting has a cooling rate more representative of commercial production and provides an understanding of the potential challenges arising from casting-related segregation during efforts to scale up… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  22. arXiv:2109.08342  [pdf, other

    cs.LG

    Dropout's Dream Land: Generalization from Learned Simulators to Reality

    Authors: Zac Wellmer, James T. Kwok

    Abstract: A World Model is a generative model used to simulate an environment. World Models have proven capable of learning spatial and temporal representations of Reinforcement Learning environments. In some cases, a World Model offers an agent the opportunity to learn entirely inside of its own dream environment. In this work we explore improving the generalization capabilities from dream environments to… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: Published at ECML PKDD 2021

  23. arXiv:2107.00184  [pdf, other

    cs.AI

    Bilinear Scoring Function Search for Knowledge Graph Learning

    Authors: Yongqi Zhang, Quanming Yao, James Tin-Yau Kwok

    Abstract: Learning embeddings for entities and relations in knowledge graph (KG) have benefited many downstream tasks. In recent years, scoring functions, the crux of KG learning, have been human-designed to measure the plausibility of triples and capture different kinds of relations in KGs. However, as relations exhibit intricate patterns that are hard to infer before training, none of them consistently pe… ▽ More

    Submitted 4 March, 2022; v1 submitted 30 June, 2021; originally announced July 2021.

    Comments: TPAMI accepted

  24. arXiv:2106.06996  [pdf, other

    eess.IV cs.CV

    Pyramidal Dense Attention Networks for Lightweight Image Super-Resolution

    Authors: Huapeng Wu, Jie Gui, Jun Zhang, James T. Kwok, Zhihui Wei

    Abstract: Recently, deep convolutional neural network methods have achieved an excellent performance in image superresolution (SR), but they can not be easily applied to embedded devices due to large memory cost. To solve this problem, we propose a pyramidal dense attention network (PDAN) for lightweight image super-resolution in this paper. In our method, the proposed pyramidal dense learning can gradually… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  25. arXiv:2106.06966  [pdf, other

    eess.IV cs.CV

    Feedback Pyramid Attention Networks for Single Image Super-Resolution

    Authors: Huapeng Wu, Jie Gui, Jun Zhang, James T. Kwok, Zhihui Wei

    Abstract: Recently, convolutional neural network (CNN) based image super-resolution (SR) methods have achieved significant performance improvement. However, most CNN-based methods mainly focus on feed-forward architecture design and neglect to explore the feedback mechanism, which usually exists in the human visual system. In this paper, we propose feedback pyramid attention networks (FPAN) to fully exploit… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  26. arXiv:2106.06326  [pdf, other

    cs.LG

    TOHAN: A One-step Approach towards Few-shot Hypothesis Adaptation

    Authors: Haoang Chi, Feng Liu, Wenjing Yang, Long Lan, Tongliang Liu, Bo Han, William K. Cheung, James T. Kwok

    Abstract: In few-shot domain adaptation (FDA), classifiers for the target domain are trained with accessible labeled data in the source domain (SD) and few labeled data in the target domain (TD). However, data usually contain private information in the current era, e.g., data distributed on personal phones. Thus, the private information will be leaked if we directly access data in SD to train a target-domai… ▽ More

    Submitted 7 September, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

  27. Microstructure evolution and tensile behaviour of a cold rolled 8 wt\% Mn medium manganese steel

    Authors: Thomas WJ Kwok, Peng Gong, Xin Xu, John Nutter, W Mark Rainforth, David Dye

    Abstract: A novel medium manganese steel named Novalloy with composition Fe-8.3Mn-3.8Al-1.8Si-0.5C-0.06V-0.05Sn was developed and thermomechanically processed through hot rolling and intercritical annealing. The steel possessed a yield strength of 1 GPa, tensile strength of 1.13 GPa and ductility of 41\%. In order to study the effect of cold rolling after intercritical annealing on subsequent tensile proper… ▽ More

    Submitted 29 July, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

    Comments: Updated in response to reviewer comments

    Journal ref: Metall Mater Trans A, 2022

  28. arXiv:2102.12871  [pdf, other

    cs.LG

    SparseBERT: Rethinking the Importance Analysis in Self-attention

    Authors: Han Shi, Jiahui Gao, Xiaozhe Ren, Hang Xu, Xiaodan Liang, Zhenguo Li, James T. Kwok

    Abstract: Transformer-based models are popularly used in natural language processing (NLP). Its core component, self-attention, has aroused widespread interest. To understand the self-attention mechanism, a direct method is to visualize the attention map of a pre-trained model. Based on the patterns observed, a series of efficient Transformers with different sparse attention masks have been proposed. From a… ▽ More

    Submitted 1 July, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Accepted by ICML 2021

  29. arXiv:2011.04406  [pdf, other

    cs.LG

    A Survey of Label-noise Representation Learning: Past, Present and Future

    Authors: Bo Han, Quanming Yao, Tongliang Liu, Gang Niu, Ivor W. Tsang, James T. Kwok, Masashi Sugiyama

    Abstract: Classical machine learning implicitly assumes that labels of the training data are sampled from a clean distribution, which can be too restrictive for real-world scenarios. However, statistical-learning-based methods may not train deep learning models robustly with these noisy labels. Therefore, it is urgent to design Label-Noise Representation Learning (LNRL) methods for robustly training deep mo… ▽ More

    Submitted 20 February, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: The draft is kept updating; any comments and suggestions are welcome

  30. arXiv:2008.06542  [pdf, other

    cs.LG stat.ML

    A Scalable, Adaptive and Sound Nonconvex Regularizer for Low-rank Matrix Completion

    Authors: Yaqing Wang, Quanming Yao, James T. Kwok

    Abstract: Matrix learning is at the core of many machine learning problems. A number of real-world applications such as collaborative filtering and text mining can be formulated as a low-rank matrix completion problem, which recovers incomplete matrix using low-rank assumptions. To ensure that the matrix solution has a low rank, a recent trend is to use nonconvex regularizers that adaptively penalize sing… ▽ More

    Submitted 22 February, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

    Comments: WebConf 2021

  31. arXiv:2006.09117  [pdf, other

    eess.IV cs.CV cs.RO

    End-to-End Real-time Catheter Segmentation with Optical Flow-Guided Warping during Endovascular Intervention

    Authors: Anh Nguyen, Dennis Kundrat, Giulio Dagnino, Wenqiang Chi, Mohamed E. M. K. Abdelaziz, Yao Guo, YingLiang Ma, Trevor M. Y. Kwok, Celia Riga, Guang-Zhong Yang

    Abstract: Accurate real-time catheter segmentation is an important pre-requisite for robot-assisted endovascular intervention. Most of the existing learning-based methods for catheter segmentation and tracking are only trained on small-scale datasets or synthetic data due to the difficulties of ground-truth annotation. Furthermore, the temporal continuity in intraoperative imaging sequences is not fully uti… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: ICRA 2020

  32. arXiv:2004.03982  [pdf

    cond-mat.mtrl-sci

    4D-STEM elastic stress state characterisation of a TWIP steel nanotwin

    Authors: T P McAuliffe, A K Ackerman, B H Savitzky, T W J Kwok, M Danaie, C Ophus, D Dye

    Abstract: We measure the stress state in and around a deformation nanotwin in a twinning-induced plasticity (TWIP) steel. Using four-dimensional scanning transmission electron microscopy (4D-STEM), we measure the elastic strain field in a 68.2-by-83.1 nm area of interest with a scan step of 0.36 nm and a diffraction limit resolution of 0.73 nm. The stress field in and surrounding the twin matches the form e… ▽ More

    Submitted 20 October, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: After peer review comments and resubmission

  33. arXiv:1911.11322  [pdf, other

    cs.LG stat.ML

    Effective Decoding in Graph Auto-Encoder using Triadic Closure

    Authors: Han Shi, Haozheng Fan, James T. Kwok

    Abstract: The (variational) graph auto-encoder and its variants have been popularly used for representation learning on graph-structured data. While the encoder is often a powerful graph convolutional network, the decoder reconstructs the graph structure by only considering two nodes at a time, thus ignoring possible interactions among edges. On the other hand, structured prediction, which considers the who… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI 2020

  34. arXiv:1911.09336  [pdf, other

    cs.LG stat.ML

    Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS

    Authors: Han Shi, Renjie Pi, Hang Xu, Zhenguo Li, James T. Kwok, Tong Zhang

    Abstract: Neural Architecture Search (NAS) has shown great potentials in finding better neural network designs. Sample-based NAS is the most reliable approach which aims at exploring the search space and evaluating the most promising architectures. However, it is computationally very costly. As a remedy, the one-shot approach has emerged as a popular technique for accelerating NAS using weight-sharing. Howe… ▽ More

    Submitted 24 November, 2020; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: Accepted by NeurIPS 2020

  35. Design of a High Strength, High Ductility 12 wt% Mn Medium Manganese Steel With Hierarchical Deformation Behaviour

    Authors: T W J Kwok, K M Rahman, X Xu, I Bantounas, J F Kelleher, S Daswari, T Alam, R Banerjee, D Dye

    Abstract: A novel medium Mn steel of composition Fe-12Mn-4.8Al-2Si-0.32C-0.3V was manufactured with 1.09 GPa yield strength, 1.26 GPa tensile strength and 54% elongation. The thermomechanical process route was designed to be industrially translatable and consists of hot and then warm rolling before a 30 min intercritical anneal. The resulting microstructure comprised of coarse elongated austenite grains in… ▽ More

    Submitted 21 April, 2020; v1 submitted 20 August, 2019; originally announced August 2019.

    Comments: Updated on resubmission, minor clarifications

    Journal ref: Mater. Sci. Eng. A 782:139258, 2020

  36. arXiv:1905.10936  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Communication-Efficient Distributed Blockwise Momentum SGD with Error-Feedback

    Authors: Shuai Zheng, Ziyue Huang, James T. Kwok

    Abstract: Communication overhead is a major bottleneck hampering the scalability of distributed machine learning systems. Recently, there has been a surge of interest in using gradient compression to improve the communication efficiency of distributed neural network training. Using 1-bit quantization, signSGD with majority vote achieves a 32x reduction on communication cost. However, its convergence is base… ▽ More

    Submitted 28 October, 2019; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019

  37. arXiv:1905.09899  [pdf, other

    cs.LG math.OC stat.ML

    Blockwise Adaptivity: Faster Training and Better Generalization in Deep Learning

    Authors: Shuai Zheng, James T. Kwok

    Abstract: Stochastic methods with coordinate-wise adaptive stepsize (such as RMSprop and Adam) have been widely used in training deep neural networks. Despite their fast convergence, they can generalize worse than stochastic gradient descent. In this paper, by revisiting the design of Adagrad, we propose to split the network parameters into blocks, and use a blockwise adaptive stepsize. Intuitively, blockwi… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

  38. arXiv:1904.03213  [pdf, ps, other

    cs.DS math.FA math.OC quant-ph

    Spectral analysis of matrix scaling and operator scaling

    Authors: Tsz Chiu Kwok, Lap Chi Lau, Akshay Ramachandran

    Abstract: We present a spectral analysis for matrix scaling and operator scaling. We prove that if the input matrix or operator has a spectral gap, then a natural gradient flow has linear convergence. This implies that a simple gradient descent algorithm also has linear convergence under the same assumption. The spectral gap condition for operator scaling is closely related to the notion of quantum expander… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

  39. General Convolutional Sparse Coding with Unknown Noise

    Authors: Yaqing Wang, James T. Kwok, Lionel M. Ni

    Abstract: Convolutional sparse coding (CSC) can learn representative shift-invariant patterns from multiple kinds of data. However, existing CSC methods can only model noises from Gaussian distribution, which is restrictive and unrealistic. In this paper, we propose a general CSC model capable of dealing with complicated unknown noise. The noise is now modeled by Gaussian mixture model, which can approximat… ▽ More

    Submitted 7 March, 2019; originally announced March 2019.

  40. arXiv:1811.09491  [pdf, other

    cs.LG cs.AI stat.ML

    Differential Private Stack Generalization with an Application to Diabetes Prediction

    Authors: Quanming Yao, Xiawei Guo, James T. Kwok, WeiWei Tu, Yuqiang Chen, Wenyuan Dai, Qiang Yang

    Abstract: To meet the standard of differential privacy, noise is usually added into the original data, which inevitably deteriorates the predicting performance of subsequent learning algorithms. In this paper, motivated by the success of improving predicting performance by ensemble learning, we propose to enhance privacy-preserving logistic regression by stacking. We show that this can be done either by sam… ▽ More

    Submitted 2 June, 2019; v1 submitted 23 November, 2018; originally announced November 2018.

  41. arXiv:1807.08725  [pdf, other

    cs.LG stat.ML

    FasTer: Fast Tensor Completion with Nonconvex Regularization

    Authors: Quanming Yao, James T Kwok, Bo Han

    Abstract: Low-rank tensor completion problem aims to recover a tensor from limited observations, which has many real-world applications. Due to the easy optimization, the convex overlapping nuclear norm has been popularly used for tensor completion. However, it over-penalizes top singular values and lead to biased estimations. In this paper, we propose to use the nonconvex regularizer, which can less penali… ▽ More

    Submitted 23 January, 2019; v1 submitted 23 July, 2018; originally announced July 2018.

  42. arXiv:1806.02927  [pdf, other

    cs.LG math.OC stat.ML

    Lightweight Stochastic Optimization for Minimizing Finite Sums with Infinite Data

    Authors: Shuai Zheng, James T. Kwok

    Abstract: Variance reduction has been commonly used in stochastic optimization. It relies crucially on the assumption that the data set is finite. However, when the data are imputed with random noise as in data augmentation, the perturbed data set be- comes essentially infinite. Recently, the stochastic MISO (S-MISO) algorithm is introduced to address this expected risk minimization problem. Though it conve… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

    Comments: To appear in ICML 2018

  43. arXiv:1805.01891  [pdf, other

    cs.LG stat.ML

    Power Law in Sparsified Deep Neural Networks

    Authors: Lu Hou, James T. Kwok

    Abstract: The power law has been observed in the degree distributions of many biological neural networks. Sparse deep neural networks, which learn an economical representation from the data, resemble biological neural networks in many ways. In this paper, we study if these artificial networks also exhibit properties of the power law. Experimental results on two popular deep learning models, namely, multilay… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

  44. arXiv:1804.10366  [pdf, other

    cs.CV

    Online Convolutional Sparse Coding with Sample-Dependent Dictionary

    Authors: Yaqing Wang, Quanming Yao, James T. Kwok, Lionel M. Ni

    Abstract: Convolutional sparse coding (CSC) has been popularly used for the learning of shift-invariant dictionaries in image and signal processing. However, existing methods have limited scalability. In this paper, instead of convolving with a dictionary shared by all samples, we propose the use of a sample-dependent dictionary in which filters are obtained as linear combinations of a small set of base fil… ▽ More

    Submitted 7 June, 2018; v1 submitted 27 April, 2018; originally announced April 2018.

    Comments: Accepted by ICML-2018

  45. arXiv:1802.08635  [pdf, other

    cs.LG

    Loss-aware Weight Quantization of Deep Networks

    Authors: Lu Hou, James T. Kwok

    Abstract: The huge size of deep networks hinders their use in small computing devices. In this paper, we consider compressing the network by weight quantization. We extend a recently proposed loss-aware weight binarization scheme to ternarization, with possibly different scaling parameters for the positive and negative weights, and m-bit (where m > 2) quantization. Experiments on feedforward and recurrent n… ▽ More

    Submitted 10 May, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

  46. arXiv:1712.08451  [pdf

    physics.pop-ph physics.ed-ph

    Testing the validity of the Lorentz factor

    Authors: H Broomfield, J Hirst, Theodoros Vafeiadis, Markus Joos, A Singh, M Raven, T K Chung, J Harrow, T Kwok, J Li, K Tsui, A Tsui, R Perkins, H Mandelstam, D Khoo, J Southwell, J Martin-Halls, D Townsend, H Watson

    Abstract: Our proposed experiment aimed to test the validity of the Lorentz factor with two methods: The time of flight (TOF) of various particles at different momenta and the decay rate of pions at different momenta. Due to the high sensitivity required for the second method the results were inconclusive, therefore we report only on the results of the first method.

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: The authors of the paper are winners of the CERN Beamline for Schools <https://voisins.cern/en/offre/bl4s> 2016 competition. To be submitted to the journal Physics Education

  47. arXiv:1710.07205  [pdf, other

    math.NA math.OC

    Scalable Robust Matrix Factorization with Nonconvex Loss

    Authors: Quanming Yao, James T. Kwok

    Abstract: Robust matrix factorization (RMF), which uses the $\ell_1$-loss, often outperforms standard matrix factorization using the $\ell_2$-loss, particularly when outliers are present. The state-of-the-art RMF solver is the RMF-MM algorithm, which, however, cannot utilize data sparsity. Moreover, sometimes even the (convex) $\ell_1$-loss is not robust enough. In this paper, we propose the use of nonconve… ▽ More

    Submitted 23 September, 2018; v1 submitted 19 October, 2017; originally announced October 2017.

  48. arXiv:1710.02587  [pdf, ps, other

    cs.DS math.FA math.OA math.OC quant-ph

    The Paulsen Problem, Continuous Operator Scaling, and Smoothed Analysis

    Authors: Tsz Chiu Kwok, Lap Chi Lau, Yin Tat Lee, Akshay Ramachandran

    Abstract: The Paulsen problem is a basic open problem in operator theory: Given vectors $u_1, \ldots, u_n \in \mathbb R^d$ that are $ε$-nearly satisfying the Parseval's condition and the equal norm condition, is it close to a set of vectors $v_1, \ldots, v_n \in \mathbb R^d$ that exactly satisfy the Parseval's condition and the equal norm condition? Given $u_1, \ldots, u_n$, the squared distance (to the set… ▽ More

    Submitted 8 November, 2017; v1 submitted 6 October, 2017; originally announced October 2017.

    Comments: Added Subsection 1.4; Incorporated comments and fixed typos; Minor changes in various places

  49. arXiv:1708.01265  [pdf, other

    physics.ins-det hep-ex

    Seasonal Variation of the Underground Cosmic Muon Flux Observed at Daya Bay

    Authors: F. P. An, A. B. Balantekin, H. R. Band, M. Bishai, S. Blyth, D. Cao, G. F. Cao, J. Cao, Y. L. Chan, J. F. Chang, Y. Chang, H. S. Chen, Q. Y. Chen, S. M. Chen, Y. X. Chen, Y. Chen, J. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, A. Chukanov, J. P. Cummings, Y. Y. Ding, M. V. Diwan, M. Dolgareva , et al. (179 additional authors not shown)

    Abstract: The Daya Bay Experiment consists of eight identically designed detectors located in three underground experimental halls named as EH1, EH2, EH3, with 250, 265 and 860 meters of water equivalent vertical overburden, respectively. Cosmic muon events have been recorded over a two-year period. The underground muon rate is observed to be positively correlated with the effective atmospheric temperature… ▽ More

    Submitted 8 January, 2018; v1 submitted 3 August, 2017; originally announced August 2017.

    Comments: Updated to be identical to the published version

    Journal ref: JCAP01(2018)001

  50. arXiv:1708.00146  [pdf, other

    cs.LG cs.AI stat.ML

    Large-Scale Low-Rank Matrix Learning with Nonconvex Regularizers

    Authors: Quanming Yao, James T. Kwok, Taifeng Wang, Tie-Yan Liu

    Abstract: Low-rank modeling has many important applications in computer vision and machine learning. While the matrix rank is often approximated by the convex nuclear norm, the use of nonconvex low-rank regularizers has demonstrated better empirical performance. However, the resulting optimization problem is much more challenging. Recent state-of-the-art requires an expensive full SVD in each iteration. In… ▽ More

    Submitted 23 July, 2018; v1 submitted 31 July, 2017; originally announced August 2017.

    Comments: Accepted by TPAMI in 2018 (extension of ICDM-2015 conference paper arXiv:1512.00984)