Skip to main content

Showing 1–36 of 36 results for author: Chang, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.14999  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision

    Authors: Eric Hanchen Jiang, Haozheng Luo, Shengyuan Pang, Xiaomin Li, Zhenting Qi, Hengli Li, Cheng-Fu Yang, Zongyu Lin, Xinfeng Li, Hao Xu, Kai-Wei Chang, Ying Nian Wu

    Abstract: Mathematical reasoning presents a significant challenge for Large Language Models (LLMs), often requiring robust multi step logical consistency. While Chain of Thought (CoT) prompting elicits reasoning steps, it doesn't guarantee correctness, and improving reliability via extensive sampling is computationally costly. This paper introduces the Energy Outcome Reward Model (EORM), an effective, light… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  2. arXiv:2410.13111  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Controllable Generation via Locally Constrained Resampling

    Authors: Kareem Ahmed, Kai-Wei Chang, Guy Van den Broeck

    Abstract: Autoregressive models have demonstrated an unprecedented ability at modeling the intricacies of natural language. However, they continue to struggle with generating complex outputs that adhere to logical constraints. Sampling from a fully-independent distribution subject to a constraint is hard. Sampling from an autoregressive distribution subject to a constraint is doubly hard: We have to contend… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: text overlap with arXiv:2312.03905

  3. arXiv:2304.14236  [pdf, other

    stat.AP stat.ME

    Guidance note on best statistical practices for TOAR analyses

    Authors: Kai-Lan Chang, Martin G. Schultz, Gerbrand Koren, Niklas Selke

    Abstract: The aim of this guidance note is to provide recommendations on best statistical practices and to ensure consistent communication of statistical analysis and associated uncertainty across TOAR publications. The scope includes approaches for reporting trends, a discussion of strengths and weaknesses of commonly used techniques, and calibrated language for the communication of uncertainty. The focus… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  4. arXiv:2210.03826  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    An Analysis of the Effects of Decoding Algorithms on Fairness in Open-Ended Language Generation

    Authors: Jwala Dhamala, Varun Kumar, Rahul Gupta, Kai-Wei Chang, Aram Galstyan

    Abstract: Several prior works have shown that language models (LMs) can generate text containing harmful social biases and stereotypes. While decoding algorithms play a central role in determining properties of LM generated text, their impact on the fairness of the generations has not been studied. We present a systematic analysis of the impact of decoding algorithms on LM fairness, and analyze the trade-of… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Accepted at IEEE SLT 2022

  5. arXiv:2209.00076  [pdf, other

    cs.CY cs.SI stat.AP

    Connecticut Redistricting Analysis

    Authors: Kyle Evans, Katherine T. Chang

    Abstract: Connecticut passed their new state House of Representatives district plan on November 18, 2021 and passed their new state Senate district plan on November 23, 2021. Each passed unanimously in their 9-person bipartisan Reapportionment Commission; however, the process has been criticized for legislators controlling the process and for the negotiations that serve to protect incumbents. This paper inv… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

    Comments: 13 pages, 3 tables

  6. arXiv:2207.05461  [pdf, other

    eess.SP stat.ML

    Parallel APSM for Fast and Adaptive Digital SIC in Full-Duplex Transceivers with Nonlinearity

    Authors: M. Hossein Attar, Omid Taghizadeh, Kaxin Chang, Ramez Askar, Matthias Mehlhose, Slawomir Stanczak

    Abstract: This paper presents a kernel-based adaptive filter that is applied for the digital domain self-interference cancellation (SIC) in a transceiver operating in full-duplex (FD) mode. In FD, the benefit of simultaneous transmission and receiving of signals comes at the price of strong self-interference (SI). In this work, we are primarily interested in suppressing the SI using an adaptive filter namel… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  7. arXiv:2201.11250  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    Neuro-Symbolic Entropy Regularization

    Authors: Kareem Ahmed, Eric Wang, Kai-Wei Chang, Guy Van den Broeck

    Abstract: In structured prediction, the goal is to jointly predict many output variables that together encode a structured object -- a path in a graph, an entity-relation triple, or an ordering of objects. Such a large output space makes learning hard and requires vast amounts of labeled data. Different approaches leverage alternate sources of supervision. One approach -- entropy regularization -- posits th… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  8. arXiv:2011.14211  [pdf, other

    cs.LG cs.CV stat.ML

    Curvature Regularization to Prevent Distortion in Graph Embedding

    Authors: Hongbin Pei, Bingzhe Wei, Kevin Chen-Chuan Chang, Chunxu Zhang, Bo Yang

    Abstract: Recent research on graph embedding has achieved success in various applications. Most graph embedding methods preserve the proximity in a graph into a manifold in an embedding space. We argue an important but neglected problem about this proximity-preserving strategy: Graph topology patterns, while preserved well into an embedding manifold by preserving proximity, may distort in the ambient embedd… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

    Comments: Published as a conference paper at NeurIPS 2020

  9. arXiv:2006.15437  [pdf, other

    cs.LG cs.SI stat.ML

    GPT-GNN: Generative Pre-Training of Graph Neural Networks

    Authors: Ziniu Hu, Yuxiao Dong, Kuansan Wang, Kai-Wei Chang, Yizhou Sun

    Abstract: Graph neural networks (GNNs) have been demonstrated to be powerful in modeling graph-structured data. However, training GNNs usually requires abundant task-specific labeled data, which is often arduously expensive to obtain. One effective way to reduce the labeling effort is to pre-train an expressive GNN model on unlabeled data with self-supervision and then transfer the learned model to downstre… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

    Comments: Published on KDD 2020

  10. arXiv:2006.10836  [pdf, other

    cs.LG cs.AI stat.ML

    An Integer Linear Programming Framework for Mining Constraints from Data

    Authors: Tao Meng, Kai-Wei Chang

    Abstract: Structured output prediction problems (e.g., sequential tagging, hierarchical multi-class classification) often involve constraints over the output label space. These constraints interact with the learned models to filter infeasible solutions and facilitate in building an accountable system. However, although constraints are useful, they are often based on hand-crafted rules. This raises a questio… ▽ More

    Submitted 11 June, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: 13 pages, published in ICML2021

  11. arXiv:2005.06251  [pdf, other

    cs.CL cs.CV cs.LG stat.ML

    Mitigating Gender Bias Amplification in Distribution by Posterior Regularization

    Authors: Shengyu Jia, Tao Meng, Jieyu Zhao, Kai-Wei Chang

    Abstract: Advanced machine learning techniques have boosted the performance of natural language processing. Nevertheless, recent studies, e.g., Zhao et al. (2017) show that these techniques inadvertently capture the societal bias hidden in the corpus and further amplify it. However, their analysis is conducted only on models' top predictions. In this paper, we investigate the gender bias amplification issue… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: 7 pages, 3 figures, published in ACL 2020

  12. arXiv:2005.00653  [pdf, ps, other

    cs.SE cs.AI cs.LG stat.ML

    A Transformer-based Approach for Source Code Summarization

    Authors: Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

    Abstract: Generating a readable summary that describes the functionality of a program is known as source code summarization. In this task, learning code representation by modeling the pairwise relationship between code tokens to capture their long-range dependencies is crucial. To learn code representation for summarization, we explore the Transformer model that uses a self-attention mechanism and has shown… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: This paper is accepted at ACL2020

  13. arXiv:2003.09103  [pdf, other

    cs.LG stat.ML

    Learning to simulate and design for structural engineering

    Authors: Kai-Hung Chang, Chin-Yi Cheng

    Abstract: The structural design process for buildings is time-consuming and laborious. To automate this process, structural engineers combine optimization methods with simulation tools to find an optimal design with minimal building mass subject to building regulations. However, structural engineers in practice often avoid optimization and compromise on a suboptimal design for the majority of buildings, due… ▽ More

    Submitted 12 August, 2020; v1 submitted 20 March, 2020; originally announced March 2020.

    Comments: ICML2020

  14. arXiv:2003.02452  [pdf, other

    cs.LG stat.ML

    Semi-supervised Learning Meets Factorization: Learning to Recommend with Chain Graph Model

    Authors: Chaochao Chen, Kevin C. Chang, Qibing Li, Xiaolin Zheng

    Abstract: Recently latent factor model (LFM) has been drawing much attention in recommender systems due to its good performance and scalability. However, existing LFMs predict missing values in a user-item rating matrix only based on the known ones, and thus the sparsity of the rating matrix always limits their performance. Meanwhile, semi-supervised learning (SSL) provides an effective way to alleviate the… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Comments: Accepted by TKDD

  15. arXiv:2002.12920  [pdf, other

    cs.LG stat.ML

    Automatic Perturbation Analysis for Scalable Certified Robustness and Beyond

    Authors: Kaidi Xu, Zhouxing Shi, Huan Zhang, Yihan Wang, Kai-Wei Chang, Minlie Huang, Bhavya Kailkhura, Xue Lin, Cho-Jui Hsieh

    Abstract: Linear relaxation based perturbation analysis (LiRPA) for neural networks, which computes provable linear bounds of output neurons given a certain amount of input perturbation, has become a core component in robustness verification and certified defense. The majority of LiRPA-based methods focus on simple feed-forward networks and need particular manual derivations and implementations when extende… ▽ More

    Submitted 25 October, 2020; v1 submitted 28 February, 2020; originally announced February 2020.

  16. arXiv:2002.06622  [pdf, other

    cs.LG stat.ML

    Robustness Verification for Transformers

    Authors: Zhouxing Shi, Huan Zhang, Kai-Wei Chang, Minlie Huang, Cho-Jui Hsieh

    Abstract: Robustness verification that aims to formally certify the prediction behavior of neural networks has become an important tool for understanding model behavior and obtaining safety guarantees. However, previous methods can usually only handle neural networks with relatively simple architectures. In this paper, we consider the robustness verification problem for Transformers. Transformers have compl… ▽ More

    Submitted 23 December, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

    Comments: ICLR 2020

  17. arXiv:2002.05287  [pdf, other

    cs.LG cs.CV stat.ML

    Geom-GCN: Geometric Graph Convolutional Networks

    Authors: Hongbin Pei, Bingzhe Wei, Kevin Chen-Chuan Chang, Yu Lei, Bo Yang

    Abstract: Message-passing neural networks (MPNNs) have been successfully applied to representation learning on graphs in a variety of real-world applications. However, two fundamental weaknesses of MPNNs' aggregators limit their ability to represent graph-structured data: losing the structural information of nodes in neighborhoods and lacking the ability to capture long-range dependencies in disassortative… ▽ More

    Submitted 13 February, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper at ICLR 2020

  18. arXiv:1912.12115  [pdf, other

    cs.LG cs.DC stat.ML

    Split Learning for collaborative deep learning in healthcare

    Authors: Maarten G. Poirot, Praneeth Vepakomma, Ken Chang, Jayashree Kalpathy-Cramer, Rajiv Gupta, Ramesh Raskar

    Abstract: Shortage of labeled data has been holding the surge of deep learning in healthcare back, as sample sizes are often small, patient information cannot be shared openly, and multi-center collaborative studies are a burden to set up. Distributed machine learning methods promise to mitigate these problems. We argue for a split learning based approach and apply this distributed learning method for the f… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

    Comments: Workshop paper: 8 pages, 2 figures, 1 table

  19. arXiv:1911.03642  [pdf, other

    cs.LG cs.CL stat.ML

    Towards Understanding Gender Bias in Relation Extraction

    Authors: Andrew Gaut, Tony Sun, Shirlyn Tang, Yuxin Huang, Jing Qian, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, William Yang Wang

    Abstract: Recent developments in Neural Relation Extraction (NRE) have made significant strides towards Automated Knowledge Base Construction (AKBC). While much attention has been dedicated towards improvements in accuracy, there have been no attempts in the literature to our knowledge to evaluate social biases in NRE systems. We create WikiGenderBias, a distantly supervised dataset with a human annotated t… ▽ More

    Submitted 8 August, 2020; v1 submitted 9 November, 2019; originally announced November 2019.

  20. arXiv:1909.04288  [pdf, ps, other

    cs.LG stat.ML

    BOSH: An Efficient Meta Algorithm for Decision-based Attacks

    Authors: Zhenxin Xiao, Puyudi Yang, Yuchen Jiang, Kai-Wei Chang, Cho-Jui Hsieh

    Abstract: Adversarial example generation becomes a viable method for evaluating the robustness of a machine learning model. In this paper, we consider hard-label black-box attacks (a.k.a. decision-based attacks), which is a challenging setting that generates adversarial examples based on only a series of black-box hard-label queries. This type of attacks can be used to attack discrete and complex models, su… ▽ More

    Submitted 14 October, 2019; v1 submitted 10 September, 2019; originally announced September 2019.

  21. arXiv:1905.13728  [pdf, other

    cs.LG stat.ML

    Pre-Training Graph Neural Networks for Generic Structural Feature Extraction

    Authors: Ziniu Hu, Changjun Fan, Ting Chen, Kai-Wei Chang, Yizhou Sun

    Abstract: Graph neural networks (GNNs) are shown to be successful in modeling applications with graph structures. However, training an accurate GNN model requires a large collection of labeled data and expressive features, which might be inaccessible for some applications. To tackle this problem, we propose a pre-training framework that captures generic graph structural information that is transferable acro… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

  22. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  23. arXiv:1809.01496  [pdf, other

    cs.CL cs.LG stat.ML

    Learning Gender-Neutral Word Embeddings

    Authors: Jieyu Zhao, Yichao Zhou, Zeyu Li, Wei Wang, Kai-Wei Chang

    Abstract: Word embedding models have become a fundamental component in a wide range of Natural Language Processing (NLP) applications. However, embeddings trained on human-generated corpora have been demonstrated to inherit strong gender stereotypes that reflect social constructs. To address this concern, in this paper, we propose a novel training procedure for learning gender-neutral word embeddings. Our a… ▽ More

    Submitted 29 August, 2018; originally announced September 2018.

    Comments: EMNLP 2018

  24. arXiv:1808.08270  [pdf, other

    cs.LG cs.CL stat.ML

    Robust Text Classifier on Test-Time Budgets

    Authors: Md Rizwan Parvez, Tolga Bolukbasi, Kai-Wei Chang, Venkatesh Saligrama

    Abstract: We propose a generic and interpretable learning framework for building robust text classification model that achieves accuracy comparable to full models under test-time budget constraints. Our approach learns a selector to identify words that are relevant to the prediction tasks and passes them to the classifier for processing. The selector is trained jointly with the classifier and directly learn… ▽ More

    Submitted 13 September, 2019; v1 submitted 24 August, 2018; originally announced August 2018.

    Comments: To appear at EMNLP-IJCAI 2019, 6 pages + 2 pages appendix

  25. arXiv:1711.10162  [pdf, other

    cs.LG stat.ML

    Topological Recurrent Neural Network for Diffusion Prediction

    Authors: Jia Wang, Vincent W. Zheng, Zemin Liu, Kevin Chen-Chuan Chang

    Abstract: In this paper, we study the problem of using representation learning to assist information diffusion prediction on graphs. In particular, we aim at estimating the probability of an inactive node to be activated next in a cascade. Despite the success of recent deep learning methods for diffusion, we find that they often underexplore the cascade structure. We consider a cascade as not merely a seque… ▽ More

    Submitted 28 November, 2017; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: In Proc. of The IEEE International Conference on Data Mining (ICDM '17), New Orleans, Louisiana, USA, 2017

  26. arXiv:1708.00049  [pdf, other

    stat.ML cs.LG

    Interpretable Active Learning

    Authors: Richard L. Phillips, Kyu Hyun Chang, Sorelle A. Friedler

    Abstract: Active learning has long been a topic of study in machine learning. However, as increasingly complex and opaque models have become standard practice, the process of active learning, too, has become more opaque. There has been little investigation into interpreting what specific trends and patterns an active learning strategy may be exploring. This work expands on the Local Interpretable Model-agno… ▽ More

    Submitted 23 June, 2018; v1 submitted 31 July, 2017; originally announced August 2017.

    Comments: 13 pages, 8 figures, presented at 2018 Conference on Fairness, Accountability, and Transparency (FAT*), New York, New York, USA. Proceedings of the 1st Conference on Fairness, Accountability and Transparency, PMLR 81:49-61, 2018

  27. arXiv:1707.09457  [pdf, other

    cs.AI cs.CL cs.CV stat.ML

    Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints

    Authors: Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, Kai-Wei Chang

    Abstract: Language is increasingly being used to define rich visual recognition problems with supporting image collections sourced from the web. Structured prediction models are used in these tasks to take advantage of correlations between co-occurring labels and visual input but risk inadvertently encoding social biases found in web corpora. In this work, we study data and models associated with multilabel… ▽ More

    Submitted 28 July, 2017; originally announced July 2017.

    Comments: 11 pages, published in EMNLP 2017

  28. DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks

    Authors: Zi Yin, Keng-hao Chang, Ruofei Zhang

    Abstract: Information extraction and user intention identification are central topics in modern query understanding and recommendation systems. In this paper, we propose DeepProbe, a generic information-directed interaction framework which is built around an attention-based sequence to sequence (seq2seq) recurrent neural network. DeepProbe can rephrase, evaluate, and even actively ask questions, leveraging… ▽ More

    Submitted 1 March, 2018; v1 submitted 18 July, 2017; originally announced July 2017.

    Comments: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2017

  29. arXiv:1705.05085  [pdf, other

    cs.LG stat.ML

    Active Learning for Graph Embedding

    Authors: Hongyun Cai, Vincent W. Zheng, Kevin Chen-Chuan Chang

    Abstract: Graph embedding provides an efficient solution for graph analysis by converting the graph into a low-dimensional space which preserves the structure information. In contrast to the graph structure data, the i.i.d. node embedding can be processed efficiently in terms of both time and space. Current semi-supervised graph embedding algorithms assume the labelled nodes are given, which may not be alwa… ▽ More

    Submitted 15 May, 2017; originally announced May 2017.

    Comments: Technical Report

  30. arXiv:1607.06520  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings

    Authors: Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, Adam Kalai

    Abstract: The blind application of machine learning runs the risk of amplifying biases present in data. Such a danger is facing us with word embedding, a popular framework to represent text data as vectors which has been used in many machine learning and natural language processing tasks. We show that even word embeddings trained on Google News articles exhibit female/male gender stereotypes to a disturbing… ▽ More

    Submitted 21 July, 2016; originally announced July 2016.

  31. arXiv:1606.06121  [pdf, other

    cs.CL cs.LG stat.ML

    Quantifying and Reducing Stereotypes in Word Embeddings

    Authors: Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, Adam Kalai

    Abstract: Machine learning algorithms are optimized to model statistical properties of the training data. If the input data reflects stereotypes and biases of the broader society, then the output of the learning algorithm also captures these stereotypes. In this paper, we initiate the study of gender stereotypes in {\em word embedding}, a popular framework to represent text data. As their use becomes increa… ▽ More

    Submitted 20 June, 2016; originally announced June 2016.

    Comments: presented at 2016 ICML Workshop on #Data4Good: Machine Learning in Social Good Applications, New York, NY

  32. Computer model calibration with large non-stationary spatial outputs: application to the calibration of a climate model

    Authors: Kai-Lan Chang, Serge Guillas

    Abstract: Bayesian calibration of computer models tunes unknown input parameters by comparing outputs with observations. For model outputs that are distributed over space, this becomes computationally expensive because of the output size. To overcome this challenge, we employ a basis representation of the model outputs and observations: we match these decompositions to carry out the calibration efficiently.… ▽ More

    Submitted 4 October, 2018; v1 submitted 15 April, 2016; originally announced April 2016.

  33. arXiv:1602.08761  [pdf, other

    stat.ML cs.CL cs.CV cs.LG

    Resource Constrained Structured Prediction

    Authors: Tolga Bolukbasi, Kai-Wei Chang, Joseph Wang, Venkatesh Saligrama

    Abstract: We study the problem of structured prediction under test-time budget constraints. We propose a novel approach applicable to a wide range of structured prediction problems in computer vision and natural language processing. Our approach seeks to adaptively generate computationally costly features during test-time in order to reduce the computational cost of prediction while maintaining prediction p… ▽ More

    Submitted 7 June, 2016; v1 submitted 28 February, 2016; originally announced February 2016.

  34. arXiv:1509.07179  [pdf, other

    cs.LG cs.CL stat.ML

    IllinoisSL: A JAVA Library for Structured Prediction

    Authors: Kai-Wei Chang, Shyam Upadhyay, Ming-Wei Chang, Vivek Srikumar, Dan Roth

    Abstract: IllinoisSL is a Java library for learning structured prediction models. It supports structured Support Vector Machines and structured Perceptron. The library consists of a core learning module and several applications, which can be executed from command-lines. Documentation is provided to guide users. In Comparison to other structured learning libraries, IllinoisSL is efficient, general, and easy… ▽ More

    Submitted 23 September, 2015; originally announced September 2015.

    Comments: http://cogcomp.cs.illinois.edu/software/illinois-sl

  35. arXiv:1506.02620  [pdf, other

    stat.ML cs.DC cs.LG

    Distributed Training of Structured SVM

    Authors: Ching-pei Lee, Kai-Wei Chang, Shyam Upadhyay, Dan Roth

    Abstract: Training structured prediction models is time-consuming. However, most existing approaches only use a single machine, thus, the advantage of computing power and the capacity for larger data sets of multiple machines have not been exploited. In this work, we propose an efficient algorithm for distributedly training structured support vector machines based on a distributed block-coordinate descent m… ▽ More

    Submitted 14 February, 2016; v1 submitted 8 June, 2015; originally announced June 2015.

    Comments: NIPS Workshop on Optimization for Machine Learning, 2015

  36. arXiv:1502.02206  [pdf, other

    cs.LG stat.ML

    Learning to Search Better Than Your Teacher

    Authors: Kai-Wei Chang, Akshay Krishnamurthy, Alekh Agarwal, Hal Daumé III, John Langford

    Abstract: Methods for learning to search for structured prediction typically imitate a reference policy, with existing theoretical guarantees demonstrating low regret compared to that reference. This is unsatisfactory in many applications where the reference policy is suboptimal and the goal of learning is to improve upon it. Can learning to search work even when the reference is poor? We provide a new le… ▽ More

    Submitted 20 May, 2015; v1 submitted 7 February, 2015; originally announced February 2015.

    Comments: In ICML 2015