Skip to main content

Showing 1–3 of 3 results for author: Zhang, D T

Searching in archive cs. Search in all archives.
.
  1. arXiv:1910.10937  [pdf, other

    stat.ML cs.LG

    Online Boosting for Multilabel Ranking with Top-k Feedback

    Authors: Vinod Raman, Daniel T. Zhang, Young Hun Jung, Ambuj Tewari

    Abstract: We present online boosting algorithms for multilabel ranking with top-k feedback, where the learner only receives information about the top k items from the ranking it provides. We propose a novel surrogate loss function and unbiased estimator, allowing weak learners to update themselves with limited information. Using these techniques we adapt full information multilabel ranking algorithms (Jung… ▽ More

    Submitted 19 October, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: Under review for AISTATS 2021. Fixed small errors throughout the manuscript and added new content comparing/contrasting various randomization procedures

  2. arXiv:1810.05290  [pdf, other

    stat.ML cs.LG

    Online Multiclass Boosting with Bandit Feedback

    Authors: Daniel T. Zhang, Young Hun Jung, Ambuj Tewari

    Abstract: We present online boosting algorithms for multiclass classification with bandit feedback, where the learner only receives feedback about the correctness of its prediction. We propose an unbiased estimate of the loss using a randomized prediction, allowing the model to update its weak learners with limited information. Using the unbiased estimate, we extend two full information boosting algorithms… ▽ More

    Submitted 25 February, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: Accepted in AISTATS 2019

  3. arXiv:1707.01591  [pdf, other

    cs.LG stat.AP stat.ML

    A Data Science Approach to Understanding Residential Water Contamination in Flint

    Authors: Alex Chojnacki, Chengyu Dai, Arya Farahi, Guangsha Shi, Jared Webb, Daniel T. Zhang, Jacob Abernethy, Eric Schwartz

    Abstract: When the residents of Flint learned that lead had contaminated their water system, the local government made water-testing kits available to them free of charge. The city government published the results of these tests, creating a valuable dataset that is key to understanding the causes and extent of the lead contamination event in Flint. This is the nation's largest dataset on lead in a municipal… ▽ More

    Submitted 5 July, 2017; originally announced July 2017.

    Comments: Applied Data Science track paper at KDD 2017. For associated promotional video, see https://www.youtube.com/watch?v=0g66ImaV8Ag