Skip to main content

Showing 1–9 of 9 results for author: Cardoso, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2111.10198  [pdf, other

    stat.AP cs.DB stat.ME

    Datasets for Online Controlled Experiments

    Authors: C. H. Bryan Liu, Ângelo Cardoso, Paul Couturier, Emma J. McCoy

    Abstract: Online Controlled Experiments (OCE) are the gold standard to measure impact and guide decisions for digital products and services. Despite many methodological advances in this area, the scarcity of public datasets and the lack of a systematic review and categorization hinder its development. We present the first survey and taxonomy for OCE datasets, which highlight the lack of a public dataset to… ▽ More

    Submitted 14 January, 2022; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks. 17 pages, 2 figures, 2 tables. Dataset available on Open Science Framework: https://osf.io/64jsb/

  2. arXiv:1907.07723  [pdf, other

    cs.LG stat.ML

    Competing Against Equilibria in Zero-Sum Games with Evolving Payoffs

    Authors: Adrian Rivera Cardoso, Jacob Abernethy, He Wang, Huan Xu

    Abstract: We study the problem of repeated play in a zero-sum game in which the payoff matrix may change, in a possibly adversarial fashion, on each round; we call these Online Matrix Games. Finding the Nash Equilibrium (NE) of a two player zero-sum game is core to many problems in statistics, optimization, and economics, and for a fixed game matrix this can be easily reduced to solving a linear program. Bu… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: arXiv admin note: text overlap with arXiv:1806.08301

  3. arXiv:1905.10649  [pdf, ps, other

    cs.LG stat.ML

    Large Scale Markov Decision Processes with Changing Rewards

    Authors: Adrian Rivera Cardoso, He Wang, Huan Xu

    Abstract: We consider Markov Decision Processes (MDPs) where the rewards are unknown and may change in an adversarial manner. We provide an algorithm that achieves state-of-the-art regret bound of $O( \sqrt{τ(\ln|S|+\ln|A|)T}\ln(T))$, where $S$ is the state space, $A$ is the action space, $τ$ is the mixing time of the MDP, and $T$ is the number of periods. The algorithm's computational complexity is polynom… ▽ More

    Submitted 25 May, 2019; originally announced May 2019.

  4. arXiv:1810.00737  [pdf, ps, other

    cs.LG stat.ML

    Risk-Averse Stochastic Convex Bandit

    Authors: Adrian Rivera Cardoso, Huan Xu

    Abstract: Motivated by applications in clinical trials and finance, we study the problem of online convex optimization (with bandit feedback) where the decision maker is risk-averse. We provide two algorithms to solve this problem. The first one is a descent-type algorithm which is easy to implement. The second algorithm, which combines the ellipsoid method and a center point device, achieves (almost) optim… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

  5. arXiv:1807.04098  [pdf, other

    cs.LG cs.CY cs.IR cs.NE stat.ML

    A Recurrent Neural Network Survival Model: Predicting Web User Return Time

    Authors: Georg L. Grob, Ângelo Cardoso, C. H. Bryan Liu, Duncan A. Little, Benjamin Paul Chamberlain

    Abstract: The size of a website's active user base directly affects its value. Thus, it is important to monitor and influence a user's likelihood to return to a site. Essential to this is predicting when a user will return. Current state of the art approaches to solve this problem come in two flavors: (1) Recurrent Neural Network (RNN) based solutions and (2) survival analysis methods. We observe that both… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted into ECML PKDD 2018; 8 figures and 1 table

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2018. Lecture Notes in Computer Science, vol 11053. pp 152-168

  6. arXiv:1807.02290  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Differentially Private Online Submodular Optimization

    Authors: Adrian Rivera Cardoso, Rachel Cummings

    Abstract: In this paper we develop the first algorithms for online submodular minimization that preserve differential privacy under full information feedback and bandit feedback. A sequence of $T$ submodular functions over a collection of $n$ elements arrive online, and at each timestep the algorithm must choose a subset of $[n]$ before seeing the function. The algorithm incurs a cost equal to the function… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

  7. arXiv:1803.07679  [pdf, other

    stat.ML cs.CL cs.CV cs.IR cs.LG

    Product Characterisation towards Personalisation: Learning Attributes from Unstructured Data to Recommend Fashion Products

    Authors: Ângelo Cardoso, Fabio Daolio, Saúl Vargas

    Abstract: In this paper, we describe a solution to tackle a common set of challenges in e-commerce, which arise from the fact that new products are continually being added to the catalogue. The challenges involve properly personalising the customer experience, forecasting demand and planning the product range. We argue that the foundational piece to solve all of these problems is having consistent and detai… ▽ More

    Submitted 20 March, 2018; originally announced March 2018.

    Comments: Under submission

  8. arXiv:1706.09865  [pdf, other

    stat.ML cs.CY cs.LG

    Generalising Random Forest Parameter Optimisation to Include Stability and Cost

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain, Duncan A. Little, Angelo Cardoso

    Abstract: Random forests are among the most popular classification and regression methods used in industrial applications. To be effective, the parameters of random forests must be carefully tuned. This is usually done by choosing values that minimize the prediction error on a held out dataset. We argue that error reduction is only one of several metrics that must be considered when optimizing random forest… ▽ More

    Submitted 13 July, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

    Comments: To appear in ECML-PKDD 2017

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2017. LNCS vol 10536, pp. 102-113 (2017)

  9. arXiv:1703.02596  [pdf, other

    cs.LG cs.CY cs.IR cs.NE stat.ML

    Customer Lifetime Value Prediction Using Embeddings

    Authors: Benjamin Paul Chamberlain, Angelo Cardoso, C. H. Bryan Liu, Roberto Pagliari, Marc Peter Deisenroth

    Abstract: We describe the Customer LifeTime Value (CLTV) prediction system deployed at ASOS.com, a global online fashion retailer. CLTV prediction is an important problem in e-commerce where an accurate estimate of future value allows retailers to effectively allocate marketing spend, identify and nurture high value customers and mitigate exposure to losses. The system at ASOS provides daily estimates of th… ▽ More

    Submitted 6 July, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: 10 pages, 11 figures

    Journal ref: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Pages 1753-1762, 2017