Skip to main content

Showing 1–17 of 17 results for author: Webster, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.01297  [pdf, ps, other

    cs.AI

    MobCLIP: Learning General-purpose Geospatial Representation at Scale

    Authors: Ya Wen, Jixuan Cai, Qiyao Ma, Linyan Li, Xinhua Chen, Chris Webster, Yulun Zhou

    Abstract: Representation learning of geospatial locations remains a core challenge in achieving general geospatial intelligence. Current embedding methods often lack versatility, limiting their utility across diverse tasks in both human and natural domains. We present MobCLIP, the first nationwide general-purpose location encoder, integrating an unprecedented diversity of data modalities through effective a… ▽ More

    Submitted 3 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

  2. arXiv:2409.17165  [pdf, other

    cs.IR cs.LG

    Mamba for Scalable and Efficient Personalized Recommendations

    Authors: Andrew Starnes, Clayton Webster

    Abstract: In this effort, we propose using the Mamba for handling tabular data in personalized recommendation systems. We present the \textit{FT-Mamba} (Feature Tokenizer\,$+$\,Mamba), a novel hybrid model that replaces Transformer layers with Mamba layers within the FT-Transformer architecture, for handling tabular data in personalized recommendation systems. The \textit{Mamba model} offers an efficient al… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: 8 pages, 6 figures, 2 tables

    MSC Class: 68T07

  3. arXiv:2311.05363  [pdf, other

    cs.LG q-bio.QM

    Beyond the training set: an intuitive method for detecting distribution shift in model-based optimization

    Authors: Farhan Damani, David H Brookes, Theodore Sternlieb, Cameron Webster, Stephen Malina, Rishi Jajoo, Kathy Lin, Sam Sinai

    Abstract: Model-based optimization (MBO) is increasingly applied to design problems in science and engineering. A common scenario involves using a fixed training set to train models, with the goal of designing new samples that outperform those present in the training data. A major challenge in this setting is distribution shift, where the distributions of training and design samples are different. While som… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  4. arXiv:2310.05324  [pdf, other

    cs.LG

    Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks

    Authors: Andrew Starnes, Anton Dereventsov, Clayton Webster

    Abstract: In this effort, we consider the impact of regularization on the diversity of actions taken by policies generated from reinforcement learning agents trained using a policy gradient. Policy gradient agents are prone to entropy collapse, which means certain actions are seldomly, if ever, selected. We augment the optimization objective function for the policy with terms constructed from various… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: 8 pages, 3 figures, accepted to WAIN 2023

  5. arXiv:2303.15116  [pdf

    cs.CL

    An ontology-aided, natural language-based approach for multi-constraint BIM model querying

    Authors: Mengtian Yin, Llewellyn Tang, Chris Webster, Shen Xu, Xiongyi Li, Huaquan Ying

    Abstract: Being able to efficiently retrieve the required building information is critical for construction project stakeholders to carry out their engineering and management activities. Natural language interface (NLI) systems are emerging as a time and cost-effective way to query Building Information Models (BIMs). However, the existing methods cannot logically combine different constraints to perform fin… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  6. arXiv:2211.11869  [pdf, other

    cs.LG cs.AI math.NA math.OC

    Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks

    Authors: Anton Dereventsov, Andrew Starnes, Clayton G. Webster

    Abstract: This effort is focused on examining the behavior of reinforcement learning systems in personalization environments and detailing the differences in policy entropy associated with the type of learning algorithm utilized. We demonstrate that Policy Optimization agents often possess low-entropy policies during training, which in practice results in agents prioritizing certain actions and avoiding oth… ▽ More

    Submitted 27 April, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

  7. arXiv:2112.13141  [pdf, other

    cs.LG cs.AI math.NA

    On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

    Authors: Anton Dereventsov, Ranga Raju Vatsavai, Clayton Webster

    Abstract: In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals. In particular, our approach is based on state space clustering with the use of a simplistic $k$-means algorithm as well as conventional choices of the network architectures and optimization algorithms. Numerical examples demonstrate the efficiency of different RL proced… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

  8. arXiv:2106.03934  [pdf, other

    cs.LG cs.AI

    Offline Policy Comparison under Limited Historical Agent-Environment Interactions

    Authors: Anton Dereventsov, Joseph D. Daws Jr., Clayton Webster

    Abstract: We address the challenge of policy evaluation in real-world applications of reinforcement learning systems where the available historical data is limited due to ethical, practical, or security considerations. This constrained distribution of data samples often leads to biased policy evaluation estimates. To remedy this, we propose that instead of policy evaluation, one should perform policy compar… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  9. arXiv:2006.10887  [pdf, other

    math.OC cs.LG

    An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization

    Authors: Anton Dereventsov, Clayton G. Webster, Joseph D. Daws Jr

    Abstract: In this work, we propose a novel adaptive stochastic gradient-free (ASGF) approach for solving high-dimensional nonconvex optimization problems based on function evaluations. We employ a directional Gaussian smoothing of the target function that generates a surrogate of the gradient and assists in avoiding bad local optima by utilizing nonlocal information of the loss landscape. Applying a determi… ▽ More

    Submitted 15 January, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

  10. arXiv:2004.05873  [pdf, other

    math.NA cs.CV math.OC

    Analysis of The Ratio of $\ell_1$ and $\ell_2$ Norms in Compressed Sensing

    Authors: Yiming Xu, Akil Narayan, Hoang Tran, Clayton G. Webster

    Abstract: We first propose a novel criterion that guarantees that an $s$-sparse signal is the local minimizer of the $\ell_1/\ell_2$ objective; our criterion is interpretable and useful in practice. We also give the first uniform recovery condition using a geometric characterization of the null space of the measurement matrix, and show that this condition is easily satisfied for a class of random matrices.… ▽ More

    Submitted 27 January, 2021; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: 25 pages, 7 figures

  11. arXiv:1912.02302  [pdf, ps, other

    math.NA cs.LG

    Analysis of Deep Neural Networks with Quasi-optimal polynomial approximation rates

    Authors: Joseph Daws, Clayton Webster

    Abstract: We show the existence of a deep neural network capable of approximating a wide class of high-dimensional approximations. The construction of the proposed neural network is based on a quasi-optimal polynomial approximation. We show that this network achieves an error rate that is sub-exponential in the number of polynomial functions, $M$, used in the polynomial approximation. The complexity of the… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: 13 pages submitted to MSML 2020

    MSC Class: 65D15

  12. arXiv:1910.02743  [pdf, ps, other

    cs.LG stat.ML

    Neural network integral representations with the ReLU activation function

    Authors: Armenak Petrosyan, Anton Dereventsov, Clayton Webster

    Abstract: In this effort, we derive a formula for the integral representation of a shallow neural network with the ReLU activation function. We assume that the outer weighs admit a finite $L_1$-norm with respect to Lebesgue measure on the sphere. For univariate target functions we further provide a closed-form formula for all possible representations. Additionally, in this case our formula allows one to exp… ▽ More

    Submitted 10 June, 2020; v1 submitted 7 October, 2019; originally announced October 2019.

  13. A nonlocal feature-driven exemplar-based approach for image inpainting

    Authors: Viktor Reshniak, Jeremy Trageser, Clayton G. Webster

    Abstract: We present a nonlocal variational image completion technique which admits simultaneous inpainting of multiple structures and textures in a unified framework. The recovery of geometric structures is achieved by using general convolution operators as a measure of behavior within an image. These are combined with a nonlocal exemplar-based approach to exploit the self-similarity of an image in the sel… ▽ More

    Submitted 3 December, 2020; v1 submitted 19 September, 2019; originally announced September 2019.

    MSC Class: 68U10; 94A08; 65D18; 65K10 ACM Class: I.4.4

    Journal ref: SIAM J. Imaging Sci. 13(2020) 2140-2168

  14. Robust learning with implicit residual networks

    Authors: Viktor Reshniak, Clayton Webster

    Abstract: In this effort, we propose a new deep architecture utilizing residual blocks inspired by implicit discretization schemes. As opposed to the standard feed-forward networks, the outputs of the proposed implicit residual blocks are defined as the fixed points of the appropriately chosen nonlinear transformations. We show that this choice leads to the improved stability of both forward and backward pr… ▽ More

    Submitted 22 February, 2021; v1 submitted 24 May, 2019; originally announced May 2019.

    Journal ref: Knowl. Extr. 2021, 3, 34-55

  15. arXiv:1905.10457  [pdf, other

    cs.LG math.NA stat.ML

    A Polynomial-Based Approach for Architectural Design and Learning with Deep Neural Networks

    Authors: Joseph Daws Jr., Clayton G. Webster

    Abstract: In this effort we propose a novel approach for reconstructing multivariate functions from training data, by identifying both a suitable network architecture and an initialization using polynomial-based approximations. Training deep neural networks using gradient descent can be interpreted as moving the set of network parameters along the loss landscape in order to minimize the loss functional. The… ▽ More

    Submitted 28 May, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: 11 pages, 6 figures, submitted to NeurIPS 2019, corrected several typos and included new examples

    MSC Class: 65D15

  16. arXiv:1905.10409  [pdf, other

    cs.LG stat.ML

    Greedy Shallow Networks: An Approach for Constructing and Training Neural Networks

    Authors: Anton Dereventsov, Armenak Petrosyan, Clayton Webster

    Abstract: We present a greedy-based approach to construct an efficient single hidden layer neural network with the ReLU activation that approximates a target function. In our approach we obtain a shallow network by utilizing a greedy algorithm with the prescribed dictionary provided by the available training data and a set of possible inner weights. To facilitate the greedy selection process we employ an in… ▽ More

    Submitted 30 September, 2021; v1 submitted 24 May, 2019; originally announced May 2019.

  17. arXiv:1811.08778  [pdf, other

    math.NA cs.IT

    Reconstruction of jointly sparse vectors via manifold optimization

    Authors: Armenak Petrosyan, Hoang Tran, Clayton Webster

    Abstract: In this paper, we consider the challenge of reconstructing jointly sparse vectors from linear measurements. Firstly, we show that by utilizing the rank of the output data matrix we can reduce the problem to a full column rank case. This result reveals a reduction in the computational complexity of the original problem and enables a simple implementation of joint sparse recovery algorithms for full… ▽ More

    Submitted 25 May, 2019; v1 submitted 21 November, 2018; originally announced November 2018.