Skip to main content

Showing 1–50 of 64 results for author: Young, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3278 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  2. arXiv:2503.09312  [pdf, ps, other

    q-bio.GN cs.LG

    Terrier: A Deep Learning Repeat Classifier

    Authors: Robert Turnbull, Neil D. Young, Edoardo Tescari, Lee F. Skerratt, Tiffany A. Kosch

    Abstract: Repetitive DNA sequences underpin genome architecture and evolutionary processes, yet they remain challenging to classify accurately. Terrier is a deep learning model designed to overcome these challenges by classifying repetitive DNA sequences using a publicly available, curated repeat sequence library trained under the RepeatMasker schema. Poor representation of taxa within repeat databases ofte… ▽ More

    Submitted 8 July, 2025; v1 submitted 12 March, 2025; originally announced March 2025.

    Comments: 14 pages, 9 figures

    ACM Class: I.2

  3. arXiv:2501.07663  [pdf, other

    cs.CL

    Enhancing Talent Employment Insights Through Feature Extraction with LLM Finetuning

    Authors: Karishma Thakrar, Nick Young

    Abstract: This paper explores the application of large language models (LLMs) to extract nuanced and complex job features from unstructured job postings. Using a dataset of 1.2 million job postings provided by AdeptID, we developed a robust pipeline to identify and classify variables such as remote work availability, remuneration structures, educational requirements, and work experience preferences. Our met… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

  4. arXiv:2408.12065  [pdf, ps, other

    cs.AI

    Transformers As Approximations of Solomonoff Induction

    Authors: Nathan Young, Michael Witbrock

    Abstract: Solomonoff Induction is an optimal-in-the-limit unbounded algorithm for sequence prediction, representing a Bayesian mixture of every computable probability distribution and performing close to optimally in predicting any computable sequence. Being an optimal form of computational sequence prediction, it seems plausible that it may be used as a model against which other methods of sequence predi… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  5. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (69 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 11 October, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  6. arXiv:2305.12599  [pdf, other

    cs.CL cs.AI

    Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning

    Authors: Qiming Bao, Alex Yuxuan Peng, Zhenyun Deng, Wanjun Zhong, Gael Gendron, Timothy Pistotti, Neset Tan, Nathan Young, Yang Chen, Yonghua Zhu, Paul Denny, Michael Witbrock, Jiamou Liu

    Abstract: Combining large language models with logical reasoning enhances their capacity to address problems in a robust and reliable manner. Nevertheless, the intricate nature of logical reasoning poses challenges when gathering reliable data from the web to build comprehensive training datasets, subsequently affecting performance on downstream tasks. To address this, we introduce a novel logic-driven data… ▽ More

    Submitted 17 April, 2025; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: 21 pages, 8 figures, the Findings of ACL 2024

  7. Classification via Two-Way Comparisons

    Authors: Marek Chrobak, Neal E. Young

    Abstract: Given a weighted, ordered query set $Q$ and a partition of $Q$ into classes, we study the problem of computing a minimum-cost decision tree that, given any query $q$ in $Q$, uses equality tests and less-than comparisons to determine the class to which $q$ belongs. Such a tree can be much smaller than a lookup table, and much faster and smaller than a conventional search tree. We give the first pol… ▽ More

    Submitted 25 January, 2025; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: Appears in WADS 2023 and TALG 2024

    MSC Class: 68P10; 68P30; 68W25; 94A45 ACM Class: E.4; G.1.6; G.2.2; H.3.1; I.4.2

    Journal ref: ACM Transactions on Algorithms (2024)

  8. arXiv:2302.04106  [pdf

    cs.DB cs.SE

    Detecting Data Type Inconsistencies in a Property Graph Database

    Authors: Joshua R. Porter, Michael N. Young, Aleks Y. M. Ontman

    Abstract: Some property graph databases do not have a fixed schema, which can result in data type inconsistencies for properties on nodes and relationships, especially when importing data into a running database. Here we present a tool which can rapidly produce a detailed report on every property in the graph. When executed on a large knowledge graph, it allowed us to debug a complex ETL process and enforce… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: 5 pages, 3 figures, general approach applied to production databases

    ACM Class: E.0

  9. Online Paging with Heterogeneous Cache Slots

    Authors: Marek Chrobak, Samuel Haney, Mehraneh Liaee, Debmalya Panigrahi, Rajmohan Rajaraman, Ravi Sundaram, Neal E. Young

    Abstract: It is natural to generalize the online $k$-Server problem by allowing each request to specify not only a point $p$, but also a subset $S$ of servers that may serve it. For uniform metrics, the problem is equivalent to a generalization of Paging in which each request specifies not only a page $p$, but also a subset $S$ of cache slots, and is satisfied by having a copy of $p$ in some slot in $S$. We… ▽ More

    Submitted 19 October, 2024; v1 submitted 11 June, 2022; originally announced June 2022.

    Comments: conference and journal versions appear in STACS 2023 and Algorithmica (2004)

    ACM Class: F.2.0; F.1.2; C.0

    Journal ref: Algorithmica (2004)

  10. arXiv:2203.12186  [pdf, other

    cs.CL

    AbductionRules: Training Transformers to Explain Unexpected Inputs

    Authors: Nathan Young, Qiming Bao, Joshua Bensemann, Michael Witbrock

    Abstract: Transformers have recently been shown to be capable of reliably performing logical reasoning over facts and rules expressed in natural language, but abductive reasoning - inference to the best explanation of an unexpected observation - has been underexplored despite significant applications to scientific discovery, common-sense reasoning, and model interpretability. We present AbductionRules, a… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Findings of ACL 2022

  11. High Precision Control of Tracked Field Robots in the Presence of Unknown Traction Coefficients

    Authors: Erkan Kayacan, Sierra N. Young, Joshua M. Peschel, Girish Chowdhary

    Abstract: Accurate steering through crop rows that avoids crop damage is one of the most important tasks for agricultural robots utilized in various field operations, such as monitoring, mechanical weeding, or spraying. In practice, varying soil conditions can result in off-track navigation due to unknown traction coefficients so that it can cause crop damage. To address this problem, this paper presents th… ▽ More

    Submitted 20 March, 2021; originally announced March 2021.

    Journal ref: Journal of Field Robotics, vol. 35, pp. 1050-1062, 2018

  12. A Simple Algorithm for Optimal Search Trees with Two-Way Comparisons

    Authors: Marek Chrobak, Mordecai Golin, J. Ian Munro, Neal E. Young

    Abstract: We present a simple $O(n^4)$-time algorithm for computing optimal search trees with two-way comparisons. The only previous solution to this problem, by Anderson et al., has the same running time, but is significantly more complicated and is restricted to the variant where only successful queries are allowed. Our algorithm extends directly to solve the standard full variant of the problem, which al… ▽ More

    Submitted 4 October, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: v3 adds Appendix B, with a stronger alternative to Theorem 1

    MSC Class: 68P10; 68P30; 68W25; 94A45 ACM Class: E.4; G.1.6; G.2.2; H.3.1; I.4.2

    Journal ref: ACM Transactions on Algorithms 18(1) (2022) 1-11

  13. On the Cost of Unsuccessful Searches in Search Trees with Two-way Comparisons

    Authors: Marek Chrobak, Mordecai Golin, J. Ian Munro, Neal E. Young

    Abstract: Search trees are commonly used to implement access operations to a set of stored keys. If this set is static and the probabilities of membership queries are known in advance, then one can precompute an optimal search tree, namely one that minimizes the expected access cost. For a non-key query, a search tree can determine its approximate location by returning the inter-key interval containing the… ▽ More

    Submitted 9 March, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: v2 has updated bibliography

    MSC Class: 68P10; 68P30; 68W25; 94A45 ACM Class: E.4; G.1.6; G.2.2; H.3.1; I.4.2

    Journal ref: Information and Computation 281 (2021)

  14. arXiv:2011.02615  [pdf, other

    cs.DS cs.DB

    Competitive Data-Structure Dynamization

    Authors: Claire Mathieu, Rajmohan Rajaraman, Neal E. Young, Arman Yousefi

    Abstract: Data-structure dynamization is a general approach for making static data structures dynamic. It is used extensively in geometric settings and in the guise of so-called merge (or compaction) policies in big-data databases such as Google Bigtable and LevelDB (our focus). Previous theoretical work is based on worst-case analyses for uniform inputs -- insertions of one item at a time and constant read… ▽ More

    Submitted 23 July, 2024; v1 submitted 4 November, 2020; originally announced November 2020.

    Comments: Conference version in SODA (2021). Journal version in ACM TALG (accepted June 2024)

    MSC Class: 68W27; 68P15; 68R05 ACM Class: F.1.2; H.2.4

    Journal ref: ACM Trans. Algorithms. June 2024

  15. Algorithmic approaches to selecting control clones in DNA array hybridization experiments

    Authors: Qi Fu, Elizabeth Bent, James Borneman, Marek Chrobak, Neal E. Young

    Abstract: We study the problem of selecting control clones in DNA array hybridization experiments. The problem arises in the OFRG method for analyzing microbial communities. The OFRG method performs classification of rRNA gene clones using binary fingerprints created from a series of hybridization experiments, where each experiment consists of hybridizing a collection of arrayed clones with a single oligonu… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Journal ref: Journal of Bioinformatics and Computational Biology 5(4) 937-961, 2007

  16. Distributed algorithms for covering, packing and maximum weighted matching

    Authors: Christos Koufogiannakis, Neal E. Young

    Abstract: This paper gives poly-logarithmic-round, distributed D-approximation algorithms for covering problems with submodular cost and monotone covering constraints (Submodular-cost Covering). The approximation ratio D is the maximum number of variables in any constraint. Special cases include Covering Mixed Integer Linear Programs (CMIP), and Weighted Vertex Cover (with D=2). Via duality, the paper also… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    MSC Class: 90C26; 68W15 ACM Class: C.2.4; G.1.6

    Journal ref: Distributed Computing 24, 45--63 (2011)

  17. On Huang and Wong's Algorithm for Generalized Binary Split Trees

    Authors: Marek Chrobak, Mordecai Golin, J. Ian Munro, Neal E. Young

    Abstract: Huang and Wong [1984] proposed a polynomial-time dynamic-programming algorithm for computing optimal generalized binary split trees. We show that their algorithm is incorrect. Thus, it remains open whether such trees can be computed in polynomial time. Spuler [1994] proposed modifying Huang and Wong's algorithm to obtain an algorithm for a different problem: computing optimal two-way-comparison se… ▽ More

    Submitted 14 February, 2022; v1 submitted 11 January, 2019; originally announced January 2019.

    MSC Class: 68P10; 68P30; 68W25; 94A45 ACM Class: E.4; G.1.6; G.2.2; H.3.1; I.4.2

    Journal ref: Acta Informatica (2022)

  18. arXiv:1803.11119  [pdf, other

    cs.RO cs.CY

    Decentralized Control Systems Laboratory Using Human Centered Robotic Actuators

    Authors: Binghan He, Kunye Chen, Rachel Schlossman, Neal Ormsbee, Mara Altman, Nathan Young, Matt Mangum, Luis Sentis

    Abstract: University laboratories deliver unique hands-on experimentation for STEM students but often lack state-of-the-art equipment and provide limited access to their equipment. The University of Texas Cloud Laboratory provides remote access to a cutting-edge series elastic actuators for student experimentation regarding human-centered robotics, dynamical systems, and controls. Through a browser-based in… ▽ More

    Submitted 8 April, 2019; v1 submitted 29 March, 2018; originally announced March 2018.

  19. arXiv:1710.03358  [pdf, other

    cs.DS

    Balanced power diagrams for redistricting

    Authors: Vincent Cohen-Addad, Philip N. Klein, Neal E. Young

    Abstract: We propose a method for redistricting, decomposing a geographical area into subareas, called districts, so that the populations of the districts are as close as possible and the districts are compact and contiguous. Each district is the intersection of a polygon with the geographical area. The polygons are convex and the average number of sides per polygon is less than six. The polygons tend to be… ▽ More

    Submitted 7 January, 2018; v1 submitted 9 October, 2017; originally announced October 2017.

  20. arXiv:1709.10180  [pdf, other

    cs.CV

    Possibilistic Fuzzy Local Information C-Means for Sonar Image Segmentation

    Authors: Alina Zare, Nicholas Young, Daniel Suen, Thomas Nabelek, Aquila Galusha, James Keller

    Abstract: Side-look synthetic aperture sonar (SAS) can produce very high quality images of the sea-floor. When viewing this imagery, a human observer can often easily identify various sea-floor textures such as sand ripple, hard-packed sand, sea grass and rock. In this paper, we present the Possibilistic Fuzzy Local Information C-Means (PFLICM) approach to segment SAS imagery into sea-floor regions that exh… ▽ More

    Submitted 28 September, 2017; originally announced September 2017.

    Comments: 8 pages, 11 figures, to appear in the 2017 IEEE Symposium Series on Computational Intelligence (SSCI) Proceedings

  21. Optimal Search Trees with 2-Way Comparisons

    Authors: Marek Chrobak, Mordecai Golin, J. Ian Munro, Neal E. Young

    Abstract: In 1971, Knuth gave an $O(n^2)$-time algorithm for the classic problem of finding an optimal binary search tree. Knuth's algorithm works only for search trees based on 3-way comparisons, while most modern computers support only 2-way comparisons (e.g., $<, \le, =, \ge$, and $>$). Until this paper, the problem of finding an optimal search tree using 2-way comparisons remained open -- poly-time algo… ▽ More

    Submitted 9 March, 2021; v1 submitted 2 May, 2015; originally announced May 2015.

    Comments: ERRATUM: The proof of Theorem 3 of the ISAAC'15 paper (v4 here) is incorrect. Version v5 here contains: a full erratum, proofs of the other results, and pointers to journal versions expanding those results

    MSC Class: 68P10; 68P30; 68W25; 94A45; ACM Class: E.4; G.1.6; G.2.2; H.3.1; I.4.2

    Journal ref: Optimal Search Trees with 2-Way Comparisons. In: Elbassioni K., Makino K. (eds) Algorithms and Computation. ISAAC 2015. Lecture Notes in Computer Science, vol 9472 (2105). Springer, Berlin, Heidelberg

  22. arXiv:1407.3015  [pdf, ps, other

    cs.DS

    Nearly Linear-Work Algorithms for Mixed Packing/Covering and Facility-Location Linear Programs

    Authors: Neal E. Young

    Abstract: We describe the first nearly linear-time approximation algorithms for explicitly given mixed packing/covering linear programs, and for (non-metric) fractional facility location. We also describe the first parallel algorithms requiring only near-linear total work and finishing in polylog time. The algorithms compute $(1+ε)$-approximate solutions in time (and work) $O^*(N/ε^2)$, where $N$ is the num… ▽ More

    Submitted 5 November, 2014; v1 submitted 10 July, 2014; originally announced July 2014.

    MSC Class: 90-08; 90C05; 49M29; 65K05 ACM Class: F.2.1; G.1.6

  23. arXiv:1407.3008  [pdf, other

    cs.DS

    Bigtable Merge Compaction

    Authors: Claire Mathieu, Carl Staelin, Neal E. Young, Arman Yousefi

    Abstract: NoSQL databases are widely used for massive data storage and real-time web applications. Yet important aspects of these data structures are not well understood. For example, NoSQL databases write most of their data to a collection of files on disk, meanwhile periodically compacting subsets of these files. A compaction policy must choose which files to compact, and when to compact them, without kno… ▽ More

    Submitted 9 July, 2015; v1 submitted 10 July, 2014; originally announced July 2014.

    MSC Class: 68W27; 68P15; 68R05 ACM Class: F.1.2; H.2.4

    Journal ref: SUPERSEDED BY https://arxiv.boxedpaper.com/abs/2011.02615

  24. arXiv:1307.5296  [pdf, ps, other

    cs.DS cs.IT

    First-Come-First-Served for Online Slot Allocation and Huffman Coding

    Authors: Monik Khare, Claire Mathieu, Neal E. Young

    Abstract: Can one choose a good Huffman code on the fly, without knowing the underlying distribution? Online Slot Allocation (OSA) models this and similar problems: There are n slots, each with a known cost. There are n items. Requests for items are drawn i.i.d. from a fixed but hidden probability distribution p. After each request, if the item, i, was not previously requested, then the algorithm (knowing t… ▽ More

    Submitted 7 October, 2013; v1 submitted 19 July, 2013; originally announced July 2013.

    Comments: ACM-SIAM Symposium on Discrete Algorithms (SODA) 2014

    MSC Class: 68W40; 68Q87 ACM Class: F.1.2; F.2.0; H.1.1

  25. arXiv:1303.2920  [pdf, ps, other

    cs.DS

    Approximating 1-dimensional TSP Requires Omega(n log n) Comparisons

    Authors: Neal E. Young

    Abstract: We give a short proof that any comparison-based n^(1-epsilon)-approximation algorithm for the 1-dimensional Traveling Salesman Problem (TSP) requires Omega(n log n) comparisons.

    Submitted 26 March, 2013; v1 submitted 12 March, 2013; originally announced March 2013.

    Comments: Superseded by "On the complexity of approximating Euclidean traveling salesman tours and minimum spanning trees", by Das et al; Algorithmica 19:447-460 (1997)

    MSC Class: 68W25 ACM Class: F.2.2; G.2.2

  26. Approximation Algorithms for the Joint Replenishment Problem with Deadlines

    Authors: Marcin Bienkowski, Jaroslaw Byrka, Marek Chrobak, Neil Dobbs, Tomasz Nowicki, Maxim Sviridenko, Grzegorz Swirszcz, Neal E. Young

    Abstract: The Joint Replenishment Problem (JRP) is a fundamental optimization problem in supply-chain management, concerned with optimizing the flow of goods from a supplier to retailers. Over time, in response to demands at the retailers, the supplier ships orders, via a warehouse, to the retailers. The objective is to schedule these orders to minimize the sum of ordering costs and retailers' waiting costs… ▽ More

    Submitted 2 December, 2015; v1 submitted 13 December, 2012; originally announced December 2012.

    MSC Class: 68W25; 90C05 ACM Class: G.1.6

    Journal ref: J. Scheduling 18(6): 545-560 (2015)

  27. arXiv:1208.2724  [pdf, ps, other

    cs.DS

    Caching with rental cost and zapping

    Authors: Monik Khare, Neal E. Young

    Abstract: The \emph{file caching} problem is defined as follows. Given a cache of size $k$ (a positive integer), the goal is to minimize the total retrieval cost for the given sequence of requests to files. A file $f$ has size $size(f)$ (a positive integer) and retrieval cost $cost(f)$ (a non-negative number) for bringing the file into the cache. A \emph{miss} or \emph{fault} occurs when the requested file… ▽ More

    Submitted 18 October, 2012; v1 submitted 13 August, 2012; originally announced August 2012.

    Comments: Caching with rental cost, caching with zapping

  28. Hamming Approximation of NP Witnesses

    Authors: Daniel Sheldon, Neal E. Young

    Abstract: Given a satisfiable 3-SAT formula, how hard is it to find an assignment to the variables that has Hamming distance at most n/2 to a satisfying assignment? More generally, consider any polynomial-time verifier for any NP-complete language. A d(n)-Hamming-approximation algorithm for the verifier is one that, given any member x of the language, outputs in polynomial time a string a with Hamming dista… ▽ More

    Submitted 19 July, 2013; v1 submitted 1 August, 2012; originally announced August 2012.

    MSC Class: 03D15; 68Q25; 90C59 ACM Class: F.1.3; F.2.2

    Journal ref: Theory of Computing 9(22), 2013, pp. 685-702

  29. On a Linear Program for Minimum-Weight Triangulation

    Authors: Arman Yousefi, Neal E. Young

    Abstract: Minimum-weight triangulation (MWT) is NP-hard. It has a polynomial-time constant-factor approximation algorithm, and a variety of effective polynomial- time heuristics that, for many instances, can find the exact MWT. Linear programs (LPs) for MWT are well-studied, but previously no connection was known between any LP and any approximation algorithm or heuristic for MWT. Here we show the first suc… ▽ More

    Submitted 4 October, 2013; v1 submitted 22 November, 2011; originally announced November 2011.

    Comments: To appear in SICOMP. Extended abstract appeared in SODA 2012

    MSC Class: 68W25; 90C05 ACM Class: G.1.6; I.3.5

    Journal ref: SIAM Journal on Computing 43(1):25-51(2014)

  30. arXiv:1007.0217  [pdf, ps, other

    cs.DS

    A Bound on the Sum of Weighted Pairwise Distances of Points Constrained to Balls

    Authors: Neal E. Young

    Abstract: We consider the problem of choosing Euclidean points to maximize the sum of their weighted pairwise distances, when each point is constrained to a ball centered at the origin. We derive a dual minimization problem and show strong duality holds (i.e., the resulting upper bound is tight) when some locally optimal configuration of points is affinely independent. We sketch a polynomial time algorithm… ▽ More

    Submitted 1 July, 2010; originally announced July 2010.

    Comments: Cornell ORIE Tech Report

    Report number: 1103 MSC Class: 90C27 (Primary) 90C22; 52A40 (Secondary) ACM Class: G.1.6

  31. Greedy D-Approximation Algorithm for Covering with Arbitrary Constraints and Submodular Cost

    Authors: Christos Koufogiannakis, Neal E. Young

    Abstract: This paper describes a simple greedy D-approximation algorithm for any covering problem whose objective function is submodular and non-decreasing, and whose feasible region can be expressed as the intersection of arbitrary (closed upwards) covering constraints, each of which constrains at most D variables of the problem. (A simple example is Vertex Cover, with D = 2.) The algorithm generalizes pre… ▽ More

    Submitted 30 December, 2011; v1 submitted 4 July, 2008; originally announced July 2008.

    MSC Class: 68W25 ACM Class: G.1.6

    Journal ref: Algorithmica 66(1):113-152 (2013)

  32. A Nearly Linear-Time PTAS for Explicit Fractional Packing and Covering Linear Programs

    Authors: Christos Koufogiannakis, Neal E. Young

    Abstract: We give an approximation algorithm for packing and covering linear programs (linear programs with non-negative coefficients). Given a constraint matrix with n non-zeros, r rows, and c columns, the algorithm computes feasible primal and dual solutions whose costs are within a factor of 1+eps of the optimal cost in time O((r+c)log(n)/eps^2 + n).

    Submitted 13 March, 2013; v1 submitted 13 January, 2008; originally announced January 2008.

    Comments: corrected version of FOCS 2007 paper: 10.1109/FOCS.2007.62. Accepted to Algorithmica, 2013

    MSC Class: 68W25 ACM Class: G.1.6

    Journal ref: Algorithmica 70(4):648-674(2014)

  33. The reverse greedy algorithm for the metric k-median problem

    Authors: Marek Chrobak, Claire Kenyon, Neal E. Young

    Abstract: The Reverse Greedy algorithm (RGreedy) for the k-median problem works as follows. It starts by placing facilities on all nodes. At each step, it removes a facility to minimize the resulting total distance from the customers to the remaining facilities. It stops when k facilities remain. We prove that, if the distance function is metric, then the approximation ratio of RGreedy is between ?(log n/… ▽ More

    Submitted 27 September, 2005; v1 submitted 27 April, 2005; originally announced April 2005.

    Comments: to appear in IPL. preliminary version in COCOON '05

    ACM Class: G.1.6; G.2.2; F.2.2

    Journal ref: Information Processing Letters 97:68-72(2006)

  34. Incremental Medians via Online Bidding

    Authors: Marek Chrobak, Claire Kenyon, John Noga, Neal E. Young

    Abstract: In the k-median problem we are given sets of facilities and customers, and distances between them. For a given set F of facilities, the cost of serving a customer u is the minimum distance between u and a facility in F. The goal is to find a set F of k facilities that minimizes the sum, over all customers, of their service costs. Following Mettu and Plaxton, we study the incremental medians prob… ▽ More

    Submitted 28 May, 2020; v1 submitted 26 April, 2005; originally announced April 2005.

    Comments: conference version appeared in LATIN 2006 as "Oblivious Medians via Online Bidding"

    ACM Class: G.1.6; G.2.2; F.2.2

    Journal ref: Algorithmica 50(4):455-478(2008)

  35. A New Operation on Sequences: the Boustrouphedon Transform

    Authors: Jessica Millar, N. J. A. Sloane, Neal E. Young

    Abstract: A generalization of the Seidel-Entringer-Arnold method for calculating the alternating permutation numbers (or secant-tangent numbers) leads to a new operation on integer sequences, the Boustrophedon transform.

    Submitted 24 June, 2002; v1 submitted 20 May, 2002; originally announced May 2002.

    Comments: very minor change: corrected typo in author list. June 24 2002: correction to a proof; additional references

    MSC Class: 05A15

    Journal ref: J. Combinatorial Theory, Series A 76(1):44-54 (1996)

  36. Designing Multi-Commodity Flow Trees

    Authors: Samir Khuller, Balaji Raghavachari, Neal E. Young

    Abstract: The traditional multi-commodity flow problem assumes a given flow network in which multiple commodities are to be maximally routed in response to given demands. This paper considers the multi-commodity flow network-design problem: given a set of multi-commodity flow demands, find a network subject to certain constraints such that the commodities can be maximally routed. This paper focuses on t… ▽ More

    Submitted 30 May, 2002; originally announced May 2002.

    Comments: Conference version in WADS'93

    ACM Class: F.2.2; G.2.2

    Journal ref: Information Processing Letters 50:49-55 (1994)

  37. Rounding Algorithms for a Geometric Embedding of Minimum Multiway Cut

    Authors: David Karger, Phil Klein, Cliff Stein, Mikkel Thorup, Neal E. Young

    Abstract: The multiway-cut problem is, given a weighted graph and k >= 2 terminal nodes, to find a minimum-weight set of edges whose removal separates all the terminals. The problem is NP-hard, and even NP-hard to approximate within 1+delta for some small delta > 0. Calinescu, Karloff, and Rabani (1998) gave an algorithm with performance guarantee 3/2-1/k, based on a geometric relaxation of the problem.… ▽ More

    Submitted 15 September, 2003; v1 submitted 19 May, 2002; originally announced May 2002.

    Comments: Conference version in ACM Symposium on Theory of Computing (1999). To appear in Mathematics of Operations Research

    ACM Class: F.2.0; G.1.6; G.2.2

    Journal ref: Mathematics of Operations Research 29(3):436-461(2004)

  38. A Network-Flow Technique for Finding Low-Weight Bounded-Degree Spanning Trees

    Authors: S. Fekete, S. Khuller, M. Klemmstein, B. Raghavachari, Neal E. Young

    Abstract: The problem considered is the following. Given a graph with edge weights satisfying the triangle inequality, and a degree bound for each vertex, compute a low-weight spanning tree such that the degree of each vertex is at most its specified bound. The problem is NP-hard (it generalizes Traveling Salesman (TSP)). This paper describes a network-flow heuristic for modifying a given tree T to meet t… ▽ More

    Submitted 18 May, 2002; originally announced May 2002.

    ACM Class: F.2.2; G.2.2

    Journal ref: Journal of Algorithms 24(2):310-324 (1997)

  39. Prefix Codes: Equiprobable Words, Unequal Letter Costs

    Authors: Mordecai Golin, Neal E. Young

    Abstract: Describes a near-linear-time algorithm for a variant of Huffman coding, in which the letters may have non-uniform lengths (as in Morse code), but with the restriction that each word to be encoded has equal probability. [See also ``Huffman Coding with Unequal Letter Costs'' (2002).]

    Submitted 18 May, 2002; originally announced May 2002.

    Comments: proceedings version in ICALP (1994)

    ACM Class: F.2.0; E.4; I.4.2

    Journal ref: SIAM J. Computing 25(6):1281-1304 (1996)

  40. Huffman Coding with Letter Costs: A Linear-Time Approximation Scheme

    Authors: Mordecai Golin, Claire Mathieu, Neal E. Young

    Abstract: We give a polynomial-time approximation scheme for the generalization of Huffman Coding in which codeword letters have non-uniform costs (as in Morse code, where the dash is twice as long as the dot). The algorithm computes a (1+epsilon)-approximate solution in time O(n + f(epsilon) log^3 n), where n is the input size.

    Submitted 23 April, 2012; v1 submitted 18 May, 2002; originally announced May 2002.

    ACM Class: F.2.0; E.4; I.4.2

    Journal ref: SIAM Journal on Computing 41(3):684-713(2012)

  41. arXiv:cs/0205047  [pdf, ps, other

    cs.DS cs.DM

    K-Medians, Facility Location, and the Chernoff-Wald Bound

    Authors: Neal E. Young

    Abstract: The paper gives approximation algorithms for the k-medians and facility-location problems (both NP-hard). For k-medians, the algorithm returns a solution using at most ln(n+n/epsilon)k medians and having cost at most (1+epsilon) times the cost of the best solution that uses at most k medians. Here epsilon > 0 is an input to the algorithm. In comparison, the best previous algorithm (Jyh-Han Lin a… ▽ More

    Submitted 8 April, 2005; v1 submitted 18 May, 2002; originally announced May 2002.

    ACM Class: F.2.1; G.1.6; G.2.2; G.3

    Journal ref: ACM-SIAM Symposium on Discrete Algorithms (2000)

  42. On the Number of Iterations for Dantzig-Wolfe Optimization and Packing-Covering Approximation Algorithms

    Authors: Phil Klein, Neal E. Young

    Abstract: We give a lower bound on the iteration complexity of a natural class of Lagrangean-relaxation algorithms for approximately solving packing/covering linear programs. We show that, given an input with $m$ random 0/1-constraints on $n$ variables, with high probability, any such algorithm requires $Ω(ρ\log(m)/ε^2)$ iterations to compute a $(1+ε)$-approximate solution, where $ρ$ is the width of the inp… ▽ More

    Submitted 19 November, 2015; v1 submitted 19 May, 2002; originally announced May 2002.

    ACM Class: F.2.1; G.1.6

    Journal ref: LNCS 1610 (IPCO): 320-327 (1999); SIAM Journal on Computing 44(4):1154-1172(2015)

  43. Balancing Minimum Spanning and Shortest Path Trees

    Authors: Samir Khuller, Balaji Raghavachari, Neal E. Young

    Abstract: This paper give a simple linear-time algorithm that, given a weighted digraph, finds a spanning tree that simultaneously approximates a shortest-path tree and a minimum spanning tree. The algorithm provides a continuous trade-off: given the two trees and epsilon > 0, the algorithm returns a spanning tree in which the distance between any vertex and the root of the shortest-path tree is at most 1… ▽ More

    Submitted 18 May, 2002; originally announced May 2002.

    Comments: conference version: ACM-SIAM Symposium on Discrete Algorithms (1993)

    ACM Class: F.2.2; G.2.2

    Journal ref: Algorithmica 14(4):305-322 (1995)

  44. The K-Server Dual and Loose Competitiveness for Paging

    Authors: Neal E. Young

    Abstract: This paper has two results. The first is based on the surprising observation that the well-known ``least-recently-used'' paging algorithm and the ``balance'' algorithm for weighted caching are linear-programming primal-dual algorithms. This observation leads to a strategy (called ``Greedy-Dual'') that generalizes them both and has an optimal performance guarantee for weighted caching. For the… ▽ More

    Submitted 18 May, 2002; originally announced May 2002.

    Comments: conference version: "On-Line Caching as Cache Size Varies", SODA (1991)

    ACM Class: F.1.2; F.2.0; C.2.0

    Journal ref: Algorithmica 11(6):525-541 (1994)

  45. Low-Degree Spanning Trees of Small Weight

    Authors: Samir Khuller, Balaji Raghavachari, Neal E. Young

    Abstract: The degree-d spanning tree problem asks for a minimum-weight spanning tree in which the degree of each vertex is at most d. When d=2 the problem is TSP, and in this case, the well-known Christofides algorithm provides a 1.5-approximation algorithm (assuming the edge weights satisfy the triangle inequality). In 1984, Christos Papadimitriou and Umesh Vazirani posed the challenge of finding an al… ▽ More

    Submitted 18 May, 2002; originally announced May 2002.

    Comments: conference version in Symposium on Theory of Computing (1994)

    ACM Class: F.2.2; G.2.2

    Journal ref: SIAM J. Computing 25(2):355-368 (1996)

  46. Orienting Graphs to Optimize Reachability

    Authors: S. L. Hakimi, E. Schmeichel, Neal E. Young

    Abstract: The paper focuses on two problems: (i) how to orient the edges of an undirected graph in order to maximize the number of ordered vertex pairs (x,y) such that there is a directed path from x to y, and (ii) how to orient the edges so as to minimize the number of such pairs. The paper describes a quadratic-time algorithm for the first problem, and a proof that the second problem is NP-hard to appro… ▽ More

    Submitted 18 May, 2002; originally announced May 2002.

    ACM Class: F.2.2; G.2.2

    Journal ref: Information Processing Letters 63:229-235 (1997)

  47. Faster Parametric Shortest Path and Minimum Balance Algorithms

    Authors: Neal Young, Robert Tarjan, James Orlin

    Abstract: The parametric shortest path problem is to find the shortest paths in graph where the edge costs are of the form w_ij+lambda where each w_ij is constant and lambda is a parameter that varies. The problem is to find shortest path trees for every possible value of lambda. The minimum-balance problem is to find a ``weighting'' of the vertices so that adjusting the edge costs by the vertex weights… ▽ More

    Submitted 18 May, 2002; originally announced May 2002.

    ACM Class: F.2.2; G.2.2; G.1.6

    Journal ref: Networks 21(2):205-221 (1991)

  48. Approximating the Minimum Equivalent Digraph

    Authors: Samir Khuller, Balaji Raghavachari, Neal E. Young

    Abstract: The MEG (minimum equivalent graph) problem is, given a directed graph, to find a small subset of the edges that maintains all reachability relations between nodes. The problem is NP-hard. This paper gives an approximation algorithm with performance guarantee of pi^2/6 ~ 1.64. The algorithm and its analysis are based on the simple idea of contracting long cycles. (This result is strengthened slig… ▽ More

    Submitted 18 May, 2002; originally announced May 2002.

    Comments: conference version in ACM-SIAM Symposium on Discrete Algorithms (1994)

    ACM Class: F.2.2; G.2.2

    Journal ref: SIAM J. Computing 24(4):859-872 (1995)

  49. Sequential and Parallel Algorithms for Mixed Packing and Covering

    Authors: Neal E. Young

    Abstract: Mixed packing and covering problems are problems that can be formulated as linear programs using only non-negative coefficients. Examples include multicommodity network flow, the Held-Karp lower bound on TSP, fractional relaxations of set cover, bin-packing, knapsack, scheduling problems, minimum-weight triangulation, etc. This paper gives approximation algorithms for the general class of proble… ▽ More

    Submitted 18 May, 2002; originally announced May 2002.

    ACM Class: F.2.1, G.1.6

  50. Competitive Paging Algorithms

    Authors: Amos Fiat, Richard Karp, Mike Luby, Lyle McGeoch, Daniel Sleator, Neal E. Young

    Abstract: The paging problem is that of deciding which pages to keep in a memory of k pages in order to minimize the number of page faults. This paper introduces the marking algorithm, a simple randomized on-line algorithm for the paging problem, and gives a proof that its performance guarantee (competitive ratio) is O(log k). In contrast, no deterministic on-line algorithm can have a performance guarante… ▽ More

    Submitted 18 May, 2002; originally announced May 2002.

    ACM Class: F.2.0; F.1.2; C.0

    Journal ref: Journal of Algorithms 12:685-699 (1991)