Skip to main content

Showing 1–22 of 22 results for author: Singer, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.13225  [pdf, other

    cs.CV

    Automatic Complementary Separation Pruning Toward Lightweight CNNs

    Authors: David Levin, Gonen Singer

    Abstract: In this paper, we present Automatic Complementary Separation Pruning (ACSP), a novel and fully automated pruning method for convolutional neural networks. ACSP integrates the strengths of both structured pruning and activation-based pruning, enabling the efficient removal of entire components such as neurons and channels while leveraging activations to identify and retain the most relevant compone… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  2. arXiv:2501.16535  [pdf, ps, other

    cs.DS

    Latency Guarantees for Caching with Delayed Hits

    Authors: Keerthana Gurushankar, Noah G. Singer, Bernardo Subercaseaux

    Abstract: In the classical caching problem, when a requested page is not present in the cache (i.e., a "miss"), it is assumed to travel from the backing store into the cache "before" the next request arrives. However, in many real-life applications, such as content delivery networks, this assumption is unrealistic. The "delayed-hits" model for caching, introduced by Atre, Sherry, Wang, and Berger, account… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: Accepted at INFOCOM2025

  3. arXiv:2411.18829  [pdf, ps, other

    cs.DS

    Streaming Algorithms via Local Algorithms for Maximum Directed Cut

    Authors: Raghuvansh R. Saxena, Noah G. Singer, Madhu Sudan, Santhoshini Velusamy

    Abstract: We explore the use of local algorithms in the design of streaming algorithms for the Maximum Directed Cut problem. Specifically, building on the local algorithm of Buchbinder et al. (FOCS'12) and Censor-Hillel et al. (ALGOSENSORS'17), we develop streaming algorithms for both adversarially and randomly ordered streams that approximate the value of maximum directed cut in bounded-degree graphs. In… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: 45 pages, to appear in SODA 2025

  4. arXiv:2411.12976  [pdf, ps, other

    cs.DS

    Oblivious Algorithms for Maximum Directed Cut: New Upper and Lower Bounds

    Authors: Samuel Hwang, Noah G. Singer, Santhoshini Velusamy

    Abstract: In the maximum directed cut problem, the input is a directed graph $G=(V,E)$, and the goal is to pick a partition $V = S \cup (V \setminus S)$ of the vertices such that as many edges as possible go from $S$ to $V\setminus S$. Oblivious algorithms, introduced by Feige and Jozeph (Algorithmica'17), are a simple class of algorithms for this problem. These algorithms independently and randomly assign… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: 17 pages, 7 figures

  5. arXiv:2411.05916  [pdf, ps, other

    math.GR cs.DM

    Coboundary expansion inside Chevalley coset complex HDXs

    Authors: Ryan O'Donnell, Noah G. Singer

    Abstract: Recent major results in property testing~\cite{BLM24,DDL24} and PCPs~\cite{BMV24} were unlocked by moving to high-dimensional expanders (HDXs) constructed from $\widetilde{C}_d$-type buildings, rather than the long-known $\widetilde{A}_d$-type ones. At the same time, these building quotient HDXs are not as easy to understand as the more elementary (and more symmetric/explicit) \emph{coset complex}… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: 130 pages

  6. arXiv:2309.02272  [pdf, other

    cs.LG

    Graph-Based Automatic Feature Selection for Multi-Class Classification via Mean Simplified Silhouette

    Authors: David Levin, Gonen Singer

    Abstract: This paper introduces a novel graph-based filter method for automatic feature selection (abbreviated as GB-AFS) for multi-class classification tasks. The method determines the minimum combination of features required to sustain prediction performance while maintaining complementary discriminating abilities between different classes. It does not require any user-defined parameters such as the numbe… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 8 pages, 4 figures

  7. arXiv:2305.04978  [pdf, other

    cs.CL

    NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge

    Authors: Phillip Howard, Junlin Wang, Vasudev Lal, Gadi Singer, Yejin Choi, Swabha Swayamdipta

    Abstract: Comparative knowledge (e.g., steel is stronger and heavier than styrofoam) is an essential component of our world knowledge, yet understudied in prior literature. In this paper, we harvest the dramatic improvements in knowledge capabilities of language models into a large-scale comparative knowledge base. While the ease of acquisition of such comparative knowledge is much higher from extreme-scale… ▽ More

    Submitted 5 April, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to NAACL 2024 Findings

  8. arXiv:2305.04438  [pdf, ps, other

    cs.DS

    Oblivious algorithms for the Max-$k$AND Problem

    Authors: Noah G. Singer

    Abstract: Motivated by recent works on streaming algorithms for constraint satisfaction problems (CSPs), we define and analyze oblivious algorithms for the Max-$k$AND problem. This generalizes the definition by Feige and Jozeph (Algorithmica '15) of oblivious algorithms for Max-DICUT, a special case of Max-$2$AND. Oblivious algorithms round each variable with probability depending only on a quantity called… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

    Comments: 29 pages, 1 table. In submission

  9. arXiv:2304.06664  [pdf, other

    cs.DS

    On streaming approximation algorithms for constraint satisfaction problems

    Authors: Noah G. Singer

    Abstract: In this thesis, we explore streaming algorithms for approximating constraint satisfaction problems (CSPs). The setup is roughly the following: A computer has limited memory space, sees a long "stream" of local constraints on a set of variables, and tries to estimate how many of the constraints may be simultaneously satisfied. The past ten years have seen a number of works in this area, and this th… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Harvard College senior thesis; 119 pages plus references; abstract shortened for arXiv; formatted with Dissertate template (feel free to copy!); exposits papers arXiv:2105.01782 (APPROX 2021) and arXiv:2112.06319 (APPROX 2022)

  10. Thrill-K Architecture: Towards a Solution to the Problem of Knowledge Based Understanding

    Authors: Gadi Singer, Joscha Bach, Tetiana Grinberg, Nagib Hakim, Phillip Howard, Vasudev Lal, Zev Rivlin

    Abstract: While end-to-end learning systems are rapidly gaining capabilities and popularity, the increasing computational demands for deploying such systems, along with a lack of flexibility, adaptability, explainability, reasoning and verification capabilities, require new types of architectures. Here we introduce a classification of hybrid systems which, based on an analysis of human knowledge and intelli… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

    Comments: Artificial General Intelligence: 15th International Conference, AGI 2022, Seattle, WA, USA, August 2022, Proceedings

    Journal ref: Springer Lecture Notes in Computer Science, vol 13539, 2023

  11. arXiv:2303.01792  [pdf, other

    cs.LG

    Graph-based Extreme Feature Selection for Multi-class Classification Tasks

    Authors: Shir Friedman, Gonen Singer, Neta Rabin

    Abstract: When processing high-dimensional datasets, a common pre-processing step is feature selection. Filter-based feature selection algorithms are not tailored to a specific classification method, but rather rank the relevance of each feature with respect to the target and the task. This work focuses on a graph-based, filter feature selection method that is suited for multi-class classifications tasks. W… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  12. arXiv:2211.03916  [pdf, ps, other

    cs.DS

    Improved Streaming Algorithms for Maximum Directed Cut via Smoothed Snapshots

    Authors: Raghuvansh R. Saxena, Noah G. Singer, Madhu Sudan, Santhoshini Velusamy

    Abstract: We give an $\widetilde{O}(\sqrt{n})$-space single-pass $0.483$-approximation streaming algorithm for estimating the maximum directed cut size (Max-DICUT) in a directed graph on $n$ vertices. This improves over an $O(\log n)$-space $4/9 < 0.45$ approximation algorithm due to Chou, Golovnev, and Velusamy (FOCS 2020), which was known to be optimal for $o(\sqrt{n})$-space algorithms. Max-DICUT is a sp… ▽ More

    Submitted 9 May, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: 53 pages, 2 figures; substantial revisions; in submission; abstract shortened to fit requirements

  13. arXiv:2210.12365  [pdf, other

    cs.CL

    NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation

    Authors: Phillip Howard, Gadi Singer, Vasudev Lal, Yejin Choi, Swabha Swayamdipta

    Abstract: While counterfactual data augmentation offers a promising step towards robust generalization in natural language processing, producing a set of counterfactuals that offer valuable inductive bias for models remains a challenge. Most existing approaches for producing counterfactuals, manual or automated, rely on small perturbations via minimal edits, resulting in simplistic changes. We introduce Neu… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  14. Cross-Domain Aspect Extraction using Transformers Augmented with Knowledge Graphs

    Authors: Phillip Howard, Arden Ma, Vasudev Lal, Ana Paula Simoes, Daniel Korat, Oren Pereg, Moshe Wasserblat, Gadi Singer

    Abstract: The extraction of aspect terms is a critical step in fine-grained sentiment analysis of text. Existing approaches for this task have yielded impressive results when the training and testing data are from the same domain. However, these methods show a drastic decrease in performance when applied to cross-domain settings where the domain of the testing data differs from that of the training data. To… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    ACM Class: I.2.7

    Journal ref: Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM 2022). Association for Computing Machinery, New York, NY, USA, 780-790

  15. Adaptive Learning for the Resource-Constrained Classification Problem

    Authors: Danit Shifman Abukasis, Izack Cohen, Xiaochen Xian, Kejun Huang, Gonen Singer

    Abstract: Resource-constrained classification tasks are common in real-world applications such as allocating tests for disease diagnosis, hiring decisions when filling a limited number of positions, and defect detection in manufacturing settings under a limited inspection budget. Typical classification algorithms treat the learning process and the resource constraints as two separate and sequential tasks. H… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Journal ref: Engineering Applications of Artificial Intelligence, 119, 105741 (2023)

  16. arXiv:2111.07382  [pdf, ps, other

    cs.LG stat.ML

    Adaptive Cost-Sensitive Learning in Neural Networks for Misclassification Cost Problems

    Authors: Ohad Volk, Gonen Singer

    Abstract: We design a new adaptive learning algorithm for misclassification cost problems that attempt to reduce the cost of misclassified instances derived from the consequences of various errors. Our algorithm (adaptive cost sensitive learning - AdaCSL) adaptively adjusts the loss function such that the classifier bridges the difference between the class distributions between subgroups of samples in the t… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

  17. Streaming approximation resistance of every ordering CSP

    Authors: Noah G. Singer, Madhu Sudan, Santhoshini Velusamy

    Abstract: An ordering constraint satisfaction problem (OCSP) is defined by a family $\mathcal{F}$ of predicates mapping permutations on $\{1,\ldots,k\}$ to $\{0,1\}$. An instance of Max-OCSP($\mathcal{F}$) on $n$ variables consists of a list of constraints, each consisting of a predicate from $\mathcal{F}$ applied on $k$ distinct variables. The goal is to find an ordering of the $n$ variables that maximizes… ▽ More

    Submitted 1 August, 2024; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: 21 pages, 1 figure. Abstract abridged. Appeared in APPROX'21 and Computational Complexity

  18. The relationship between internet user type and user performance when carrying out simple vs. complex search tasks

    Authors: Georg Singer, Pille Pruulmann-Vengerfeldt, Ulrich Norbisrath, Dirk Lewandowski

    Abstract: It is widely known that people become better at an activity if they perform this activity long and often. Yet, the question is whether being active in related areas like communicating online, writing blog articles or commenting on community forums have an impact on a persons ability to perform Web searches, is still unanswered. Web searching has become a key task conducted online; in this paper we… ▽ More

    Submitted 18 November, 2015; originally announced November 2015.

    Comments: http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/view/3960/3245

  19. arXiv:1206.2528  [pdf, other

    cs.IR

    Ordinary Search Engine Users assessing Difficulty, Effort, and Outcome for Simple and Complex Search Tasks

    Authors: Georg Singer, Ulrich Norbisrath, Dirk Lewandowski

    Abstract: Search engines are the preferred tools for finding information on the Web. They are advancing to be the common helpers to answer any of our search needs. We use them to carry out simple look-up tasks and also to work on rather time consuming and more complex search tasks. Yet, we do not know very much about the user performance while carrying out those tasks -- especially not for ordinary users.… ▽ More

    Submitted 12 June, 2012; originally announced June 2012.

    Comments: 10 pages

  20. arXiv:1206.2465  [pdf

    cs.IR cs.DL

    Search Strategies of Library Search Experts

    Authors: Kristiina Singer, Georg Singer, Krista Lepik, Ulrich Norbisrath, Pille Pruulmann-Vengerfeldt

    Abstract: Search engines like Google, Yahoo or Bing are an excellent support for finding documents, but this strength also imposes a limitation. As they are optimized for document retrieval tasks, they perform less well when it comes to more complex search needs. Complex search tasks are usually described as open-ended, abstract and poorly defined information needs with a multifaceted character. In this pap… ▽ More

    Submitted 26 June, 2012; v1 submitted 12 June, 2012; originally announced June 2012.

    Comments: 6 pages

  21. Impact of Gender and Age on performing Search Tasks Online

    Authors: Georg Singer, Ulrich Norbisrath, Dirk Lewandowski

    Abstract: More and more people use the Internet to work on duties of their daily work routine. To find the right information online, Web search engines are the tools of their choice. Apart from finding facts, people use Web search engines to also execute rather complex and time consuming search tasks. So far search engines follow the one-for-all approach to serve its users and little is known about the impa… ▽ More

    Submitted 7 June, 2012; originally announced June 2012.

    Comments: 10 pages

  22. arXiv:1206.1492  [pdf

    cs.IR

    Ordinary Search Engine Users Carrying Out Complex Search Tasks

    Authors: Georg Singer, Ulrich Norbisrath, Dirk Lewandowski

    Abstract: Web search engines have become the dominant tools for finding information on the Internet. Due to their popularity, users apply them to a wide range of search needs, from simple look-ups to rather complex information tasks. This paper presents the results of a study to investigate the characteristics of these complex information needs in the context of Web search engines. The aim of the study is t… ▽ More

    Submitted 4 July, 2012; v1 submitted 7 June, 2012; originally announced June 2012.

    Comments: 60 pages