Skip to main content

Showing 1–21 of 21 results for author: Ullman, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.14887  [pdf

    cs.RO cs.HC

    Measuring Human-Robot Trust with the MDMT (Multi-Dimensional Measure of Trust)

    Authors: Bertram F. Malle, Daniel Ullman

    Abstract: We describe the steps of developing the MDMT (Multi-Dimensional Measure of Trust), an intuitive self-report measure of perceived trustworthiness of various agents (human, robot, animal). We summarize the evidence that led to the original four-dimensional form (v1) and to the most recent five-dimensional form (v2). We examine the measure's strengths and limitations and point to further necessary va… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: In SCRITA 2023 Workshop Proceedings (arXiv:2311.05401) held in conjunction with 32nd IEEE International Conference on Robot & Human Interactive Communication, 08/28-31 2023, Busan (Korea)

    Report number: SCRITA/2023/05

  2. arXiv:2310.17639  [pdf, other

    cs.AI cs.CL cs.LG

    In-Context Learning Dynamics with Random Binary Sequences

    Authors: Eric J. Bigelow, Ekdeep Singh Lubana, Robert P. Dick, Hidenori Tanaka, Tomer D. Ullman

    Abstract: Large language models (LLMs) trained on huge corpora of text datasets demonstrate intriguing capabilities, achieving state-of-the-art performance on tasks they were not explicitly trained for. The precise nature of LLM capabilities is often mysterious, and different prompts can elicit different capabilities through in-context learning. We propose a framework that enables us to analyze in-context l… ▽ More

    Submitted 15 April, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  3. arXiv:2307.01415  [pdf, other

    cs.DS

    Matrix Multiplication Using Only Addition

    Authors: Daniel Cussen, Jeffrey D. Ullman

    Abstract: Matrix multiplication consumes a large fraction of the time taken in many machine-learning algorithms. Thus, accelerator chips that perform matrix multiplication faster than conventional processors or even GPU's are of increasing interest. In this paper, we demonstrate a method of performing matrix multiplication without a scalar multiplier circuit. In many cases of practical interest, only a sing… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 9 pages, 2 figures

    ACM Class: F.2.1

  4. arXiv:2106.05891  [pdf, other

    cs.LG cs.AI stat.ML

    Temporal and Object Quantification Networks

    Authors: Jiayuan Mao, Zhezheng Luo, Chuang Gan, Joshua B. Tenenbaum, Jiajun Wu, Leslie Pack Kaelbling, Tomer D. Ullman

    Abstract: We present Temporal and Object Quantification Networks (TOQ-Nets), a new class of neuro-symbolic networks with a structural bias that enables them to learn to recognize complex relational-temporal events. This is done by including reasoning layers that implement finite-domain quantification over objects and time. The structure allows them to generalize directly to input instances with varying numb… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: IJCAI 2021. First two authors contributed equally. Project page: http://toqnet.csail.mit.edu/

  5. arXiv:2102.12321  [pdf, other

    cs.AI cs.CV cs.LG

    AGENT: A Benchmark for Core Psychological Reasoning

    Authors: Tianmin Shu, Abhishek Bhandwaldar, Chuang Gan, Kevin A. Smith, Shari Liu, Dan Gutfreund, Elizabeth Spelke, Joshua B. Tenenbaum, Tomer D. Ullman

    Abstract: For machine agents to successfully interact with humans in real-world settings, they will need to develop an understanding of human mental life. Intuitive psychology, the ability to reason about hidden mental variables that drive observable actions, comes naturally to people: even pre-verbal infants can tell agents from objects, expecting agents to act efficiently to achieve goals given constraint… ▽ More

    Submitted 25 July, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: ICML 2021, 12 pages, 7 figures

  6. arXiv:2005.06154  [pdf, other

    cs.DB cs.CR cs.DC cs.IR

    Panda: Partitioned Data Security on Outsourced Sensitive and Non-sensitive Data

    Authors: Sharad Mehrotra, Shantanu Sharma, Jeffrey D. Ullman, Dhrubajyoti Ghosh, Peeyush Gupta

    Abstract: Despite extensive research on cryptography, secure and efficient query processing over outsourced data remains an open challenge. This paper continues along with the emerging trend in secure data processing that recognizes that the entire dataset may not be sensitive, and hence, non-sensitivity of data can be exploited to overcome limitations of existing encryption-based approaches. We, first, pro… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: This version has been accepted in ACM Transactions on Management Information Systems. The final published version of this paper may differ from this accepted version. A preliminary version of this paper [arXiv:1812.09233] was accepted and presented in IEEE ICDE 2019

  7. arXiv:1905.13376  [pdf, other

    cs.DB cs.DC

    Efficient Multiway Hash Join on Reconfigurable Hardware

    Authors: Kunle Olukotun, Raghu Prabhakar, Rekha Singhal, Jeffrey D. Ullman, Yaqi Zhang

    Abstract: We propose the algorithms for performing multiway joins using a new type of coarse grain reconfigurable hardware accelerator~-- ``Plasticine''~-- that, compared with other accelerators, emphasizes high compute capability and high on-chip communication bandwidth. Joining three or more relations in a single step, i.e. multiway join, is efficient when the join of any two relations yields too large an… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: 20 pages

  8. arXiv:1812.09233  [pdf, other

    cs.DB cs.CR cs.DC cs.IR

    Partitioned Data Security on Outsourced Sensitive and Non-sensitive Data

    Authors: Sharad Mehrotra, Shantanu Sharma, Jeffrey D. Ullman, Anurag Mishra

    Abstract: Despite extensive research on cryptography, secure and efficient query processing over outsourced data remains an open challenge. This paper continues along the emerging trend in secure data processing that recognizes that the entire dataset may not be sensitive, and hence, non-sensitivity of data can be exploited to overcome limitations of existing encryption-based approaches. We propose a new se… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: Accepted in IEEE International Conference on Data Engineering (ICDE), 2019. arXiv admin note: text overlap with arXiv:1812.01741

  9. arXiv:1605.06143  [pdf, other

    cs.DB

    Efficient and Private Approximations of Distributed Databases Calculations

    Authors: Philip Derbeko, Shlomi Dolev, Ehud Gudes, Jeffrey D. Ullman

    Abstract: In recent years, an increasing amount of data is collected in different and often, not cooperative, databases. The problem of privacy-preserving, distributed calculations over separated databases and, a relative to it, issue of private data release were intensively investigated. However, despite a considerable progress, computational complexity, due to an increasing size of data, remains a limitin… ▽ More

    Submitted 19 May, 2016; originally announced May 2016.

  10. arXiv:1604.00289  [pdf, other

    cs.AI cs.CV cs.LG cs.NE stat.ML

    Building Machines That Learn and Think Like People

    Authors: Brenden M. Lake, Tomer D. Ullman, Joshua B. Tenenbaum, Samuel J. Gershman

    Abstract: Recent progress in artificial intelligence (AI) has renewed interest in building systems that learn and think like people. Many advances have come from using deep neural networks trained end-to-end in tasks such as object recognition, video games, and board games, achieving performance that equals or even beats humans in some respects. Despite their biological inspiration and performance achieveme… ▽ More

    Submitted 2 November, 2016; v1 submitted 1 April, 2016; originally announced April 2016.

    Comments: In press at Behavioral and Brain Sciences. Open call for commentary proposals (until Nov. 22, 2016). https://www.cambridge.org/core/journals/behavioral-and-brain-sciences/information/calls-for-commentary/open-calls-for-commentary

  11. arXiv:1602.01443  [pdf, other

    cs.DB

    Some Pairs Problems

    Authors: Jeffrey D. Ullman, Jonathan Ullman

    Abstract: A common form of MapReduce application involves discovering relationships between certain pairs of inputs. Similarity joins serve as a good example of this type of problem, which we call a "some-pairs" problem. In the framework of Afrati et al. (VLDB 2013), algorithms are measured by the tradeoff between reducer size (maximum number of inputs a reducer can handle) and the replication rate (average… ▽ More

    Submitted 3 February, 2016; originally announced February 2016.

  12. SharesSkew: An Algorithm to Handle Skew for Joins in MapReduce

    Authors: Foto Afrati, Nikos Stasinopoulos, Jeffrey D. Ullman, Angelos Vassilakopoulos

    Abstract: In this paper, we investigate the problem of computing a multiway join in one round of MapReduce when the data may be skewed. We optimize on communication cost, i.e., the amount of data that is transferred from the mappers to the reducers. We identify join attributes values that appear very frequently, Heavy Hitters (HH). We distribute HH valued records to reducers avoiding skew by using an adapta… ▽ More

    Submitted 12 December, 2015; originally announced December 2015.

  13. arXiv:1509.08855  [pdf, ps, other

    cs.DB

    Computing Marginals Using MapReduce

    Authors: Foto Afrati, Shantanu Sharma, Jeffrey D. Ullman, Jonathan R. Ullman

    Abstract: We consider the problem of computing the data-cube marginals of a fixed order $k$ (i.e., all marginals that aggregate over $k$ dimensions), using a single round of MapReduce. The focus is on the relationship between the reducer size (number of inputs allowed at a single reducer) and the replication rate (number of reducers to which an input is sent). We show that the replication rate is minimized… ▽ More

    Submitted 29 September, 2015; originally announced September 2015.

  14. arXiv:1508.01171  [pdf, other

    cs.DB cs.DC

    Meta-MapReduce: A Technique for Reducing Communication in MapReduce Computations

    Authors: Foto Afrati, Shlomi Dolev, Shantanu Sharma, Jeffrey D. Ullman

    Abstract: MapReduce has proven to be one of the most useful paradigms in the revolution of distributed computing, where cloud services and cluster computing become the standard venue for computing. The federation of cloud and big data activities is the next challenge where MapReduce should be modified to avoid (big) data migration across remote (cloud) sites. This is exactly our scope of research, where onl… ▽ More

    Submitted 28 July, 2016; v1 submitted 5 August, 2015; originally announced August 2015.

  15. arXiv:1507.04461  [pdf, other

    cs.DB cs.CC cs.DC

    Assignment Problems of Different-Sized Inputs in MapReduce

    Authors: Foto Afrati, Shlomi Dolev, Ephraim Korach, Shantanu Sharma, Jeffrey D. Ullman

    Abstract: A MapReduce algorithm can be described by a mapping schema, which assigns inputs to a set of reducers, such that for each required output there exists a reducer that receives all the inputs that participate in the computation of this output. Reducers have a capacity, which limits the sets of inputs that they can be assigned. However, individual inputs may vary in terms of size. We consider, for th… ▽ More

    Submitted 20 October, 2016; v1 submitted 16 July, 2015; originally announced July 2015.

    Comments: This paper is accepted in ACM Transactions on Knowledge Discovery from Data (TKDD), August 2016. Preliminary versions of this paper have appeared in the proceeding of DISC 2014 and BeyondMR 2015

  16. arXiv:1504.03247  [pdf, other

    cs.DB

    Handling Skew in Multiway Joins in Parallel Processing

    Authors: Foto N. Afrati, Jeffrey D. Ullman, Angelos Vasilakopoulos

    Abstract: Handling skew is one of the major challenges in query processing. In distributed computational environments such as MapReduce, uneven distribution of the data to the servers is not desired. One of the dominant measures that we want to optimize in distributed environments is communication cost. In a MapReduce job this is the amount of data that is transferred from the mappers to the reducers. In th… ▽ More

    Submitted 13 April, 2015; originally announced April 2015.

    Comments: 4 pages

  17. arXiv:1501.06758  [pdf, ps, other

    cs.DB

    Assignment of Different-Sized Inputs in MapReduce

    Authors: Foto Afrati, Shlomi Dolev, Ephraim Korach, Shantanu Sharma, Jeffrey D. Ullman

    Abstract: A MapReduce algorithm can be described by a mapping schema, which assigns inputs to a set of reducers, such that for each required output there exists a reducer that receives all the inputs that participate in the computation of this output. Reducers have a capacity, which limits the sets of inputs that they can be assigned. However, individual inputs may vary in terms of size. We consider, for th… ▽ More

    Submitted 27 January, 2015; originally announced January 2015.

    Comments: Brief announcement in International Symposium on Distributed Computing (DISC), 2014

  18. arXiv:1410.4156  [pdf, other

    cs.DB

    GYM: A Multiround Join Algorithm In MapReduce

    Authors: Foto Afrati, Manas Joglekar, Christopher RĂ©, Semih Salihoglu, Jeffrey D. Ullman

    Abstract: Multiround algorithms are now commonly used in distributed data processing systems, yet the extent to which algorithms can benefit from running more rounds is not well understood. This paper answers this question for a spectrum of rounds for the problem of computing the equijoin of $n$ relations. Specifically, given any query $Q$ with width $\w$, {\em intersection width} $\iw$, input size… ▽ More

    Submitted 25 January, 2017; v1 submitted 15 October, 2014; originally announced October 2014.

  19. arXiv:1208.0615  [pdf, ps, other

    cs.DC

    Enumerating Subgraph Instances Using Map-Reduce

    Authors: Foto N. Afrati, Dimitris Fotakis, Jeffrey D. Ullman

    Abstract: The theme of this paper is how to find all instances of a given "sample" graph in a larger "data graph," using a single round of map-reduce. For the simplest sample graph, the triangle, we improve upon the best known such algorithm. We then examine the general case, considering both the communication cost between mappers and reducers and the total computation cost at the reducers. To minimize comm… ▽ More

    Submitted 21 November, 2012; v1 submitted 2 August, 2012; originally announced August 2012.

    Comments: 37 pages

  20. arXiv:1206.4377  [pdf, other

    cs.DC cs.DS

    Upper and Lower Bounds on the Cost of a Map-Reduce Computation

    Authors: Foto N. Afrati, Anish Das Sarma, Semih Salihoglu, Jeffrey D. Ullman

    Abstract: In this paper we study the tradeoff between parallelism and communication cost in a map-reduce computation. For any problem that is not "embarrassingly parallel," the finer we partition the work of the reducers so that more parallelism can be extracted, the greater will be the total communication between mappers and reducers. We introduce a model of problems that can be solved in a single round of… ▽ More

    Submitted 19 June, 2012; originally announced June 2012.

    Comments: 14 pages

  21. arXiv:1204.1754  [pdf, other

    cs.DB cs.DC

    Vision Paper: Towards an Understanding of the Limits of Map-Reduce Computation

    Authors: Foto N. Afrati, Anish Das Sarma, Semih Salihoglu, Jeffrey D. Ullman

    Abstract: A significant amount of recent research work has addressed the problem of solving various data management problems in the cloud. The major algorithmic challenges in map-reduce computations involve balancing a multitude of factors such as the number of machines available for mappers/reducers, their memory requirements, and communication cost (total amount of data sent from mappers to reducers). Mos… ▽ More

    Submitted 8 April, 2012; originally announced April 2012.

    Comments: 5 pages