Skip to main content

Showing 1–6 of 6 results for author: Sarwat, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.05717  [pdf, other

    cs.OH

    Towards Mobility Data Science (Vision Paper)

    Authors: Mohamed Mokbel, Mahmoud Sakr, Li Xiong, Andreas Züfle, Jussara Almeida, Taylor Anderson, Walid Aref, Gennady Andrienko, Natalia Andrienko, Yang Cao, Sanjay Chawla, Reynold Cheng, Panos Chrysanthis, Xiqi Fei, Gabriel Ghinita, Anita Graser, Dimitrios Gunopulos, Christian Jensen, Joon-Seok Kim, Kyoung-Sook Kim, Peer Kröger, John Krumm, Johannes Lauer, Amr Magdy, Mario Nascimento , et al. (23 additional authors not shown)

    Abstract: Mobility data captures the locations of moving objects such as humans, animals, and cars. With the availability of GPS-equipped mobile devices and other inexpensive location-tracking technologies, mobility data is collected ubiquitously. In recent years, the use of mobility data has demonstrated significant impact in various domains including traffic management, urban planning, and health sciences… ▽ More

    Submitted 7 March, 2024; v1 submitted 21 June, 2023; originally announced July 2023.

    Comments: Updated to reflect the major revision for ACM Transactions on Spatial Algorithms and Systems (TSAS). This version reflects the final version accepted by ACM TSAS

  2. A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching

    Authors: Venkata Vamsikrishna Meduri, Lucian Popa, Prithviraj Sen, Mohamed Sarwat

    Abstract: Entity Matching (EM) is a core data cleaning task, aiming to identify different mentions of the same real-world entity. Active learning is one way to address the challenge of scarce labeled data in practice, by dynamically collecting the necessary examples to be labeled by an Oracle and refining the learned model (classifier) upon them. In this paper, we build a unified active learning benchmark f… ▽ More

    Submitted 29 March, 2020; originally announced March 2020.

    Comments: accepted for publication in ACM-SIGMOD 2020, 15 pages

    ACM Class: H.2

  3. arXiv:1604.03234  [pdf, ps, other

    cs.DB

    Hippo: A Fast, yet Scalable, Database Indexing Approach

    Authors: Jia Yu, Mohamed Sarwat

    Abstract: Even though existing database indexes (e.g., B+-Tree) speed up the query execution, they suffer from two main drawbacks: (1) A database index usually yields 5% to 15% additional storage overhead which results in non-ignorable dollar cost in big data scenarios especially when deployed on modern storage devices like Solid State Disk (SSD) or Non-Volatile Memory (NVM). (2) Maintaining a database inde… ▽ More

    Submitted 11 April, 2016; originally announced April 2016.

    Comments: 12 pages, 10 figures, conference

    ACM Class: H.2.2

  4. arXiv:1603.05355  [pdf, ps, other

    cs.DB cs.SI

    GeoReach: An Efficient Approach for Evaluating Graph Reachability Queries with Spatial Range Predicates

    Authors: Yuhan Sun, Mohamed Sarwat

    Abstract: Graphs are widely used to model data in many application domains. Thanks to the wide spread use of GPS-enabled devices, many applications assign a spatial attribute to graph vertices (e.g., geo-tagged social media). Users may issue a Reachability Query with Spatial Range Predicate (abbr. RangeReach). RangeReach finds whether an input vertex can reach any spatial vertex that lies within an input sp… ▽ More

    Submitted 17 March, 2016; originally announced March 2016.

  5. arXiv:1408.0325  [pdf, other

    cs.SI cs.IR cs.LG

    Matrix Factorization with Explicit Trust and Distrust Relationships

    Authors: Rana Forsati, Mehrdad Mahdavi, Mehrnoush Shamsfard, Mohamed Sarwat

    Abstract: With the advent of online social networks, recommender systems have became crucial for the success of many online applications/services due to their significance role in tailoring these applications to user-specific needs or preferences. Despite their increasing popularity, in general recommender systems suffer from the data sparsity and the cold-start problems. To alleviate these issues, in recen… ▽ More

    Submitted 1 August, 2014; originally announced August 2014.

    Comments: ACM Transactions on Information Systems

  6. arXiv:1302.5871  [pdf, ps, other

    cs.DS

    The Budgeted Transportation Problem

    Authors: S. Kapoor, M. Sarwat

    Abstract: Consider a transportation problem with sets of sources and sinks. There are profits and prices on the edges. The goal is to maximize the profit while meeting the following constraints; the total flow going out of a source must not exceed its capacity and the total price of the incoming flow on a sink must not exceed its budget. This problem is closely related to the generalized flow problem. We… ▽ More

    Submitted 24 February, 2013; originally announced February 2013.