Skip to main content

Showing 1–15 of 15 results for author: Doraiswamy, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.12918  [pdf, ps, other

    cs.DB

    Query Rewriting via LLMs

    Authors: Sriram Dharwada, Himanshu Devrani, Jayant Haritsa, Harish Doraiswamy

    Abstract: When complex SQL queries suffer slow executions despite query optimization, DBAs typically invoke automated query rewriting tools to recommend ``lean'' equivalents that are conducive to faster execution. The rewritings are usually achieved via transformation rules, but these rules are limited in scope and difficult to update in a production system. Recently, LLM-based techniques have also been sug… ▽ More

    Submitted 10 June, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

  2. arXiv:2409.07257  [pdf, other

    cs.GR cs.CG cs.CV cs.HC cs.LG

    TopoMap++: A faster and more space efficient technique to compute projections with topological guarantees

    Authors: Vitoria Guardieiro, Felipe Inagaki de Oliveira, Harish Doraiswamy, Luis Gustavo Nonato, Claudio Silva

    Abstract: High-dimensional data, characterized by many features, can be difficult to visualize effectively. Dimensionality reduction techniques, such as PCA, UMAP, and t-SNE, address this challenge by projecting the data into a lower-dimensional space while preserving important relationships. TopoMap is another technique that excels at preserving the underlying structure of the data, leading to interpretabl… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: This is the author's version of the article that has been accepted for publication in IEEE Transactions on Visualization and Computer Graphics (TVCG)

  3. arXiv:2203.14362  [pdf, other

    cs.DB

    GPU-Powered Spatial Database Engine for Commodity Hardware: Extended Version

    Authors: Harish Doraiswamy, Juliana Freire

    Abstract: Given the massive growth in the volume of spatial data, there is a great need for systems that can efficiently evaluate spatial queries over large data sets. These queries are notoriously expensive using traditional database solutions. While faster response times can be attained through powerful clusters or servers with large main-memory, these options, due to cost and complexity, are out of reach… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

  4. arXiv:2201.02155  [pdf, other

    cs.LG

    Topological Representations of Local Explanations

    Authors: Peter Xenopoulos, Gromit Chan, Harish Doraiswamy, Luis Gustavo Nonato, Brian Barr, Claudio Silva

    Abstract: Local explainability methods -- those which seek to generate an explanation for each prediction -- are becoming increasingly prevalent due to the need for practitioners to rationalize their model outputs. However, comparing local explainability methods is difficult since they each generate outputs in various scales and dimensions. Furthermore, due to the stochastic nature of some explainability me… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

  5. UrbanRama: Navigating Cities in Virtual Reality

    Authors: Shaoyu Chen, Fabio Miranda, Nivan Ferreira, Marcos Lage, Harish Doraiswamy, Corinne Brenner, Connor Defanti, Michael Koutsoubis, Luc Wilson, Ken Perlin, Claudio Silva

    Abstract: Exploring large virtual environments, such as cities, is a central task in several domains, such as gaming and urban planning. VR systems can greatly help this task by providing an immersive experience; however, a common issue with viewing and navigating a city in the traditional sense is that users can either obtain a local or a global view, but not both at the same time, requiring them to contin… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

    Comments: Video: https://www.youtube.com/watch?v=M8BFZnxq-Qg

  6. arXiv:2011.01324  [pdf, other

    cs.AI cs.LG stat.ML

    Valuing Player Actions in Counter-Strike: Global Offensive

    Authors: Peter Xenopoulos, Harish Doraiswamy, Claudio Silva

    Abstract: Esports, despite its expanding interest, lacks fundamental sports analytics resources such as accessible data or proven and reproducible analytical frameworks. Even Counter-Strike: Global Offensive (CSGO), the second most popular esport, suffers from these problems. Thus, quantitative evaluation of CSGO players, a task important to teams, media, bettors and fans, is difficult. To address this, we… ▽ More

    Submitted 4 November, 2020; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: to be published in 2020 IEEE International Conference on Big Data

  7. arXiv:2010.12548  [pdf, other

    cs.DB

    The Case for Distance-Bounded Spatial Approximations

    Authors: Eleni Tzirita Zacharatou, Andreas Kipf, Ibrahim Sabek, Varun Pandey, Harish Doraiswamy, Volker Markl

    Abstract: Spatial approximations have been traditionally used in spatial databases to accelerate the processing of complex geometric operations. However, approximations are typically only used in a first filtering step to determine a set of candidate spatial objects that may fulfill the query condition. To provide accurate results, the exact geometries of the candidate objects are tested against the query c… ▽ More

    Submitted 21 January, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: 11th Annual Conference on Innovative Data Systems Research (CIDR'21)

  8. arXiv:2009.01512  [pdf, other

    cs.GR cs.CG cs.CV cs.HC cs.LG

    TopoMap: A 0-dimensional Homology Preserving Projection of High-Dimensional Data

    Authors: Harish Doraiswamy, Julien Tierny, Paulo J. S. Silva, Luis Gustavo Nonato, Claudio Silva

    Abstract: Multidimensional Projection is a fundamental tool for high-dimensional data analytics and visualization. With very few exceptions, projection techniques are designed to map data from a high-dimensional space to a visual space so as to preserve some dissimilarity (similarity) measure, such as the Euclidean distance for example. In fact, although adopting distinct mathematical formulations designed… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

  9. Urban Mosaic: Visual Exploration of Streetscapes Using Large-Scale Image Data

    Authors: Fabio Miranda, Maryam Hosseini, Marcos Lage, Harish Doraiswamy, Graham Dove, Claudio T. Silva

    Abstract: Urban planning is increasingly data driven, yet the challenge of designing with data at a city scale and remaining sensitive to the impact at a human scale is as important today as it was for Jane Jacobs. We address this challenge with Urban Mosaic,a tool for exploring the urban fabric through a spatially and temporally dense data set of 7.7 million street-level images from New York City, captured… ▽ More

    Submitted 30 August, 2020; originally announced August 2020.

    Comments: Video: https://www.youtube.com/watch?v=Nrhk7lb3GUo

    Journal ref: CHI '20: Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems

  10. arXiv:2004.03630  [pdf, other

    cs.DB

    A GPU-friendly Geometric Data Model and Algebra for Spatial Queries: Extended Version

    Authors: Harish Doraiswamy, Juliana Freire

    Abstract: The availability of low cost sensors has led to an unprecedented growth in the volume of spatial data. However, the time required to evaluate even simple spatial queries over large data sets greatly hampers our ability to interactively explore these data sets and extract actionable insights. Graphics Processing Units~(GPUs) are increasingly being used to speedup spatial queries. However, existing… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: This is the extended version of the paper published in SIGMOD 2020

    ACM Class: H.2.1; H.2.8

  11. Shadow Accrual Maps: Efficient Accumulation of City-Scale Shadows Over Time

    Authors: Fabio Miranda, Harish Doraiswamy, Marcos Lage, Luc Wilson, Mondrian Hsieh, Claudio T. Silva

    Abstract: Large scale shadows from buildings in a city play an important role in determining the environmental quality of public spaces. They can be both beneficial, such as for pedestrians during summer, and detrimental, by impacting vegetation and by blocking direct sunlight. Determining the effects of shadows requires the accumulation of shadows over time across different periods in a year. In this paper… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: Video: https://www.youtube.com/watch?v=LsZv23d1LyM, Data: https://github.com/ViDA-NYU/shadow-accrual-maps

    Journal ref: IEEE Transactions on Visualization and Computer Graphics (Volume: 25, Issue: 3, Mar. 2019)

  12. Unwind: Interactive Fish Straightening

    Authors: Francis Williams, Alexander Bock, Harish Doraiswamy, Cassandra Donatelli, Kayla Hall, Adam Summers, Daniele Panozzo, Cláudio T. Silva

    Abstract: The ScanAllFish project is a large-scale effort to scan all the world's 33,100 known species of fishes. It has already generated thousands of volumetric CT scans of fish species which are available on open access platforms such as the Open Science Framework. To achieve a scanning rate required for a project of this magnitude, many specimens are grouped together into a single tube and scanned all a… ▽ More

    Submitted 5 February, 2020; v1 submitted 9 April, 2019; originally announced April 2019.

  13. arXiv:1805.00889  [pdf, other

    cs.SD cs.CY cs.HC eess.AS

    SONYC: A System for the Monitoring, Analysis and Mitigation of Urban Noise Pollution

    Authors: Juan Pablo Bello, Claudio Silva, Oded Nov, R. Luke DuBois, Anish Arora, Justin Salamon, Charles Mydlarz, Harish Doraiswamy

    Abstract: We present the Sounds of New York City (SONYC) project, a smart cities initiative focused on developing a cyber-physical system for the monitoring, analysis and mitigation of urban noise pollution. Noise pollution is one of the topmost quality of life issues for urban residents in the U.S. with proven effects on health, education, the economy, and the environment. Yet, most cities lack the resourc… ▽ More

    Submitted 18 May, 2018; v1 submitted 2 May, 2018; originally announced May 2018.

    Comments: Accepted May 2018, Communications of the ACM. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record will be published in Communications of the ACM

  14. Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets

    Authors: Fernando Chirigati, Harish Doraiswamy, Theodoros Damoulas, Juliana Freire

    Abstract: The increasing ability to collect data from urban environments, coupled with a push towards openness by governments, has resulted in the availability of numerous spatio-temporal data sets covering diverse aspects of a city. Discovering relationships between these data sets can produce new insights by enabling domain experts to not only test but also generate hypotheses. However, discovering these… ▽ More

    Submitted 21 October, 2016; originally announced October 2016.

    Journal ref: Proceedings of the 2016 International Conference on Management of Data (SIGMOD '16), pp. 1011-1025

  15. arXiv:1608.06949  [pdf, other

    cs.CY cs.GR cs.SI physics.data-an physics.soc-ph

    Urban Pulse: Capturing the Rhythm of Cities

    Authors: Fabio Miranda, Harish Doraiswamy, Marcos Lage, Kai Zhao, Bruno Gonçalves, Luc Wilson, Mondrian Hsieh, Cláudio T. Silva

    Abstract: Cities are inherently dynamic. Interesting patterns of behavior typically manifest at several key areas of a city over multiple temporal resolutions. Studying these patterns can greatly help a variety of experts ranging from city planners and architects to human behavioral experts. Recent technological innovations have enabled the collection of enormous amounts of data that can help in these studi… ▽ More

    Submitted 29 December, 2017; v1 submitted 24 August, 2016; originally announced August 2016.

    Comments: 10 pages, 10 figures, 1 table. Demo video: https://www.youtube.com/watch?v=J70-Ns0cFnQ . Github project: https://github.com/ViDA-NYU/urban-pulse ; Added github link

    Journal ref: IEEE Transactions on Visualization and Computer Graphics (Volume: 23, Issue: 1, Jan. 2017)