Skip to main content

Showing 1–9 of 9 results for author: Nieto-Santisteban, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:cs/0701172  [pdf

    cs.DB cs.CE

    Cross-Matching Multiple Spatial Observations and Dealing with Missing Data

    Authors: Jim Gray, Alex Szalay, Tamas Budavari, Robert Lupton, Maria Nieto-Santisteban, Ani Thakar

    Abstract: Cross-match spatially clusters and organizes several astronomical point-source measurements from one or more surveys. Ideally, each object would be found in each survey. Unfortunately, the observation conditions and the objects themselves change continually. Even some stationary objects are missing in some observations; sometimes objects have a variable light flux and sometimes the seeing is wor… ▽ More

    Submitted 26 January, 2007; originally announced January 2007.

    Report number: MSR TR 2006-175

  2. arXiv:cs/0701171  [pdf

    cs.DB cs.DS

    The Zones Algorithm for Finding Points-Near-a-Point or Cross-Matching Spatial Datasets

    Authors: Jim Gray, Maria A. Nieto-Santisteban, Alexander S. Szalay

    Abstract: Zones index an N-dimensional Euclidian or metric space to efficiently support points-near-a-point queries either within a dataset or between two datasets. The approach uses relational algebra and the B-Tree mechanism found in almost all relational database systems. Hence, the Zones Algorithm gives a portable-relational implementation of points-near-point, spatial cross-match, and self-match quer… ▽ More

    Submitted 26 January, 2007; originally announced January 2007.

    Report number: MSR TR 2006 52

  3. arXiv:cs/0701167  [pdf

    cs.DB cs.CE

    Large-Scale Query and XMatch, Entering the Parallel Zone

    Authors: Maria A. Nieto-Santisteban, Aniruddha R. Thakar, Alexander S. Szalay, Jim Gray

    Abstract: Current and future astronomical surveys are producing catalogs with millions and billions of objects. On-line access to such big datasets for data mining and cross-correlation is usually as highly desired as unfeasible. Providing these capabilities is becoming critical for the Virtual Observatory framework. In this paper we present various performance tests that show how using Relational Databas… ▽ More

    Submitted 25 January, 2007; originally announced January 2007.

    Comments: Astronomical Data Analysis Software and Systems XV in San Lorenzo de El Escorial, Madrid, Spain, October 2005, to appear in the ASP Conference Series

    Report number: MSR-TR-2005- 169

  4. arXiv:cs/0604112  [pdf

    cs.DB cs.DL

    Designing a Multi-petabyte Database for LSST

    Authors: Jacek Becla, Andrew Hanushevsky, Sergei Nikolaev, Ghaleb Abdulla, Alex Szalay, Maria Nieto-Santisteban, Ani Thakar, Jim Gray

    Abstract: The 3.2 giga-pixel LSST camera will produce approximately half a petabyte of archive images every month. These data need to be reduced in under a minute to produce real-time transient alerts, and then added to the cumulative catalog for further analysis. The catalog is expected to grow about three hundred terabytes per year. The data volume, the real-time transient alerting requirements of the L… ▽ More

    Submitted 27 April, 2006; originally announced April 2006.

    Comments: 8 pages. to appear in SPIE

  5. arXiv:cs/0502072  [pdf

    cs.DC cs.DB

    Batch is back: CasJobs, serving multi-TB data on the Web

    Authors: William OMullane, Nolan Li, Maria Nieto-Santisteban, Alex Szalay, Ani Thakar, Jim Gray

    Abstract: The Sloan Digital Sky Survey (SDSS) science database describes over 140 million objects and is over 1.5 TB in size. The SDSS Catalog Archive Server (CAS) provides several levels of query interface to the SDSS data via the SkyServer website. Most queries execute in seconds or minutes. However, some queries can take hours or days, either because they require non-index scans of the largest tables,… ▽ More

    Submitted 16 February, 2005; originally announced February 2005.

    Report number: Microsoft Technical Report MSR TR 2005 19

  6. arXiv:cs/0502018  [pdf

    cs.DC

    When Database Systems Meet the Grid

    Authors: Maria A. Nieto-Santisteban, Alexander S. Szalay, Aniruddha R. Thakar, William J. O'Mullane, Jim Gray, James Annis

    Abstract: We illustrate the benefits of combining database systems and Grid technologies for data-intensive applications. Using a cluster of SQL servers, we reimplemented an existing Grid application that finds galaxy clusters in a large astronomical database. The SQL implementation runs an order of magnitude faster than the earlier Tcl-C-file-based implementation. We discuss why and how Grid applications… ▽ More

    Submitted 3 February, 2005; originally announced February 2005.

    Comments: better version of the paper (better graphics exceed ArXiv 1MB limit) at http://www-db.cs.wisc.edu/cidr/papers/P13.pdf

    Report number: Microsoft Technical Report MSR-TR-2004-81

    Journal ref: Proceedings of CIDR 2005 Conference, Asilomar, CA. Jan. 2005, pp 154-161

  7. arXiv:cs/0502008  [pdf

    cs.DB cs.CE

    Scientific Data Management in the Coming Decade

    Authors: Jim Gray, David T. Liu, Maria Nieto-Santisteban, Alexander S. Szalay, David DeWitt, Gerd Heber

    Abstract: This is a thought piece on data-intensive science requirements for databases and science centers. It argues that peta-scale datasets will be housed by science centers that provide substantial storage and processing for scientists who access the data via smart notebooks. Next-generation science instruments and simulations will generate these peta-scale datasets. The need to publish and share data… ▽ More

    Submitted 1 February, 2005; originally announced February 2005.

    Report number: Microsoft Technical Report MSR-TR-2005-10

  8. arXiv:cs/0408031  [pdf

    cs.DB

    There Goes the Neighborhood: Relational Algebra for Spatial Data Search

    Authors: Jim Gray, Alexander S. Szalay, Aniruddha R. Thakar, Gyorgy Fekete, William O'Mullane, Maria A. Nieto-Santisteban, Gerd Heber, Arnold H. Rots

    Abstract: We explored ways of doing spatial search within a relational database: (1) hierarchical triangular mesh (a tessellation of the sphere), (2) a zoned bucketing system, and (3) representing areas as disjunctive-normal form constraints. Each of these approaches has merits. They all allow efficient point-in-region queries. A relational representation for regions allows Boolean operations among them a… ▽ More

    Submitted 13 August, 2004; originally announced August 2004.

    Comments: Original at http://research.microsoft.com/research/pubs/view.aspx?tr_id=736

    Report number: MSR-TR-2004-32 ACM Class: C.4

  9. arXiv:cs/0403017  [pdf

    cs.DB

    Extending the SDSS Batch Query System to the National Virtual Observatory Grid

    Authors: Maria A. Nieto-Santisteban, William O'Mullane, Jim Gray, Nolan Li, Tamas Budavari, Alexander S. Szalay, Aniruddha R. Thakar

    Abstract: The Sloan Digital Sky Survey science database is approaching 2TB. While the vast majority of queries normally execute in seconds or minutes, this interactive execution time can be disproportionately increased by a small fraction of queries that take hours or days to run; either because they require non-index scans of the largest tables or because they request very large result sets. In response… ▽ More

    Submitted 12 March, 2004; originally announced March 2004.

    Comments: original available at http://research.microsoft.com/research/pubs/view.aspx?tr_id=714

    Report number: MSR-TR-2004-12 ACM Class: H.2.4