-
The XENONnT Dark Matter Experiment
Authors:
XENON Collaboration,
E. Aprile,
J. Aalbers,
K. Abe,
S. Ahmed Maouloud,
L. Althueser,
B. Andrieu,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
M. Balata,
L. Baudis,
A. L. Baxter,
M. Bazyk,
L. Bellagamba,
R. Biondi,
A. Bismark,
E. J. Brookes,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
T. K. Bui
, et al. (170 additional authors not shown)
Abstract:
The multi-staged XENON program at INFN Laboratori Nazionali del Gran Sasso aims to detect dark matter with two-phase liquid xenon time projection chambers of increasing size and sensitivity. The XENONnT experiment is the latest detector in the program, planned to be an upgrade of its predecessor XENON1T. It features an active target of 5.9 tonnes of cryogenic liquid xenon (8.5 tonnes total mass in…
▽ More
The multi-staged XENON program at INFN Laboratori Nazionali del Gran Sasso aims to detect dark matter with two-phase liquid xenon time projection chambers of increasing size and sensitivity. The XENONnT experiment is the latest detector in the program, planned to be an upgrade of its predecessor XENON1T. It features an active target of 5.9 tonnes of cryogenic liquid xenon (8.5 tonnes total mass in cryostat). The experiment is expected to extend the sensitivity to WIMP dark matter by more than an order of magnitude compared to XENON1T, thanks to the larger active mass and the significantly reduced background, improved by novel systems such as a radon removal plant and a neutron veto. This article describes the XENONnT experiment and its sub-systems in detail and reports on the detector performance during the first science run.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Data Integrity Error Localization in Networked Systems with Missing Data
Authors:
Yufeng Xin,
Shih-Wen Fu,
Anirban Mandal,
Ryan Tanaka,
Mats Rynge,
Karan Vahi,
Ewa Deelman
Abstract:
Most recent network failure diagnosis systems focused on data center networks where complex measurement systems can be deployed to derive routing information and ensure network coverage in order to achieve accurate and fast fault localization. In this paper, we target wide-area networks that support data-intensive distributed applications. We first present a new multi-output prediction model that…
▽ More
Most recent network failure diagnosis systems focused on data center networks where complex measurement systems can be deployed to derive routing information and ensure network coverage in order to achieve accurate and fast fault localization. In this paper, we target wide-area networks that support data-intensive distributed applications. We first present a new multi-output prediction model that directly maps the application level observations to localize the system component failures. In reality, this application-centric approach may face the missing data challenge as some input (feature) data to the inference models may be missing due to incomplete or lost measurements in wide area networks. We show that the presented prediction model naturally allows the {\it multivariate} imputation to recover the missing data. We evaluate multiple imputation algorithms and show that the prediction performance can be improved significantly in a large-scale network. As far as we know, this is the first study on the missing data issue and applying imputation techniques in network failure localization.
△ Less
Submitted 5 July, 2022;
originally announced July 2022.
-
Blueprint: Cyberinfrastructure Center of Excellence
Authors:
Ewa Deelman,
Anirban Mandal,
Angela P. Murillo,
Jarek Nabrzyski,
Valerio Pascucci,
Robert Ricci,
Ilya Baldin,
Susan Sons,
Laura Christopherson,
Charles Vardeman,
Rafael Ferreira da Silva,
Jane Wyngaard,
Steve Petruzza,
Mats Rynge,
Karan Vahi,
Wendy R. Whitcup,
Josh Drake,
Erik Scott
Abstract:
In 2018, NSF funded an effort to pilot a Cyberinfrastructure Center of Excellence (CI CoE or Center) that would serve the cyberinfrastructure (CI) needs of the NSF Major Facilities (MFs) and large projects with advanced CI architectures. The goal of the CI CoE Pilot project (Pilot) effort was to develop a model and a blueprint for such a CoE by engaging with the MFs, understanding their CI needs,…
▽ More
In 2018, NSF funded an effort to pilot a Cyberinfrastructure Center of Excellence (CI CoE or Center) that would serve the cyberinfrastructure (CI) needs of the NSF Major Facilities (MFs) and large projects with advanced CI architectures. The goal of the CI CoE Pilot project (Pilot) effort was to develop a model and a blueprint for such a CoE by engaging with the MFs, understanding their CI needs, understanding the contributions the MFs are making to the CI community, and exploring opportunities for building a broader CI community. This document summarizes the results of community engagements conducted during the first two years of the project and describes the identified CI needs of the MFs. To better understand MFs' CI, the Pilot has developed and validated a model of the MF data lifecycle that follows the data generation and management within a facility and gained an understanding of how this model captures the fundamental stages that the facilities' data passes through from the scientific instruments to the principal investigators and their teams, to the broader collaborations and the public. The Pilot also aimed to understand what CI workforce development challenges the MFs face while designing, constructing, and operating their CI and what solutions they are exploring and adopting within their projects. Based on the needs of the MFs in the data lifecycle and workforce development areas, this document outlines a blueprint for a CI CoE that will learn about and share the CI solutions designed, developed, and/or adopted by the MFs, provide expertise to the largest NSF projects with advanced and complex CI architectures, and foster a community of CI practitioners and researchers.
△ Less
Submitted 6 March, 2021;
originally announced March 2021.
-
Serverless Containers -- rising viable approach to Scientific Workflows
Authors:
Krzysztof Burkat,
Maciej Pawlik,
Bartosz Balis,
Maciej Malawski,
Karan Vahi,
Mats Rynge,
Rafael Ferreira da Silva,
Ewa Deelman
Abstract:
Increasing popularity of the serverless computing approach has led to the emergence of new cloud infrastructures working in Container-as-a-Service (CaaS) model like AWS Fargate, Google Cloud Run, or Azure Container Instances. They introduce an innovative approach to running cloud containers where developers are freed from managing underlying resources. In this paper, we focus on evaluating capabil…
▽ More
Increasing popularity of the serverless computing approach has led to the emergence of new cloud infrastructures working in Container-as-a-Service (CaaS) model like AWS Fargate, Google Cloud Run, or Azure Container Instances. They introduce an innovative approach to running cloud containers where developers are freed from managing underlying resources. In this paper, we focus on evaluating capabilities of elastic containers and their usefulness for scientific computing in the scientific workflow paradigm using AWS Fargate and Google Cloud Run infrastructures. For experimental evaluation of our approach, we extended HyperFlow engine to support these CaaS platform, together with adapting four real-world scientific workflows composed of several dozen to over a hundred of tasks organized into a dependency graph. We used these workflows to create cost-performance benchmarks and flow execution plots, measuring delays, elasticity, and scalability. The experiments proved that serverless containers can be successfully applied for scientific workflows. Also, the results allow us to gain insights on specific advantages and limits of such platforms.
△ Less
Submitted 21 October, 2020;
originally announced October 2020.
-
Creating a content delivery network for general science on the internet backbone using XCaches
Authors:
Edgar Fajardo,
Marian Zvada,
Derek Weitzel,
Mats Rynge,
John Hicks,
Mat Selmeci,
Brian Lin,
Pascal Paschos,
Brian Bockelman,
Igor Sfiligoi,
Andrew Hanushevsky,
Frank Würthwein
Abstract:
A general problem faced by computing on the grid for opportunistic users is that delivering cycles is simpler than delivering data to those cycles. In this project we show how we integrated XRootD caches placed on the internet backbone to implement a content delivery network for general science workflows. We will show that for some workflows on different science domains like high energy physics, g…
▽ More
A general problem faced by computing on the grid for opportunistic users is that delivering cycles is simpler than delivering data to those cycles. In this project we show how we integrated XRootD caches placed on the internet backbone to implement a content delivery network for general science workflows. We will show that for some workflows on different science domains like high energy physics, gravitational waves, and others the combination of data reuse from the workflows together with the use of caches increases CPU efficiency while decreasing network bandwidth use.
△ Less
Submitted 28 September, 2020; v1 submitted 2 July, 2020;
originally announced July 2020.
-
Custom Execution Environments with Containers in Pegasus-enabled Scientific Workflows
Authors:
Karan Vahi,
Mats Rynge,
George Papadimitriou,
Duncan A. Brown,
Rajiv Mayani,
Rafael Ferreira da Silva,
Ewa Deelman,
Anirban Mandal,
Eric Lyons,
Michael Zink
Abstract:
Science reproducibility is a cornerstone feature in scientific workflows. In most cases, this has been implemented as a way to exactly reproduce the computational steps taken to reach the final results. While these steps are often completely described, including the input parameters, datasets, and codes, the environment in which these steps are executed is only described at a higher level with end…
▽ More
Science reproducibility is a cornerstone feature in scientific workflows. In most cases, this has been implemented as a way to exactly reproduce the computational steps taken to reach the final results. While these steps are often completely described, including the input parameters, datasets, and codes, the environment in which these steps are executed is only described at a higher level with endpoints and operating system name and versions. Though this may be sufficient for reproducibility in the short term, systems evolve and are replaced over time, breaking the underlying workflow reproducibility. A natural solution to this problem is containers, as they are well defined, have a lifetime independent of the underlying system, and can be user-controlled so that they can provide custom environments if needed. This paper highlights some unique challenges that may arise when using containers in distributed scientific workflows. Further, this paper explores how the Pegasus Workflow Management System implements container support to address such challenges.
△ Less
Submitted 20 May, 2019;
originally announced May 2019.
-
StashCache: A Distributed Caching Federation for the Open Science Grid
Authors:
Derek Weitzel,
Marian Zvada,
Ilija Vukotic,
Rob Gardner,
Brian Bockelman,
Mats Rynge,
Edgar Fajardo Hernandez,
Brian Lin,
Matyas Selmeci
Abstract:
Data distribution for opportunistic users is challenging as they neither own the computing resources they are using or any nearby storage. Users are motivated to use opportunistic computing to expand their data processing capacity, but they require storage and fast networking to distribute data to that processing. Since it requires significant management overhead, it is rare for resource providers…
▽ More
Data distribution for opportunistic users is challenging as they neither own the computing resources they are using or any nearby storage. Users are motivated to use opportunistic computing to expand their data processing capacity, but they require storage and fast networking to distribute data to that processing. Since it requires significant management overhead, it is rare for resource providers to allow opportunistic access to storage. Additionally, in order to use opportunistic storage at several distributed sites, users assume the responsibility to maintain their data. In this paper we present StashCache, a distributed caching federation that enables opportunistic users to utilize nearby opportunistic storage. StashCache is comprised of four components: data origins, redirectors, caches, and clients. StashCache has been deployed in the Open Science Grid for several years and has been used by many projects. Caches are deployed in geographically distributed locations across the U.S. and Europe. We will present the architecture of StashCache, as well as utilization information of the infrastructure. We will also present performance analysis comparing distributed HTTP Proxies vs StashCache.
△ Less
Submitted 16 May, 2019;
originally announced May 2019.
-
First low-frequency Einstein@Home all-sky search for continuous gravitational waves in Advanced LIGO data
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
B. P. Abbott,
R. Abbott,
T. D. Abbott,
F. Acernese,
K. Ackley,
C. Adams,
T. Adams,
P. Addesso,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
M. Afrough,
B. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
B. Allen,
G. Allen,
A. Allocca,
P. A. Altin
, et al. (1017 additional authors not shown)
Abstract:
We report results of a deep all-sky search for periodic gravitational waves from isolated neutron stars in data from the first Advanced LIGO observing run. This search investigates the low frequency range of Advanced LIGO data, between 20 and 100 Hz, much of which was not explored in initial LIGO. The search was made possible by the computing power provided by the volunteers of the Einstein@Home p…
▽ More
We report results of a deep all-sky search for periodic gravitational waves from isolated neutron stars in data from the first Advanced LIGO observing run. This search investigates the low frequency range of Advanced LIGO data, between 20 and 100 Hz, much of which was not explored in initial LIGO. The search was made possible by the computing power provided by the volunteers of the Einstein@Home project. We find no significant signal candidate and set the most stringent upper limits to date on the amplitude of gravitational wave signals from the target population, corresponding to a sensitivity depth of 48.7 [1/$\sqrt{\textrm{Hz}}$]. At the frequency of best strain sensitivity, near 100 Hz, we set 90% confidence upper limits of $1.8 \times 10^{-25}$. At the low end of our frequency range, 20 Hz, we achieve upper limits of $3.9 \times 10^{-24}$. At 55 Hz we can exclude sources with ellipticities greater than $10^{-5}$ within 100 pc of Earth with fiducial value of the principal moment of inertia of $10^{38} \textrm{kg m}^2$.
△ Less
Submitted 14 July, 2017; v1 submitted 9 July, 2017;
originally announced July 2017.
-
All-sky Search for Periodic Gravitational Waves in the O1 LIGO Data
Authors:
LIGO Scientific Collaboration,
Virgo Collaboration,
B. P. Abbott,
R. Abbott,
T. D. Abbott,
F. Acernese,
K. Ackley,
C. Adams,
T. Adams,
P. Addesso,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
M. Afrough,
B. Agarwal,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
B. Allen,
G. Allen,
A. Allocca,
P. A. Altin
, et al. (1020 additional authors not shown)
Abstract:
We report on an all-sky search for periodic gravitational waves in the frequency band 20-475 Hz and with a frequency time derivative in the range of [-1.0, +0.1]e-8 Hz/s. Such a signal could be produced by a nearby spinning and slightly non-axisymmetric isolated neutron star in our galaxy. This search uses the data from Advanced LIGO's first observational run, O1. No periodic gravitational wave si…
▽ More
We report on an all-sky search for periodic gravitational waves in the frequency band 20-475 Hz and with a frequency time derivative in the range of [-1.0, +0.1]e-8 Hz/s. Such a signal could be produced by a nearby spinning and slightly non-axisymmetric isolated neutron star in our galaxy. This search uses the data from Advanced LIGO's first observational run, O1. No periodic gravitational wave signals were observed, and upper limits were placed on their strengths. The lowest upper limits on worst-case (linearly polarized) strain amplitude h0 are 4e-25 near 170 Hz. For a circularly polarized source (most favorable orientation), the smallest upper limits obtained are 1.5e-25. These upper limits refer to all sky locations and the entire range of frequency derivative values. For a population-averaged ensemble of sky locations and stellar orientations, the lowest upper limits obtained for the strain amplitude are 2.5e-25.
△ Less
Submitted 15 July, 2017; v1 submitted 9 July, 2017;
originally announced July 2017.
-
Upper Limits on Gravitational Waves from Scorpius X-1 from a Model-Based Cross-Correlation Search in Advanced LIGO Data
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
B. P. Abbott,
R. Abbott,
T. D. Abbott,
F. Acernese,
K. Ackley,
C. Adams,
T. Adams,
P. Addesso,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
M. Afrough,
B. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
B. Allen,
G. Allen,
A. Allocca
, et al. (1024 additional authors not shown)
Abstract:
We present the results of a semicoherent search for continuous gravitational waves from the low-mass X-ray binary Scorpius X-1, using data from the first Advanced LIGO observing run. The search method uses details of the modelled, parametrized continuous signal to combine coherently data separated by less than a specified coherence time, which can be adjusted to trade off sensitivity against compu…
▽ More
We present the results of a semicoherent search for continuous gravitational waves from the low-mass X-ray binary Scorpius X-1, using data from the first Advanced LIGO observing run. The search method uses details of the modelled, parametrized continuous signal to combine coherently data separated by less than a specified coherence time, which can be adjusted to trade off sensitivity against computational cost. A search was conducted over the frequency range from 25 Hz to 2000 Hz, spanning the current observationally-constrained range of the binary orbital parameters. No significant detection candidates were found, and frequency-dependent upper limits were set using a combination of sensitivity estimates and simulated signal injections. The most stringent upper limit was set at 175 Hz, with comparable limits set across the most sensitive frequency range from 100 Hz to 200 Hz. At this frequency, the 95 pct upper limit on signal amplitude h0 is 2.3e-25 marginalized over the unknown inclination angle of the neutron star's spin, and 8.03e-26 assuming the best orientation (which results in circularly polarized gravitational waves). These limits are a factor of 3-4 stronger than those set by other analyses of the same data, and a factor of about 7 stronger than the best upper limits set using initial LIGO data. In the vicinity of 100 Hz, the limits are a factor of between 1.2 and 3.5 above the predictions of the torque balance model, depending on inclination angle, if the most likely inclination angle of 44 degrees is assumed, they are within a factor of 1.7.
△ Less
Submitted 16 November, 2019; v1 submitted 9 June, 2017;
originally announced June 2017.
-
GW170104: Observation of a 50-Solar-Mass Binary Black Hole Coalescence at Redshift 0.2
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
B. P. Abbott,
R. Abbott,
T. D. Abbott,
F. Acernese,
K. Ackley,
C. Adams,
T. Adams,
P. Addesso,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
M. Afrough,
B. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
B. Allen,
G. Allen,
A. Allocca
, et al. (1026 additional authors not shown)
Abstract:
We describe the observation of GW170104, a gravitational-wave signal produced by the coalescence of a pair of stellar-mass black holes. The signal was measured on January 4, 2017 at 10:11:58.6 UTC by the twin advanced detectors of the Laser Interferometer Gravitational-Wave Observatory during their second observing run, with a network signal-to-noise ratio of 13 and a false alarm rate less than 1…
▽ More
We describe the observation of GW170104, a gravitational-wave signal produced by the coalescence of a pair of stellar-mass black holes. The signal was measured on January 4, 2017 at 10:11:58.6 UTC by the twin advanced detectors of the Laser Interferometer Gravitational-Wave Observatory during their second observing run, with a network signal-to-noise ratio of 13 and a false alarm rate less than 1 in 70,000 years. The inferred component black hole masses are $31.2^{+8.4}_{-6.0}\,M_\odot$ and $19.4^{+5.3}_{-5.9}\,M_\odot$ (at the 90% credible level). The black hole spins are best constrained through measurement of the effective inspiral spin parameter, a mass-weighted combination of the spin components perpendicular to the orbital plane, $χ_\mathrm{eff} = -0.12^{+0.21}_{-0.30}.$ This result implies that spin configurations with both component spins positively aligned with the orbital angular momentum are disfavored. The source luminosity distance is $880^{+450}_{-390}~\mathrm{Mpc}$ corresponding to a redshift of $z = 0.18^{+0.08}_{-0.07}$. We constrain the magnitude of modifications to the gravitational-wave dispersion relation and perform null tests of general relativity. Assuming that gravitons are dispersed in vacuum like massive particles, we bound the graviton mass to $m_g \le 7.7 \times 10^{-23}~\mathrm{eV}/c^2$. In all cases, we find that GW170104 is consistent with general relativity.
△ Less
Submitted 23 October, 2018; v1 submitted 6 June, 2017;
originally announced June 2017.
-
Search for intermediate mass black hole binaries in the first observing run of Advanced LIGO
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
B. P. Abbott,
R. Abbott,
T. D. Abbott,
F. Acernese,
K. Ackley,
C. Adams,
T. Adams,
P. Addesso,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
M. Afrough,
B. Agarwal,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
B. Allen,
G. Allen,
A. Allocca,
H. Almoubayyed,
P. A. Altin
, et al. (1018 additional authors not shown)
Abstract:
During their first observational run, the two Advanced LIGO detectors attained an unprecedented sensitivity, resulting in the first direct detections of gravitational-wave signals and GW151226, produced by stellar-mass binary black hole systems. This paper reports on an all-sky search for gravitational waves (GWs) from merging intermediate mass black hole binaries (IMBHBs). The combined results fr…
▽ More
During their first observational run, the two Advanced LIGO detectors attained an unprecedented sensitivity, resulting in the first direct detections of gravitational-wave signals and GW151226, produced by stellar-mass binary black hole systems. This paper reports on an all-sky search for gravitational waves (GWs) from merging intermediate mass black hole binaries (IMBHBs). The combined results from two independent search techniques were used in this study: the first employs a matched-filter algorithm that uses a bank of filters covering the GW signal parameter space, while the second is a generic search for GW transients (bursts). No GWs from IMBHBs were detected, therefore, we constrain the rate of several classes of IMBHB mergers. The most stringent limit is obtained for black holes of individual mass $100\,M_\odot$, with spins aligned with the binary orbital angular momentum. For such systems, the merger rate is constrained to be less than $0.93~\mathrm{Gpc^{-3}\,yr}^{-1}$ in comoving units at the $90\%$ confidence level, an improvement of nearly 2 orders of magnitude over previous upper limits.
△ Less
Submitted 25 September, 2017; v1 submitted 15 April, 2017;
originally announced April 2017.
-
Search for gravitational waves from Scorpius X-1 in the first Advanced LIGO observing run with a hidden Markov model
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
B. P. Abbott,
R. Abbott,
T. D. Abbott,
F. Acernese,
K. Ackley,
C. Adams,
T. Adams,
P. Addesso,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
M. Afrough,
B. Agarwal,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
B. Allen,
G. Allen,
A. Allocca,
H. Almoubayyed
, et al. (1021 additional authors not shown)
Abstract:
Results are presented from a semi-coherent search for continuous gravitational waves from the brightest low-mass X-ray binary, Scorpius X-1, using data collected during the first Advanced LIGO observing run (O1). The search combines a frequency domain matched filter (Bessel-weighted $\mathcal{F}$-statistic) with a hidden Markov model to track wandering of the neutron star spin frequency. No eviden…
▽ More
Results are presented from a semi-coherent search for continuous gravitational waves from the brightest low-mass X-ray binary, Scorpius X-1, using data collected during the first Advanced LIGO observing run (O1). The search combines a frequency domain matched filter (Bessel-weighted $\mathcal{F}$-statistic) with a hidden Markov model to track wandering of the neutron star spin frequency. No evidence of gravitational waves is found in the frequency range 60-650 Hz. Frequentist 95% confidence strain upper limits, $h_0^{95\%} = 4.0\times10^{-25}$, $8.3\times10^{-25}$, and $3.0\times10^{-25}$ for electromagnetically restricted source orientation, unknown polarization, and circular polarization, respectively, are reported at 106 Hz. They are $\leq 10$ times higher than the theoretical torque-balance limit at 106 Hz.
△ Less
Submitted 31 May, 2017; v1 submitted 12 April, 2017;
originally announced April 2017.
-
Standing Together for Reproducibility in Large-Scale Computing: Report on reproducibility@XSEDE
Authors:
Doug James,
Nancy Wilkins-Diehr,
Victoria Stodden,
Dirk Colbry,
Carlos Rosales,
Mark Fahey,
Justin Shi,
Rafael F. Silva,
Kyo Lee,
Ralph Roskies,
Laurence Loewe,
Susan Lindsey,
Rob Kooper,
Lorena Barba,
David Bailey,
Jonathan Borwein,
Oscar Corcho,
Ewa Deelman,
Michael Dietze,
Benjamin Gilbert,
Jan Harkes,
Seth Keele,
Praveen Kumar,
Jong Lee,
Erika Linke
, et al. (30 additional authors not shown)
Abstract:
This is the final report on reproducibility@xsede, a one-day workshop held in conjunction with XSEDE14, the annual conference of the Extreme Science and Engineering Discovery Environment (XSEDE). The workshop's discussion-oriented agenda focused on reproducibility in large-scale computational research. Two important themes capture the spirit of the workshop submissions and discussions: (1) organiz…
▽ More
This is the final report on reproducibility@xsede, a one-day workshop held in conjunction with XSEDE14, the annual conference of the Extreme Science and Engineering Discovery Environment (XSEDE). The workshop's discussion-oriented agenda focused on reproducibility in large-scale computational research. Two important themes capture the spirit of the workshop submissions and discussions: (1) organizational stakeholders, especially supercomputer centers, are in a unique position to promote, enable, and support reproducible research; and (2) individual researchers should conduct each experiment as though someone will replicate that experiment. Participants documented numerous issues, questions, technologies, practices, and potentially promising initiatives emerging from the discussion, but also highlighted four areas of particular interest to XSEDE: (1) documentation and training that promotes reproducible research; (2) system-level tools that provide build- and run-time information at the level of the individual job; (3) the need to model best practices in research collaborations involving XSEDE staff; and (4) continued work on gateways and related technologies. In addition, an intriguing question emerged from the day's interactions: would there be value in establishing an annual award for excellence in reproducible research?
△ Less
Submitted 2 January, 2015; v1 submitted 17 December, 2014;
originally announced December 2014.
-
Creating A Galactic Plane Atlas With Amazon Web Services
Authors:
G. Bruce Berriman,
Ewa Deelman,
John Good,
Gideon Juve,
Jamie Kinney,
Ann Merrihew,
Mats Rynge
Abstract:
This paper describes by example how astronomers can use cloud-computing resources offered by Amazon Web Services (AWS) to create new datasets at scale. We have created from existing surveys an atlas of the Galactic Plane at 16 wavelengths from 1 μm to 24 μm with pixels co-registered at spatial sampling of 1 arcsec. We explain how open source tools support management and operation of a virtual clus…
▽ More
This paper describes by example how astronomers can use cloud-computing resources offered by Amazon Web Services (AWS) to create new datasets at scale. We have created from existing surveys an atlas of the Galactic Plane at 16 wavelengths from 1 μm to 24 μm with pixels co-registered at spatial sampling of 1 arcsec. We explain how open source tools support management and operation of a virtual cluster on AWS platforms to process data at scale, and describe the technical issues that users will need to consider, such as optimization of resources, resource costs, and management of virtual machine instances.
△ Less
Submitted 23 December, 2013;
originally announced December 2013.
-
Snowmass Energy Frontier Simulations using the Open Science Grid (A Snowmass 2013 whitepaper)
Authors:
A. Avetisyan,
S. Bhattacharya,
M. Narain,
S. Padhi,
J. Hirschauer,
T. Levshina,
P. McBride,
C. Sehgal,
M. Slyz,
M. Rynge,
S. Malik,
J. Stupak III
Abstract:
Snowmass is a US long-term planning study for the high-energy community by the American Physical Society's Division of Particles and Fields. For its simulation studies, opportunistic resources are harnessed using the Open Science Grid infrastructure. Late binding grid technology, GlideinWMS, was used for distributed scheduling of the simulation jobs across many sites mainly in the US. The pilot in…
▽ More
Snowmass is a US long-term planning study for the high-energy community by the American Physical Society's Division of Particles and Fields. For its simulation studies, opportunistic resources are harnessed using the Open Science Grid infrastructure. Late binding grid technology, GlideinWMS, was used for distributed scheduling of the simulation jobs across many sites mainly in the US. The pilot infrastructure also uses the Parrot mechanism to dynamically access CvmFS in order to ascertain a homogeneous environment across the nodes. This report presents the resource usage and the storage model used for simulating large statistics Standard Model backgrounds needed for Snowmass Energy Frontier studies.
△ Less
Submitted 1 October, 2013; v1 submitted 4 August, 2013;
originally announced August 2013.
-
A Tale Of 160 Scientists, Three Applications, A Workshop and A Cloud
Authors:
G. Bruce Berriman,
Carolyn Brinkworth,
Dawn Gelino,
Dennis K. Wittman,
Ewa Deelman,
Gideon Juve,
Mats Rynge,
Jamie Kinney
Abstract:
The NASA Exoplanet Science Institute (NExScI) hosts the annual Sagan Workshops, thematic meetings aimed at introducing researchers to the latest tools and methodologies in exoplanet research. The theme of the Summer 2012 workshop, held from July 23 to July 27 at Caltech, was to explore the use of exoplanet light curves to study planetary system architectures and atmospheres. A major part of the wo…
▽ More
The NASA Exoplanet Science Institute (NExScI) hosts the annual Sagan Workshops, thematic meetings aimed at introducing researchers to the latest tools and methodologies in exoplanet research. The theme of the Summer 2012 workshop, held from July 23 to July 27 at Caltech, was to explore the use of exoplanet light curves to study planetary system architectures and atmospheres. A major part of the workshop was to use hands-on sessions to instruct attendees in the use of three open source tools for the analysis of light curves, especially from the Kepler mission. Each hands-on session involved the 160 attendees using their laptops to follow step-by-step tutorials given by experts. We describe how we used the Amazon Elastic Cloud 2 to run these applications.
△ Less
Submitted 16 November, 2012;
originally announced November 2012.