Skip to main content

Showing 1–50 of 110 results for author: Kurtz, M

.
  1. arXiv:2411.02355  [pdf, ps, other

    cs.LG cs.AI

    "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

    Authors: Eldar Kurtic, Alexandre Marques, Shubhra Pandit, Mark Kurtz, Dan Alistarh

    Abstract: Quantization is a powerful tool for accelerating large language model (LLM) inference, but the accuracy-performance trade-offs across different formats remain unclear. In this paper, we conduct the most comprehensive empirical study to date, evaluating FP8, INT8, and INT4 quantization across academic benchmarks and real-world tasks on the entire Llama-3.1 model family. Through over 500,000 evaluat… ▽ More

    Submitted 30 May, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: Accepted to ACL 2025

  2. arXiv:2405.03594  [pdf, other

    cs.CL cs.AI

    Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

    Authors: Abhinav Agarwalla, Abhay Gupta, Alexandre Marques, Shubhra Pandit, Michael Goin, Eldar Kurtic, Kevin Leong, Tuan Nguyen, Mahmoud Salem, Dan Alistarh, Sean Lie, Mark Kurtz

    Abstract: Large language models (LLMs) have revolutionized Natural Language Processing (NLP), but their size creates computational bottlenecks. We introduce a novel approach to create accurate, sparse foundational versions of performant LLMs that achieve full accuracy recovery for fine-tuning tasks at up to 70% sparsity. We achieve this for the LLaMA-2 7B model by combining the SparseGPT one-shot pruning me… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  3. arXiv:2404.17438  [pdf, other

    cs.RO cs.AI cs.MA

    Real-World Deployment of a Hierarchical Uncertainty-Aware Collaborative Multiagent Planning System

    Authors: Martina Stadler Kurtz, Samuel Prentice, Yasmin Veys, Long Quang, Carlos Nieto-Granda, Michael Novitzky, Ethan Stump, Nicholas Roy

    Abstract: We would like to enable a collaborative multiagent team to navigate at long length scales and under uncertainty in real-world environments. In practice, planning complexity scales with the number of agents in the team, with the length scale of the environment, and with environmental uncertainty. Enabling tractable planning requires developing abstract models that can represent complex, high-qualit… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2024

  4. arXiv:2312.14211  [pdf, ps, other

    cs.CL astro-ph.IM cs.AI

    Experimenting with Large Language Models and vector embeddings in NASA SciX

    Authors: Sergi Blanco-Cuaresma, Ioana Ciucă, Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Kelly E. Lockhart, Felix Grezes, Thomas Allen, Golnaz Shapurian, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Daniel Chivvis, Fernanda de Macedo Alves, Jean-Claude Paquin, Jennifer Bartlett, Mugdha Polimera, Stephanie Jarmak

    Abstract: Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

  5. arXiv:2312.08579  [pdf, other

    cs.CL astro-ph.IM cs.LG

    Identifying Planetary Names in Astronomy Papers: A Multi-Step Approach

    Authors: Golnaz Shapurian, Michael J Kurtz, Alberto Accomazzi

    Abstract: The automatic identification of planetary feature names in astronomy publications presents numerous challenges. These features include craters, defined as roughly circular depressions resulting from impact or volcanic activity; dorsas, which are elongate raised structures or wrinkle ridges; and lacus, small irregular patches of dark, smooth material on the Moon, referred to as "lake" (Planetary Na… ▽ More

    Submitted 17 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

  6. arXiv:2303.17612  [pdf, other

    cs.CL cs.AI cs.LG

    oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes

    Authors: Daniel Campos, Alexandre Marques, Mark Kurtz, ChengXiang Zhai

    Abstract: In this paper, we introduce the range of oBERTa language models, an easy-to-use set of language models which allows Natural Language Processing (NLP) practitioners to obtain between 3.8 and 24.3 times faster models without expertise in model compression. Specifically, oBERTa extends existing work on pruning, knowledge distillation, and quantization and leverages frozen embeddings improves distilla… ▽ More

    Submitted 6 June, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: SustaiNLP2023 @ ACL 2023,9 pages, 2 figures, 45 tables

  7. arXiv:2212.00744  [pdf, ps, other

    cs.CL astro-ph.IM

    Improving astroBERT using Semantic Textual Similarity

    Authors: Felix Grezes, Thomas Allen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Pavlos Protopapas

    Abstract: The NASA Astrophysics Data System (ADS) is an essential tool for researchers that allows them to explore the astronomy and astrophysics scientific literature, but it has yet to exploit recent advances in natural language processing. At ADASS 2021, we introduced astroBERT, a machine learning language model tailored to the text used in astronomy papers in ADS. In this work we: - announce the first… ▽ More

    Submitted 29 November, 2022; originally announced December 2022.

  8. arXiv:2205.12452  [pdf, other

    cs.CL cs.AI

    Sparse*BERT: Sparse Models Generalize To New tasks and Domains

    Authors: Daniel Campos, Alexandre Marques, Tuan Nguyen, Mark Kurtz, ChengXiang Zhai

    Abstract: Large Language Models have become the core architecture upon which most modern natural language processing (NLP) systems build. These models can consistently deliver impressive accuracy and robustness across tasks and domains, but their high computational overhead can make inference difficult and expensive. To make using these models less costly, recent work has explored leveraging structured and… ▽ More

    Submitted 5 April, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: Presented at Sparsity in Neural Networks Workshop at ICML 2022, 6 pages, 2 figures, 4 tables

  9. arXiv:2203.07259  [pdf, other

    cs.CL cs.LG

    The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models

    Authors: Eldar Kurtic, Daniel Campos, Tuan Nguyen, Elias Frantar, Mark Kurtz, Benjamin Fineran, Michael Goin, Dan Alistarh

    Abstract: Transformer-based language models have become a key building block for natural language processing. While these models are extremely accurate, they can be too large and computationally intensive to run on standard deployments. A variety of compression methods, including distillation, quantization, structured and unstructured pruning are known to decrease model size and increase inference speed, wi… ▽ More

    Submitted 17 October, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted to EMNLP 2022

  10. arXiv:2202.00777  [pdf, ps, other

    cs.HC astro-ph.IM

    Web accessibility trends and implementation in dynamic web applications

    Authors: Timothy W. Hostetler, Shinyi Chen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Edwin Henneken, Donna M. Thompson, Roman Chyla, Golnaz Shapurian, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Stephen McDonald, Felix Grezes

    Abstract: The NASA Astrophysics Data System (ADS), a critical research service for the astrophysics community, strives to provide the most accessible and inclusive environment for the discovery and exploration of the astronomical literature. Part of this goal involves creating a digital platform that can accommodate everybody, including those with disabilities that would benefit from alternative ways to pre… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Submitted to ADASS XXXI (2021)

  11. arXiv:2112.00590  [pdf, ps, other

    cs.CL astro-ph.IM

    Building astroBERT, a language model for Astronomy & Astrophysics

    Authors: Felix Grezes, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Shinyi Chen, Chris Tanner, Pavlos Protopapas

    Abstract: The existing search tools for exploring the NASA Astrophysics Data System (ADS) can be quite rich and empowering (e.g., similar and trending operators), but researchers are not yet allowed to fully leverage semantic search. For example, a query for "results from the Planck mission" should be able to distinguish between all the various meanings of Planck (person, mission, constant, institutions and… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  12. arXiv:2111.13445  [pdf, other

    cs.CV cs.AI cs.LG

    How Well Do Sparse Imagenet Models Transfer?

    Authors: Eugenia Iofinova, Alexandra Peste, Mark Kurtz, Dan Alistarh

    Abstract: Transfer learning is a classic paradigm by which models pretrained on large "upstream" datasets are adapted to yield good results on "downstream" specialized datasets. Generally, more accurate models on the "upstream" dataset tend to provide better transfer accuracy "downstream". In this work, we perform an in-depth investigation of this phenomenon in the context of convolutional neural networks (… ▽ More

    Submitted 21 April, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: Accepted to CVPR'22. This version: 25 pages, 9 figures (including appendix). **Includes extended upstream training results, which are not present in the CVPR version.**

  13. arXiv:2012.03470  [pdf, ps, other

    astro-ph.IM astro-ph.GA astro-ph.SR

    Center for Astrophysics Optical Infrared Science Archive. I. FAST Spectrograph

    Authors: Jessica Mink, Warren R. Brown, Igor V. Chilingarian, Daniel Fabricant, Michael J. Kurtz, Sean Moran, Jaehyon Rhee, Susan Tokarz, William F. Wyatt

    Abstract: We announce the public release of 141,531 moderate-dispersion optical spectra of 72,247 objects acquired over the past 25 years with the FAST Spectrograph on the Fred L. Whipple Observatory 1.5-meter Tillinghast telescope. We describe the data acquisition and processing so that scientists can understand the spectra. We highlight some of the largest FAST survey programs, and make recommendations fo… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: 17 pages, 18 figures, 8 tables

  14. arXiv:2010.01418  [pdf

    cs.DL astro-ph.IM physics.soc-ph

    Second Order Operators in the NASA Astrophysics Data System

    Authors: Michael J. Kurtz, Roman Chyla

    Abstract: Second Order Operators (SOOs) are database functions which form secondary queries based on attributes of the objects returned in an initial query; they can provide powerful methods to investigate complex, multipartite information graphs. The NASA Astrophysics Data System (ADS) has implemented four SOOs, reviews, useful, trending, and similar which use the citations, references, downloads, and abst… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

    Comments: ADS Bibcode:2020BAAS...52b0207K, author's version

    Journal ref: Bulletin of the American Astronomical Society, Vol. 52, No. 2, id. 0207 2020

  15. arXiv:2009.14323  [pdf

    astro-ph.IM cs.DL

    Enabling Synergy: Improving the Information Infrastructure for Planetary Science

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin A. Henneken

    Abstract: In this whitepaper we advocate that the Planetary Science (PS) community build a discipline-specific digital library, in collaboration with the existing astronomy digital library, ADS. We suggest that the PS data archives increase their level of curation to allow for direct linking between the archival data and the derived journal articles. And we suggest that a new component of the PS information… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: 8 pages, submitted to the Planetary Science and Astrobiology Decadal Survey 2023-2032

  16. arXiv:2009.05048  [pdf, ps, other

    cs.SE astro-ph.IM

    Agile methodologies in teams with highly creative and autonomous members

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi

    Abstract: The Agile manifesto encourages us to value individuals and interactions over processes and tools, while Scrum, the most adopted Agile development methodology, is essentially based on roles, events, artifacts, and the rules that bind them together (i.e., processes). Moreover, it is generally proclaimed that whenever a Scrum project does not succeed, the reason is because Scrum was not implemented c… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: To appear in the proceedings of the 29th annual international Astronomical Data Analysis Software & Systems (ADASS XXIX)

  17. arXiv:1903.00297  [pdf

    astro-ph.IM

    From Dark Energy to Exolife: Improving the Digital Information Infrastructure for Astrophysics

    Authors: Michael J. Kurtz, Alberto Accomazzi

    Abstract: Some of the most exciting and promising areas of Astronomy research today are found at the boundaries of the discipline: the search for Exoplanets and Multi-Messenger Astronomy. In order to achieve breakthroughs in these research fields over the next decade, innovation and expansion of the digital information infrastructure which supports this research is required. Astronomy has been well-served b… ▽ More

    Submitted 1 March, 2019; originally announced March 2019.

    Comments: 6 pages, whitepaper submitted to Astro2020, the Astronomy and Astrophysics Decadal Survey

  18. arXiv:1901.05463  [pdf, ps, other

    astro-ph.IM cs.DL

    Fundamentals of effective cloud management for the new NASA Astrophysics Data System

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi, Nathan Rapport

    Abstract: The new NASA Astrophysics Data System (ADS) is designed with a serviceoriented architecture (SOA) that consists of multiple customized Apache Solr search engine instances plus a collection of microservices, containerized using Docker, and deployed in Amazon Web Services (AWS). For complex systems, like the ADS, this loosely coupled architecture can lead to a more scalable, reliable and resilient s… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: To appear in the proceedings of the 28th annual international Astronomical Data Analysis Software & Systems (ADASS XXVIII)

  19. arXiv:1803.03598  [pdf

    astro-ph.IM cs.DL physics.soc-ph

    Merging the Astrophysics and Planetary Science Information Systems

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin A. Henneken

    Abstract: Conceptually exoplanet research has one foot in the discipline of Astrophysics and the other foot in Planetary Science. Research strategies for exoplanets will require efficient access to data and information from both realms. Astrophysics has a sophisticated, well integrated, distributed information system with archives and data centers which are interlinked with the technical literature via the… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: Whitepaper submitted to the Committee on an Exoplanet Science Strategy

  20. arXiv:1801.00815  [pdf

    cs.AI astro-ph.IM physics.soc-ph

    Advice from the Oracle: Really Intelligent Information Retrieval

    Authors: Michael J. Kurtz

    Abstract: What is "intelligent" information retrieval? Essentially this is asking what is intelligence, in this article I will attempt to show some of the aspects of human intelligence, as related to information retrieval. I will do this by the device of a semi-imaginary Oracle. Every Observatory has an oracle, someone who is a distinguished scientist, has great administrative responsibilities, acts as ment… ▽ More

    Submitted 2 January, 2018; originally announced January 2018.

    Comments: Author copy; published 25 years ago at the beginning of the Astrophysics Data System; 2018 keywords added

    Journal ref: In: Heck A., Murtagh F. (eds) Intelligent Information Retrieval: The Case of Astronomy and Related Space Sciences. Astrophysics and Space Science Library, vol 182. Springer, Dordrecht (1993)

  21. arXiv:1712.06704  [pdf, ps, other

    stat.ML cs.CL cs.IR

    Multilingual Topic Models

    Authors: Kriste Krstovski, Michael J. Kurtz, David A. Smith, Alberto Accomazzi

    Abstract: Scientific publications have evolved several features for mitigating vocabulary mismatch when indexing, retrieving, and computing similarity between articles. These mitigation strategies range from simply focusing on high-value article sections, such as titles and abstracts, to assigning keywords, often from controlled vocabularies, either manually or through automatic annotation. Various document… ▽ More

    Submitted 18 December, 2017; originally announced December 2017.

    Comments: 18 pages, 9 figures

  22. New ADS Functionality for the Curator

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Steven McDonald, Taylor J. Shaulis, Sergi Blanco-Cuaresma, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton

    Abstract: In this paper we provide an update concerning the operations of the NASA Astrophysics Data System (ADS), its services and user interface, and the content currently indexed in its database. As the primary information system used by researchers in Astronomy, the ADS aims to provide a comprehensive index of all scholarly resources appearing in the literature. With the current effort in our community… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

    Comments: Submitted to the Proceedings of Library and Information Services in Astronomy VIII, Strasbourg, France

  23. arXiv:1707.09955  [pdf

    physics.soc-ph astro-ph.IM cs.DL

    Comparing People with Bibliometrics

    Authors: Michael J. Kurtz

    Abstract: Bibliometric indicators, citation counts and/or download counts are increasingly being used to inform personnel decisions such as hiring or promotions. These statistics are very often misused. Here we provide a guide to the factors which should be considered when using these so-called quantitative measures to evaluate people. Rules of thumb are given for when begin to use bibliometric measures whe… ▽ More

    Submitted 31 July, 2017; originally announced July 2017.

    Comments: to appear in Proceedings of Library and Information Science in Astronomy VIII (LISA-8)

  24. arXiv:1706.02153  [pdf

    cs.DL astro-ph.IM cs.CY cs.IR physics.soc-ph

    Usage Bibliometrics as a Tool to Measure Research Activity

    Authors: Edwin A. Henneken, Michael J. Kurtz

    Abstract: Measures for research activity and impact have become an integral ingredient in the assessment of a wide range of entities (individual researchers, organizations, instruments, regions, disciplines). Traditional bibliometric indicators, like publication and citation based indicators, provide an essential part of this picture, but cannot describe the complete picture. Since reading scholarly publica… ▽ More

    Submitted 7 June, 2017; originally announced June 2017.

    Comments: 25 pages, 11 figures, accepted for publication in Handbook of Quantitative Science and Technology Research, Springer

  25. arXiv:1603.06885  [pdf, ps, other

    astro-ph.GA astro-ph.CO

    SHELS: Complete Redshift Surveys of Two Widely Separated Fields

    Authors: Margaret J. Geller, Ho Seong Hwang, Ian P. Dell'Antonio, Harus Jabran Zahid, Michael J. Kurtz, Daniel G. Fabricant

    Abstract: The SHELS (Smithsonian Hectospec Lensing Survey) is a complete redshift survey covering two well-separated fields (F1 and F2) of the Deep Lens Survey. Both fields are more than 94% complete to a Galactic extinction corrected R0 = 20.2. Here we describe the redshift survey of the F1 field centered at R.A. = 00h53m25.3s and Decl = 12d33m55s; like F2, the F1 field covers 4 sq deg. The redshift survey… ▽ More

    Submitted 22 March, 2016; originally announced March 2016.

    Comments: 24 pages, 6 tables, 13 figures; ApJS, accepted; full data tables available in journal upon publication

  26. HectoMAP and Horizon Run 4: Dense Structures and Voids in the Real and Simulated Universe

    Authors: Ho Seong Hwang, Margaret J. Geller, Changbom Park, Daniel G. Fabricant, Michael J. Kurtz, Kenneth J. Rines, Juhan Kim, Antonaldo Diaferio, H. Jabran Zahid, Perry Berlind, Michael Calkins, Susan Tokarz, Sean Moran

    Abstract: HectoMAP is a dense redshift survey of red galaxies covering a 53 $deg^{2}$ strip of the northern sky. HectoMAP is 97\% complete for galaxies with $r<20.5$, $(g-r)>1.0$, and $(r-i)>0.5$. The survey enables tests of the physical properties of large-scale structure at intermediate redshift against cosmological models. We use the Horizon Run 4, one of the densest and largest cosmological simulations… ▽ More

    Submitted 19 February, 2016; originally announced February 2016.

    Comments: 20 pages, 16 figures, 1 table. Published in ApJ (818:106, 2016). Paper with high resolution figures is available at https://astro.kias.re.kr/~hshwang/Hwang_etal16_LSS_HectoMAP_HorizonRun4_high.pdf

    Journal ref: 2016, ApJ, 818, 173

  27. arXiv:1601.07858  [pdf, ps, other

    astro-ph.IM cs.DL

    Aggregation and Linking of Observational Metadata in the ADS

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Alexandra Holachek, Jonathan Elliott

    Abstract: We discuss current efforts behind the curation of observing proposals, archive bibliographies, and data links in the NASA Astrophysics Data System (ADS). The primary data in the ADS is the bibliographic content from scholarly articles in Astronomy and Physics, which ADS aggregates from publishers, arXiv and conference proceeding sites. This core bibliographic information is then further enriched b… ▽ More

    Submitted 28 January, 2016; originally announced January 2016.

    Comments: 4 pages, Proceedings of the ADASS XXV conference

  28. arXiv:1601.01611  [pdf, other

    cs.IR

    Automatic Construction of Evaluation Sets and Evaluation of Document Similarity Models in Large Scholarly Retrieval Systems

    Authors: Kriste Krstovski, David A. Smith, Michael J. Kurtz

    Abstract: Retrieval systems for scholarly literature offer the ability for the scientific community to search, explore and download scholarly articles across various scientific disciplines. Mostly used by the experts in the particular field, these systems contain user community logs including information on user specific downloaded articles. In this paper we present a novel approach for automatically evalua… ▽ More

    Submitted 7 January, 2016; originally announced January 2016.

  29. arXiv:1510.09099  [pdf

    physics.soc-ph astro-ph.IM cs.DL

    Measuring Metrics - A forty year longitudinal cross-validation of citations, downloads, and peer review in Astrophysics

    Authors: Michael J. Kurtz, Edwin A. Henneken

    Abstract: Citation measures, and newer altmetric measures such as downloads are now commonly used to inform personnel decisions. How well do or can these measures measure or predict the past, current of future scholarly performance of an individual? Using data from the Smithsonian/NASA Astrophysics Data System we analyze the publication, citation, download, and distinction histories of a cohort of 922 indiv… ▽ More

    Submitted 30 October, 2015; originally announced October 2015.

    Comments: Author's version of manuscript accepted for publication in the Journal of the Association for Information Science and Technology (JASIST); 35 pages 16 figures

  30. arXiv:1503.05881  [pdf, other

    cs.DL

    ADS 2.0: new architecture, API and services

    Authors: Roman Chyla, Alberto Accomazzi, Alexandra Holachek, Carolyn S. Grant, Jonathan Elliott, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray, Vladimir Sudilovsky

    Abstract: The ADS platform is undergoing the biggest rewrite of its 20-year history. While several components have been added to its architecture over the past couple of years, this talk will concentrate on the underpinnings of ADS's search layer and its API. To illustrate the design of the components in the new system, we will show how the new ADS user interface is built exclusively on top of the API using… ▽ More

    Submitted 19 March, 2015; originally announced March 2015.

    Comments: ADASS Conference 2014

  31. arXiv:1503.04194  [pdf, other

    astro-ph.IM cs.DL

    ADS: The Next Generation Search Platform

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Roman Chyla, James Luker, Carolyn S. Grant, Donna M. Thompson, Alexandra Holachek, Rahul Dave, Stephen S. Murray

    Abstract: Four years after the last LISA meeting, the NASA Astrophysics Data System (ADS) finds itself in the middle of major changes to the infrastructure and contents of its database. In this paper we highlight a number of features of great importance to librarians and discuss the additional functionality that we are currently developing. Starting in 2011, the ADS started to systematically collect, parse… ▽ More

    Submitted 13 March, 2015; originally announced March 2015.

    Comments: Submitted to Library and Information Services in Astronomy VII, Naples, Italy

  32. arXiv:1406.4542  [pdf, ps, other

    cs.DL astro-ph.IM

    Computing and Using Metrics in the ADS

    Authors: Edwin A. Henneken, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Donna Thompson, Jay Luker, Roman Chyla, Alexandra Holachek, Stephen S. Murray

    Abstract: Finding measures for research impact, be it for individuals, institutions, instruments or projects, has gained a lot of popularity. More papers than ever are being written on new impact measures, and problems with existing measures are being pointed out on a regular basis. Funding agencies require impact statistics in their reports, job candidates incorporate them in their resumes, and publication… ▽ More

    Submitted 17 June, 2014; originally announced June 2014.

    Comments: to appear in proceedings of LISA VII conference, Naples, Italy

  33. arXiv:1405.7704  [pdf, ps, other

    astro-ph.GA astro-ph.CO

    SHELS: A Complete Galaxy Redshift Survey with R$\leq$20.6

    Authors: Margaret J. Geller, Ho Seong Hwang, Daniel G. Fabricant, Michael J. Kurtz, Ian P. Dell'Antonio, Harus Jabran Zahid

    Abstract: The SHELS (Smithsonian Hectospec Lensing Survey) is a complete redshift survey covering two well-separated fields (F1 and F2) of the Deep Lens Survey to a limiting R = 20.6. Here we describe the redshift survey of the F2 field (R.A.$_{2000}$ = 09$^h$19$^m$32.4$^s$ and Decl.$_{2000}$ = +30$^{\circ}$00$^{\prime}$00$^{\prime\prime}$). The survey includes 16,294 new redshifts measured with the Hectosp… ▽ More

    Submitted 29 May, 2014; originally announced May 2014.

    Comments: 45 pages, 16 figures, 7 tables. Data will be available only when the paper is published in Astrophysical Journal Supplements (now submitted). Movie and full resolution figures are available at https://www.cfa.harvard.edu/~mjg/f6movie.mp4 and https://www.cfa.harvard.edu/~mjg/SHELS.pdf

  34. A Redshift Survey of the Strong Lensing Cluster Abell 383

    Authors: Margaret J. Geller, Ho Seong Hwang, Antonaldo Diaferio, Michael J. Kurtz, Dan Coe, Kenneth J. Rines

    Abstract: Abell 383 is a famous rich cluster (z = 0.1887) imaged extensively as a basis for intensive strong and weak lensing studies. Nonetheless there are few spectroscopic observations. We enable dynamical analyses by measuring 2360 new redshifts for galaxies with r$_{petro} \leq 20.5$ and within 50$^\prime$ of the BCG (Brightest Cluster Galaxy: R.A.$_{2000} = 42.014125^\circ$, Decl… ▽ More

    Submitted 7 January, 2014; originally announced January 2014.

    Comments: 29 pages, 9 figures, ApJ accepted

  35. arXiv:1308.4442  [pdf, ps, other

    astro-ph.GA

    Measuring Galaxy Velocity Dispersions with Hectospec

    Authors: Daniel Fabricant, Igor Chilingarian, Ho Seong Hwang, Michael J. Kurtz, Margaret J. Geller

    Abstract: We describe a robust technique based on the ULySS IDL code for measuring velocity dispersions of galaxies observed with the MMT's fiber-fed spectrograph, Hectospec. This procedure is applicable to all Hectospec spectra having a signal-to-noise >5 and weak emission lines. We estimate the internal error in the Hectospec velocity dispersion measurements by comparing duplicate measurements of 171 gala… ▽ More

    Submitted 20 August, 2013; originally announced August 2013.

    Comments: 8 figures

  36. arXiv:1304.4656  [pdf, ps, other

    astro-ph.IM astro-ph.CO

    Reducing Systematic Error in Cluster Scale Weak Lensing

    Authors: Yousuke Utsumi, Satoshi Miyazaki, Margaret J. Geller, Ian P. Dell'Antonio, Masamune Oguri, Michael J. Kurtz, Takashi Hamana, Daniel G. Fabricant

    Abstract: Weak lensing provides an important route toward collecting samples of clusters of galaxies selected by mass. Subtle systematic errors in image reduction can compromise the power of this technique. We use the B-mode signal to quantify this systematic error and to test methods for reducing this error. We show that two procedures are efficient in suppressing systematic error in the B-mode: (1) refine… ▽ More

    Submitted 16 April, 2013; originally announced April 2013.

    Comments: 30 pages, Submitted to Astrophysical Journal

  37. The Chemical Evolution of Star-Forming Galaxies Over the Last 11 Billion Years

    Authors: H. Jabran Zahid, Margaret Geller, Lisa Kewley, Ho Seong Hwang, Daniel Fabricant, Michael Kurtz

    Abstract: We calculate the stellar mass-metallicity relation at five epochs ranging to z~2.3. We quantify evolution in the shape of the mass-metallicity relation as a function of redshift; the mass-metallicity relation flattens at late times. There is an empirical upper limit to the gas-phase oxygen abundance in star-forming galaxies that is independent of redshift. From examination of the mass-metallicity… ▽ More

    Submitted 6 May, 2013; v1 submitted 24 March, 2013; originally announced March 2013.

    Comments: 6 pages, 2 figures. Accepted for publication in ApJ Letters

  38. Measuring the Ultimate Mass of Galaxy Clusters: Redshifts and Mass Profiles from the Hectospec Cluster Survey (HeCS)

    Authors: Kenneth Rines, Margaret J. Geller, Antonaldo Diaferio, Michael J. Kurtz

    Abstract: The infall regions of galaxy clusters represent the largest gravitationally bound structures in a $Λ$CDM universe. Measuring cluster mass profiles into the infall regions provides an estimate of the ultimate mass of these haloes. We use the caustic technique to measure cluster mass profiles from galaxy redshifts obtained with the Hectospec Cluster Survey (HeCS), an extensive spectroscopic survey o… ▽ More

    Submitted 25 April, 2013; v1 submitted 17 September, 2012; originally announced September 2012.

    Comments: revised to match version published in ApJ

  39. arXiv:1209.2124  [pdf, other

    astro-ph.IM cs.DL physics.soc-ph

    A measure of total research impact independent of time and discipline

    Authors: Alberto Pepe, Michael J. Kurtz

    Abstract: Authorship and citation practices evolve with time and differ by academic discipline. As such, indicators of research productivity based on citation records are naturally subject to historical and disciplinary effects. We observe these effects on a corpus of astronomer career data constructed from a database of refereed publications. We employ a simple mechanism to measure research output using au… ▽ More

    Submitted 10 September, 2012; originally announced September 2012.

    Comments: 14 pages, 5 figures. PLoS ONE, in press

  40. arXiv:1209.1318  [pdf

    cs.IR astro-ph.IM cs.DL physics.soc-ph

    Finding and Recommending Scholarly Articles

    Authors: Michael J. Kurtz, Edwin A. Henneken

    Abstract: The rate at which scholarly literature is being produced has been increasing at approximately 3.5 percent per year for decades. This means that during a typical 40 year career the amount of new literature produced each year increases by a factor of four. The methods scholars use to discover relevant literature must change. Just like everybody else involved in information discovery, scholars are co… ▽ More

    Submitted 6 September, 2012; originally announced September 2012.

    Comments: 14 pages, part of the forthcoming MIT book "Bibliometrics and Beyond: Metrics-Based Evaluation of Scholarly Research" edited by Blaise Cronin and Cassidy R. Sugimoto

  41. arXiv:1209.0125  [pdf, other

    cs.DL cs.LG stat.ML

    A History of Cluster Analysis Using the Classification Society's Bibliography Over Four Decades

    Authors: Fionn Murtagh, Michael J. Kurtz

    Abstract: The Classification Literature Automated Search Service, an annual bibliography based on citation of one or more of a set of around 80 book or journal publications, ran from 1972 to 2012. We analyze here the years 1994 to 2011. The Classification Society's Service, as it was termed, has been produced by the Classification Society. In earlier decades it was distributed as a diskette or CD with the J… ▽ More

    Submitted 16 August, 2013; v1 submitted 1 September, 2012; originally announced September 2012.

    Comments: 23 pages, 9 figures

    MSC Class: 62H30 ACM Class: I.5.3; H.3.3

  42. SHELS: Optical Spectral Properties of WISE 22 μm-selected Galaxies

    Authors: Ho Seong Hwang, Margaret J. Geller, Michael J. Kurtz, Ian P. Dell'Antonio, Daniel G. Fabricant

    Abstract: We use a dense, complete redshift survey, the Smithsonian Hectospec Lensing Survey (SHELS), covering a 4 square degree region of a deep imaging survey, the Deep Lens Survey (DLS), to study the optical spectral properties of Wide-field Infrared Survey Explorer (WISE) 22 μm-selected galaxies. Among 507 WISE 22 μm-selected sources with (S/N)_{22μm}>3 (\simS_{22μm}>2.5 mJy), we identify the optical co… ▽ More

    Submitted 15 August, 2012; originally announced August 2012.

    Comments: 18 pages, 17 figures. To appear in ApJ

  43. CLASH: Precise New Constraints on the Mass Profile of Abell 2261

    Authors: Dan Coe, Keiichi Umetsu, Adi Zitrin, Megan Donahue, Elinor Medezinski, Marc Postman, Mauricio Carrasco, Timo Anguita, Margaret J. Geller, Kenneth J. Rines, Antonaldo Diaferio, Michael J. Kurtz, Larry Bradley, Anton Koekemoer, Wei Zheng, Mario Nonino, Alberto Molino, Andisheh Mahdavi, Doron Lemze, Leopoldo Infante, Sara Ogaz, Peter Melchior, Ole Host, Holland Ford, Claudio Grillo , et al. (21 additional authors not shown)

    Abstract: We precisely constrain the inner mass profile of Abell 2261 (z=0.225) for the first time and determine this cluster is not "over-concentrated" as found previously, implying a formation time in agreement with ΛCDM expectations. These results are based on strong lensing analyses of new 16-band HST imaging obtained as part of the Cluster Lensing and Supernova survey with Hubble (CLASH). Combining thi… ▽ More

    Submitted 8 January, 2012; originally announced January 2012.

    Comments: Submitted to the Astrophysical Journal. 19 pages, 14 figures

  44. Mapping the Universe: The 2010 Russell Lecture

    Authors: Margaret J. Geller, Antonaldo Diaferio, Michael J. Kurtz

    Abstract: Redshift surveys are a powerful tool of modern cosmology. We discuss two aspects of their power to map the distribution of mass and light in the universe: (1) measuring the mass distribution extending into the infall regions of rich clusters and (2) applying deep redshift surveys to the selection of clusters of galaxies and to the identification of very large structures (Great Walls). We preview t… ▽ More

    Submitted 6 October, 2011; originally announced October 2011.

    Comments: 19 pages, 5 figures (2 videos available in the on-line journal article)

    Journal ref: Astronomical Journal 2011, Vol. 142, id133

  45. The Faint End of the Luminosity Function and Low Surface Brightness Galaxies

    Authors: Margaret J. Geller, Antonaldo Diaferio, Michael J. Kurtz, Ian P. Dell'Antonio, Daniel G. Fabricant

    Abstract: SHELS (Smithsonian Hectospec Lensing Survey) is a dense redshift survey covering a 4 square degree region to a limiting R = 20.6. In the construction of the galaxy catalog and in the acquisition of spectroscopic targets, we paid careful attention to the survey completeness for lower surface brightness dwarf galaxies. Thus, although the survey covers a small area, it is a robust basis for computati… ▽ More

    Submitted 9 March, 2012; v1 submitted 14 July, 2011; originally announced July 2011.

    Comments: 34 pages, 13 figures, 3 tables, Astronomical Journal, in press (updated based on review)

  46. arXiv:1106.5644  [pdf, ps, other

    astro-ph.IM cs.DL

    The ADS in the Information Age - Impact on Discovery

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi

    Abstract: The SAO/NASA Astrophysics Data System (ADS) grew up with and has been riding the waves of the Information Age, closely monitoring and anticipating the needs of its end-users. By now, all professional astronomers are using the ADS on a daily basis, and a substantial fraction have been using it for their entire professional career. In addition to being an indispensable tool for professional scientis… ▽ More

    Submitted 28 June, 2011; originally announced June 2011.

    Comments: 10 pages, 5 figures, to appear in "Organizations, People and Strategies in Astronomy (OPSA)", volume 8

  47. arXiv:1102.5743  [pdf, ps, other

    astro-ph.CO astro-ph.IM

    Testing Weak Lensing Maps With Redshift Surveys: A Subaru Field

    Authors: Michael J. Kurtz, Margaret J. Geller, Yousuke Utsumi, Satoshi Miyazaki, Ian P. Dell'Antonio, Daniel G. Fabricant

    Abstract: We use a dense redshift survey in the foreground of the Subaru GTO2deg^2 weak lensing field (centered at $α_{2000}$ = 16$^h04^m44^s$;$δ_{2000}$ =43^\circ11^{\prime}24^{\prime\prime}$) to assess the completeness and comment on the purity of massive halo identification in the weak lensing map. The redshift survey (published here) includes 4541 galaxies; 4405 are new redshifts measured with the Hecto… ▽ More

    Submitted 2 April, 2012; v1 submitted 28 February, 2011; originally announced February 2011.

    Comments: Astrophysical Journal accepted version

  48. arXiv:1102.2891  [pdf

    cs.DL astro-ph.IM cs.IR physics.soc-ph

    Usage Bibliometrics

    Authors: Michael J. Kurtz, Johan Bollen

    Abstract: Scholarly usage data provides unique opportunities to address the known shortcomings of citation analysis. However, the collection, processing and analysis of usage data remains an area of active research. This article provides a review of the state-of-the-art in usage-based informetric, i.e. the use of usage data to study the scholarly process.

    Submitted 14 February, 2011; originally announced February 2011.

    Comments: Publisher's PDF (by permission). Publisher web site: books.infotoday.com/asist/arist44.shtml

    Journal ref: Annual Review of Information Science and Technology, vol 44, p. 3-64 (2010)

  49. arXiv:1008.0826  [pdf, ps, other

    physics.soc-ph astro-ph.IM cs.DL cs.IR

    The Emerging Scholarly Brain

    Authors: Michael J. Kurtz

    Abstract: It is now a commonplace observation that human society is becoming a coherent super-organism, and that the information infrastructure forms its emerging brain. Perhaps, as the underlying technologies are likely to become billions of times more powerful than those we have today, we could say that we are now building the lizard brain for the future organism.

    Submitted 4 August, 2010; originally announced August 2010.

    Comments: to appear in Future Professional Communication in Astronomy-II (FPCA-II) editors A. Heck and A. Accomazzi

  50. arXiv:1006.2823  [pdf, ps, other

    astro-ph.CO

    Empirical optical k-Corrections for redshifts <= 0.7

    Authors: Eduard Westra, Margaret J. Geller, Michael J. Kurtz, Daniel G. Fabricant, Ian Dell'Antonio

    Abstract: The Smithsonian Hectospec Lensing Survey (SHELS) is a magnitude limited spectroscopically complete survey for R<=21.0 covering 4 square degrees. SHELS provides a large sample (15,513) of flux calibrated spectra. The wavelength range covered by the spectra allows empirical determination of k-corrections for the g- and r-band from z=0 to ~0.68 and 0.33, respectively, based on large samples of spectr… ▽ More

    Submitted 28 December, 2010; v1 submitted 14 June, 2010; originally announced June 2010.

    Comments: 48 pages in total (includes 19 figures, 25 tables). Published in PASP. Version with high resolution figures available at http://www.cfa.harvard.edu/~ewestra/publications/. Online calculator at http://tdc-www.cfa.harvard.edu/instruments/hectospec/progs/EOK/. Tables with coefficients differ slightly from first astro-ph version, results barely changed