Skip to main content

Showing 1–12 of 12 results for author: Wallin, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.03821  [pdf, ps, other

    stat.ML cs.LG stat.ME

    The Choice of Normalization Influences Shrinkage in Regularized Regression

    Authors: Johan Larsson, Jonas Wallin

    Abstract: Regularized models are often sensitive to the scales of the features in the data and it has therefore become standard practice to normalize (center and scale) the features before fitting the model. But there are many different ways to normalize the features and the choice may have dramatic effects on the resulting model. In spite of this, there has so far been no research on this topic. In this pa… ▽ More

    Submitted 3 July, 2025; v1 submitted 7 January, 2025; originally announced January 2025.

    Comments: 39 pages, 18 figures

    MSC Class: 62J07 (Primary); 68T09 (Secondary) ACM Class: G.3; G.4; I.6

  2. Incorporating Crowdsourced Annotator Distributions into Ensemble Modeling to Improve Classification Trustworthiness for Ancient Greek Papyri

    Authors: Graham West, Matthew I. Swindall, Ben Keener, Timothy Player, Alex C. Williams, James H. Brusuelas, John F. Wallin

    Abstract: Performing classification on noisy, crowdsourced image datasets can prove challenging even for the best neural networks. Two issues which complicate the problem on such datasets are class imbalance and ground-truth uncertainty in labeling. The AL-ALL and AL-PUB datasets - consisting of tightly cropped, individual characters from images of ancient Greek papyri - are strongly affected by both issues… ▽ More

    Submitted 26 January, 2024; v1 submitted 28 October, 2022; originally announced October 2022.

    Journal ref: Journal of Data Mining & Digital Humanities, Historical Documents and automatic text recognition, Digital humanities in languages (February 7, 2024) jdmdh:10297

  3. arXiv:2104.13026  [pdf, other

    stat.ML cs.LG stat.CO

    The Hessian Screening Rule

    Authors: Johan Larsson, Jonas Wallin

    Abstract: Predictor screening rules, which discard predictors before fitting a model, have had considerable impact on the speed with which sparse regression problems, such as the lasso, can be solved. In this paper we present a new screening rule for solving the lasso path: the Hessian Screening Rule. The rule uses second-order information from the model to provide both effective screening, particularly in… ▽ More

    Submitted 4 October, 2022; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: 25 pages, 14 figures

    MSC Class: 62J07 ACM Class: G.3; G.4

    Journal ref: Advances in neural information processing systems 35 (eds. Koyejo, S. et al.) vol. 35 15823-15835 (Curran Associates, Inc., New Orleans, USA, 2022)

  4. arXiv:2005.03730  [pdf, other

    stat.ML cs.LG stat.CO

    The Strong Screening Rule for SLOPE

    Authors: Johan Larsson, MaƂgorzata Bogdan, Jonas Wallin

    Abstract: Extracting relevant features from data sets where the number of observations ($n$) is much smaller then the number of predictors ($p$) is a major challenge in modern statistics. Sorted L-One Penalized Estimation (SLOPE), a generalization of the lasso, is a promising method within this setting. Current numerical procedures for SLOPE, however, lack the efficiency that respective tools for the lasso… ▽ More

    Submitted 22 April, 2022; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: 15 pages, 5 figures

    MSC Class: 62J07 ACM Class: G.3; G.4

    Journal ref: Advances in neural information processing systems 33 (eds. Larochelle, H. et al.) vol. 33 14592-14603 (Curran Associates, Inc., Virtual, 2020)

  5. Bestow and Atomic: Concurrent Programming using Isolation, Delegation and Grouping

    Authors: Elias Castegren, Joel Wallin, Tobias Wrigstad

    Abstract: Any non-trivial concurrent system warrants synchronisation, regardless of the concurrency model. Actor-based concurrency serialises all computations in an actor through asynchronous message passing. In contrast, lock-based concurrency serialises some computations by following a lock--unlock protocol for accessing certain data. Both systems require sound reasoning about pointers and aliasing to exc… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Journal ref: Journal of Logical and Algebraic Methods in Programming, Vol. 100 (2018), 130-151

  6. arXiv:1512.07919  [pdf, ps, other

    cs.DL astro-ph.IM

    Improving Software Citation and Credit

    Authors: Alice Allen, G. Bruce Berriman, Kimberly DuPrie, Jessica Mink, Robert Nemiroff, Thomas Robitaille, Lior Shamir, Keith Shortridge, Mark Taylor, Peter Teuben, John Wallin

    Abstract: The past year has seen movement on several fronts for improving software citation, including the Center for Open Science's Transparency and Openness Promotion (TOP) Guidelines, the Software Publishing Special Interest Group that was started at January's AAS meeting in Seattle at the request of that organization's Working Group on Astronomical Software, a Sloan-sponsored meeting at GitHub in San Fr… ▽ More

    Submitted 24 December, 2015; originally announced December 2015.

    Comments: Birds of a Feather session organized by the Astrophysics Source Code Library (ASCL, http://ascl.net/ ); to be published in Proceedings of ADASS XXV (Sydney, Australia; October, 2015). 4 pages

  7. arXiv:1411.2031  [pdf, ps, other

    astro-ph.IM cs.DL

    Astrophysics Source Code Library Enhancements

    Authors: Robert J. Hanisch, Alice Allen, G. Bruce Berriman, Kimberly DuPrie, Jessica Mink, Robert J. Nemiroff, Judy Schmidt, Lior Shamir, Keith Shortridge, Mark Taylor, Peter J. Teuben, John Wallin

    Abstract: The Astrophysics Source Code Library (ASCL; ascl.net) is a free online registry of codes used in astronomy research; it currently contains over 900 codes and is indexed by ADS. The ASCL has recently moved a new infrastructure into production. The new site provides a true database for the code entries and integrates the WordPress news and information pages and the discussion forum into one site. Pr… ▽ More

    Submitted 7 November, 2014; originally announced November 2014.

    Comments: 4 pages; to be published in ADASS XXIV Proceedings. ASCL can be accessed at http://ascl.net/

  8. arXiv:1409.7935  [pdf

    astro-ph.IM astro-ph.GA cs.CV cs.LG

    Combining human and machine learning for morphological analysis of galaxy images

    Authors: Evan Kuminski, Joe George, John Wallin, Lior Shamir

    Abstract: The increasing importance of digital sky surveys collecting many millions of galaxy images has reinforced the need for robust methods that can perform morphological analysis of large galaxy image databases. Citizen science initiatives such as Galaxy Zoo showed that large datasets of galaxy images can be analyzed effectively by non-scientist volunteers, but since databases generated by robotic tele… ▽ More

    Submitted 28 September, 2014; originally announced September 2014.

    Comments: PASP, accepted

  9. arXiv:1312.7352  [pdf, ps, other

    astro-ph.IM cs.DL

    Ideas for Advancing Code Sharing (A Different Kind of Hack Day)

    Authors: Peter Teuben, Alice Allen, Bruce Berriman, Kimberly DuPrie, Robert J. Hanisch, Jessica Mink, Robert Nemiroff, Lior Shamir, Keith Shortridge, Mark Taylor, John Wallin

    Abstract: How do we as a community encourage the reuse of software for telescope operations, data processing, and calibration? How can we support making codes used in research available for others to examine? Continuing the discussion from last year Bring out your codes! BoF session, participants separated into groups to brainstorm ideas to mitigate factors which inhibit code sharing and nurture those which… ▽ More

    Submitted 27 December, 2013; originally announced December 2013.

    Comments: To be published in Proceedings of ADASS XXIII. Links to notes from brainstorming sessions are available here: http://asterisk.apod.com/wp/?p=543

  10. arXiv:1312.6693  [pdf, ps, other

    astro-ph.IM cs.DL

    Astrophysics Source Code Library: Incite to Cite!

    Authors: Kimberly DuPrie, Alice Allen, Bruce Berriman, Robert J. Hanisch, Jessica Mink, Robert J. Nemiroff, Lior Shamir, Keith Shortridge, Mark B. Taylor, Peter Teuben, John F. Wallin

    Abstract: The Astrophysics Source Code Library (ASCL, http://ascl.net/) is an online registry of over 700 source codes that are of interest to astrophysicists, with more being added regularly. The ASCL actively seeks out codes as well as accepting submissions from the code authors, and all entries are citable and indexed by ADS. All codes have been used to generate results published in or submitted to a ref… ▽ More

    Submitted 23 December, 2013; originally announced December 2013.

    Comments: Four pages, two figures. To be published in the Proceedings of ADASS XXIII, which took place September 29-October 3, 2013. The Astrophysics Source Code Library can be accessed at http://www.ascl.net; a concise directory of codes can be accessed at http://asterisk.apod.com/wp/?page_id=12

  11. arXiv:1304.6780  [pdf

    astro-ph.IM cs.DL

    Practices in source code sharing in astrophysics

    Authors: Lior Shamir, John F. Wallin, Alice Allen, Bruce Berriman, Peter Teuben, Robert J. Nemiroff, Jessica Mink, Robert J. Hanisch, Kimberly DuPrie

    Abstract: While software and algorithms have become increasingly important in astronomy, the majority of authors who publish computational astronomy research do not share the source code they develop, making it difficult to replicate and reuse the work. In this paper we discuss the importance of sharing scientific source code with the entire astrophysics community, and propose that journals require authors… ▽ More

    Submitted 24 April, 2013; originally announced April 2013.

    Comments: Accepted by Astronomy and Computing. 10 pages

  12. arXiv:0909.3895  [pdf, other

    astro-ph.IM cs.DB cs.DL cs.IR physics.ed-ph

    The Revolution in Astronomy Education: Data Science for the Masses

    Authors: Kirk D. Borne, Suzanne Jacoby, K. Carney, A. Connolly, T. Eastman, M. J. Raddick, J. A. Tyson, J. Wallin

    Abstract: As our capacity to study ever-expanding domains of our science has increased (including the time domain, non-electromagnetic phenomena, magnetized plasmas, and numerous sky surveys in multiple wavebands with broad spatial coverage and unprecedented depths), so have the horizons of our understanding of the Universe been similarly expanding. This expansion is coupled to the exponential data deluge… ▽ More

    Submitted 21 September, 2009; originally announced September 2009.

    Comments: 12 pages total: 1 cover page, 1 page of co-signers, plus 10 pages, State of the Profession Position Paper submitted to the Astro2010 Decadal Survey (March 2009)