-
tidychangepoint: a unified framework for analyzing changepoint detection in univariate time series
Authors:
Benjamin S. Baumer,
Biviana Marcela Suarez Sierra
Abstract:
We present tidychangepoint, a new R package for changepoint detection analysis. Most R packages for segmenting univariate time series focus on providing one or two algorithms for changepoint detection that work with a small set of models and penalized objective functions, and all of them return a custom, nonstandard object type. This makes comparing results across various algorithms, models, and p…
▽ More
We present tidychangepoint, a new R package for changepoint detection analysis. Most R packages for segmenting univariate time series focus on providing one or two algorithms for changepoint detection that work with a small set of models and penalized objective functions, and all of them return a custom, nonstandard object type. This makes comparing results across various algorithms, models, and penalized objective functions unnecessarily difficult. tidychangepoint solves this problem by wrapping functions from a variety of existing packages and storing the results in a common S3 class called tidycpt. The package then provides functionality for easily extracting comparable numeric or graphical information from a tidycpt object, all in a tidyverse-compliant framework. tidychangepoint is versatile: it supports both deterministic algorithms like PELT (from changepoint), and also flexible, randomized, genetic algorithms (via GA) that -- via new functionality built into tidychangepoint -- can be used with any compliant model-fitting function and any penalized objective function. By bringing all of these disparate tools together in a cohesive fashion, tidychangepoint facilitates comparative analysis of changepoint detection algorithms and models.
△ Less
Submitted 2 February, 2025; v1 submitted 19 July, 2024;
originally announced July 2024.
-
Multivariate Representations of Univariate Marked Hawkes Processes
Authors:
Louis Davis,
Conor Kresin,
Boris Baeumer,
Ting Wang
Abstract:
Univariate marked Hawkes processes are used to model a range of real-world phenomena including earthquake aftershock sequences, contagious disease spread, content diffusion on social media platforms, and order book dynamics. This paper illustrates a fundamental connection between univariate marked Hawkes processes and multivariate Hawkes processes. Exploiting this connection renders a framework th…
▽ More
Univariate marked Hawkes processes are used to model a range of real-world phenomena including earthquake aftershock sequences, contagious disease spread, content diffusion on social media platforms, and order book dynamics. This paper illustrates a fundamental connection between univariate marked Hawkes processes and multivariate Hawkes processes. Exploiting this connection renders a framework that can be built upon for expressive and flexible inference on diverse data. Specifically, multivariate unmarked Hawkes representations are introduced as a tool to parameterize univariate marked Hawkes processes. We show that such multivariate representations can asymptotically approximate a large class of univariate marked Hawkes processes, are stationary given the approximated process is stationary, and that resultant conditional intensity parameters are identifiable. A simulation study demonstrates the efficacy of this approach, and provides heuristic bounds for error induced by the relatively larger parameter space of multivariate Hawkes processes.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
A Multidimensional Fractional Hawkes Process for Multiple Earthquake Mainshock Aftershock Sequences
Authors:
Louis Davis,
Boris Baeumer,
Ting Wang
Abstract:
Most point process models for earthquakes currently in the literature assume the magnitude distribution is i.i.d. potentially hindering the ability of the model to describe the main features of data sets containing multiple earthquake mainshock aftershock sequences in succession. This study presents a novel multidimensional fractional Hawkes process model designed to capture magnitude dependent tr…
▽ More
Most point process models for earthquakes currently in the literature assume the magnitude distribution is i.i.d. potentially hindering the ability of the model to describe the main features of data sets containing multiple earthquake mainshock aftershock sequences in succession. This study presents a novel multidimensional fractional Hawkes process model designed to capture magnitude dependent triggering behaviour by incorporating history dependence into the magnitude distribution. This is done by discretising the magnitude range into disjoint intervals and modelling events with magnitude in these ranges as the subprocesses of a mutually exciting Hawkes process using the Mittag-Leffler density as the kernel function. We demonstrate this model's use by applying it to two data sets, Japan and the Middle America Trench, both containing multiple mainshock aftershock sequences and compare it to the existing ETAS model by using information criteria, residual diagnostics and retrospective prediction performance. We find that for both data sets all metrics indicate that the multidimensional fractional Hawkes process performs favourably against the ETAS model. Furthermore, using the multidimensional fractional Hawkes process we are able to infer characteristics of the data sets that are consistent with results currently in the literature and that cannot be found by using the ETAS model.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
A Fractional Model for Earthquakes
Authors:
Louis Davis,
Boris Baeumer,
Ting Wang
Abstract:
This paper extends the existing fractional Hawkes process to better model mainshock-aftershock sequences of earthquakes. The fractional Hawkes process is a self-exciting point process model with temporal decay kernel being a Mittag-Leffler function. A maximum likelihood estimation scheme is developed and its consistency is checked. It is then compared to the ETAS model on three earthquake sequence…
▽ More
This paper extends the existing fractional Hawkes process to better model mainshock-aftershock sequences of earthquakes. The fractional Hawkes process is a self-exciting point process model with temporal decay kernel being a Mittag-Leffler function. A maximum likelihood estimation scheme is developed and its consistency is checked. It is then compared to the ETAS model on three earthquake sequences in Southern California. The fractional Hawkes process performs favourably against the ETAS model. Additionally, two parameters in the fractional Hawkes process may have a fixed geophysical meaning dependent on the study zone and the stage of the seismic cycle the zone is in.
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
Big Ideas in Sports Analytics and Statistical Tools for their Investigation
Authors:
Benjamin S. Baumer,
Gregory J. Matthews,
Quang Nguyen
Abstract:
Sports analytics -- broadly defined as the pursuit of improvement in athletic performance through the analysis of data -- has expanded its footprint both in the professional sports industry and in academia over the past 30 years. In this paper, we connect four big ideas that are common across multiple sports: the expected value of a game state, win probability, measures of team strength, and the u…
▽ More
Sports analytics -- broadly defined as the pursuit of improvement in athletic performance through the analysis of data -- has expanded its footprint both in the professional sports industry and in academia over the past 30 years. In this paper, we connect four big ideas that are common across multiple sports: the expected value of a game state, win probability, measures of team strength, and the use of sports betting market data. For each, we explore both the shared similarities and individual idiosyncrasies of analytical approaches in each sport. While our focus is on the concepts underlying each type of analysis, any implementation necessarily involves statistical methodologies, computational tools, and data sources. Where appropriate, we outline how data, models, tools, and knowledge of the sport combine to generate actionable insights. We also describe opportunities to share analytical work, but omit an in-depth discussion of individual player evaluation as beyond our scope. This paper should serve as a useful overview for anyone becoming interested in the study of sports analytics.
△ Less
Submitted 10 January, 2023;
originally announced January 2023.
-
Data science transfer pathways from associate's to bachelor's programs
Authors:
Benjamin S. Baumer,
Nicholas J. Horton
Abstract:
A substantial fraction of students who complete their college education at a public university in the United States begin their journey at one of the 935 public two-year colleges. While the number of four-year colleges offering bachelor's degrees in data science continues to increase, data science instruction at many two-year colleges lags behind. A major impediment is the relative paucity of intr…
▽ More
A substantial fraction of students who complete their college education at a public university in the United States begin their journey at one of the 935 public two-year colleges. While the number of four-year colleges offering bachelor's degrees in data science continues to increase, data science instruction at many two-year colleges lags behind. A major impediment is the relative paucity of introductory data science courses that serve multiple student audiences and can easily transfer. In addition, the lack of pre-defined transfer pathways (or articulation agreements) for data science creates a growing disconnect that leaves students who want to study data science at a disadvantage. We describe opportunities and barriers to data science transfer pathways. Five points of curricular friction merit attention: 1) a first course in data science, 2) a second course in data science, 3) a course in scientific computing, data science workflow, and/or reproducible computing, 4) lab sciences, and 5) navigating communication, ethics, and application domain requirements in the context of general education and liberal arts course mappings. We catalog existing transfer pathways, efforts to align curricula across institutions, obstacles to overcome with minimally-disruptive solutions, and approaches to foster these pathways. Improvements in these areas are critically important to ensure that a broad and diverse set of students are able to engage and succeed in undergraduate data science programs.
△ Less
Submitted 6 January, 2023; v1 submitted 22 October, 2022;
originally announced October 2022.
-
A Higher Order Resolvent-positive Finite Difference Approximation for Fractional Derivatives
Authors:
Boris Baeumer,
Mihály Kovács,
Matthew Parry
Abstract:
We develop a finite difference approximation of order $α$ for the $α$-fractional derivative. The weights of the approximation scheme have the same rate-matrix type properties as the popular Grünwald scheme. In particular, approximate solutions to fractional diffusion equations preserve positivity. Furthermore, for the approximation of the solution to the skewed fractional heat equation on a bounde…
▽ More
We develop a finite difference approximation of order $α$ for the $α$-fractional derivative. The weights of the approximation scheme have the same rate-matrix type properties as the popular Grünwald scheme. In particular, approximate solutions to fractional diffusion equations preserve positivity. Furthermore, for the approximation of the solution to the skewed fractional heat equation on a bounded domain the new approximation scheme keeps its order $α$ whereas the order of the Grünwald scheme reduces to order $α-1$, contradicting the convergence rate results by Meerschaert and Tadjeran.
△ Less
Submitted 17 December, 2021; v1 submitted 15 December, 2021;
originally announced December 2021.
-
An educator's perspective of the tidyverse
Authors:
Mine Çetinkaya-Rundel,
Johanna Hardin,
Benjamin S. Baumer,
Amelia McNamara,
Nicholas J. Horton,
Colin Rundel
Abstract:
Computing makes up a large and growing component of data science and statistics courses. Many of those courses, especially when taught by faculty who are statisticians by training, teach R as the programming language. A number of instructors have opted to build much of their teaching around use of the tidyverse. The tidyverse, in the words of its developers, "is a collection of R packages that sha…
▽ More
Computing makes up a large and growing component of data science and statistics courses. Many of those courses, especially when taught by faculty who are statisticians by training, teach R as the programming language. A number of instructors have opted to build much of their teaching around use of the tidyverse. The tidyverse, in the words of its developers, "is a collection of R packages that share a high-level design philosophy and low-level grammar and data structures, so that learning one package makes it easier to learn the next". These shared principles have led to the widespread adoption of the tidyverse ecosystem. A large part of this usage is because the tidyverse tools have been intentionally designed to ease the learning process and make it easier for users to learn new functions as they engage with additional pieces of the larger ecosystem. Moreover, the functionality offered by the packages within the tidyverse spans the entire data science cycle, which includes data import, visualisation, wrangling, modeling, and communication. We believe the tidyverse provides an effective and efficient pathway for undergraduate students at all levels and majors to gain computational skills and thinking needed throughout the data science cycle. In this paper, we introduce the tidyverse from an educator's perspective. We provide a brief introduction to the tidyverse, demonstrate how foundational statistics and data science tasks are accomplished with the tidyverse, and discuss the strengths of the tidyverse, particularly in the context of teaching and learning.
△ Less
Submitted 22 April, 2022; v1 submitted 7 August, 2021;
originally announced August 2021.
-
Facilitating team-based data science: lessons learned from the DSC-WAV project
Authors:
Chelsey Legacy,
Andrew Zieffler,
Benjamin S. Baumer,
Valerie Barr,
Nicholas J. Horton
Abstract:
While coursework provides undergraduate data science students with some relevant analytic skills, many are not given the rich experiences with data and computing they need to be successful in the workplace. Additionally, students often have limited exposure to team-based data science and the principles and tools of collaboration that are encountered outside of school. In this paper, we describe th…
▽ More
While coursework provides undergraduate data science students with some relevant analytic skills, many are not given the rich experiences with data and computing they need to be successful in the workplace. Additionally, students often have limited exposure to team-based data science and the principles and tools of collaboration that are encountered outside of school. In this paper, we describe the DSC-WAV program, an NSF-funded data science workforce development project in which teams of undergraduate sophomores and juniors work with a local non-profit organization on a data-focused problem. To help students develop a sense of agency and improve confidence in their technical and non-technical data science skills, the project promoted a team-based approach to data science, adopting several processes and tools intended to facilitate this collaboration. Evidence from the project evaluation, including participant survey and interview data, is presented to document the degree to which the project was successful in engaging students in team-based data science, and how the project changed the students' perceptions of their technical and non-technical skills. We also examine opportunities for improvement and offer insight to other data science educators who may want to implement a similar team-based approach to data science projects at their own institutions.
△ Less
Submitted 21 October, 2021; v1 submitted 21 June, 2021;
originally announced June 2021.
-
Boundary conditions for nonlocal one-sided pseudo-differential operators and the associated stochastic processes II
Authors:
Boris Baeumer,
Mihály Kovács,
Lorenzo Toniazzi
Abstract:
We connect boundary conditions for one-sided pseudo-differential operators with the generators of modified one-sided Lévy processes. On one hand this allows modellers to use appropriate boundary conditions with confidence when restricting the modelling domain. On the other hand it allows for numerical techniques based on differential equation solvers to obtain fast approximations of densities or o…
▽ More
We connect boundary conditions for one-sided pseudo-differential operators with the generators of modified one-sided Lévy processes. On one hand this allows modellers to use appropriate boundary conditions with confidence when restricting the modelling domain. On the other hand it allows for numerical techniques based on differential equation solvers to obtain fast approximations of densities or other statistical properties of restricted one-sided Lévy processes encountered, for example, in finance. In particular we identify a new nonlocal mass conserving boundary condition by showing it corresponds to fast-forwarding, i.e. removing the time the process spends outside the domain. We treat all combinations of killing, reflecting and fast-forwarding boundary conditions.
In Part I we show wellposedness of the backward and forward Cauchy problems with a one-sided pseudo-differential operator with boundary conditions as generator. We do so by showing convergence of Feller semigroups based on grid point approximations of the modified Lévy process.
In Part II we show that the limiting Feller semigroup is indeed the semigroup associated with the modified Lévy process by showing continuity of the modifications with respect to the Skorokhod topology.
△ Less
Submitted 28 February, 2021;
originally announced March 2021.
-
Boundary conditions for nonlocal one-sided pseudo-differential operators and the associated stochastic processes I
Authors:
Boris Baeumer,
Mihály Kovács,
Lorenzo Toniazzi
Abstract:
We connect boundary conditions for one-sided pseudo-differential operators with the generators of modified one-sided Lévy processes. On one hand this allows modellers to use appropriate boundary conditions with confidence when restricting the modelling domain. On the other hand it allows for numerical techniques based on differential equation solvers to obtain fast approximations of densities or o…
▽ More
We connect boundary conditions for one-sided pseudo-differential operators with the generators of modified one-sided Lévy processes. On one hand this allows modellers to use appropriate boundary conditions with confidence when restricting the modelling domain. On the other hand it allows for numerical techniques based on differential equation solvers to obtain fast approximations of densities or other statistical properties of restricted one-sided Lévy processes encountered, for example, in finance. In particular we identify a new nonlocal mass conserving boundary condition by showing it corresponds to fast-forwarding, i.e. removing the time the process spends outside the domain. We treat all combinations of killing, reflecting and fast-forwarding boundary conditions.
In Part I we show wellposedness of the backward and forward Cauchy problems with a one-sided pseudo-differential operator with boundary conditions as generator. We do so by showing convergence of Feller semigroups based on grid point approximations of the modified Lévy process.
In Part II we show that the limiting Feller semigroup is indeed the semigroup associated with the modified Lévy process by showing continuity of the modifications with respect to the Skorokhod topology.
△ Less
Submitted 20 December, 2020;
originally announced December 2020.
-
Creating optimal conditions for reproducible data analysis in R with 'fertile'
Authors:
Audrey M. Bertin,
Benjamin S. Baumer
Abstract:
The advancement of scientific knowledge increasingly depends on ensuring that data-driven research is reproducible: that two people with the same data obtain the same results. However, while the necessity of reproducibility is clear, there are significant behavioral and technical challenges that impede its widespread implementation, and no clear consensus on standards of what constitutes reproduci…
▽ More
The advancement of scientific knowledge increasingly depends on ensuring that data-driven research is reproducible: that two people with the same data obtain the same results. However, while the necessity of reproducibility is clear, there are significant behavioral and technical challenges that impede its widespread implementation, and no clear consensus on standards of what constitutes reproducibility in published research. We present fertile, an R package that focuses on a series of common mistakes programmers make while conducting data science projects in R, primarily through the RStudio integrated development environment. fertile operates in two modes: proactively (to prevent reproducibility mistakes from happening in the first place), and retroactively (analyzing code that is already written for potential problems). Furthermore, fertile is designed to educate users on why their mistakes are problematic and how to fix them.
△ Less
Submitted 18 August, 2020;
originally announced August 2020.
-
Color filter arrays based on dielectric metasurface elements
Authors:
Jonas Berzins,
Fabrizio Silvestri,
Giampiero Gerini,
Frank Setzpfandt,
Thomas Pertsch,
Stefan M. B. Bäumer
Abstract:
Digital imaging has been steadily improving over the past decades and we are moving towards a wide use of multi- and hyperspectral cameras. A key component of such imaging systems are color filter arrays, which define the spectrum of light detected by each camera pixel. Hence, it is essential to develop a variable, robust and scalable way for controlling the transmission of light. Nanostructured s…
▽ More
Digital imaging has been steadily improving over the past decades and we are moving towards a wide use of multi- and hyperspectral cameras. A key component of such imaging systems are color filter arrays, which define the spectrum of light detected by each camera pixel. Hence, it is essential to develop a variable, robust and scalable way for controlling the transmission of light. Nanostructured surfaces, also known as metasurfaces, offer a promising solution as their transmission spectra can be controlled by shaping the wavelength-dependent scattering properties of their constituting elements. Here we present, metasurfaces based on silicon nanodisks, which provide filter functions with amplitudes reaching 70-90% of transmission, and well suitable for RGB and CMY color filter arrays, the initial stage towards the further development of hyperspectral filters. We suggest and discuss possible ways to expand the color gamut and improve the color values of such optical filters.
△ Less
Submitted 14 April, 2020;
originally announced April 2020.
-
Nanostructure-modulated planar high spectral resolution spectro-polarimeter
Authors:
L. Pjotr Stoevelaar,
Jonas Berzinš,
Fabrizio Silvestri,
Stefan Fasold,
Khosro Zangeneh Kamali,
Heiko Knopf,
Falk Eilenberger,
Frank Setzpfandt,
Stefan M. B. Bäumer,
Giampiero Gerini
Abstract:
We present a planar spectro-polarimeter based on Fabry-P{é}rot cavities with embedded polarization-sensitive high-index nanostructures. A $7~μ$m-thick spectro-polarimetric system for 3 spectral bands and 2 linear polarization states is experimentally demonstrated. Furthermore, an optimal design is theoretically proposed, estimating that a system with a bandwidth of 127~nm and a spectral resolution…
▽ More
We present a planar spectro-polarimeter based on Fabry-P{é}rot cavities with embedded polarization-sensitive high-index nanostructures. A $7~μ$m-thick spectro-polarimetric system for 3 spectral bands and 2 linear polarization states is experimentally demonstrated. Furthermore, an optimal design is theoretically proposed, estimating that a system with a bandwidth of 127~nm and a spectral resolution of 1~nm is able to reconstruct the first three Stokes parameters \textcolor{black}{with a signal-to-noise ratio of -13.14~dB with respect to the the shot noise limited SNR}. The pixelated spectro-polarimetric system can be directly integrated on a sensor, thus enabling applicability in a variety of miniaturized optical devices, including but not limited to satellites for Earth observation.
△ Less
Submitted 25 May, 2020; v1 submitted 9 March, 2020;
originally announced March 2020.
-
Direct and High-Throughput Fabrication of Mie-Resonant Metasurfaces via Single-Pulse Laser Interference
Authors:
Jonas Berzinš,
Simonas Indrišiūnas,
Koen van Erve,
Arvind Nagarajan,
Stefan Fasold,
Michael Steinert,
Giampiero Gerini,
Paulius Gečys,
Thomas Pertsch,
Stefan M. B. Bäumer,
Frank Setzpfandt
Abstract:
High-index dielectric metasurfaces featuring Mie-type electric and magnetic resonances have been of a great interest in a variety of applications such as imaging, sensing, photovoltaics and others, which led to the necessity of an efficient large-scale fabrication technique. To address this, here we demonstrate the use of single-pulse laser interference for direct patterning of an amorphous silico…
▽ More
High-index dielectric metasurfaces featuring Mie-type electric and magnetic resonances have been of a great interest in a variety of applications such as imaging, sensing, photovoltaics and others, which led to the necessity of an efficient large-scale fabrication technique. To address this, here we demonstrate the use of single-pulse laser interference for direct patterning of an amorphous silicon film into an array of Mie resonators. The proposed technique is based on laser-interference-induced dewetting. A precise control of the laser pulse energy enables the fabrication of ordered dielectric metasurfaces in areas spanning tens of micrometers and consisting of thousands of hemispherical nanoparticles with a single laser shot. The fabricated nanoparticles exhibit a wavelength-dependent optical response with a strong electric dipole signature. Variation of the pre-deposited silicon film thickness allows tailoring of the resonances in the targeted visible and infrared spectral ranges. Such direct and high-throughput fabrication paves the way towards a simple realization of spatially invariant metasurface-based devices.
△ Less
Submitted 6 March, 2020;
originally announced March 2020.
-
Integrating data science ethics into an undergraduate major: A case study
Authors:
Benjamin S. Baumer,
Randi L. Garcia,
Albert Y. Kim,
Katherine M. Kinnaird,
Miles Q. Ott
Abstract:
We present a programmatic approach to incorporating ethics into an undergraduate major in statistical and data sciences. We discuss departmental-level initiatives designed to meet the National Academy of Sciences recommendation for integrating ethics into the curriculum from top-to-bottom as our majors progress from our introductory courses to our senior capstone course, as well as from side-to-si…
▽ More
We present a programmatic approach to incorporating ethics into an undergraduate major in statistical and data sciences. We discuss departmental-level initiatives designed to meet the National Academy of Sciences recommendation for integrating ethics into the curriculum from top-to-bottom as our majors progress from our introductory courses to our senior capstone course, as well as from side-to-side through co-curricular programming. We also provide six examples of data science ethics modules used in five different courses at our liberal arts college, each focusing on a different ethical consideration. The modules are designed to be portable such that they can be flexibly incorporated into existing courses at different levels of instruction with minimal disruption to syllabi. We connect our efforts to a growing body of literature on the teaching of data science ethics, present assessments of our effectiveness, and conclude with next steps and final thoughts.
△ Less
Submitted 31 January, 2022; v1 submitted 21 January, 2020;
originally announced January 2020.
-
Laser-Induced Spatially-Selective Tailoring of High-Index Dielectric Metasurfaces
Authors:
Jonas Berzinš,
Simonas Indrišiūnas,
Stefan Fasold,
Michael Steinert,
Olga Žukovskaja,
Dana Cialla-May,
Paulius Gečys,
Stefan M. B. Bäumer,
Thomas Pertsch,
Frank Setzpfandt
Abstract:
Optically resonant high-index dielectric metasurfaces featuring Mie-type electric and magnetic resonances are usually fabricated by means of planar technologies, which limit the degrees of freedom in tunability and scalability of the fabricated systems. Therefore, we propose a complimentary post-processing technique based on ultrashort ($\leq$ 10 ps) laser pulses. The process involves thermal effe…
▽ More
Optically resonant high-index dielectric metasurfaces featuring Mie-type electric and magnetic resonances are usually fabricated by means of planar technologies, which limit the degrees of freedom in tunability and scalability of the fabricated systems. Therefore, we propose a complimentary post-processing technique based on ultrashort ($\leq$ 10 ps) laser pulses. The process involves thermal effects: crystallization and reshaping, while the heat is localized by a high-precision positioning of the focused laser beam. Moreover, for the first time, the resonant behavior of dielectric metasurface elements is exploited to engineer a specific absorption profile, which leads to a spatially-selective heating and a customized modification. Such technique has a potential to reduce the complexity in the fabrication of non-uniform metasurface-based optical elements. Two distinct cases, a spatial pixelation of a large-scale metasurface and a height modification of metasurface elements, are explicitly demonstrated.
△ Less
Submitted 11 October, 2019; v1 submitted 11 July, 2019;
originally announced July 2019.
-
Reflection confocal nanoscopy using a super-oscillatory lens
Authors:
Arvind Nagarajan,
L. Pjotr Stoevelaar,
Fabrizio Silvestri,
Marijn Siemons,
Venu Gopal Achanta,
Stefan M. B. Bäumer,
Giampiero Gerini
Abstract:
A Superoscillatory lens (SOL) is known to produce a sub-diffraction hotspot which is useful for high-resolution imaging. However, high-energy rings called sidelobes coexist with the central hotspot. Additionally, SOLs have not yet been directly used to image reflective objects due to low efficiency and poor imaging properties. We propose a novel reflection confocal nanoscope which mitigates these…
▽ More
A Superoscillatory lens (SOL) is known to produce a sub-diffraction hotspot which is useful for high-resolution imaging. However, high-energy rings called sidelobes coexist with the central hotspot. Additionally, SOLs have not yet been directly used to image reflective objects due to low efficiency and poor imaging properties. We propose a novel reflection confocal nanoscope which mitigates these issues by relaying the SOL intensity pattern onto the object and use conventional optics for detection. We experimentally demonstrate super-resolution by imaging double bars with 330 nm separation using a 632.8 nm excitation and a 0.95 NA objective. We also discuss the enhanced contrast properties of the SOL nanoscope against a laser confocal microscope, and the degradation of performance while imaging large objects.
△ Less
Submitted 21 March, 2019;
originally announced March 2019.
-
Sub-micrometer Nanostructure-based RGB Filters for CMOS Image Sensors
Authors:
Jonas Berzinš,
Stefan Fasold,
Thomas Pertsch,
Stefan M. B. Bäumer,
Frank Setzpfandt
Abstract:
Digital color imaging relies on spectral filters on top of a pixelated sensor, such as a CMOS image sensor. An important parameter of imaging devices is their resolution, which depends on the size of the pixels. For many applications, a high resolution is desirable, consequently requiring small spectral filters. Dielectric nanostructures, due to their resonant behavior and its tunability, offer th…
▽ More
Digital color imaging relies on spectral filters on top of a pixelated sensor, such as a CMOS image sensor. An important parameter of imaging devices is their resolution, which depends on the size of the pixels. For many applications, a high resolution is desirable, consequently requiring small spectral filters. Dielectric nanostructures, due to their resonant behavior and its tunability, offer the possibility to be assembled into flexible and miniature spectral filters, which could potentially replace conventional pigmented and dye-based color filters. In this paper, we demonstrate the generation of transmissive structural colors based on uniform-height amorphous silicon nanostructures. We optimize the structures for the primary RGB colors and report the construction of sub-micrometer RGB filter arrays for a pixel size down to 0.5 μm.
△ Less
Submitted 4 December, 2018;
originally announced December 2018.
-
Curriculum Guidelines for Undergraduate Programs in Data Science
Authors:
Richard De Veaux,
Mahesh Agarwal,
Maia Averett,
Benjamin Baumer,
Andrew Bray,
Thomas Bressoud,
Lance Bryant,
Lei Cheng,
Amanda Francis,
Robert Gould,
Albert Y. Kim,
Matt Kretchmar,
Qin Lu,
Ann Moskol,
Deborah Nolan,
Roberto Pelayo,
Sean Raleigh,
Ricky J. Sethi,
Mutiara Sondjaja,
Neelesh Tiruviluamala,
Paul Uhlig,
Talitha Washington,
Curtis Wesley,
David White,
Ping Ye
Abstract:
The Park City Math Institute (PCMI) 2016 Summer Undergraduate Faculty Program met for the purpose of composing guidelines for undergraduate programs in Data Science. The group consisted of 25 undergraduate faculty from a variety of institutions in the U.S., primarily from the disciplines of mathematics, statistics and computer science. These guidelines are meant to provide some structure for insti…
▽ More
The Park City Math Institute (PCMI) 2016 Summer Undergraduate Faculty Program met for the purpose of composing guidelines for undergraduate programs in Data Science. The group consisted of 25 undergraduate faculty from a variety of institutions in the U.S., primarily from the disciplines of mathematics, statistics and computer science. These guidelines are meant to provide some structure for institutions planning for or revising a major in Data Science.
△ Less
Submitted 21 January, 2018;
originally announced January 2018.
-
Greater data science at baccalaureate institutions
Authors:
Amelia McNamara,
Nicholas J. Horton,
Benjamin S. Baumer
Abstract:
Donoho's JCGS (in press) paper is a spirited call to action for statisticians, who he points out are losing ground in the field of data science by refusing to accept that data science is its own domain. (Or, at least, a domain that is becoming distinctly defined.) He calls on writings by John Tukey, Bill Cleveland, and Leo Breiman, among others, to remind us that statisticians have been dealing wi…
▽ More
Donoho's JCGS (in press) paper is a spirited call to action for statisticians, who he points out are losing ground in the field of data science by refusing to accept that data science is its own domain. (Or, at least, a domain that is becoming distinctly defined.) He calls on writings by John Tukey, Bill Cleveland, and Leo Breiman, among others, to remind us that statisticians have been dealing with data science for years, and encourages acceptance of the direction of the field while also ensuring that statistics is tightly integrated.
As faculty at baccalaureate institutions (where the growth of undergraduate statistics programs has been dramatic), we are keen to ensure statistics has a place in data science and data science education. In his paper, Donoho is primarily focused on graduate education. At our undergraduate institutions, we are considering many of the same questions.
△ Less
Submitted 24 October, 2017;
originally announced October 2017.
-
A Grammar for Reproducible and Painless Extract-Transform-Load Operations on Medium Data
Authors:
Benjamin S. Baumer
Abstract:
Many interesting data sets available on the Internet are of a medium size---too big to fit into a personal computer's memory, but not so large that they won't fit comfortably on its hard disk. In the coming years, data sets of this magnitude will inform vital research in a wide array of application domains. However, due to a variety of constraints they are cumbersome to ingest, wrangle, analyze, a…
▽ More
Many interesting data sets available on the Internet are of a medium size---too big to fit into a personal computer's memory, but not so large that they won't fit comfortably on its hard disk. In the coming years, data sets of this magnitude will inform vital research in a wide array of application domains. However, due to a variety of constraints they are cumbersome to ingest, wrangle, analyze, and share in a reproducible fashion. These obstructions hamper thorough peer-review and thus disrupt the forward progress of science. We propose a predictable and pipeable framework for R (the state-of-the-art statistical computing environment) that leverages SQL (the venerable database architecture and query language) to make reproducible research on medium data a painless reality.
△ Less
Submitted 23 May, 2018; v1 submitted 23 August, 2017;
originally announced August 2017.
-
Boundary Conditions for Fractional Diffusion
Authors:
Boris Baeumer,
Mihály Kovács,
Mark M. Meerschaert,
Harish Sankaranarayanan
Abstract:
This paper derives physically meaningful boundary conditions for fractional diffusion equations, using a mass balance approach. Numerical solutions are presented, and theoretical properties are reviewed, including well-posedness and steady state solutions. Absorbing and reflecting boundary conditions are considered, and illustrated through several examples. Reflecting boundary conditions involve f…
▽ More
This paper derives physically meaningful boundary conditions for fractional diffusion equations, using a mass balance approach. Numerical solutions are presented, and theoretical properties are reviewed, including well-posedness and steady state solutions. Absorbing and reflecting boundary conditions are considered, and illustrated through several examples. Reflecting boundary conditions involve fractional derivatives. The Caputo fractional derivative is shown to be unsuitable for modeling fractional diffusion, since the resulting boundary value problem is not positivity preserving.
△ Less
Submitted 24 June, 2017;
originally announced June 2017.
-
Fractional Partial Differential Equations with Boundary Conditions
Authors:
Boris Baeumer,
Mihály Kovács,
Harish Sankaranarayanan
Abstract:
We identify the stochastic processes associated with one-sided fractional partial differential equations on a bounded domain with various boundary conditions. This is essential for modelling using spatial fractional derivatives. We show well-posedness of the associated Cauchy problems in $C_0(Ω)$ and $L_1(Ω)$. In order to do so we develop a new method of embedding finite state Markov processes int…
▽ More
We identify the stochastic processes associated with one-sided fractional partial differential equations on a bounded domain with various boundary conditions. This is essential for modelling using spatial fractional derivatives. We show well-posedness of the associated Cauchy problems in $C_0(Ω)$ and $L_1(Ω)$. In order to do so we develop a new method of embedding finite state Markov processes into Feller processes and then show convergence of the respective Feller processes. This also gives a numerical approximation of the solution. The proof of well-posedness closes a gap in many numerical algorithm articles approximating solutions to fractional differential equations that use the Lax-Richtmyer Equivalence Theorem to prove convergence without checking well-posedness.
△ Less
Submitted 22 June, 2017;
originally announced June 2017.
-
How often does the best team win? A unified approach to understanding randomness in North American sport
Authors:
Michael J. Lopez,
Gregory J. Matthews,
Benjamin S. Baumer
Abstract:
Statistical applications in sports have long centered on how to best separate signal (e.g. team talent) from random noise. However, most of this work has concentrated on a single sport, and the development of meaningful cross-sport comparisons has been impeded by the difficulty of translating luck from one sport to another. In this manuscript, we develop Bayesian state-space models using betting m…
▽ More
Statistical applications in sports have long centered on how to best separate signal (e.g. team talent) from random noise. However, most of this work has concentrated on a single sport, and the development of meaningful cross-sport comparisons has been impeded by the difficulty of translating luck from one sport to another. In this manuscript, we develop Bayesian state-space models using betting market data that can be uniformly applied across sporting organizations to better understand the role of randomness in game outcomes. These models can be used to extract estimates of team strength, the between-season, within-season, and game-to-game variability of team strengths, as well each team's home advantage. We implement our approach across a decade of play in each of the National Football League (NFL), National Hockey League (NHL), National Basketball Association (NBA), and Major League Baseball (MLB), finding that the NBA demonstrates both the largest dispersion in talent and the largest home advantage, while the NHL and MLB stand out for their relative randomness in game outcomes. We conclude by proposing new metrics for judging competitiveness across sports leagues, both within the regular season and using traditional postseason tournament formats. Although we focus on sports, we discuss a number of other situations in which our generalizable models might be usefully applied.
△ Less
Submitted 22 November, 2017; v1 submitted 20 January, 2017;
originally announced January 2017.
-
Space-time fractional Dirichlet problems
Authors:
Boris Baeumer,
Tomasz Luks,
Mark M. Meerschaert
Abstract:
This paper establishes explicit solutions for fractional diffusion problems on bounded domains. It also gives stochastic solutions, in terms of Markov processes time-changed by an inverse stable subordinator whose index equals the order of the fractional time derivative. Some applications are given, to demonstrate how to specify a well-posed Dirichlet problem for space-time fractional diffusions i…
▽ More
This paper establishes explicit solutions for fractional diffusion problems on bounded domains. It also gives stochastic solutions, in terms of Markov processes time-changed by an inverse stable subordinator whose index equals the order of the fractional time derivative. Some applications are given, to demonstrate how to specify a well-posed Dirichlet problem for space-time fractional diffusions in one or several variables. This solves an open problem in numerical analysis.
△ Less
Submitted 19 April, 2016;
originally announced April 2016.
-
We Found the Smallest Non-Autograph
Authors:
Ben S. Baumer,
Yijin Wei,
Gary S. Bloom
Abstract:
Suppose that $G$ is a simple, vertex-labeled graph and that $S$ is a multiset. Then if there exists a one-to-one mapping between the elements of $S$ and the vertices of $G$, such that edges in $G$ exist if and only if the absolute difference of the corresponding vertex labels exist in $S$, then $G$ is an \emph{autograph}, and $S$ is a \emph{signature} for $G$. While it is known that many common fa…
▽ More
Suppose that $G$ is a simple, vertex-labeled graph and that $S$ is a multiset. Then if there exists a one-to-one mapping between the elements of $S$ and the vertices of $G$, such that edges in $G$ exist if and only if the absolute difference of the corresponding vertex labels exist in $S$, then $G$ is an \emph{autograph}, and $S$ is a \emph{signature} for $G$. While it is known that many common families are graphs are autographs, and that infinitely many graphs are not autographs, a non-autograph has never been exhibited. In this paper, we identify the smallest non-autograph: a graph with 6 vertices and 11 edges. Furthermore, we demonstrate that the infinite family of graphs on $n$ vertices consisting of the complement of two non-intersecting cycles contains only non-autographs for $n \geq 8$.
△ Less
Submitted 12 November, 2015;
originally announced November 2015.
-
A Data Science Course for Undergraduates: Thinking with Data
Authors:
Ben Baumer
Abstract:
Data science is an emerging interdisciplinary field that combines elements of mathematics, statistics, computer science, and knowledge in a particular application domain for the purpose of extracting meaningful information from the increasingly sophisticated array of data available in many settings. These data tend to be non-traditional, in the sense that they are often live, large, complex, and/o…
▽ More
Data science is an emerging interdisciplinary field that combines elements of mathematics, statistics, computer science, and knowledge in a particular application domain for the purpose of extracting meaningful information from the increasingly sophisticated array of data available in many settings. These data tend to be non-traditional, in the sense that they are often live, large, complex, and/or messy. A first course in statistics at the undergraduate level typically introduces students with a variety of techniques to analyze small, neat, and clean data sets. However, whether they pursue more formal training in statistics or not, many of these students will end up working with data that is considerably more complex, and will need facility with statistical computing techniques. More importantly, these students require a framework for thinking structurally about data. We describe an undergraduate course in a liberal arts environment that provides students with the tools necessary to apply data science. The course emphasizes modern, practical, and useful skills that cover the full data analysis spectrum, from asking an interesting question to acquiring, managing, manipulating, processing, querying, analyzing, and visualizing data, as well communicating findings in written, graphical, and oral forms.
△ Less
Submitted 18 March, 2015;
originally announced March 2015.
-
Setting the stage for data science: integration of data management skills in introductory and second courses in statistics
Authors:
Nicholas J. Horton,
Benjamin S. Baumer,
Hadley Wickham
Abstract:
Many have argued that statistics students need additional facility to express statistical computations. By introducing students to commonplace tools for data management, visualization, and reproducible analysis in data science and applying these to real-world scenarios, we prepare them to think statistically. In an era of increasingly big data, it is imperative that students develop data-related c…
▽ More
Many have argued that statistics students need additional facility to express statistical computations. By introducing students to commonplace tools for data management, visualization, and reproducible analysis in data science and applying these to real-world scenarios, we prepare them to think statistically. In an era of increasingly big data, it is imperative that students develop data-related capacities, beginning with the introductory course. We believe that the integration of these precursors to data science into our curricula-early and often-will help statisticians be part of the dialogue regarding "Big Data" and "Big Questions".
△ Less
Submitted 1 February, 2015;
originally announced February 2015.
-
R Markdown
Authors:
Dana Udwin,
Ben Baumer
Abstract:
Reproducibility is increasingly important to statistical research, but many details are often omitted from the published version of complex statistical analyses. A reader's comprehension is limited to what the author concludes, without exposure to the computational process. Often, the industrious reader cannot expand upon or validate the author's results. Even the author may struggle to reproduce…
▽ More
Reproducibility is increasingly important to statistical research, but many details are often omitted from the published version of complex statistical analyses. A reader's comprehension is limited to what the author concludes, without exposure to the computational process. Often, the industrious reader cannot expand upon or validate the author's results. Even the author may struggle to reproduce their own results upon revisiting them. R Markdown is an authoring syntax that combines the ease of Markdown with the statistical programming language R. An R Markdown document or presentation interweaves computation, output and written analysis to the effect of transparency, clarity and an inherent invitation to reproduce (especially as sharing data is now as easy as the click of a button). It is an open-source tool that can be used either on its own or through the RStudio integrated development environment (IDE). In addition to facilitating reproducible research, R Markdown is a boon to collaboratively-minded data analysts, whose workflow can be streamlined by sharing only one master document that contains both code and content. Statistics educators may also find that R Markdown is helpful as a homework template, for both ease-of-use and in discouraging students from copy-and-pasting results from classmates. Training students in R Markdown will introduce to the workforce a new class of data analysts with an ingrained, foundational inclination toward reproducible research.
△ Less
Submitted 7 January, 2015;
originally announced January 2015.
-
Fokker--Planck and Kolmogorov Backward Equations for Continuous Time Random Walk scaling limits
Authors:
Boris Baeumer,
Peter Straka
Abstract:
It is proved that the distributions of scaling limits of Continuous Time Random Walks (CTRWs) solve integro-differential equations akin to Fokker-Planck Equations for diffusion processes. In contrast to previous such results, it is not assumed that the underlying process has absolutely continuous laws. Moreover, governing equations in the backward variables are derived. Three examples of anomalous…
▽ More
It is proved that the distributions of scaling limits of Continuous Time Random Walks (CTRWs) solve integro-differential equations akin to Fokker-Planck Equations for diffusion processes. In contrast to previous such results, it is not assumed that the underlying process has absolutely continuous laws. Moreover, governing equations in the backward variables are derived. Three examples of anomalous diffusion processes illustrate the theory.
△ Less
Submitted 19 July, 2016; v1 submitted 3 January, 2015;
originally announced January 2015.
-
Existence, uniqueness and regularity for a class of semilinear stochastic Volterra equations with multiplicative noise
Authors:
Boris Baeumer,
Matthias Geissert,
Mihaly Kovacs
Abstract:
We consider a class of semilinear Volterra type stochastic evolution equation driven by multiplicative Gaussian noise. The memory kernel, not necessarily analytic, is such that the deterministic linear equation exhibits a parabolic character. Under appropriate Lipschitz-type and linear growth assumptions on the nonlinear terms we show that the unique mild solution is mean-$p$ Hölder continuous wit…
▽ More
We consider a class of semilinear Volterra type stochastic evolution equation driven by multiplicative Gaussian noise. The memory kernel, not necessarily analytic, is such that the deterministic linear equation exhibits a parabolic character. Under appropriate Lipschitz-type and linear growth assumptions on the nonlinear terms we show that the unique mild solution is mean-$p$ Hölder continuous with values in an appropriate Sobolev space depending on the kernel and the data. In particular, we obtain pathwise space-time (Sobolev-Hölder) regularity of the solution together with a maximal type bound on the spatial Sobolev norm. As one of the main technical tools we establish a smoothing property of the derivative of the deterministic evolution operator family.
△ Less
Submitted 14 May, 2014; v1 submitted 15 April, 2014;
originally announced April 2014.
-
R Markdown: Integrating A Reproducible Analysis Tool into Introductory Statistics
Authors:
Ben Baumer,
Mine Cetinkaya-Rundel,
Andrew Bray,
Linda Loi,
Nicholas J. Horton
Abstract:
Nolan and Temple Lang argue that "the ability to express statistical computations is an essential skill." A key related capacity is the ability to conduct and present data analysis in a way that another person can understand and replicate. The copy-and-paste workflow that is an artifact of antiquated user-interface design makes reproducibility of statistical analysis more difficult, especially as…
▽ More
Nolan and Temple Lang argue that "the ability to express statistical computations is an essential skill." A key related capacity is the ability to conduct and present data analysis in a way that another person can understand and replicate. The copy-and-paste workflow that is an artifact of antiquated user-interface design makes reproducibility of statistical analysis more difficult, especially as data become increasingly complex and statistical methods become increasingly sophisticated. R Markdown is a new technology that makes creating fully-reproducible statistical analysis simple and painless. It provides a solution suitable not only for cutting edge research, but also for use in an introductory statistics course. We present evidence that R Markdown can be used effectively in introductory statistics courses, and discuss its role in the rapidly-changing world of statistical computation.
△ Less
Submitted 8 February, 2014;
originally announced February 2014.
-
Teaching precursors to data science in introductory and second courses in statistics
Authors:
Nicholas J Horton,
Benjamin S Baumer,
Hadley Wickham
Abstract:
Statistics students need to develop the capacity to make sense of the staggering amount of information collected in our increasingly data-centered world. Data science is an important part of modern statistics, but our introductory and second statistics courses often neglect this fact. This paper discusses ways to provide a practical foundation for students to learn to "compute with data" as define…
▽ More
Statistics students need to develop the capacity to make sense of the staggering amount of information collected in our increasingly data-centered world. Data science is an important part of modern statistics, but our introductory and second statistics courses often neglect this fact. This paper discusses ways to provide a practical foundation for students to learn to "compute with data" as defined by Nolan and Temple Lang (2010), as well as develop "data habits of mind" (Finzer, 2013). We describe how introductory and second courses can integrate two key precursors to data science: the use of reproducible analysis tools and access to large databases. By introducing students to commonplace tools for data management, visualization, and reproducible analysis in data science and applying these to real-world scenarios, we prepare them to think statistically in the era of big data.
△ Less
Submitted 14 January, 2014;
originally announced January 2014.
-
openWAR: An Open Source System for Evaluating Overall Player Performance in Major League Baseball
Authors:
Benjamin S. Baumer,
Shane T. Jensen,
Gregory J. Matthews
Abstract:
Within baseball analytics, there is substantial interest in comprehensive statistics intended to capture overall player performance. One such measure is Wins Above Replacement (WAR), which aggregates the contributions of a player in each facet of the game: hitting, pitching, baserunning, and fielding. However, current versions of WAR depend upon proprietary data, ad hoc methodology, and opaque cal…
▽ More
Within baseball analytics, there is substantial interest in comprehensive statistics intended to capture overall player performance. One such measure is Wins Above Replacement (WAR), which aggregates the contributions of a player in each facet of the game: hitting, pitching, baserunning, and fielding. However, current versions of WAR depend upon proprietary data, ad hoc methodology, and opaque calculations. We propose a competitive aggregate measure, openWAR, that is based upon public data and methodology with greater rigor and transparency. We discuss a principled standard for the nebulous concept of a "replacement" player. Finally, we use simulation-based techniques to provide interval estimates for our openWAR measure.
△ Less
Submitted 24 March, 2015; v1 submitted 26 December, 2013;
originally announced December 2013.
-
Incorporating the influence of sub-grid heterogeneity in regional-scale contaminant transport models
Authors:
Boris Baeumer,
Yong Zhang,
Rina Schumer
Abstract:
Numerical transport models based on the advection-dispersion equation (ADE) are built on the assumption that sub-grid cell transport is Fickian such that dispersive spreading around the average velocity is symmetric and without significant tailing on the front edge of a solute plume. However, anomalous diffusion in the form of super-diffusion due to preferential pathways in an aquifer has been obs…
▽ More
Numerical transport models based on the advection-dispersion equation (ADE) are built on the assumption that sub-grid cell transport is Fickian such that dispersive spreading around the average velocity is symmetric and without significant tailing on the front edge of a solute plume. However, anomalous diffusion in the form of super-diffusion due to preferential pathways in an aquifer has been observed in field data, challenging the assumption of Fickian dispersion at the local scale. This study develops a fully Lagrangian method to simulate sub-grid super-diffusion in a multi-dimensional regional-scale transport. The underlying concept is based on previous observations that solutions to space-fractional ADEs, which can describe super-diffusive dispersion, can be obtained by transforming solutions of classical ADEs. The transformations are equivalent to randomizing particle travel time or relative velocity for each model time step. Here, the time randomizing procedure known as subordination is applied to flow field output from MODFLOW simulations. Numerical tests check the applicability of the novel method in mapping regional-scale super-diffusive transport conditioned on local properties of multi-dimensional heterogeneous media.
△ Less
Submitted 11 July, 2013;
originally announced July 2013.
-
Reflected Spectrally Negative Stable Processes and their Governing Equations
Authors:
Boris Baeumer,
Mihály Kovács,
Mark M. Meerschaert,
René L. Schilling,
Peter Straka
Abstract:
This paper explicitly computes the transition densities of a spectrally negative stable process with index greater than one, reflected at its infimum. First we derive the forward equation using the theory of sun-dual semigroups. The resulting forward equation is a boundary value problem on the positive half-line that involves a negative Riemann-Liouville fractional derivative in space, and a fract…
▽ More
This paper explicitly computes the transition densities of a spectrally negative stable process with index greater than one, reflected at its infimum. First we derive the forward equation using the theory of sun-dual semigroups. The resulting forward equation is a boundary value problem on the positive half-line that involves a negative Riemann-Liouville fractional derivative in space, and a fractional reflecting boundary condition at the origin. Then we apply numerical methods to explicitly compute the transition density of this space-inhomogeneous Markov process, for any starting point, to any desired degree of accuracy. Finally, we discuss an application to fractional Cauchy problems, which involve a positive Caputo fractional derivative in time.
△ Less
Submitted 24 November, 2016; v1 submitted 23 January, 2013;
originally announced January 2013.
-
Higher order Grünwald approximations of fractional derivatives and fractional powers of operators
Authors:
Boris Baeumer,
Mihály Kovács,
Harish Sankaranarayanan
Abstract:
We give stability and consistency results for higher order Grünwald-type formulae used in the approximation of solutions to fractional-in-space partial differential equations. We use a new Carlson-type inequality for periodic Fourier multipliers to gain regularity and stability results. We then generalise the theory to the case where the first derivative operator is replaced by the generator of a…
▽ More
We give stability and consistency results for higher order Grünwald-type formulae used in the approximation of solutions to fractional-in-space partial differential equations. We use a new Carlson-type inequality for periodic Fourier multipliers to gain regularity and stability results. We then generalise the theory to the case where the first derivative operator is replaced by the generator of a bounded group on an arbitrary Banach space.
△ Less
Submitted 9 October, 2012;
originally announced October 2012.
-
Set It and Forget It: Approximating the Set Once Strip Cover Problem
Authors:
Amotz Bar-Noy,
Ben Baumer,
Dror Rawitz
Abstract:
We consider the Set Once Strip Cover problem, in which n wireless sensors are deployed over a one-dimensional region. Each sensor has a fixed battery that drains in inverse proportion to a radius that can be set just once, but activated at any time. The problem is to find an assignment of radii and activation times that maximizes the length of time during which the entire region is covered. We sho…
▽ More
We consider the Set Once Strip Cover problem, in which n wireless sensors are deployed over a one-dimensional region. Each sensor has a fixed battery that drains in inverse proportion to a radius that can be set just once, but activated at any time. The problem is to find an assignment of radii and activation times that maximizes the length of time during which the entire region is covered. We show that this problem is NP-hard. Second, we show that RoundRobin, the algorithm in which the sensors simply take turns covering the entire region, has a tight approximation guarantee of 3/2 in both Set Once Strip Cover and the more general Strip Cover problem, in which each radius may be set finitely-many times. Moreover, we show that the more general class of duty cycle algorithms, in which groups of sensors take turns covering the entire region, can do no better. Finally, we give an optimal O(n^2 log n)-time algorithm for the related Set Radius Strip Cover problem, in which all sensors must be activated immediately.
△ Less
Submitted 16 August, 2013; v1 submitted 4 April, 2012;
originally announced April 2012.
-
Space-time duality for fractional diffusion
Authors:
Boris Baeumer,
Mark M. Meerschaert,
Erkan Nane
Abstract:
Zolotarev proved a duality result that relates stable densities with different indices. In this paper, we show how Zolotarev duality leads to some interesting results on fractional diffusion. Fractional diffusion equations employ fractional derivatives in place of the usual integer order derivatives. They govern scaling limits of random walk models, with power law jumps leading to fractional der…
▽ More
Zolotarev proved a duality result that relates stable densities with different indices. In this paper, we show how Zolotarev duality leads to some interesting results on fractional diffusion. Fractional diffusion equations employ fractional derivatives in place of the usual integer order derivatives. They govern scaling limits of random walk models, with power law jumps leading to fractional derivatives in space, and power law waiting times between the jumps leading to fractional derivatives in time. The limit process is a stable Lévy motion that models the jumps, subordinated to an inverse stable process that models the waiting times. Using duality, we relate the density of a spectrally negative stable process with index $1<α<2$ to the density of the hitting time of a stable subordinator with index $1/α$, and thereby unify some recent results in the literature. These results also provide a concrete interpretation of Zolotarev duality in terms of the fractional diffusion model.
△ Less
Submitted 7 April, 2009;
originally announced April 2009.
-
Predicting the Drug Release Kinetics of Matrix Tablets
Authors:
Boris Baeumer,
Lipika Chatterjee,
Peter Hinow,
Thomas Rades,
Ami Radunskaya,
Ian Tucker
Abstract:
In this paper we develop two mathematical models to predict the release kinetics of a water soluble drug from a polymer/excipient matrix tablet. The first of our models consists of a random walk on a weighted graph, where the vertices of the graph represent particles of drug, excipient and polymer, respectively. The graph itself is the contact graph of a multidisperse random sphere packing. The…
▽ More
In this paper we develop two mathematical models to predict the release kinetics of a water soluble drug from a polymer/excipient matrix tablet. The first of our models consists of a random walk on a weighted graph, where the vertices of the graph represent particles of drug, excipient and polymer, respectively. The graph itself is the contact graph of a multidisperse random sphere packing. The second model describes the dissolution and the subsequent diffusion of the active drug out of a porous matrix using a system of partial differential equations. The predictions of both models show good qualitative agreement with experimental release curves. The models will provide tools for designing better controlled release devices.
△ Less
Submitted 6 April, 2009; v1 submitted 29 October, 2008;
originally announced October 2008.
-
Brownian subordinators and fractional Cauchy problems
Authors:
Boris Baeumer,
Mark M. Meerschaert,
Erkan Nane
Abstract:
A Brownian time process is a Markov process subordinated to the absolute value of an independent one-dimensional Brownian motion. Its transition densities solve an initial value problem involving the square of the generator of the original Markov process. An apparently unrelated class of processes, emerging as the scaling limits of continuous time random walks, involve subordination to the inver…
▽ More
A Brownian time process is a Markov process subordinated to the absolute value of an independent one-dimensional Brownian motion. Its transition densities solve an initial value problem involving the square of the generator of the original Markov process. An apparently unrelated class of processes, emerging as the scaling limits of continuous time random walks, involve subordination to the inverse or hitting time process of a classical stable subordinator. The resulting densities solve fractional Cauchy problems, an extension that involves fractional derivatives in time. In this paper, we will show a close and unexpected connection between these two classes of processes, and consequently, an equivalence between these two families of partial differential equations.
△ Less
Submitted 9 May, 2007; v1 submitted 1 May, 2007;
originally announced May 2007.