-
Journey-Based Transit Equity Analysis: A Case Study in the Greater Boston Area
Authors:
Daniela Shuman,
Xiaotong Guo,
Nicholas S. Caros
Abstract:
In this paper, a new methodology, journey-based equity analysis, is presented for measuring the equity of transit convenience between income groups. Two data sources are combined in the proposed transit equity analysis: on-board ridership surveys and passenger origin-destination data. The spatial unit of our proposed transit equity analysis is census blocks, which are relatively stable over time a…
▽ More
In this paper, a new methodology, journey-based equity analysis, is presented for measuring the equity of transit convenience between income groups. Two data sources are combined in the proposed transit equity analysis: on-board ridership surveys and passenger origin-destination data. The spatial unit of our proposed transit equity analysis is census blocks, which are relatively stable over time and allows an exploration of the data that is granular enough to make conclusions about the service convenience various communities are facing. A case study in the Greater Boston area using real data from the Massachusetts Bay Transportation Authority (MBTA) bus network demonstrates a significant difference in transit service convenience, measured by number of transfers per unit distance, transfer wait time and travel time per unit distance, between low-income riders and high income riders. Implications of analysis results to transit agencies are also discussed.
△ Less
Submitted 3 August, 2024;
originally announced August 2024.
-
Inferring Mobility of Care Travel Behavior From Transit Origin-Destination Data
Authors:
Awad Abdelhalim,
Daniela Shuman,
Anson F Stewart,
Kayleigh B Campbell,
Mira Patel,
Ines Sanchez de Madariaga,
Jinhua Zhao
Abstract:
There are substantial differences in travel behavior by gender on public transit. Studies have concluded that these differences are largely attributable to household responsibilities typically falling disproportionately on women, leading to women being more likely to utilize transit for purposes referred to by the umbrella concept of "mobility of care". In contrast to past studies that have quanti…
▽ More
There are substantial differences in travel behavior by gender on public transit. Studies have concluded that these differences are largely attributable to household responsibilities typically falling disproportionately on women, leading to women being more likely to utilize transit for purposes referred to by the umbrella concept of "mobility of care". In contrast to past studies that have quantified the impact of gender using survey and qualitative data, we propose a novel data-driven workflow utilizing a combination of previously developed origin, destination, and transfer inference (ODX) based on individual transit fare card transactions, name-based gender inference, and geospatial analysis as a framework to identify mobility of care trip making. We apply this framework to data from the Washington Metropolitan Area Transit Authority (WMATA). Analyzing data from millions of journeys conducted in the first quarter of 2019, the results of this study show that our proposed workflow can identify mobility of care travel behavior, detecting times and places of interest where the share of women travelers in an equally-sampled subset (on basis of inferred gender) of transit users is 10% - 15% higher than that of men. The workflow presented in this study provides a blueprint for combining transit origin-destination data, inferred customer demographics, and geospatial analyses enabling public transit agencies to assess, at the fare card level, the gendered impacts of different policy and operational decisions.
△ Less
Submitted 2 September, 2024; v1 submitted 9 November, 2022;
originally announced November 2022.
-
Signal Processing on the Permutahedron: Tight Spectral Frames for Ranked Data Analysis
Authors:
Yilin Chen,
Jennifer DeJong,
Tom Halverson,
David I Shuman
Abstract:
Ranked data sets, where m judges/voters specify a preference ranking of n objects/candidates, are increasingly prevalent in contexts such as political elections, computer vision, recommender systems, and bioinformatics. The vote counts for each ranking can be viewed as an n! data vector lying on the permutahedron, which is a Cayley graph of the symmetric group with vertices labeled by permutations…
▽ More
Ranked data sets, where m judges/voters specify a preference ranking of n objects/candidates, are increasingly prevalent in contexts such as political elections, computer vision, recommender systems, and bioinformatics. The vote counts for each ranking can be viewed as an n! data vector lying on the permutahedron, which is a Cayley graph of the symmetric group with vertices labeled by permutations and an edge when two permutations differ by an adjacent transposition. Leveraging combinatorial representation theory and recent progress in signal processing on graphs, we investigate a novel, scalable transform method to interpret and exploit structure in ranked data. We represent data on the permutahedron using an overcomplete dictionary of atoms, each of which captures both smoothness information about the data (typically the focus of spectral graph decomposition methods in graph signal processing) and structural information about the data (typically the focus of symmetry decomposition methods from representation theory). These atoms have a more naturally interpretable structure than any known basis for signals on the permutahedron, and they form a Parseval frame, ensuring beneficial numerical properties such as energy preservation. We develop specialized algorithms and open software that take advantage of the symmetry and structure of the permutahedron to improve the scalability of the proposed method, making it more applicable to the high-dimensional ranked data found in applications.
△ Less
Submitted 15 August, 2021; v1 submitted 6 March, 2021;
originally announced March 2021.
-
Global and Local Uncertainty Principles for Signals on Graphs
Authors:
Nathanael Perraudin,
Benjamin Ricaud,
David Shuman,
Pierre Vandergheynst
Abstract:
Uncertainty principles such as Heisenberg's provide limits on the time-frequency concentration of a signal, and constitute an important theoretical tool for designing and evaluating linear signal transforms. Generalizations of such principles to the graph setting can inform dictionary design for graph signals, lead to algorithms for reconstructing missing information from graph signals via sparse…
▽ More
Uncertainty principles such as Heisenberg's provide limits on the time-frequency concentration of a signal, and constitute an important theoretical tool for designing and evaluating linear signal transforms. Generalizations of such principles to the graph setting can inform dictionary design for graph signals, lead to algorithms for reconstructing missing information from graph signals via sparse representations, and yield new graph analysis tools. While previous work has focused on generalizing notions of spreads of a graph signal in the vertex and graph spectral domains, our approach is to generalize the methods of Lieb in order to develop uncertainty principles that provide limits on the concentration of the analysis coefficients of any graph signal under a dictionary transform whose atoms are jointly localized in the vertex and graph spectral domains. One challenge we highlight is that due to the inhomogeneity of the underlying graph data domain, the local structure in a single small region of the graph can drastically affect the uncertainty bounds for signals concentrated in different regions of the graph, limiting the information provided by global uncertainty principles. Accordingly, we suggest a new way to incorporate a notion of locality, and develop local uncertainty principles that bound the concentration of the analysis coefficients of each atom of a localized graph spectral filter frame in terms of quantities that depend on the local structure of the graph around the center vertex of the given atom. Finally, we demonstrate how our proposed local uncertainty measures can improve the random sampling of graph signals.
△ Less
Submitted 9 March, 2016;
originally announced March 2016.
-
UNLocBoX: A MATLAB convex optimization toolbox for proximal-splitting methods
Authors:
Nathanael Perraudin,
Vassilis Kalofolias,
David Shuman,
Pierre Vandergheynst
Abstract:
Convex optimization is an essential tool for machine learning, as many of its problems can be formulated as minimization problems of specific objective functions. While there is a large variety of algorithms available to solve convex problems, we can argue that it becomes more and more important to focus on efficient, scalable methods that can deal with big data. When the objective function can be…
▽ More
Convex optimization is an essential tool for machine learning, as many of its problems can be formulated as minimization problems of specific objective functions. While there is a large variety of algorithms available to solve convex problems, we can argue that it becomes more and more important to focus on efficient, scalable methods that can deal with big data. When the objective function can be written as a sum of "simple" terms, proximal splitting methods are a good choice. UNLocBoX is a MATLAB library that implements many of these methods, designed to solve convex optimization problems of the form $\min_{x \in \mathbb{R}^N} \sum_{n=1}^K f_n(x).$ It contains the most recent solvers such as FISTA, Douglas-Rachford, SDMM as well a primal dual techniques such as Chambolle-Pock and forward-backward-forward. It also includes an extensive list of common proximal operators that can be combined, allowing for a quick implementation of a large variety of convex problems.
△ Less
Submitted 27 December, 2016; v1 submitted 4 February, 2014;
originally announced February 2014.
-
Learning parametric dictionaries for graph signals
Authors:
Dorina Thanou,
David I Shuman,
Pascal Frossard
Abstract:
In sparse signal representation, the choice of a dictionary often involves a tradeoff between two desirable properties -- the ability to adapt to specific signal data and a fast implementation of the dictionary. To sparsely represent signals residing on weighted graphs, an additional design challenge is to incorporate the intrinsic geometric structure of the irregular data domain into the atoms of…
▽ More
In sparse signal representation, the choice of a dictionary often involves a tradeoff between two desirable properties -- the ability to adapt to specific signal data and a fast implementation of the dictionary. To sparsely represent signals residing on weighted graphs, an additional design challenge is to incorporate the intrinsic geometric structure of the irregular data domain into the atoms of the dictionary. In this work, we propose a parametric dictionary learning algorithm to design data-adapted, structured dictionaries that sparsely represent graph signals. In particular, we model graph signals as combinations of overlapping local patterns. We impose the constraint that each dictionary is a concatenation of subdictionaries, with each subdictionary being a polynomial of the graph Laplacian matrix, representing a single pattern translated to different areas of the graph. The learning algorithm adapts the patterns to a training set of graph signals. Experimental results on both synthetic and real datasets demonstrate that the dictionaries learned by the proposed algorithm are competitive with and often better than unstructured dictionaries learned by state-of-the-art numerical learning algorithms in terms of sparse approximation of graph signals. In contrast to the unstructured dictionaries, however, the dictionaries learned by the proposed algorithm feature localized atoms and can be implemented in a computationally efficient manner in signal processing tasks such as compression, denoising, and classification.
△ Less
Submitted 5 January, 2014;
originally announced January 2014.