Skip to main content

Showing 1–37 of 37 results for author: De-Arteaga, M

.
  1. arXiv:2504.08954  [pdf, other

    cs.CY cs.HC

    Should you use LLMs to simulate opinions? Quality checks for early-stage deliberation

    Authors: Terrence Neumann, Maria De-Arteaga, Sina Fazelpour

    Abstract: The emergent capabilities of large language models (LLMs) have sparked interest in assessing their ability to simulate human opinions in a variety of contexts, potentially serving as surrogates for human subjects in opinion surveys. However, previous evaluations of this capability have depended heavily on costly, domain-specific human survey data, and mixed empirical results about LLM effectivenes… ▽ More

    Submitted 1 May, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

  2. arXiv:2504.04243  [pdf, other

    cs.LG cs.AI cs.HC stat.ME

    Perils of Label Indeterminacy: A Case Study on Prediction of Neurological Recovery After Cardiac Arrest

    Authors: Jakob Schoeffer, Maria De-Arteaga, Jonathan Elmer

    Abstract: The design of AI systems to assist human decision-making typically requires the availability of labels to train and evaluate supervised models. Frequently, however, these labels are unknown, and different ways of estimating them involve unverifiable assumptions or arbitrary choices. In this work, we introduce the concept of label indeterminacy and derive important implications in high-stakes AI-as… ▽ More

    Submitted 7 May, 2025; v1 submitted 5 April, 2025; originally announced April 2025.

    Comments: The 2025 ACM Conference on Fairness, Accountability, and Transparency (FAccT '25)

  3. arXiv:2503.00333  [pdf, other

    cs.CL cs.AI

    More of the Same: Persistent Representational Harms Under Increased Representation

    Authors: Jennifer Mickel, Maria De-Arteaga, Leqi Liu, Kevin Tian

    Abstract: To recognize and mitigate the harms of generative AI systems, it is crucial to consider who is represented in the outputs of generative AI systems and how people are represented. A critical gap emerges when naively improving who is represented, as this does not imply bias mitigation efforts have been applied to address how people are represented. We critically examined this by investigating gender… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

    Comments: 26 pages, 7 figures, 6 tables, pre-print

  4. arXiv:2411.18122  [pdf, other

    cs.LG

    Using Machine Bias To Measure Human Bias

    Authors: Wanxue Dong, Maria De-Arteaga, Maytal Saar-Tsechansky

    Abstract: Biased human decisions have consequential impacts across various domains, yielding unfair treatment of individuals and resulting in suboptimal outcomes for organizations and society. In recognition of this fact, organizations regularly design and deploy interventions aimed at mitigating these biases. However, measuring human decision biases remains an important but elusive task. Organizations are… ▽ More

    Submitted 10 December, 2024; v1 submitted 27 November, 2024; originally announced November 2024.

  5. arXiv:2407.11933  [pdf, other

    cs.LG

    Fairly Accurate: Optimizing Accuracy Parity in Fair Target-Group Detection

    Authors: Soumyajit Gupta, Venelin Kovatchev, Maria De-Arteaga, Matthew Lease

    Abstract: In algorithmic toxicity detection pipelines, it is important to identify which demographic group(s) are the subject of a post, a task commonly known as \textit{target (group) detection}. While accurate detection is clearly important, we further advocate a fairness objective: to provide equal protection to all groups who may be targeted. To this end, we adopt \textit{Accuracy Parity} (AP) -- balanc… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  6. arXiv:2401.16558  [pdf, other

    cs.CY cs.CL

    Diverse, but Divisive: LLMs Can Exaggerate Gender Differences in Opinion Related to Harms of Misinformation

    Authors: Terrence Neumann, Sooyong Lee, Maria De-Arteaga, Sina Fazelpour, Matthew Lease

    Abstract: The pervasive spread of misinformation and disinformation poses a significant threat to society. Professional fact-checkers play a key role in addressing this threat, but the vast scale of the problem forces them to prioritize their limited resources. This prioritization may consider a range of factors, such as varying risks of harm posed to specific groups of people. In this work, we investigate… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Under Review

  7. A Critical Survey on Fairness Benefits of Explainable AI

    Authors: Luca Deck, Jakob Schoeffer, Maria De-Arteaga, Niklas Kühl

    Abstract: In this critical survey, we analyze typical claims on the relationship between explainable AI (XAI) and fairness to disentangle the multidimensional relationship between these two concepts. Based on a systematic literature review and a subsequent qualitative content analysis, we identify seven archetypal claims from 175 scientific articles on the alleged fairness benefits of XAI. We present crucia… ▽ More

    Submitted 7 May, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT '24)

  8. arXiv:2307.08945  [pdf, other

    cs.LG cs.CL

    Mitigating Label Bias via Decoupled Confident Learning

    Authors: Yunyi Li, Maria De-Arteaga, Maytal Saar-Tsechansky

    Abstract: Growing concerns regarding algorithmic fairness have led to a surge in methodologies to mitigate algorithmic bias. However, such methodologies largely assume that observed labels in training data are correct. This is problematic because bias in labels is pervasive across important domains, including healthcare, hiring, and content moderation. In particular, human-generated labels are prone to enco… ▽ More

    Submitted 29 September, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: AI & HCI Workshop at the 40th International Conference on Machine Learning (ICML), Honolulu, Hawaii, USA. 2023

  9. Human-Centered Responsible Artificial Intelligence: Current & Future Trends

    Authors: Mohammad Tahaei, Marios Constantinides, Daniele Quercia, Sean Kennedy, Michael Muller, Simone Stumpf, Q. Vera Liao, Ricardo Baeza-Yates, Lora Aroyo, Jess Holbrook, Ewa Luger, Michael Madaio, Ilana Golbin Blumenfeld, Maria De-Arteaga, Jessica Vitak, Alexandra Olteanu

    Abstract: In recent years, the CHI community has seen significant growth in research on Human-Centered Responsible Artificial Intelligence. While different research communities may use different terminology to discuss similar topics, all of this work is ultimately aimed at developing AI that benefits humanity while being grounded in human rights and ethics, and reducing the potential harms of AI. In this sp… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: To appear in Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems

  10. arXiv:2302.07372  [pdf, other

    cs.LG stat.ML

    Same Same, But Different: Conditional Multi-Task Learning for Demographic-Specific Toxicity Detection

    Authors: Soumyajit Gupta, Sooyong Lee, Maria De-Arteaga, Matthew Lease

    Abstract: Algorithmic bias often arises as a result of differential subgroup validity, in which predictive relationships vary across groups. For example, in toxic language detection, comments targeting different demographic groups can vary markedly across groups. In such settings, trained models can be dominated by the relationships that best fit the majority group, leading to disparate performance. We prop… ▽ More

    Submitted 6 March, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Journal ref: Proceedings of the Web Conference, WWW 2023

  11. arXiv:2302.02944  [pdf, other

    cs.AI cs.HC

    Learning Complementary Policies for Human-AI Teams

    Authors: Ruijiang Gao, Maytal Saar-Tsechansky, Maria De-Arteaga, Ligong Han, Wei Sun, Min Kyung Lee, Matthew Lease

    Abstract: Human-AI complementarity is important when neither the algorithm nor the human yields dominant performance across all instances in a given context. Recent work that explored human-AI collaboration has considered decisions that correspond to classification tasks. However, in many important contexts where humans can benefit from AI complementarity, humans undertake course of action. In this paper, w… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Previous name: Robust Human-AI Collaboration with Bandit Feedback; Best student paper award at Conference on Information Systems and Technology (CIST), 2022

  12. Explanations, Fairness, and Appropriate Reliance in Human-AI Decision-Making

    Authors: Jakob Schoeffer, Maria De-Arteaga, Niklas Kuehl

    Abstract: In this work, we study the effects of feature-based explanations on distributive fairness of AI-assisted decisions, specifically focusing on the task of predicting occupations from short textual bios. We also investigate how any effects are mediated by humans' fairness perceptions and their reliance on AI recommendations. Our findings show that explanations influence fairness perceptions, which, i… ▽ More

    Submitted 18 March, 2024; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: ACM CHI Conference on Human Factors in Computing Systems (CHI '24)

  13. arXiv:2208.06648  [pdf, other

    cs.AI cs.LG

    Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness

    Authors: Vincent Jeanselme, Maria De-Arteaga, Zhe Zhang, Jessica Barrett, Brian Tom

    Abstract: Machine learning risks reinforcing biases present in data and, as we argue in this work, in what is absent from data. In healthcare, societal and decision biases shape patterns in missing data, yet the algorithmic fairness implications of group-specific missingness are poorly understood. The way we address missingness in healthcare can have detrimental impacts on downstream algorithmic fairness. O… ▽ More

    Submitted 17 March, 2025; v1 submitted 13 August, 2022; originally announced August 2022.

    Comments: Full Journal Version under review; Presented at the conference Machine Learning for Health (ML4H) 2022 Published in the Proceedings of Machine Learning Research (193)

  14. arXiv:2207.13834  [pdf, ps, other

    cs.HC cs.AI

    Toward Supporting Perceptual Complementarity in Human-AI Collaboration via Reflection on Unobservables

    Authors: Kenneth Holstein, Maria De-Arteaga, Lakshmi Tumati, Yanghuidi Cheng

    Abstract: In many real world contexts, successful human-AI collaboration requires humans to productively integrate complementary sources of information into AI-informed decisions. However, in practice human decision-makers often lack understanding of what information an AI model has access to in relation to themselves. There are few available guidelines regarding how to effectively communicate about unobser… ▽ More

    Submitted 26 January, 2023; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: CSCW 2023

  15. arXiv:2207.10991  [pdf, other

    cs.AI

    Algorithmic Fairness in Business Analytics: Directions for Research and Practice

    Authors: Maria De-Arteaga, Stefan Feuerriegel, Maytal Saar-Tsechansky

    Abstract: The extensive adoption of business analytics (BA) has brought financial gains and increased efficiencies. However, these advances have simultaneously drawn attention to rising legal and ethical challenges when BA inform decisions with fairness implications. As a response to these concerns, the emerging study of algorithmic fairness deals with algorithmic outputs that may result in disparate outcom… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  16. arXiv:2207.07723  [pdf, other

    cs.LG cs.CY

    More Data Can Lead Us Astray: Active Data Acquisition in the Presence of Label Bias

    Authors: Yunyi Li, Maria De-Arteaga, Maytal Saar-Tsechansky

    Abstract: An increased awareness concerning risks of algorithmic bias has driven a surge of efforts around bias mitigation strategies. A vast majority of the proposed approaches fall under one of two categories: (1) imposing algorithmic fairness constraints on predictive models, and (2) collecting additional training samples. Most recently and at the intersection of these two categories, methods that propos… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Report number: https://ojs.aaai.org/index.php/HCOMP/article/view/21994/21770

    Journal ref: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing 2022 Oct 14 (Vol. 10, pp. 133-146)

  17. arXiv:2205.00072  [pdf, other

    cs.LG cs.CY cs.HC

    Doubting AI Predictions: Influence-Driven Second Opinion Recommendation

    Authors: Maria De-Arteaga, Alexandra Chouldechova, Artur Dubrawski

    Abstract: Effective human-AI collaboration requires a system design that provides humans with meaningful ways to make sense of and critically evaluate algorithmic recommendations. In this paper, we propose a way to augment human-AI collaboration by building on a common organizational practice: identifying experts who are likely to provide complementary opinions. When machine learning algorithms are trained… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

    Comments: ACM CHI 2022 Workshop on Trust and Reliance in AI-Human Teams (TRAIT)

  18. Justice in Misinformation Detection Systems: An Analysis of Algorithms, Stakeholders, and Potential Harms

    Authors: Terrence Neumann, Maria De-Arteaga, Sina Fazelpour

    Abstract: Faced with the scale and surge of misinformation on social media, many platforms and fact-checking organizations have turned to algorithms for automating key parts of misinformation detection pipelines. While offering a promising solution to the challenge of scale, the ethical and societal risks associated with algorithmic misinformation detection are not well-understood. In this paper, we employ… ▽ More

    Submitted 29 April, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: Accepted at ACM Conference on Fairness, Accountability, and Transparenct (FAccT), 2022

  19. arXiv:2204.13156  [pdf, other

    cs.HC cs.AI

    On the Relationship Between Explanations, Fairness Perceptions, and Decisions

    Authors: Jakob Schoeffer, Maria De-Arteaga, Niklas Kuehl

    Abstract: It is known that recommendations of AI-based systems can be incorrect or unfair. Hence, it is often proposed that a human be the final decision-maker. Prior work has argued that explanations are an essential pathway to help human decision-makers enhance decision quality and mitigate bias, i.e., facilitate human-AI complementarity. For these benefits to materialize, explanations should enable human… ▽ More

    Submitted 6 May, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: ACM CHI 2022 Workshop on Human-Centered Explainable AI (HCXAI), May 12--13, 2022, New Orleans, LA, USA

  20. Finding Pareto Trade-offs in Fair and Accurate Detection of Toxic Speech

    Authors: Soumyajit Gupta, Venelin Kovatchev, Anubrata Das, Maria De-Arteaga, Matthew Lease

    Abstract: Optimizing NLP models for fairness poses many challenges. Lack of differentiable fairness measures prevents gradient-based loss training or requires surrogate losses that diverge from the true metric of interest. In addition, competing objectives (e.g., accuracy vs. fairness) often require making trade-offs based on stakeholder preferences, but stakeholders may not know their preferences before se… ▽ More

    Submitted 9 April, 2025; v1 submitted 15 April, 2022; originally announced April 2022.

    Journal ref: Published in Information Research, vol. 30, iConf, pp. 123--141, 2025

  21. Social Norm Bias: Residual Harms of Fairness-Aware Algorithms

    Authors: Myra Cheng, Maria De-Arteaga, Lester Mackey, Adam Tauman Kalai

    Abstract: Many modern machine learning algorithms mitigate bias by enforcing fairness constraints across coarsely-defined groups related to a sensitive attribute like gender or race. However, these algorithms seldom account for within-group heterogeneity and biases that may disproportionately affect some members of a group. In this work, we characterize Social Norm Bias (SNoB), a subtle but consequential ty… ▽ More

    Submitted 10 August, 2022; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: Spotlighted at the 2021 ICML Machine Learning for Data Workshop and presented at the 2021 ICML Socially Responsible Machine Learning Workshop

    Report number: Data Min Knowl Disc (2023)

  22. arXiv:2107.09163  [pdf, other

    cs.CY cs.HC

    Diversity in Sociotechnical Machine Learning Systems

    Authors: Sina Fazelpour, Maria De-Arteaga

    Abstract: There has been a surge of recent interest in sociocultural diversity in machine learning (ML) research, with researchers (i) examining the benefits of diversity as an organizational solution for alleviating problems with algorithmic bias, and (ii) proposing measures and methods for implementing diversity as a design desideratum in the construction of predictive algorithms. Currently, however, ther… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  23. arXiv:2105.10614  [pdf, other

    cs.HC

    Human-AI Collaboration with Bandit Feedback

    Authors: Ruijiang Gao, Maytal Saar-Tsechansky, Maria De-Arteaga, Ligong Han, Min Kyung Lee, Matthew Lease

    Abstract: Human-machine complementarity is important when neither the algorithm nor the human yield dominant performance across all instances in a given domain. Most research on algorithmic decision-making solely centers on the algorithm's performance, while recent work that explores human-machine collaboration has framed the decision-making problems as classification tasks. In this paper, we first propose… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

    Comments: Accepted at IJCAI 2021

    Journal ref: In Proceedings of the 30th International Joint Conference on Artificial Intelligence (IJCAI), pages 1722--1728, 2021

  24. The effect of differential victim crime reporting on predictive policing systems

    Authors: Nil-Jana Akpinar, Maria De-Arteaga, Alexandra Chouldechova

    Abstract: Police departments around the world have been experimenting with forms of place-based data-driven proactive policing for over two decades. Modern incarnations of such systems are commonly known as hot spot predictive policing. These systems predict where future crime is likely to concentrate such that police can allocate patrols to these areas and deter crime before it occurs. Previous research on… ▽ More

    Submitted 4 February, 2021; v1 submitted 29 January, 2021; originally announced February 2021.

    Comments: Conference on Fairness, Accountability, and Transparency (FAccT 2021)

  25. arXiv:2101.09648  [pdf, other

    cs.LG cs.HC

    Leveraging Expert Consistency to Improve Algorithmic Decision Support

    Authors: Maria De-Arteaga, Vincent Jeanselme, Artur Dubrawski, Alexandra Chouldechova

    Abstract: Machine learning (ML) is increasingly being used to support high-stakes decisions. However, there is frequently a construct gap: a gap between the construct of interest to the decision-making task and what is captured in proxies used as labels to train ML models. As a result, ML models may fail to capture important dimensions of decision criteria, hampering their utility for decision support. Thus… ▽ More

    Submitted 3 June, 2024; v1 submitted 24 January, 2021; originally announced January 2021.

    Comments: Best Paper Runner-Up Award, Workshop on Information Technologies and Systems (WITS), 2021

  26. A Case for Humans-in-the-Loop: Decisions in the Presence of Erroneous Algorithmic Scores

    Authors: Maria De-Arteaga, Riccardo Fogliato, Alexandra Chouldechova

    Abstract: The increased use of algorithmic predictions in sensitive domains has been accompanied by both enthusiasm and concern. To understand the opportunities and risks of these technologies, it is key to study how experts alter their decisions when using such tools. In this paper, we study the adoption of an algorithmic tool used to assist child maltreatment hotline screening decisions. We focus on the q… ▽ More

    Submitted 20 February, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Accepted at ACM Conference on Human Factors in Computing Systems (ACM CHI), 2020

  27. arXiv:2001.00249   

    cs.CY

    Proceedings of NeurIPS 2019 Workshop on Machine Learning for the Developing World: Challenges and Risks of ML4D

    Authors: Maria De-Arteaga, Tejumade Afonja, Amanda Coston

    Abstract: This is the proceedings of the 3rd ML4D workshop which was help in Vancouver, Canada on December 13, 2019 as part of the Neural Information Processing Systems conference.

    Submitted 10 April, 2020; v1 submitted 1 January, 2020; originally announced January 2020.

  28. arXiv:1906.08206  [pdf, other

    stat.AP cs.CY

    Killings of social leaders in the Colombian post-conflict: Data analysis for investigative journalism

    Authors: Maria De-Arteaga, Benedikt Boecking

    Abstract: After the peace agreement of 2016 with FARC, the killings of social leaders have emerged as an important post-conflict challenge for Colombia. We present a data analysis based on official records obtained from the Colombian General Attorney's Office spanning the time period from 2012 to 2017. The results of the analysis show a drastic increase in the officially recorded number of killings of democ… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  29. arXiv:1904.05233  [pdf, other

    cs.LG cs.CL stat.ML

    What's in a Name? Reducing Bias in Bios without Access to Protected Attributes

    Authors: Alexey Romanov, Maria De-Arteaga, Hanna Wallach, Jennifer Chayes, Christian Borgs, Alexandra Chouldechova, Sahin Geyik, Krishnaram Kenthapadi, Anna Rumshisky, Adam Tauman Kalai

    Abstract: There is a growing body of work that proposes methods for mitigating bias in machine learning systems. These methods typically rely on access to protected attributes such as race, gender, or age. However, this raises two significant challenges: (1) protected attributes may not be available or it may not be legal to use them, and (2) it is often desirable to simultaneously consider multiple protect… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: Accepted at NAACL 2019; Best Thematic Paper

  30. arXiv:1901.09451  [pdf, other

    cs.IR cs.LG stat.ML

    Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting

    Authors: Maria De-Arteaga, Alexey Romanov, Hanna Wallach, Jennifer Chayes, Christian Borgs, Alexandra Chouldechova, Sahin Geyik, Krishnaram Kenthapadi, Adam Tauman Kalai

    Abstract: We present a large-scale study of gender bias in occupation classification, a task where the use of machine learning may lead to negative outcomes on peoples' lives. We analyze the potential allocation harms that can result from semantic representation bias. To do so, we study the impact on occupation classification of including explicit gender indicators---such as first names and pronouns---in di… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.

    Comments: Accepted at ACM Conference on Fairness, Accountability, and Transparency (ACM FAT*), 2019

  31. arXiv:1812.10398   

    cs.CY cs.AI cs.LG stat.ML

    Proceedings of NeurIPS 2018 Workshop on Machine Learning for the Developing World: Achieving Sustainable Impact

    Authors: Maria De-Arteaga, Amanda Coston, William Herlands

    Abstract: This is the Proceedings of NeurIPS 2018 Workshop on Machine Learning for the Developing World: Achieving Sustainable Impact, held in Montreal, Canada on December 8, 2018

    Submitted 18 February, 2019; v1 submitted 20 December, 2018; originally announced December 2018.

    Comments: 18 papers in the proceedings. 10 additional papers were presented at the workshop but not included in the proceedings

  32. arXiv:1812.08769  [pdf, other

    cs.CL cs.LG

    What are the biases in my word embedding?

    Authors: Nathaniel Swinger, Maria De-Arteaga, Neil Thomas Heffernan IV, Mark DM Leiserson, Adam Tauman Kalai

    Abstract: This paper presents an algorithm for enumerating biases in word embeddings. The algorithm exposes a large number of offensive associations related to sensitive features such as race and gender on publicly available embeddings, including a supposedly "debiased" embedding. These biases are concerning in light of the widespread use of word embeddings. The associations are identified by geometric patt… ▽ More

    Submitted 19 June, 2019; v1 submitted 20 December, 2018; originally announced December 2018.

    Comments: At AIES 2019: the AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society

  33. arXiv:1807.00905  [pdf, other

    cs.LG stat.ML

    Learning under selective labels in the presence of expert consistency

    Authors: Maria De-Arteaga, Artur Dubrawski, Alexandra Chouldechova

    Abstract: We explore the problem of learning under selective labels in the context of algorithm-assisted decision making. Selective labels is a pervasive selection bias problem that arises when historical decision making blinds us to the true outcome for certain instances. Examples of this are common in many applications, ranging from predicting recidivism using pre-trial release data to diagnosing patients… ▽ More

    Submitted 4 July, 2018; v1 submitted 2 July, 2018; originally announced July 2018.

    Comments: Presented at the 2018 Workshop on Fairness, Accountability, and Transparency in Machine Learning (FAT/ML 2018)

  34. arXiv:1711.09522   

    stat.ML

    Proceedings of NIPS 2017 Workshop on Machine Learning for the Developing World

    Authors: Maria De-Arteaga, William Herlands

    Abstract: This is the Proceedings of NIPS 2017 Workshop on Machine Learning for the Developing World, held in Long Beach, California, USA on December 8, 2017

    Submitted 12 December, 2017; v1 submitted 26 November, 2017; originally announced November 2017.

    Comments: 15 papers

  35. Discovery of Complex Anomalous Patterns of Sexual Violence in El Salvador

    Authors: Maria De-Arteaga, Artur Dubrawski

    Abstract: When sexual violence is a product of organized crime or social imaginary, the links between sexual violence episodes can be understood as a latent structure. With this assumption in place, we can use data science to uncover complex patterns. In this paper we focus on the use of data mining techniques to unveil complex anomalous spatiotemporal patterns of sexual violence. We illustrate their use by… ▽ More

    Submitted 17 November, 2017; originally announced November 2017.

    Comments: Conference paper at Data for Policy 2016 - Frontiers of Data Science for Government: Ideas, Practices and Projections (Data for Policy)

  36. arXiv:1511.06419  [pdf, other

    stat.ML cs.LG

    Canonical Autocorrelation Analysis

    Authors: Maria De-Arteaga, Artur Dubrawski, Peter Huggins

    Abstract: We present an extension of sparse Canonical Correlation Analysis (CCA) designed for finding multiple-to-multiple linear correlations within a single set of variables. Unlike CCA, which finds correlations between two sets of data where the rows are matched exactly but the columns represent separate sets of variables, the method proposed here, Canonical Autocorrelation Analysis (CAA), finds multivar… ▽ More

    Submitted 19 November, 2015; originally announced November 2015.

    Comments: 6 pages, 5 figures

  37. arXiv:1511.04402  [pdf, other

    stat.ML

    Lass-0: sparse non-convex regression by local search

    Authors: William Herlands, Maria De-Arteaga, Daniel Neill, Artur Dubrawski

    Abstract: We compute approximate solutions to L0 regularized linear regression using L1 regularization, also known as the Lasso, as an initialization step. Our algorithm, the Lass-0 ("Lass-zero"), uses a computationally efficient stepwise search to determine a locally optimal L0 solution given any L1 regularization solution. We present theoretical results of consistency under orthogonality and appropriate h… ▽ More

    Submitted 17 February, 2016; v1 submitted 13 November, 2015; originally announced November 2015.

    Comments: 8 pages, 1 figure. NIPS 2015 Workshop of Optimization (OPT2015)