Skip to main content

Showing 1–38 of 38 results for author: Kıcıman, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.13351  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks

    Authors: Yifei Xu, Tusher Chakraborty, Srinagesh Sharma, Leonardo Nunes, Emre Kıcıman, Songwu Lu, Ranveer Chandra

    Abstract: Recent advances in Large Language Models (LLMs) have showcased impressive reasoning abilities in structured tasks like mathematics and programming, largely driven by Reinforcement Learning with Verifiable Rewards (RLVR), which uses outcome-based signals that are scalable, effective, and robust against reward hacking. However, applying similar techniques to open-ended long-form reasoning tasks rema… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  2. arXiv:2504.14150  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations

    Authors: Katie Matton, Robert Osazuwa Ness, John Guttag, Emre Kıcıman

    Abstract: Large language models (LLMs) are capable of generating plausible explanations of how they arrived at an answer to a question. However, these explanations can misrepresent the model's "reasoning" process, i.e., they can be unfaithful. This, in turn, can lead to over-trust and misuse. We introduce a new approach for measuring the faithfulness of LLM explanations. First, we provide a rigorous definit… ▽ More

    Submitted 20 May, 2025; v1 submitted 18 April, 2025; originally announced April 2025.

    Comments: 66 pages, 14 figures, 40 tables; ICLR 2025 (spotlight) camera ready

  3. arXiv:2502.13417  [pdf, other

    cs.CL cs.AI cs.LG

    RLTHF: Targeted Human Feedback for LLM Alignment

    Authors: Yifei Xu, Tusher Chakraborty, Emre Kıcıman, Bibek Aryal, Eduardo Rodrigues, Srinagesh Sharma, Roberto Estevao, Maria Angels de Luis Balaguer, Jessica Wolk, Rafael Padilha, Leonardo Nunes, Shobana Balakrishnan, Songwu Lu, Ranveer Chandra

    Abstract: Fine-tuning large language models (LLMs) to align with user preferences is challenging due to the high cost of quality human annotations in Reinforcement Learning from Human Feedback (RLHF) and the generalizability limitations of AI Feedback. To address these challenges, we propose RLTHF, a human-AI hybrid framework that combines LLM-based initial alignment with selective human annotations to achi… ▽ More

    Submitted 20 February, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

  4. arXiv:2411.16872  [pdf, other

    cs.IR cs.AI cs.ET

    Enabling Adoption of Regenerative Agriculture through Soil Carbon Copilots

    Authors: Margaret Capetz, Swati Sharma, Rafael Padilha, Peder Olsen, Jessica Wolk, Emre Kiciman, Ranveer Chandra

    Abstract: Mitigating climate change requires transforming agriculture to minimize environ mental impact and build climate resilience. Regenerative agricultural practices enhance soil organic carbon (SOC) levels, thus improving soil health and sequestering carbon. A challenge to increasing regenerative agriculture practices is cheaply measuring SOC over time and understanding how SOC is affected by regenerat… ▽ More

    Submitted 27 November, 2024; v1 submitted 25 November, 2024; originally announced November 2024.

  5. arXiv:2407.19118  [pdf, other

    cs.AI

    Large Language Models as Co-Pilots for Causal Inference in Medical Studies

    Authors: Ahmed Alaa, Rachael V. Phillips, Emre Kıcıman, Laura B. Balzer, Mark van der Laan, Maya Petersen

    Abstract: The validity of medical studies based on real-world clinical data, such as observational studies, depends on critical assumptions necessary for drawing causal conclusions about medical interventions. Many published studies are flawed because they violate these assumptions and entail biases such as residual confounding, selection bias, and misalignment between treatment and measurement times. Altho… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  6. arXiv:2403.14720  [pdf, other

    cs.CR cs.CL cs.LG

    Defending Against Indirect Prompt Injection Attacks With Spotlighting

    Authors: Keegan Hines, Gary Lopez, Matthew Hall, Federico Zarfati, Yonatan Zunger, Emre Kiciman

    Abstract: Large Language Models (LLMs), while powerful, are built and trained to process a single text input. In common applications, multiple inputs can be processed by concatenating them together into a single stream of text. However, the LLM is unable to distinguish which sections of prompt belong to various input sources. Indirect prompt injection attacks take advantage of this vulnerability by embeddin… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  7. arXiv:2401.07175  [pdf, other

    cs.LG

    Domain Adaptation for Sustainable Soil Management using Causal and Contrastive Constraint Minimization

    Authors: Somya Sharma, Swati Sharma, Rafael Padilha, Emre Kiciman, Ranveer Chandra

    Abstract: Monitoring organic matter is pivotal for maintaining soil health and can help inform sustainable soil management practices. While sensor-based soil information offers higher-fidelity and reliable insights into organic matter changes, sampling and measuring sensor data is cost-prohibitive. We propose a multi-modal, scalable framework that can estimate organic matter from remote sensing data, a more… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: Neurips workshop on Tackling Climate Change 2023

  8. Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models

    Authors: Jingwei Yi, Yueqi Xie, Bin Zhu, Emre Kiciman, Guangzhong Sun, Xing Xie, Fangzhao Wu

    Abstract: The integration of large language models with external content has enabled applications such as Microsoft Copilot but also introduced vulnerabilities to indirect prompt injection attacks. In these attacks, malicious instructions embedded within external content can manipulate LLM outputs, causing deviations from user expectations. To address this critical yet under-explored issue, we introduce the… ▽ More

    Submitted 27 January, 2025; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted by KDD 2025

  9. arXiv:2312.06820  [pdf, other

    cs.AI cs.CL cs.LG stat.ME

    Extracting Self-Consistent Causal Insights from Users Feedback with LLMs and In-context Learning

    Authors: Sara Abdali, Anjali Parikh, Steve Lim, Emre Kiciman

    Abstract: Microsoft Windows Feedback Hub is designed to receive customer feedback on a wide variety of subjects including critical topics such as power and battery. Feedback is one of the most effective ways to have a grasp of users' experience with Windows and its ecosystem. However, the sheer volume of feedback received by Feedback Hub makes it immensely challenging to diagnose the actual cause of reporte… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  10. arXiv:2312.02073  [pdf, other

    cs.CL cs.AI cs.LG

    A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia

    Authors: Giovanni Monea, Maxime Peyrard, Martin Josifoski, Vishrav Chaudhary, Jason Eisner, Emre Kıcıman, Hamid Palangi, Barun Patra, Robert West

    Abstract: Large language models (LLMs) have an impressive ability to draw on novel information supplied in their context. Yet the mechanisms underlying this contextual grounding remain unknown, especially in situations where contextual information contradicts factual knowledge stored in the parameters, which LLMs also excel at recalling. Favoring the contextual information is critical for retrieval-augmente… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted at ACL 2024 (main conference)

  11. arXiv:2311.01301  [pdf, other

    cs.LG cs.AI stat.ME

    TRIALSCOPE: A Unifying Causal Framework for Scaling Real-World Evidence Generation with Biomedical Language Models

    Authors: Javier González, Cliff Wong, Zelalem Gero, Jass Bagga, Risa Ueno, Isabel Chien, Eduard Oravkin, Emre Kiciman, Aditya Nori, Roshanthi Weerasinghe, Rom S. Leidner, Brian Piening, Tristan Naumann, Carlo Bifulco, Hoifung Poon

    Abstract: The rapid digitization of real-world data offers an unprecedented opportunity for optimizing healthcare delivery and accelerating biomedical discovery. In practice, however, such data is most abundantly available in unstructured forms, such as clinical notes in electronic medical records (EMRs), and it is generally plagued by confounders. In this paper, we present TRIALSCOPE, a unifying framework… ▽ More

    Submitted 6 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 6 Figures, 22 Pages, 3 Tables

  12. arXiv:2308.16095  [pdf, other

    cs.CY cs.SI

    Food Choice Mimicry on a Large University Campus

    Authors: Kristina Gligoric, Arnaud Chiolero, Emre Kıcıman, Ryen W. White, Eric Horvitz, Robert West

    Abstract: Social influence is a strong determinant of food consumption, which in turn influences health. Although consistent observations have been made on the role of social factors in driving similarities in food consumption, much less is known about the precise governing mechanisms. We study social influence on food choice through carefully designed causal analyses, leveraging the sequential nature of sh… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  13. arXiv:2306.09302  [pdf, other

    cs.LG cs.AI

    Knowledge Guided Representation Learning and Causal Structure Learning in Soil Science

    Authors: Somya Sharma, Swati Sharma, Licheng Liu, Rishabh Tushir, Andy Neal, Robert Ness, John Crawford, Emre Kiciman, Ranveer Chandra

    Abstract: An improved understanding of soil can enable more sustainable land-use practices. Nevertheless, soil is called a complex, living medium due to the complex interaction of different soil processes that limit our understanding of soil. Process-based models and analyzing observed data provide two avenues for improving our understanding of soil processes. Collecting observed data is cost-prohibitive bu… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  14. arXiv:2305.00050  [pdf, other

    cs.AI cs.CL cs.CY cs.HC cs.LG stat.ME

    Causal Reasoning and Large Language Models: Opening a New Frontier for Causality

    Authors: Emre Kıcıman, Robert Ness, Amit Sharma, Chenhao Tan

    Abstract: The causal capabilities of large language models (LLMs) are a matter of significant debate, with critical implications for the use of LLMs in societally impactful domains such as medicine, science, law, and policy. We conduct a "behavorial" study of LLMs to benchmark their capability in generating causal arguments. Across a wide range of tasks, we find that LLMs can generate text corresponding to… ▽ More

    Submitted 20 August, 2024; v1 submitted 28 April, 2023; originally announced May 2023.

    Comments: Added three novel datasets. To be published in TMLR. Authors listed alphabetically

  15. arXiv:2301.09031  [pdf, ps, other

    stat.ML cs.LG

    Counterfactual (Non-)identifiability of Learned Structural Causal Models

    Authors: Arash Nasr-Esfahany, Emre Kiciman

    Abstract: Recent advances in probabilistic generative modeling have motivated learning Structural Causal Models (SCM) from observational datasets using deep conditional generative models, also known as Deep Structural Causal Models (DSCM). If successful, DSCMs can be utilized for causal estimation tasks, e.g., for answering counterfactual queries. In this work, we warn practitioners about non-identifiabilit… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

  16. arXiv:2211.05675  [pdf, other

    cs.LG cs.CY

    Causal Modeling of Soil Processes for Improved Generalization

    Authors: Somya Sharma, Swati Sharma, Andy Neal, Sara Malvar, Eduardo Rodrigues, John Crawford, Emre Kiciman, Ranveer Chandra

    Abstract: Measuring and monitoring soil organic carbon is critical for agricultural productivity and for addressing critical environmental problems. Soil organic carbon not only enriches nutrition in soil, but also has a gamut of co-benefits such as improving water storage and limiting physical erosion. Despite a litany of work in soil organic carbon estimation, current approaches do not generalize well acr… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022 Workshop Tackling Climate Change with Machine Learning

  17. arXiv:2210.10636  [pdf, other

    cs.IR cs.LG

    Using Interventions to Improve Out-of-Distribution Generalization of Text-Matching Recommendation Systems

    Authors: Parikshit Bansal, Yashoteja Prabhu, Emre Kiciman, Amit Sharma

    Abstract: Given a user's input text, text-matching recommender systems output relevant items by comparing the input text to available items' description, such as product-to-product recommendation on e-commerce platforms. As users' interests and item inventory are expected to change, it is important for a text-matching system to generalize to data shifts, a task known as out-of-distribution (OOD) generalizat… ▽ More

    Submitted 14 June, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022 CML4Impact Workshop, NeurIPS 2022 DistShift Workshop

  18. arXiv:2210.07228  [pdf, other

    cs.CL cs.LG

    Language Model Decoding as Likelihood-Utility Alignment

    Authors: Martin Josifoski, Maxime Peyrard, Frano Rajic, Jiheng Wei, Debjit Paul, Valentin Hartmann, Barun Patra, Vishrav Chaudhary, Emre Kıcıman, Boi Faltings, Robert West

    Abstract: A critical component of a successful language generation pipeline is the decoding algorithm. However, the general principles that should guide the choice of a decoding algorithm remain unclear. Previous works only compare decoding algorithms in narrow scenarios, and their findings do not generalize across tasks. We argue that the misalignment between the model's likelihood and the task-specific no… ▽ More

    Submitted 16 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted at EACL (Findings) 2023

  19. arXiv:2206.07837  [pdf, other

    cs.LG cs.AI

    Modeling the Data-Generating Process is Necessary for Out-of-Distribution Generalization

    Authors: Jivat Neet Kaur, Emre Kiciman, Amit Sharma

    Abstract: Recent empirical studies on domain generalization (DG) have shown that DG algorithms that perform well on some distribution shifts fail on others, and no state-of-the-art DG algorithm performs consistently well on all shifts. Moreover, real-world data often has multiple distribution shifts over different attributes; hence we introduce multi-attribute distribution shift datasets and find that the a… ▽ More

    Submitted 17 May, 2024; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: Published at ICLR 2023

  20. arXiv:2202.11812  [pdf, other

    cs.HC cs.AI

    Investigations of Performance and Bias in Human-AI Teamwork in Hiring

    Authors: Andi Peng, Besmira Nushi, Emre Kiciman, Kori Inkpen, Ece Kamar

    Abstract: In AI-assisted decision-making, effective hybrid (human-AI) teamwork is not solely dependent on AI performance alone, but also on its impact on human decision-making. While prior work studies the effects of model accuracy on humans, we endeavour here to investigate the complex dynamics of how both a model's predictive performance and bias may transfer to humans in a recommendation-aided decision t… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: Accepted at AAAI 2022

  21. arXiv:2202.02195  [pdf, other

    stat.ML cs.LG

    Deep End-to-end Causal Inference

    Authors: Tomas Geffner, Javier Antoran, Adam Foster, Wenbo Gong, Chao Ma, Emre Kiciman, Amit Sharma, Angus Lamb, Martin Kukla, Nick Pawlowski, Miltiadis Allamanis, Cheng Zhang

    Abstract: Causal inference is essential for data-driven decision making across domains such as business engagement, medical treatment and policy making. However, research on causal discovery has evolved separately from inference methods, preventing straight-forward combination of methods from both fields. In this work, we develop Deep End-to-end Causal Inference (DECI), a single flow-based non-linear additi… ▽ More

    Submitted 20 June, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

  22. arXiv:2110.08413  [pdf, other

    cs.CL cs.LG

    Invariant Language Modeling

    Authors: Maxime Peyrard, Sarvjeet Singh Ghotra, Martin Josifoski, Vidhan Agarwal, Barun Patra, Dean Carignan, Emre Kiciman, Robert West

    Abstract: Large pretrained language models are critical components of modern NLP pipelines. Yet, they suffer from spurious correlations, poor out-of-domain generalization, and biases. Inspired by recent progress in causal machine learning, in particular the invariant risk minimization (IRM) paradigm, we propose invariant language modeling, a framework for learning invariant representations that generalize b… ▽ More

    Submitted 14 November, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Published at EMNLP 2022

  23. Population-scale dietary interests during the COVID-19 pandemic

    Authors: Kristina Gligoric, Arnaud Chiolero, Emre Kıcıman, Ryen W. White, Robert West

    Abstract: The SARS-CoV-2 virus has altered people's lives around the world. Here we document population-wide shifts in dietary interests in 18 countries in 2020, as revealed through time series of Google search volumes. We find that during the first wave of the COVID-19 pandemic there was an overall surge in food interest, larger and longer-lasting than the surge during typical end-of-year holidays in Weste… ▽ More

    Submitted 25 February, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: Nature Communications (2022)

  24. arXiv:2108.13518  [pdf, other

    cs.LG cs.AI

    DoWhy: Addressing Challenges in Expressing and Validating Causal Assumptions

    Authors: Amit Sharma, Vasilis Syrgkanis, Cheng Zhang, Emre Kıcıman

    Abstract: Estimation of causal effects involves crucial assumptions about the data-generating process, such as directionality of effect, presence of instrumental variables or mediators, and whether all relevant confounders are observed. Violation of any of these assumptions leads to significant error in the effect estimate. However, unlike cross-validation for predictive models, there is no global validator… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: Presented at ICML 2021 Workshop on the Neglected Assumptions in Causal Inference(NACI)

  25. Formation of Social Ties Influences Food Choice: A Campus-Wide Longitudinal Study

    Authors: Kristina Gligorić, Ryen W. White, Emre Kıcıman, Eric Horvitz, Arnaud Chiolero, Robert West

    Abstract: Nutrition is a key determinant of long-term health, and social influence has long been theorized to be a key determinant of nutrition. It has been difficult to quantify the postulated role of social influence on nutrition using traditional methods such as surveys, due to the typically small scale and short duration of studies. To overcome these limitations, we leverage a novel source of data: logs… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Journal ref: Proc. ACM Hum.-Comput. Interact.5, CSCW1, Article 184 (April 2021)

  26. arXiv:2101.07732  [pdf, other

    cs.LG cs.AI

    Out-of-distribution Prediction with Invariant Risk Minimization: The Limitation and An Effective Fix

    Authors: Ruocheng Guo, Pengchuan Zhang, Hao Liu, Emre Kiciman

    Abstract: This work considers the out-of-distribution (OOD) prediction problem where (1)~the training data are from multiple domains and (2)~the test domain is unseen in the training. DNNs fail in OOD prediction because they are prone to pick up spurious correlations. Recently, Invariant Risk Minimization (IRM) is proposed to address this issue. Its effectiveness has been demonstrated in the colored MNIST e… ▽ More

    Submitted 22 February, 2021; v1 submitted 15 January, 2021; originally announced January 2021.

    Comments: 22 pages

  27. arXiv:2011.05877  [pdf, other

    stat.ME cs.LG

    Split-Treatment Analysis to Rank Heterogeneous Causal Effects for Prospective Interventions

    Authors: Yanbo Xu, Divyat Mahajan, Liz Manrao, Amit Sharma, Emre Kiciman

    Abstract: For many kinds of interventions, such as a new advertisement, marketing intervention, or feature recommendation, it is important to target a specific subset of people for maximizing its benefits at minimum cost or potential harm. However, a key challenge is that no data is available about the effect of such a prospective intervention since it has not been deployed yet. In this work, we propose a s… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: To be published in WSDM

  28. arXiv:2011.04216  [pdf, other

    stat.ME cs.AI cs.MS econ.EM

    DoWhy: An End-to-End Library for Causal Inference

    Authors: Amit Sharma, Emre Kiciman

    Abstract: In addition to efficient statistical estimators of a treatment's effect, successful application of causal inference requires specifying assumptions about the mechanisms underlying observed data and testing whether they are valid, and to what extent. However, most libraries for causal inference focus only on the task of providing powerful statistical estimators. We describe DoWhy, an open-source Py… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: 5 pages

  29. arXiv:2010.08710  [pdf, other

    cs.LG stat.ML

    Causal Transfer Random Forest: Combining Logged Data and Randomized Experiments for Robust Prediction

    Authors: Shuxi Zeng, Murat Ali Bayir, Joesph J. Pfeiffer III, Denis Charles, Emre Kiciman

    Abstract: It is often critical for prediction models to be robust to distributional shifts between training and testing data. From a causal perspective, the challenge is to distinguish the stable causal relationships from the unstable spurious correlations across shifts. We describe a causal transfer random forest (CTRF) that combines existing training data with a small amount of data from a randomized expe… ▽ More

    Submitted 14 January, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: 9 pages, 7 figures, 2 tables, accepted to WSDM 2021

  30. Causal Inference in the Presence of Interference in Sponsored Search Advertising

    Authors: Razieh Nabi, Joel Pfeiffer, Murat Ali Bayir, Denis Charles, Emre Kıcıman

    Abstract: In classical causal inference, inferring cause-effect relations from data relies on the assumption that units are independent and identically distributed. This assumption is violated in settings where units are related through a network of dependencies. An example of such a setting is ad placement in sponsored search advertising, where the clickability of a particular ad is potentially influenced… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Journal ref: Special issue on Causal Inference and Machine Learning with Network Data, Frontiers in Big Data, 2022

  31. arXiv:2007.07205  [pdf, ps, other

    cs.CR cs.LG stat.ML

    Security and Machine Learning in the Real World

    Authors: Ivan Evtimov, Weidong Cui, Ece Kamar, Emre Kiciman, Tadayoshi Kohno, Jerry Li

    Abstract: Machine learning (ML) models deployed in many safety- and business-critical systems are vulnerable to exploitation through adversarial examples. A large body of academic research has thoroughly explored the causes of these blind spots, developed sophisticated algorithms for finding them, and proposed a few promising defenses. A vast majority of these works, however, study standalone neural network… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  32. arXiv:2006.14796  [pdf, other

    cs.AI cs.LG cs.RO

    AvE: Assistance via Empowerment

    Authors: Yuqing Du, Stas Tiomkin, Emre Kiciman, Daniel Polani, Pieter Abbeel, Anca Dragan

    Abstract: One difficulty in using artificial agents for human-assistive applications lies in the challenge of accurately assisting with a person's goal(s). Existing methods tend to rely on inferring the human's goal, which is challenging when there are many potential goals or when the set of candidate goals is difficult to identify. We propose a new paradigm for assistance by instead increasing the human's… ▽ More

    Submitted 7 January, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: Final version from NeurIPS 2020 Conference Proceedings

  33. arXiv:2005.11406  [pdf, other

    cs.CV

    Novel Human-Object Interaction Detection via Adversarial Domain Generalization

    Authors: Yuhang Song, Wenbo Li, Lei Zhang, Jianwei Yang, Emre Kiciman, Hamid Palangi, Jianfeng Gao, C. -C. Jay Kuo, Pengchuan Zhang

    Abstract: We study in this paper the problem of novel human-object interaction (HOI) detection, aiming at improving the generalization ability of the model to unseen scenarios. The challenge mainly stems from the large compositional space of objects and predicates, which leads to the lack of sufficient training data for all the object-predicate combinations. As a result, most existing HOI methods heavily re… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

  34. arXiv:1909.03567  [pdf, other

    cs.HC cs.AI cs.CY

    What You See Is What You Get? The Impact of Representation Criteria on Human Bias in Hiring

    Authors: Andi Peng, Besmira Nushi, Emre Kiciman, Kori Inkpen, Siddharth Suri, Ece Kamar

    Abstract: Although systematic biases in decision-making are widely documented, the ways in which they emerge from different sources is less understood. We present a controlled experimental platform to study gender bias in hiring by decoupling the effect of world distribution (the gender breakdown of candidates in a specific profession) from bias in human decision-making. We explore the effectiveness of \tex… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

    Comments: This paper has been accepted for publication at HCOMP 2019

  35. arXiv:1710.08880  [pdf, other

    cs.CY

    Wildbook: Crowdsourcing, computer vision, and data science for conservation

    Authors: Tanya Y. Berger-Wolf, Daniel I. Rubenstein, Charles V. Stewart, Jason A. Holmberg, Jason Parham, Sreejith Menon, Jonathan Crall, Jon Van Oast, Emre Kiciman, Lucas Joppa

    Abstract: Photographs, taken by field scientists, tourists, automated cameras, and incidental photographers, are the most abundant source of data on wildlife today. Wildbook is an autonomous computational system that starts from massive collections of images and, by detecting various species of animals and identifying individuals, combined with sophisticated data management, turns them into high resolution… ▽ More

    Submitted 24 October, 2017; originally announced October 2017.

    Comments: Presented at the Data For Good Exchange 2017

  36. Smart Societies: From Citizens as Sensors to Collective Action

    Authors: Andrés Monroy-Hernández, Shelly Farnham, Emre Kıcıman, Scott Counts, Munmun De Choudhury

    Abstract: Social media has become globally ubiquitous, transforming how people are networked and mobilized. This forum explores research and applications of these new networked publics at individual, organizational, and societal levels.

    Submitted 28 May, 2016; originally announced May 2016.

    Journal ref: interactions 20, 4 (July 2013)

  37. arXiv:1507.01291  [pdf

    cs.CY cs.HC cs.SI

    The New War Correspondents: the Rise of Civic Media Curation in Urban Warfare

    Authors: Andrés Monroy-Hernández, danah boyd, Emre Kiciman, Munmun De Choudhury, Scott Counts

    Abstract: In this paper we examine the information sharing practices of people living in cities amid armed conflict. We describe the volume and frequency of microblogging activity on Twitter from four cities afflicted by the Mexican Drug War, showing how citizens use social media to alert one another and to comment on the violence that plagues their communities. We then investigate the emergence of civic me… ▽ More

    Submitted 5 July, 2015; originally announced July 2015.

    Comments: In Proceedings of the 2013 conference on Computer supported cooperative work (CSCW 2013). ACM, New York, NY, USA, 1443-1452

  38. arXiv:1507.01290  [pdf

    cs.CY cs.SI

    Narcotweets: Social Media in Wartime

    Authors: Andrés Monroy-Hernández, Emre Kiciman, Danah Boyd, Scott Counts

    Abstract: This paper describes how people living in armed conflict environments use social media as a participatory news platform, in lieu of damaged state and media apparatuses. We investigate this by analyzing the microblogging practices of Mexican citizens whose everyday life is affected by the Drug War. We provide a descriptive analysis of the phenomenon, combining content and quantitative Twitter data… ▽ More

    Submitted 5 July, 2015; originally announced July 2015.

    Comments: In Proceedings of the 2012 International AAAI Conference on Weblogs and Social Media