Skip to main content

Showing 1–10 of 10 results for author: Drosos, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.23253  [pdf, ps, other

    cs.HC

    Vibe coding: programming through conversation with artificial intelligence

    Authors: Advait Sarkar, Ian Drosos

    Abstract: We examine "vibe coding": an emergent programming paradigm where developers primarily write code by interacting with code-generating large language models rather than writing code directly. We analysed a curated set of videos depicting extended vibe coding sessions with rich think-aloud reflections. Using framework analysis, we investigated programmers' goals, workflows, prompting techniques, debu… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

  2. arXiv:2501.17247  [pdf, other

    cs.HC

    "It makes you think": Provocations Help Restore Critical Thinking to AI-Assisted Knowledge Work

    Authors: Ian Drosos, Advait Sarkar, Xiaotong, Xu, Neil Toronto

    Abstract: Recent research suggests that the use of Generative AI tools may result in diminished critical thinking during knowledge work. We study the effect on knowledge work of provocations: brief textual prompts that offer critiques for and propose alternatives to AI suggestions. We conduct a between-subjects study (n=24) in which participants completed AI-assisted shortlisting tasks with and without prov… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  3. arXiv:2412.15030  [pdf, other

    cs.HC

    When Copilot Becomes Autopilot: Generative AI's Critical Risk to Knowledge Work and a Critical Solution

    Authors: Advait Sarkar, Xiaotong, Xu, Neil Toronto, Ian Drosos, Christian Poelitz

    Abstract: Generative AI, with its tendency to "hallucinate" incorrect results, may pose a risk to knowledge work by introducing errors. On the other hand, it may also provide unprecedented opportunities for users, particularly non-experts, to learn and apply advanced software features and greatly increase the scope and complexity of tasks they can successfully achieve. As an example of a complex knowledge… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Journal ref: Proceedings of the EuSpRIG 2024 Conference "Spreadsheet Productivity & Risks" ISBN : 978-1-905404-59-9

  4. arXiv:2412.02357  [pdf, other

    cs.HC cs.AI

    Dynamic Prompt Middleware: Contextual Prompt Refinement Controls for Comprehension Tasks

    Authors: Ian Drosos, Jack Williams, Advait Sarkar, Nicholas Wilson

    Abstract: Effective prompting of generative AI is challenging for many users, particularly in expressing context for comprehension tasks such as explaining spreadsheet formulas, Python code, and text passages. Prompt middleware aims to address this barrier by assisting in prompt construction, but barriers remain for users in expressing adequate control so that they can receive AI-responses that match their… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  5. arXiv:2408.08781  [pdf, other

    cs.AI cs.CL

    Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions

    Authors: Bhuvanashree Murugadoss, Christian Poelitz, Ian Drosos, Vu Le, Nick McKenna, Carina Suzana Negreanu, Chris Parnin, Advait Sarkar

    Abstract: LLMs-as-a-judge is a recently popularized method which replaces human judgements in task evaluation (Zheng et al. 2024) with automatic evaluation using LLMs. Due to widespread use of RLHF (Reinforcement Learning from Human Feedback), state-of-the-art LLMs like GPT4 and Llama3 are expected to have strong alignment with human preferences when prompted for a quality judgement, such as the coherence o… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  6. "It's like a rubber duck that talks back": Understanding Generative AI-Assisted Data Analysis Workflows through a Participatory Prompting Study

    Authors: Ian Drosos, Advait Sarkar, Xiaotong Xu, Carina Negreanu, Sean Rintel, Lev Tankelevitch

    Abstract: Generative AI tools can help users with many tasks. One such task is data analysis, which is notoriously challenging for non-expert end-users due to its expertise requirements, and where AI holds much potential, such as finding relevant data sources, proposing analysis strategies, and writing analysis code. To understand how data analysis workflows can be assisted or impaired by generative AI, we… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Ian Drosos, Advait Sarkar, Xiaotong Xu, Carina Negreanu, Sean Rintel, and Lev Tankelevitch. 2024. "It's like a rubber duck that talks back": Understanding Generative AI-Assisted Data Analysis Workflows through a Participatory Prompting Study. In Proceedings of the 3rd Annual Meeting of the Symposium on Human-Computer Interaction for Work (CHIWORK 2024)

    Journal ref: Proceedings of the 3rd Annual Meeting of the Symposium on Human-Computer Interaction for Work (CHIWORK 2024)

  7. Improving Steering and Verification in AI-Assisted Data Analysis with Interactive Task Decomposition

    Authors: Majeed Kazemitabaar, Jack Williams, Ian Drosos, Tovi Grossman, Austin Henley, Carina Negreanu, Advait Sarkar

    Abstract: LLM-powered tools like ChatGPT Data Analysis, have the potential to help users tackle the challenging task of data analysis programming, which requires expertise in data processing, programming, and statistics. However, our formative study (n=15) uncovered serious challenges in verifying AI-generated results and steering the AI (i.e., guiding the AI system to produce the desired output). We develo… ▽ More

    Submitted 1 August, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: Published at UIST 2024; 19 pages, 9 figures, and 2 tables

    Journal ref: Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology (UIST 2024)

  8. arXiv:2404.07114  [pdf, other

    cs.HC

    "My toxic trait is thinking I'll remember this": gaps in the learner experience of video tutorials for feature-rich software

    Authors: Ian Drosos, Advait Sarkar, Andrew D. Gordon

    Abstract: Video tutorials are a popular medium for informal and formal learning. However, when learners attempt to view and follow along with these tutorials, they encounter what we call gaps, that is, issues that can prevent learning. We examine the gaps encountered by users of video tutorials for feature-rich software, such as spreadsheets. We develop a theory and taxonomy of such gaps, identifying how th… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  9. arXiv:2312.16633  [pdf, ps, other

    cs.HC

    Participatory prompting: a user-centric research method for eliciting AI assistance opportunities in knowledge workflows

    Authors: Advait Sarkar, Ian Drosos, Rob Deline, Andrew D. Gordon, Carina Negreanu, Sean Rintel, Jack Williams, Benjamin Zorn

    Abstract: Generative AI, such as image generation models and large language models, stands to provide tremendous value to end-user programmers in creative and knowledge workflows. Current research methods struggle to engage end-users in a realistic conversation that balances the actually existing capabilities of generative AI with the open-ended nature of user workflows and the many opportunities for the ap… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: Proceedings of the 34th Annual Conference of the Psychology of Programming Interest Group (PPIG 2023)

    Journal ref: Proceedings of the 34th Annual Conference of the Psychology of Programming Interest Group (PPIG 2023)

  10. arXiv:2310.01297  [pdf, other

    cs.HC cs.AI cs.CL cs.PL

    Co-audit: tools to help humans double-check AI-generated content

    Authors: Andrew D. Gordon, Carina Negreanu, José Cambronero, Rasika Chakravarthy, Ian Drosos, Hao Fang, Bhaskar Mitra, Hannah Richardson, Advait Sarkar, Stephanie Simmons, Jack Williams, Ben Zorn

    Abstract: Users are increasingly being warned to check AI-generated content for correctness. Still, as LLMs (and other generative models) generate more complex output, such as summaries, tables, or code, it becomes harder for the user to audit or evaluate the output for quality or correctness. Hence, we are seeing the emergence of tool-assisted experiences to help the user double-check a piece of AI-generat… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.