Skip to main content

Showing 1–28 of 28 results for author: Fernandez, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.01903  [pdf, other

    cs.LG cs.AI

    LookAlike: Consistent Distractor Generation in Math MCQs

    Authors: Nisarg Parikh, Nigel Fernandez, Alexander Scarlatos, Simon Woodhead, Andrew Lan

    Abstract: Large language models (LLMs) are increasingly used to generate distractors for multiple-choice questions (MCQs), especially in domains like math education. However, existing approaches are limited in ensuring that the generated distractors are consistent with common student errors. We propose LookAlike, a method that improves error-distractor consistency via preference optimization. Our two main i… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

  2. arXiv:2502.18632  [pdf, other

    cs.AI cs.CL cs.CY cs.LG cs.SE

    Automated Knowledge Component Generation and Knowledge Tracing for Coding Problems

    Authors: Zhangqi Duan, Nigel Fernandez, Sri Kanakadandi, Bita Akram, Andrew Lan

    Abstract: Knowledge components (KCs) mapped to problems help model student learning, tracking their mastery levels on fine-grained skills thereby facilitating personalized learning and feedback in online learning platforms. However, crafting and tagging KCs to problems, traditionally performed by human domain experts, is highly labor-intensive. We present a fully automated, LLM-based pipeline for KC generat… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  3. arXiv:2502.08041  [pdf, other

    cs.LG cs.IT

    The Art of Misclassification: Too Many Classes, Not Enough Points

    Authors: Mario Franco, Gerardo Febres, Nelson Fernández, Carlos Gershenson

    Abstract: Classification is a ubiquitous and fundamental problem in artificial intelligence and machine learning, with extensive efforts dedicated to developing more powerful classifiers and larger datasets. However, the classification task is ultimately constrained by the intrinsic properties of datasets, independently of computational power or model complexity. In this work, we introduce a formal entropy-… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  4. arXiv:2501.13832  [pdf, other

    cs.SE

    Software Bills of Materials in Maven Central

    Authors: Yogya Gamage, Nadia Gonzalez Fernandez, Martin Monperrus, Benoit Baudry

    Abstract: Software Bills of Materials (SBOMs) are essential to ensure the transparency and integrity of the software supply chain. There is a growing body of work that investigates the accuracy of SBOM generation tools and the challenges for producing complete SBOMs. Yet, there is little knowledge about how developers distribute SBOMs. In this work, we mine SBOMs from Maven Central to assess the extent to w… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Journal ref: Proceedings of the International Conference on Mining Software Repositories, 2025

  5. arXiv:2410.10829  [pdf, other

    cs.CY cs.CL cs.LG

    Test Case-Informed Knowledge Tracing for Open-ended Coding Tasks

    Authors: Zhangqi Duan, Nigel Fernandez, Alexander Hicks, Andrew Lan

    Abstract: Open-ended coding tasks, which ask students to construct programs according to certain specifications, are common in computer science education. Student modeling can be challenging since their open-ended nature means that student code can be diverse. Traditional knowledge tracing (KT) models that only analyze response correctness may not fully capture nuances in student knowledge from student code… ▽ More

    Submitted 20 December, 2024; v1 submitted 27 September, 2024; originally announced October 2024.

    Comments: Published in LAK 2025: The 15th International Learning Analytics and Knowledge Conference

  6. arXiv:2406.19356  [pdf, other

    cs.CL cs.CY cs.LG

    DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions

    Authors: Nigel Fernandez, Alexander Scarlatos, Wanyong Feng, Simon Woodhead, Andrew Lan

    Abstract: High-quality distractors are crucial to both the assessment and pedagogical value of multiple-choice questions (MCQs), where manually crafting ones that anticipate knowledge deficiencies or misconceptions among real students is difficult. Meanwhile, automated distractor generation, even with the help of large language models (LLMs), remains challenging for subjects like math. It is crucial to not… ▽ More

    Submitted 7 October, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: EMNLP 2024: The 2024 Conference on Empirical Methods in Natural Language Processing

  7. arXiv:2405.08213  [pdf, other

    cs.CL cs.CY cs.LG

    Interpreting Latent Student Knowledge Representations in Programming Assignments

    Authors: Nigel Fernandez, Andrew Lan

    Abstract: Recent advances in artificial intelligence for education leverage generative large language models, including using them to predict open-ended student responses rather than their correctness only. However, the black-box nature of these models limits the interpretability of the learned student knowledge representations. In this paper, we conduct a first exploration into interpreting latent student… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: EDM 2024: 17th International Conference on Educational Data Mining

  8. arXiv:2405.01391  [pdf, other

    cs.SE

    The Sustainability Assessment Framework Toolkit: A Decade of Modeling Experience

    Authors: Patricia Lago, Nelly Condori Fernandez, Iffat Fatima, Markus Funke, Ivano Malavolta

    Abstract: Software intensive systems play a crucial role in most, if not all, aspects of modern society. As such, both their sustainability and their role in supporting sustainable processes, must be realized by design. To this aim, the architecture of software intensive systems should be designed to support sustainability goals; and measured to understand how effectively they do so. In this paper, we prese… ▽ More

    Submitted 19 October, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  9. arXiv:2403.14666  [pdf, other

    cs.CY cs.CL cs.IR cs.LG

    SyllabusQA: A Course Logistics Question Answering Dataset

    Authors: Nigel Fernandez, Alexander Scarlatos, Andrew Lan

    Abstract: Automated teaching assistants and chatbots have significant potential to reduce the workload of human instructors, especially for logistics-related question answering, which is important to students yet repetitive for instructors. However, due to privacy concerns, there is a lack of publicly available datasets. We introduce SyllabusQA, an open-source dataset with 63 real course syllabi covering 36… ▽ More

    Submitted 22 July, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: ACL 2024: The 62nd Annual Meeting of the Association for Computational Linguistics

  10. arXiv:2401.11174  [pdf, other

    cs.CV cs.AI cs.LG

    Pixel-Wise Recognition for Holistic Surgical Scene Understanding

    Authors: Nicolás Ayobi, Santiago Rodríguez, Alejandra Pérez, Isabela Hernández, Nicolás Aparicio, Eugénie Dessevres, Sebastián Peña, Jessica Santander, Juan Ignacio Caicedo, Nicolás Fernández, Pablo Arbeláez

    Abstract: This paper presents the Holistic and Multi-Granular Surgical Scene Understanding of Prostatectomies (GraSP) dataset, a curated benchmark that models surgical scene understanding as a hierarchy of complementary tasks with varying levels of granularity. Our approach enables a multi-level comprehension of surgical activities, encompassing long-term tasks such as surgical phases and steps recognition… ▽ More

    Submitted 25 January, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: Preprint submitted to Medical Image Analysis. Official extension of previous MICCAI 2022 (https://link.springer.com/chapter/10.1007/978-3-031-16449-1_42) and ISBI 2023 (https://ieeexplore.ieee.org/document/10230819) orals. Data and codes are available at https://github.com/BCV-Uniandes/GraSP

  11. 3HAN: A Deep Neural Network for Fake News Detection

    Authors: Sneha Singhania, Nigel Fernandez, Shrisha Rao

    Abstract: The rapid spread of fake news is a serious problem calling for AI solutions. We employ a deep learning based automated detector through a three level hierarchical attention network (3HAN) for fast, accurate detection of fake news. 3HAN has three levels, one each for words, sentences, and the headline, and constructs a news vector: an effective representation of an input news article, by processing… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: Published as a conference paper at ICONIP 2017

  12. arXiv:2306.08847  [pdf, other

    cs.CL cs.CY cs.LG

    Improving Reading Comprehension Question Generation with Data Augmentation and Overgenerate-and-rank

    Authors: Nischal Ashok Kumar, Nigel Fernandez, Zichao Wang, Andrew Lan

    Abstract: Reading comprehension is a crucial skill in many aspects of education, including language learning, cognitive development, and fostering early literacy skills in children. Automated answer-aware reading comprehension question generation has significant potential to scale up learner support in educational activities. One key technical challenge in this setting is that there can be multiple question… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Oral presentation at ACL BEA workshop 2023. Code available at: https://github.com/umass-ml4ed/question-gen-aug-ranking

  13. arXiv:2305.14267  [pdf, other

    cs.LG cs.CV math.NA

    SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models

    Authors: Martin Gonzalez, Nelson Fernandez, Thuy Tran, Elies Gherbi, Hatem Hajri, Nader Masmoudi

    Abstract: A potent class of generative models known as Diffusion Probabilistic Models (DPMs) has become prominent. A forward diffusion process adds gradually noise to data, while a model learns to gradually denoise. Sampling from pre-trained DPMs is obtained by solving differential equations (DE) defined by the learnt model, a process which has shown to be prohibitively slow. Numerous efforts on speeding-up… ▽ More

    Submitted 26 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 60 pages. Camera-Ready version for the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

    MSC Class: I.2.6

  14. Towards Holistic Surgical Scene Understanding

    Authors: Natalia Valderrama, Paola Ruiz Puentes, Isabela Hernández, Nicolás Ayobi, Mathilde Verlyk, Jessica Santander, Juan Caicedo, Nicolás Fernández, Pablo Arbeláez

    Abstract: Most benchmarks for studying surgical interventions focus on a specific challenge instead of leveraging the intrinsic complementarity among different tasks. In this work, we present a new experimental framework towards holistic surgical scene understanding. First, we introduce the Phase, Step, Instrument, and Atomic Visual Action recognition (PSI-AVA) Dataset. PSI-AVA includes annotations for both… ▽ More

    Submitted 25 January, 2024; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: MICCAI 2022 Oral. Official extension published at arXiv:2401.11174 . Data and codes available at https://github.com/BCV-Uniandes/TAPIR

    Journal ref: Medical Image Computing and Computer Assisted Intervention 2022,

  15. arXiv:2207.08017  [pdf

    cs.SI physics.soc-ph

    Is Soccer a lie or simply a complex system?

    Authors: Nelson Fernandez, Ricardo Bernal

    Abstract: Understanding soccer as a complex system we base on nature and the collective behavior of many organisms that "do calculations," seeking to generate solutions in a bioinspired way. When soccer mysteries appear, complex systems science emerges as a means to provide explanations. However, given the variety of interpretations that complexity and its associated properties can have and the understandin… ▽ More

    Submitted 20 July, 2022; v1 submitted 16 July, 2022; originally announced July 2022.

    Comments: 15 pages, in Spanish language, 6 Figures

  16. arXiv:2205.09864  [pdf, other

    cs.LG cs.AI cs.CY

    Automated Scoring for Reading Comprehension via In-context BERT Tuning

    Authors: Nigel Fernandez, Aritra Ghosh, Naiming Liu, Zichao Wang, Benoît Choffin, Richard Baraniuk, Andrew Lan

    Abstract: Automated scoring of open-ended student responses has the potential to significantly reduce human grader effort. Recent advances in automated scoring often leverage textual representations based on pre-trained language models such as BERT and GPT as input to scoring models. Most existing approaches train a separate model for each item/question, which is suitable for scenarios such as essay scoring… ▽ More

    Submitted 15 June, 2023; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: Published as a conference paper at AIED 2022. A grand prize-winner for the NAEP AS Challenge. Code available at: https://github.com/ni9elf/automated-scoring

  17. arXiv:2007.14432  [pdf

    cs.CV

    A Convolutional Neural Network for gaze preference detection: A potential tool for diagnostics of autism spectrum disorder in children

    Authors: Dennis Núñez Fernández, Franklin Barrientos Porras, Robert H. Gilman, Macarena Vittet Mondonedo, Patricia Sheen, Mirko Zimic

    Abstract: Early diagnosis of autism spectrum disorder (ASD) is known to improve the quality of life of affected individuals. However, diagnosis is often delayed even in wealthier countries including the US, largely due to the fact that gold standard diagnostic tools such as the Autism Diagnostic Observation Schedule (ADOS) and the Autism Diagnostic Interview-Revised (ADI-R) are time consuming and require ex… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: Pre-printed version for submission in a journal

  18. arXiv:2006.16913  [pdf, other

    cs.CY cs.AI cs.LG

    Synthesizing Tasks for Block-based Programming

    Authors: Umair Z. Ahmed, Maria Christakis, Aleksandr Efremov, Nigel Fernandez, Ahana Ghosh, Abhik Roychoudhury, Adish Singla

    Abstract: Block-based visual programming environments play a critical role in introducing computing concepts to K-12 students. One of the key pedagogical challenges in these environments is in designing new practice tasks for a student that match a desired level of difficulty and exercise specific programming concepts. In this paper, we formalize the problem of synthesizing visual programming tasks. In part… ▽ More

    Submitted 4 November, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  19. arXiv:1910.11100  [pdf, other

    cs.CV

    Development of a hand pose recognition system on an embedded computer using CNNs

    Authors: Dennis Núñez Fernández

    Abstract: Demand of hand pose recognition systems are growing in the last years in technologies like human-machine interfaces. This work suggests an approach for hand pose recognition in embedded computers using hand tracking and CNNs. Results show a fast time response with an accuracy of 94.50% and low power consumption.

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: LatinX in AI Research at NeurIPS 2019

  20. arXiv:1811.05785  [pdf, other

    cs.LG cs.AI

    Two-stream convolutional networks for end-to-end learning of self-driving cars

    Authors: Nelson Fernandez

    Abstract: We propose a methodology to extend the concept of Two-Stream Convolutional Networks to perform end-to-end learning for self-driving cars with temporal cues. The system has the ability to learn spatiotemporal features by simultaneously mapping raw images and pre-calculated optical flows directly to steering commands. Although optical flows encode temporal-rich information, we found that 2D-CNNs are… ▽ More

    Submitted 17 December, 2018; v1 submitted 13 November, 2018; originally announced November 2018.

    Journal ref: NeurIPS 2018 Workshop on modeling and decision-making in the spatiotemporal domain, Montreal, Canada

  21. arXiv:1611.09928  [pdf, other

    cs.GT

    Proportional Justified Representation

    Authors: Luis Sánchez-Fernández, Edith Elkind, Martin Lackner, Norberto Fernández, Jesús A. Fisteus, Pablo Basanta Val, Piotr Skowron

    Abstract: The goal of multi-winner elections is to choose a fixed-size committee based on voters' preferences. An important concern in this setting is representation: large groups of voters with cohesive preferences should be adequately represented by the election winners. Recently, Aziz et al. (2015a;2017) proposed two axioms that aim to capture this idea: justified representation (JR) and its strengthenin… ▽ More

    Submitted 29 November, 2016; originally announced November 2016.

    Comments: Accepted at the 31st AAAI Conference on Artificial Intelligence (AAAI-17)

  22. Architecting Time-Critical Big-Data Systems

    Authors: Pablo Basanta-Val, Neil Audsley, Andy Wellings, Ian Gray, Norberto Fernandez

    Abstract: - Current infrastructures for developing big-data applications are able to process --via big-data analytics-huge amounts of data, using clusters of machines that collaborate to perform parallel computations. However, current infrastructures were not designed to work with the requirements of time-critical applications; they are more focused on general-purpose applications rather than time-critical… ▽ More

    Submitted 3 November, 2016; originally announced November 2016.

    Comments: in IEEE Transactions on Big Data, 2016

  23. arXiv:1609.05370  [pdf, ps, other

    cs.GT

    The Maximin Support Method: An Extension of the D'Hondt Method to Approval-Based Multiwinner Elections

    Authors: Luis Sánchez-Fernández, Norberto Fernández, Jesús A. Fisteus, Markus Brill

    Abstract: We propose the maximin support method, a novel extension of the D'Hondt apportionment method to approval-based multiwinner elections. The maximin support method is based on maximizing the support of the least supported elected candidate. It can be computed efficiently and satisfies (adjusted versions of) the main properties of the original D'Hondt method: house monotonicity, population monotonicit… ▽ More

    Submitted 5 September, 2018; v1 submitted 17 September, 2016; originally announced September 2016.

    ACM Class: I.2.11

  24. arXiv:1606.00799  [pdf

    cs.MA nlin.AO

    Multi-Agent Modeling of Dynamical Systems: A Self-organized, Emergent, Homeostatic and Autopoietic Approach

    Authors: Nelson Fernandez

    Abstract: This thesis presents the theoretical, conceptual and methodological aspects that support the modeling of dynamical systems (DS) by using several agents. The modeling approach permits the assessment of properties representing order, change, equilibrium, adaptability, and autonomy, in DS. The modeling processes were supported by a conceptual corpus regarding systems dynamics, multi-agent systems, gr… ▽ More

    Submitted 2 October, 2015; originally announced June 2016.

    Comments: in Spanish

  25. arXiv:1511.00529  [pdf, other

    nlin.AO cond-mat.stat-mech cs.CC

    Measuring the Complexity of Continuous Distributions

    Authors: Guillermo Santamaría-Bonfil, Nelson Fernández, Carlos Gershenson

    Abstract: We extend previously proposed measures of complexity, emergence, and self-organization to continuous distributions using differential entropy. This allows us to calculate the complexity of phenomena for which distributions are known. We find that a broad range of common parameters found in Gaussian and scale-free distributions present high complexity values. We also explore the relationship betwee… ▽ More

    Submitted 2 November, 2015; originally announced November 2015.

    Comments: 21 pages, 5 Tables, 4 Figures

    Journal ref: Entropy, 18(3):72. 2016

  26. arXiv:1402.0197  [pdf, other

    nlin.AO cs.IT eess.SY nlin.CG physics.soc-ph

    Measuring the Complexity of Self-organizing Traffic Lights

    Authors: Dario Zubillaga, Geovany Cruz, Luis Daniel Aguilar, Jorge Zapotecatl, Nelson Fernandez, Jose Aguilar, David A. Rosenblueth, Carlos Gershenson

    Abstract: We apply measures of complexity, emergence and self-organization to an abstract city traffic model for comparing a traditional traffic coordination method with a self-organizing method in two scenarios: cyclic boundaries and non-orientable boundaries. We show that the measures are useful to identify and characterize different dynamical phases. It becomes clear that different operation regimes are… ▽ More

    Submitted 2 February, 2014; originally announced February 2014.

    Comments: 18 pages, 11 figures

    ACM Class: F.1.1; D.2.8; F.1.3; J.2; H.1.1

    Journal ref: Entropy, 16(5):2384-2407. 2014

  27. arXiv:1304.1842  [pdf, other

    nlin.AO cs.IT q-bio.OT

    Information Measures of Complexity, Emergence, Self-organization, Homeostasis, and Autopoiesis

    Authors: Nelson Fernandez, Carlos Maldonado, Carlos Gershenson

    Abstract: This chapter reviews measures of emergence, self-organization, complexity, homeostasis, and autopoiesis based on information theory. These measures are derived from proposed axioms and tested in two case studies: random Boolean networks and an Arctic lake ecosystem. Emergence is defined as the information a system or process produces. Self-organization is defined as the opposite of emergence, wh… ▽ More

    Submitted 31 July, 2013; v1 submitted 5 April, 2013; originally announced April 2013.

    Comments: 35 pages, 12 figures, to be published in Prokopenko, M., editor, Guided Self-Organization: Inception. Springer. In Press

    Report number: C3 Report 2013.02 ACM Class: H.1.1; F.1.3; J.3

  28. arXiv:1205.2026  [pdf, other

    cs.IT nlin.AO nlin.CG

    Complexity and Information: Measuring Emergence, Self-organization, and Homeostasis at Multiple Scales

    Authors: Carlos Gershenson, Nelson Fernandez

    Abstract: Concepts used in the scientific study of complex systems have become so widespread that their use and abuse has led to ambiguity and confusion in their meaning. In this paper we use information theory to provide abstract and concise measures of complexity, emergence, self-organization, and homeostasis. The purpose is to clarify the meaning of these concepts with the aid of the proposed formal meas… ▽ More

    Submitted 10 August, 2012; v1 submitted 9 May, 2012; originally announced May 2012.

    Comments: 42 pages, 11 figures, 2 tables

    Report number: C3 2012.03 MSC Class: 94A15 (Primary) 94A17; 68Q15; 68Q80 (Secondary) ACM Class: H.1.1; F.1.3; F.1.1

    Journal ref: Complexity 18(2):29-44. 2012