Skip to main content

Showing 1–20 of 20 results for author: Siebert, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.11147  [pdf, other

    cs.CL

    Reasoning Graph Enhanced Exemplars Retrieval for In-Context Learning

    Authors: Yukang Lin, Bingchen Zhong, Shuoran Jiang, Joanna Siebert, Qingcai Chen

    Abstract: Large language models (LLMs) have exhibited remarkable few-shot learning capabilities and unified the paradigm of NLP tasks through the in-context learning (ICL) technique. Despite the success of ICL, the quality of the exemplar demonstrations can significantly influence the LLM's performance. Existing exemplar selection methods mainly focus on the semantic similarity between queries and candidate… ▽ More

    Submitted 12 December, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

  2. arXiv:2401.12714  [pdf, other

    cs.SE cs.AI

    Evaluation of large language models for assessing code maintainability

    Authors: Marc Dillmann, Julien Siebert, Adam Trendowicz

    Abstract: Increased availability of open-source software repositories and recent advances in code analysis using large language models (LLMs) has triggered a wave of new work to automate software engineering tasks that were previously very difficult to automate. In this paper, we investigate a recent line of work that hypothesises that comparing the probability of code generated by LLMs with the probability… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 14 pages, 4 figures, 8 tables

    MSC Class: 68 ACM Class: D.2.7

  3. arXiv:2309.10506  [pdf, other

    cs.CL

    Enhancing Open-Domain Table Question Answering via Syntax- and Structure-aware Dense Retrieval

    Authors: Nengzheng Jin, Dongfang Li, Junying Chen, Joanna Siebert, Qingcai Chen

    Abstract: Open-domain table question answering aims to provide answers to a question by retrieving and extracting information from a large collection of tables. Existing studies of open-domain table QA either directly adopt text retrieval methods or consider the table structure only in the encoding layer for table retrieval, which may cause syntactical and structural information loss during table scoring. T… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: IJCNLP-AACL 2023

  4. arXiv:2307.04468  [pdf, other

    cs.LG

    Badgers: generating data quality deficits with Python

    Authors: Julien Siebert, Daniel Seifert, Patricia Kelbert, Michael Kläs, Adam Trendowicz

    Abstract: Generating context specific data quality deficits is necessary to experimentally assess data quality of data-driven (artificial intelligence (AI) or machine learning (ML)) applications. In this paper we present badgers, an extensible open-source Python library to generate data quality deficits (outliers, imbalanced data, drift, etc.) for different modalities (tabular data, time-series, text, etc.)… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 17 pages, 16 figures

    MSC Class: 68 ACM Class: D.m

  5. Applications of statistical causal inference in software engineering

    Authors: Julien Siebert

    Abstract: This paper reviews existing work in software engineering that applies statistical causal inference methods. These methods aim at estimating causal effects from observational data. The review covers 32 papers published between 2010 and 2022. Our results show that the application of statistical causal inference methods is relatively recent and that the corresponding research community remains relati… ▽ More

    Submitted 23 March, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: 38 pages, 12 tables, 9 figures, submitted to Information and Software Technology

    ACM Class: D.2

  6. arXiv:2211.08890  [pdf, other

    cs.SE

    Challenges related to system-of-systems for greening and climate adaptation in smart cities

    Authors: Sarah Brandt, Julien Siebert

    Abstract: This paper presents the results of interviews conducted as part of the DYNASOS project. The objective was to collect challenges related to the design, implementation and management of system-of-systems (SoS) in the context of climate adaptation and greening of smart cities. 23 individuals from cities, academia, and industry were interviewed between March and May 2022 and 57 distinct challenges wer… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 15 pages, 2 figures, 10 tables. submitted at https://www.mdpi.com/journal/sustainability/special_issues/Smart_Cities_Biodiversity_Infrastructure

    ACM Class: D.2

  7. arXiv:2207.05270  [pdf, ps, other

    cs.CL cs.AI

    A Survey on Table Question Answering: Recent Advances

    Authors: Nengzheng Jin, Joanna Siebert, Dongfang Li, Qingcai Chen

    Abstract: Table Question Answering (Table QA) refers to providing precise answers from tables to answer a user's question. In recent years, there have been a lot of works on table QA, but there is a lack of comprehensive surveys on this research topic. Hence, we aim to provide an overview of available datasets and representative methods in table QA. We classify existing methods for table QA into five catego… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: 13 pages

  8. arXiv:2206.06008  [pdf

    cs.SE

    Towards a Roadmap for Trustworthy Dynamic Systems-of-Systems

    Authors: Rasmus Adler, Frank Elberzhager, Julien Siebert

    Abstract: This paper gives insights into the DynaSoS project, which aims to propose a cross-domain roadmap for systems engineering research into trustworthy dynamic systems-of-systems with autonomous components. The project follows a methodology that combines interviews and workshops with experts from various domains and a literature review. In this paper, we motivate the project, discuss five application a… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: 6 pages, 0 figure, 6 references, preprint, paper accepted to the folowing conference: SERP'22 - The 20th Int'l Conf on Software Engineering Research and Practice (part of the 2022 World Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE'22)) https://american-cse.org/csce2022

    MSC Class: 68N99; ACM Class: D.2

  9. arXiv:2110.11338  [pdf, other

    cs.CV cs.CL cs.IR

    VLDeformer: Vision-Language Decomposed Transformer for Fast Cross-Modal Retrieval

    Authors: Lisai Zhang, Hongfa Wu, Qingcai Chen, Yimeng Deng, Zhonghua Li, Dejiang Kong, Zhao Cao, Joanna Siebert, Yunpeng Han

    Abstract: Cross-model retrieval has emerged as one of the most important upgrades for text-only search engines (SE). Recently, with powerful representation for pairwise text-image inputs via early interaction, the accuracy of vision-language (VL) transformers has outperformed existing methods for text-image retrieval. However, when the same paradigm is used for inference, the efficiency of the VL transforme… ▽ More

    Submitted 24 November, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

  10. arXiv:2105.01984  [pdf, other

    cs.SE cs.AI cs.LG

    Software Engineering for AI-Based Systems: A Survey

    Authors: Silverio Martínez-Fernández, Justus Bogner, Xavier Franch, Marc Oriol, Julien Siebert, Adam Trendowicz, Anna Maria Vollmer, Stefan Wagner

    Abstract: AI-based systems are software systems with functionalities enabled by at least one AI component (e.g., for image- and speech-recognition, and autonomous driving). AI-based systems are becoming pervasive in society due to advances in AI. However, there is limited synthesized knowledge on Software Engineering (SE) approaches for building, operating, and maintaining AI-based systems. To collect and a… ▽ More

    Submitted 2 September, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

    Comments: Accepted in ACM Transactions on Software Engineering and Methodology (TOSEM). For its published version refer to the Journal of ACM TOSEM

    ACM Class: D.2; I.2

    Journal ref: ACM Trans. Softw. Eng. Methodol. 31, 2, Article 37e (March 2022), 59 pages

  11. arXiv:2104.07406  [pdf, other

    cs.MS

    A systematic review of Python packages for time series analysis

    Authors: Julien Siebert, Janek Groß, Christof Schroth

    Abstract: This paper presents a systematic review of Python packages with a focus on time series analysis. The objective is to provide (1) an overview of the different time series analysis tasks and preprocessing methods implemented, and (2) an overview of the development characteristics of the packages (e.g., documentation, dependencies, and community size). This review is based on a search of literature d… ▽ More

    Submitted 22 June, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: 12 pages, 3 figures, 4 tables, accepted to ITISE2021

    MSC Class: 68-04 ACM Class: I.5.5

  12. Towards Guidelines for Assessing Qualities of Machine Learning Systems

    Authors: Julien Siebert, Lisa Joeckel, Jens Heidrich, Koji Nakamichi, Kyoko Ohashi, Isao Namba, Rieko Yamamoto, Mikio Aoyama

    Abstract: Nowadays, systems containing components based on machine learning (ML) methods are becoming more widespread. In order to ensure the intended behavior of a software system, there are standards that define necessary quality aspects of the system and its components (such as ISO/IEC 25010). Due to the different nature of ML, we have to adjust quality aspects or add additional ones (such as trustworthi… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: Has been accepted at the 13th International Conference on the Quality of Information and Communications Technology QUATIC2020 (https://2020.quatic.org/). QUATIC 2020 proceedings will be included in a volume of Springer CCIS Series (Communications in Computer and Information Science)

    Journal ref: Proceedings of the 13th International Conference on the Quality of Information and Communications Technology QUATIC2020 (https://2020.quatic.org/). Springer CCIS Series (Communications in Computer and Information Science)

  13. arXiv:2004.13844  [pdf, other

    cs.CL

    Decomposing Word Embedding with the Capsule Network

    Authors: Xin Liu, Qingcai Chen, Yan Liu, Joanna Siebert, Baotian Hu, Xiangping Wu, Buzhou Tang

    Abstract: Word sense disambiguation tries to learn the appropriate sense of an ambiguous word in a given context. The existing pre-trained language methods and the methods based on multi-embeddings of word did not explore the power of the unsupervised word embedding sufficiently. In this paper, we discuss a capsule network-based approach, taking advantage of capsule's potential for recognizing highly over… ▽ More

    Submitted 29 June, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

  14. arXiv:2003.01383  [pdf

    cs.CV cs.LG

    Fully Convolutional Networks for Automatically Generating Image Masks to Train Mask R-CNN

    Authors: Hao Wu, Jan Paul Siebert, Xiangrong Xu

    Abstract: This paper proposes a novel automatically generating image masks method for the state-of-the-art Mask R-CNN deep learning method. The Mask R-CNN method achieves the best results in object detection until now, however, it is very time-consuming and laborious to get the object Masks for training, the proposed method is composed by a two-stage design, to automatically generating image masks, the firs… ▽ More

    Submitted 20 May, 2021; v1 submitted 3 March, 2020; originally announced March 2020.

  15. arXiv:1809.01633  [pdf, other

    cs.CV

    Efficient Egocentric Visual Perception Combining Eye-tracking, a Software Retina and Deep Learning

    Authors: Nina Hristozova, Piotr Ozimek, Jan Paul Siebert

    Abstract: We present ongoing work to harness biological approaches to achieving highly efficient egocentric perception by combining the space-variant imaging architecture of the mammalian retina with Deep Learning methods. By pre-processing images collected by means of eye-tracking glasses to control the fixation locations of a software retina model, we demonstrate that we can reduce the input to a DCNN by… ▽ More

    Submitted 5 September, 2018; originally announced September 2018.

    Comments: Accepted for: EPIC Workshop at the European Conference on Computer Vision, ECCV2018

  16. arXiv:1707.07157  [pdf, other

    cs.RO cs.CV

    Single-Shot Clothing Category Recognition in Free-Configurations with Application to Autonomous Clothes Sorting

    Authors: Li Sun, Gerardo Aragon-Camarasa, Simon Rogers, Rustam Stolkin, J. Paul Siebert

    Abstract: This paper proposes a single-shot approach for recognising clothing categories from 2.5D features. We propose two visual features, BSP (B-Spline Patch) and TSD (Topology Spatial Distances) for this task. The local BSP features are encoded by LLC (Locality-constrained Linear Coding) and fused with three different global features. Our visual feature is robust to deformable shapes and our approach is… ▽ More

    Submitted 22 July, 2017; originally announced July 2017.

    Comments: 9 pages, accepted by IROS2017

  17. arXiv:1610.05824  [pdf, other

    cs.RO cs.CV

    Robot Vision Architecture for Autonomous Clothes Manipulation

    Authors: Li Sun, Gerardo Aragon-Camarasa, Simon Rogers, J. Paul Siebert

    Abstract: This paper presents a novel robot vision architecture for perceiving generic 3D clothes configurations. Our architecture is hierarchically structured, starting from low-level curvatures, across mid-level geometric shapes \& topology descriptions; and finally approaching high-level semantic surface structure descriptions. We demonstrate our robot vision architecture in a customised dual-arm industr… ▽ More

    Submitted 18 October, 2016; originally announced October 2016.

    Comments: 14 pages, under review

  18. arXiv:1407.8004  [pdf, other

    cs.HC cs.CY

    An Investigation into the use of Images as Password Cues

    Authors: Tony McBryan, Karen Renaud, J. Paul Siebert

    Abstract: Computer users are generally authenticated by means of a password. Unfortunately passwords are often forgotten and replacement is expensive and inconvenient. Some people write their passwords down but these records can easily be lost or stolen. The option we explore is to find a way to cue passwords securely. The specific cueing technique we report on in this paper employs images as cues. The idea… ▽ More

    Submitted 9 August, 2014; v1 submitted 30 July, 2014; originally announced July 2014.

    ACM Class: H.1.2

  19. arXiv:1311.7295  [pdf, other

    cs.RO cs.CV

    Glasgow's Stereo Image Database of Garments

    Authors: Gerardo Aragon-Camarasa, Susanne B. Oehler, Yuan Liu, Sun Li, Paul Cockshott, J. Paul Siebert

    Abstract: To provide insight into cloth perception and manipulation with an active binocular robotic vision system, we compiled a database of 80 stereo-pair colour images with corresponding horizontal and vertical disparity maps and mask annotations, for 3D garment point cloud rendering has been created and released. The stereo-image garment database is part of research conducted under the EU-FP7 Clothes Pe… ▽ More

    Submitted 28 November, 2013; originally announced November 2013.

    Comments: 7 pages, 6 figure, image database

  20. arXiv:1210.5936  [pdf

    cs.MA

    Modélisation multi-niveaux dans AA4MM

    Authors: Benjamin Camus, Julien Siebert, Christine Bourjot, Vincent Chevrier

    Abstract: In this article, we propose to represent a multi-level phenomenon as a set of interacting models. This perspective makes the levels of representation and their relationships explicit. To deal with coherence, causality and coordination issues between models, we rely on AA4MM, a metamodel dedicated to such a representation. We illustrate our proposal and we show the interest of our approach on a flo… ▽ More

    Submitted 22 October, 2012; originally announced October 2012.

    Journal ref: Journées Francophones sur les Systèmes Multi-Agents (2012) 43-52