Search | arXiv e-print repository

Reasoning Graph Enhanced Exemplars Retrieval for In-Context Learning

Authors: Yukang Lin, Bingchen Zhong, Shuoran Jiang, Joanna Siebert, Qingcai Chen

Abstract: Large language models (LLMs) have exhibited remarkable few-shot learning capabilities and unified the paradigm of NLP tasks through the in-context learning (ICL) technique. Despite the success of ICL, the quality of the exemplar demonstrations can significantly influence the LLM's performance. Existing exemplar selection methods mainly focus on the semantic similarity between queries and candidate… ▽ More Large language models (LLMs) have exhibited remarkable few-shot learning capabilities and unified the paradigm of NLP tasks through the in-context learning (ICL) technique. Despite the success of ICL, the quality of the exemplar demonstrations can significantly influence the LLM's performance. Existing exemplar selection methods mainly focus on the semantic similarity between queries and candidate exemplars. On the other hand, the logical connections between reasoning steps can be beneficial to depict the problem-solving process as well. In this paper, we proposes a novel method named Reasoning Graph-enhanced Exemplar Retrieval (RGER). RGER first quires LLM to generate an initial response, then expresses intermediate problem-solving steps to a graph structure. After that, it employs graph kernel to select exemplars with semantic and structural similarity. Extensive experiments demonstrate the structural relationship is helpful to the alignment of queries and candidate exemplars. The efficacy of RGER on math and logit reasoning tasks showcases its superiority over state-of-the-art retrieval-based approaches. Our code is released at https://github.com/Yukang-Lin/RGER. △ Less

Submitted 12 December, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

arXiv:2401.12714 [pdf, other]

Evaluation of large language models for assessing code maintainability

Authors: Marc Dillmann, Julien Siebert, Adam Trendowicz

Abstract: Increased availability of open-source software repositories and recent advances in code analysis using large language models (LLMs) has triggered a wave of new work to automate software engineering tasks that were previously very difficult to automate. In this paper, we investigate a recent line of work that hypothesises that comparing the probability of code generated by LLMs with the probability… ▽ More Increased availability of open-source software repositories and recent advances in code analysis using large language models (LLMs) has triggered a wave of new work to automate software engineering tasks that were previously very difficult to automate. In this paper, we investigate a recent line of work that hypothesises that comparing the probability of code generated by LLMs with the probability the current code would have had can indicate potential quality problems. We investigate the association between the cross-entropy of code generated by ten different models (based on GPT2 and Llama2) and the following quality aspects: readability, understandability, complexity, modularisation, and overall maintainability assessed by experts and available in an benchmark dataset. Our results show that, controlling for the number of logical lines of codes (LLOC), cross-entropy computed by LLMs is indeed a predictor of maintainability on a class level (the higher the cross-entropy the lower the maintainability). However, this relation is reversed when one does not control for LLOC (e.g., comparing small classes with longer ones). Furthermore, while the complexity of LLMs affects the range of cross-entropy (smaller models tend to have a wider range of cross-entropy), this plays a significant role in predicting maintainability aspects. Our study limits itself on ten different pretrained models (based on GPT2 and Llama2) and on maintainability aspects collected by Schnappinger et al. When controlling for logical lines of code (LLOC), cross-entropy is a predictor of maintainability. However, while related work has shown the potential usefulness of cross-entropy at the level of tokens or short sequences, at the class level this criterion alone may prove insufficient to predict maintainability and further research is needed to make best use of this information in practice. △ Less

Submitted 23 January, 2024; originally announced January 2024.

Comments: 14 pages, 4 figures, 8 tables

MSC Class: 68 ACM Class: D.2.7

arXiv:2309.10506 [pdf, other]

Enhancing Open-Domain Table Question Answering via Syntax- and Structure-aware Dense Retrieval

Authors: Nengzheng Jin, Dongfang Li, Junying Chen, Joanna Siebert, Qingcai Chen

Abstract: Open-domain table question answering aims to provide answers to a question by retrieving and extracting information from a large collection of tables. Existing studies of open-domain table QA either directly adopt text retrieval methods or consider the table structure only in the encoding layer for table retrieval, which may cause syntactical and structural information loss during table scoring. T… ▽ More Open-domain table question answering aims to provide answers to a question by retrieving and extracting information from a large collection of tables. Existing studies of open-domain table QA either directly adopt text retrieval methods or consider the table structure only in the encoding layer for table retrieval, which may cause syntactical and structural information loss during table scoring. To address this issue, we propose a syntax- and structure-aware retrieval method for the open-domain table QA task. It provides syntactical representations for the question and uses the structural header and value representations for the tables to avoid the loss of fine-grained syntactical and structural information. Then, a syntactical-to-structural aggregator is used to obtain the matching score between the question and a candidate table by mimicking the human retrieval process. Experimental results show that our method achieves the state-of-the-art on the NQ-tables dataset and overwhelms strong baselines on a newly curated open-domain Text-to-SQL dataset. △ Less

Submitted 19 September, 2023; originally announced September 2023.

Comments: IJCNLP-AACL 2023

arXiv:2307.04468 [pdf, other]

Badgers: generating data quality deficits with Python

Authors: Julien Siebert, Daniel Seifert, Patricia Kelbert, Michael Kläs, Adam Trendowicz

Abstract: Generating context specific data quality deficits is necessary to experimentally assess data quality of data-driven (artificial intelligence (AI) or machine learning (ML)) applications. In this paper we present badgers, an extensible open-source Python library to generate data quality deficits (outliers, imbalanced data, drift, etc.) for different modalities (tabular data, time-series, text, etc.)… ▽ More Generating context specific data quality deficits is necessary to experimentally assess data quality of data-driven (artificial intelligence (AI) or machine learning (ML)) applications. In this paper we present badgers, an extensible open-source Python library to generate data quality deficits (outliers, imbalanced data, drift, etc.) for different modalities (tabular data, time-series, text, etc.). The documentation is accessible at https://fraunhofer-iese.github.io/badgers/ and the source code at https://github.com/Fraunhofer-IESE/badgers △ Less

Submitted 10 July, 2023; originally announced July 2023.

Comments: 17 pages, 16 figures

MSC Class: 68 ACM Class: D.m

arXiv:2211.11482 [pdf, other]

doi 10.1016/j.infsof.2023.107198

Applications of statistical causal inference in software engineering

Authors: Julien Siebert

Abstract: This paper reviews existing work in software engineering that applies statistical causal inference methods. These methods aim at estimating causal effects from observational data. The review covers 32 papers published between 2010 and 2022. Our results show that the application of statistical causal inference methods is relatively recent and that the corresponding research community remains relati… ▽ More This paper reviews existing work in software engineering that applies statistical causal inference methods. These methods aim at estimating causal effects from observational data. The review covers 32 papers published between 2010 and 2022. Our results show that the application of statistical causal inference methods is relatively recent and that the corresponding research community remains relatively fragmented. △ Less

Submitted 23 March, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

Comments: 38 pages, 12 tables, 9 figures, submitted to Information and Software Technology

ACM Class: D.2

arXiv:2211.08890 [pdf, other]

Challenges related to system-of-systems for greening and climate adaptation in smart cities

Authors: Sarah Brandt, Julien Siebert

Abstract: This paper presents the results of interviews conducted as part of the DYNASOS project. The objective was to collect challenges related to the design, implementation and management of system-of-systems (SoS) in the context of climate adaptation and greening of smart cities. 23 individuals from cities, academia, and industry were interviewed between March and May 2022 and 57 distinct challenges wer… ▽ More This paper presents the results of interviews conducted as part of the DYNASOS project. The objective was to collect challenges related to the design, implementation and management of system-of-systems (SoS) in the context of climate adaptation and greening of smart cities. 23 individuals from cities, academia, and industry were interviewed between March and May 2022 and 57 distinct challenges were collected and analyzed. Our results show that while technical issues (such as interoperability or data acquisition) persist, non-technical issues are the main obstacles. Difficulties in information sharing, effective communication, and synchronization between different actors are the most important challenges. △ Less

Submitted 16 November, 2022; originally announced November 2022.

Comments: 15 pages, 2 figures, 10 tables. submitted at https://www.mdpi.com/journal/sustainability/special_issues/Smart_Cities_Biodiversity_Infrastructure

ACM Class: D.2

arXiv:2207.05270 [pdf, ps, other]

A Survey on Table Question Answering: Recent Advances

Authors: Nengzheng Jin, Joanna Siebert, Dongfang Li, Qingcai Chen

Abstract: Table Question Answering (Table QA) refers to providing precise answers from tables to answer a user's question. In recent years, there have been a lot of works on table QA, but there is a lack of comprehensive surveys on this research topic. Hence, we aim to provide an overview of available datasets and representative methods in table QA. We classify existing methods for table QA into five catego… ▽ More Table Question Answering (Table QA) refers to providing precise answers from tables to answer a user's question. In recent years, there have been a lot of works on table QA, but there is a lack of comprehensive surveys on this research topic. Hence, we aim to provide an overview of available datasets and representative methods in table QA. We classify existing methods for table QA into five categories according to their techniques, which include semantic-parsing-based, generative, extractive, matching-based, and retriever-reader-based methods. Moreover, as table QA is still a challenging task for existing methods, we also identify and outline several key challenges and discuss the potential future directions of table QA. △ Less

Submitted 11 July, 2022; originally announced July 2022.

Comments: 13 pages

arXiv:2206.06008 [pdf]

Towards a Roadmap for Trustworthy Dynamic Systems-of-Systems

Authors: Rasmus Adler, Frank Elberzhager, Julien Siebert

Abstract: This paper gives insights into the DynaSoS project, which aims to propose a cross-domain roadmap for systems engineering research into trustworthy dynamic systems-of-systems with autonomous components. The project follows a methodology that combines interviews and workshops with experts from various domains and a literature review. In this paper, we motivate the project, discuss five application a… ▽ More This paper gives insights into the DynaSoS project, which aims to propose a cross-domain roadmap for systems engineering research into trustworthy dynamic systems-of-systems with autonomous components. The project follows a methodology that combines interviews and workshops with experts from various domains and a literature review. In this paper, we motivate the project, discuss five application areas with their drivers and visions, and draw initial conclusions with respect to challenges that a research roadmap should address. Our goal is to share initial results with the research community about the DynaSoS project and invite them to contribute to the derivation of the research roadmap. △ Less

Submitted 13 June, 2022; originally announced June 2022.

Comments: 6 pages, 0 figure, 6 references, preprint, paper accepted to the folowing conference: SERP'22 - The 20th Int'l Conf on Software Engineering Research and Practice (part of the 2022 World Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE'22)) https://american-cse.org/csce2022

MSC Class: 68N99; ACM Class: D.2

arXiv:2110.11338 [pdf, other]

VLDeformer: Vision-Language Decomposed Transformer for Fast Cross-Modal Retrieval

Authors: Lisai Zhang, Hongfa Wu, Qingcai Chen, Yimeng Deng, Zhonghua Li, Dejiang Kong, Zhao Cao, Joanna Siebert, Yunpeng Han

Abstract: Cross-model retrieval has emerged as one of the most important upgrades for text-only search engines (SE). Recently, with powerful representation for pairwise text-image inputs via early interaction, the accuracy of vision-language (VL) transformers has outperformed existing methods for text-image retrieval. However, when the same paradigm is used for inference, the efficiency of the VL transforme… ▽ More Cross-model retrieval has emerged as one of the most important upgrades for text-only search engines (SE). Recently, with powerful representation for pairwise text-image inputs via early interaction, the accuracy of vision-language (VL) transformers has outperformed existing methods for text-image retrieval. However, when the same paradigm is used for inference, the efficiency of the VL transformers is still too low to be applied in a real cross-modal SE. Inspired by the mechanism of human learning and using cross-modal knowledge, this paper presents a novel Vision-Language Decomposed Transformer (VLDeformer), which greatly increases the efficiency of VL transformers while maintaining their outstanding accuracy. By the proposed method, the cross-model retrieval is separated into two stages: the VL transformer learning stage, and the VL decomposition stage. The latter stage plays the role of single modal indexing, which is to some extent like the term indexing of a text SE. The model learns cross-modal knowledge from early-interaction pre-training and is then decomposed into an individual encoder. The decomposition requires only small target datasets for supervision and achieves both $1000+$ times acceleration and less than $0.6$\% average recall drop. VLDeformer also outperforms state-of-the-art visual-semantic embedding methods on COCO and Flickr30k. △ Less

Submitted 24 November, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

arXiv:2105.01984 [pdf, other]

doi 10.1145/3487043

Software Engineering for AI-Based Systems: A Survey

Authors: Silverio Martínez-Fernández, Justus Bogner, Xavier Franch, Marc Oriol, Julien Siebert, Adam Trendowicz, Anna Maria Vollmer, Stefan Wagner

Abstract: AI-based systems are software systems with functionalities enabled by at least one AI component (e.g., for image- and speech-recognition, and autonomous driving). AI-based systems are becoming pervasive in society due to advances in AI. However, there is limited synthesized knowledge on Software Engineering (SE) approaches for building, operating, and maintaining AI-based systems. To collect and a… ▽ More AI-based systems are software systems with functionalities enabled by at least one AI component (e.g., for image- and speech-recognition, and autonomous driving). AI-based systems are becoming pervasive in society due to advances in AI. However, there is limited synthesized knowledge on Software Engineering (SE) approaches for building, operating, and maintaining AI-based systems. To collect and analyze state-of-the-art knowledge about SE for AI-based systems, we conducted a systematic mapping study. We considered 248 studies published between January 2010 and March 2020. SE for AI-based systems is an emerging research area, where more than 2/3 of the studies have been published since 2018. The most studied properties of AI-based systems are dependability and safety. We identified multiple SE approaches for AI-based systems, which we classified according to the SWEBOK areas. Studies related to software testing and software quality are very prevalent, while areas like software maintenance seem neglected. Data-related issues are the most recurrent challenges. Our results are valuable for: researchers, to quickly understand the state of the art and learn which topics need more research; practitioners, to learn about the approaches and challenges that SE entails for AI-based systems; and, educators, to bridge the gap among SE and AI in their curricula. △ Less

Submitted 2 September, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

Comments: Accepted in ACM Transactions on Software Engineering and Methodology (TOSEM). For its published version refer to the Journal of ACM TOSEM

ACM Class: D.2; I.2

Journal ref: ACM Trans. Softw. Eng. Methodol. 31, 2, Article 37e (March 2022), 59 pages

arXiv:2104.07406 [pdf, other]

A systematic review of Python packages for time series analysis

Authors: Julien Siebert, Janek Groß, Christof Schroth

Abstract: This paper presents a systematic review of Python packages with a focus on time series analysis. The objective is to provide (1) an overview of the different time series analysis tasks and preprocessing methods implemented, and (2) an overview of the development characteristics of the packages (e.g., documentation, dependencies, and community size). This review is based on a search of literature d… ▽ More This paper presents a systematic review of Python packages with a focus on time series analysis. The objective is to provide (1) an overview of the different time series analysis tasks and preprocessing methods implemented, and (2) an overview of the development characteristics of the packages (e.g., documentation, dependencies, and community size). This review is based on a search of literature databases as well as GitHub repositories. Following the filtering process, 40 packages were analyzed. We classified the packages according to the analysis tasks implemented, the methods related to data preparation, and the means for evaluating the results produced (methods and access to evaluation data). We also reviewed documentation aspects, the licenses, the size of the packages' community, and the dependencies used. Among other things, our results show that forecasting is by far the most frequently implemented task, that half of the packages provide access to real datasets or allow generating synthetic data, and that many packages depend on a few libraries (the most used ones being numpy, scipy and pandas). We hope that this review can help practitioners and researchers navigate the space of Python packages dedicated to time series analysis. We will provide an updated list of the reviewed packages online at https://siebert-julien.github.io/time-series-analysis-python/. △ Less

Submitted 22 June, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

Comments: 12 pages, 3 figures, 4 tables, accepted to ITISE2021

MSC Class: 68-04 ACM Class: I.5.5

arXiv:2008.11007 [pdf]

doi 10.1007/978-3-030-58793-2_2

Towards Guidelines for Assessing Qualities of Machine Learning Systems

Authors: Julien Siebert, Lisa Joeckel, Jens Heidrich, Koji Nakamichi, Kyoko Ohashi, Isao Namba, Rieko Yamamoto, Mikio Aoyama

Abstract: Nowadays, systems containing components based on machine learning (ML) methods are becoming more widespread. In order to ensure the intended behavior of a software system, there are standards that define necessary quality aspects of the system and its components (such as ISO/IEC 25010). Due to the different nature of ML, we have to adjust quality aspects or add additional ones (such as trustworthi… ▽ More Nowadays, systems containing components based on machine learning (ML) methods are becoming more widespread. In order to ensure the intended behavior of a software system, there are standards that define necessary quality aspects of the system and its components (such as ISO/IEC 25010). Due to the different nature of ML, we have to adjust quality aspects or add additional ones (such as trustworthiness) and be very precise about which aspect is really relevant for which object of interest (such as completeness of training data), and how to objectively assess adherence to quality requirements. In this article, we present the construction of a quality model (i.e., evaluation objects, quality aspects, and metrics) for an ML system based on an industrial use case. This quality model enables practitioners to specify and assess quality requirements for such kinds of ML systems objectively. In the future, we want to learn how the term quality differs between different types of ML systems and come up with general guidelines for specifying and assessing qualities of ML systems. △ Less

Submitted 25 August, 2020; originally announced August 2020.

Comments: Has been accepted at the 13th International Conference on the Quality of Information and Communications Technology QUATIC2020 (https://2020.quatic.org/). QUATIC 2020 proceedings will be included in a volume of Springer CCIS Series (Communications in Computer and Information Science)

Journal ref: Proceedings of the 13th International Conference on the Quality of Information and Communications Technology QUATIC2020 (https://2020.quatic.org/). Springer CCIS Series (Communications in Computer and Information Science)

arXiv:2004.13844 [pdf, other]

Decomposing Word Embedding with the Capsule Network

Authors: Xin Liu, Qingcai Chen, Yan Liu, Joanna Siebert, Baotian Hu, Xiangping Wu, Buzhou Tang

Abstract: Word sense disambiguation tries to learn the appropriate sense of an ambiguous word in a given context. The existing pre-trained language methods and the methods based on multi-embeddings of word did not explore the power of the unsupervised word embedding sufficiently. In this paper, we discuss a capsule network-based approach, taking advantage of capsule's potential for recognizing highly over… ▽ More Word sense disambiguation tries to learn the appropriate sense of an ambiguous word in a given context. The existing pre-trained language methods and the methods based on multi-embeddings of word did not explore the power of the unsupervised word embedding sufficiently. In this paper, we discuss a capsule network-based approach, taking advantage of capsule's potential for recognizing highly overlapping features and dealing with segmentation. We propose a Capsule network-based method to Decompose the unsupervised word Embedding of an ambiguous word into context specific Sense embedding, called CapsDecE2S. In this approach, the unsupervised ambiguous embedding is fed into capsule network to produce its multiple morpheme-like vectors, which are defined as the basic semantic language units of meaning. With attention operations, CapsDecE2S integrates the word context to reconstruct the multiple morpheme-like vectors into the context-specific sense embedding. To train CapsDecE2S, we propose a sense matching training method. In this method, we convert the sense learning into a binary classification that explicitly learns the relation between senses by the label of matching and non-matching. The CapsDecE2S was experimentally evaluated on two sense learning tasks, i.e., word in context and word sense disambiguation. Results on two public corpora Word-in-Context and English all-words Word Sense Disambiguation show that, the CapsDecE2S model achieves the new state-of-the-art for the word in context and word sense disambiguation tasks. △ Less

Submitted 29 June, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

arXiv:2003.01383 [pdf]

Fully Convolutional Networks for Automatically Generating Image Masks to Train Mask R-CNN

Authors: Hao Wu, Jan Paul Siebert, Xiangrong Xu

Abstract: This paper proposes a novel automatically generating image masks method for the state-of-the-art Mask R-CNN deep learning method. The Mask R-CNN method achieves the best results in object detection until now, however, it is very time-consuming and laborious to get the object Masks for training, the proposed method is composed by a two-stage design, to automatically generating image masks, the firs… ▽ More This paper proposes a novel automatically generating image masks method for the state-of-the-art Mask R-CNN deep learning method. The Mask R-CNN method achieves the best results in object detection until now, however, it is very time-consuming and laborious to get the object Masks for training, the proposed method is composed by a two-stage design, to automatically generating image masks, the first stage implements a fully convolutional networks (FCN) based segmentation network, the second stage network, a Mask R-CNN based object detection network, which is trained on the object image masks from FCN output, the original input image, and additional label information. Through experimentation, our proposed method can obtain the image masks automatically to train Mask R-CNN, and it can achieve very high classification accuracy with an over 90% mean of average precision (mAP) for segmentation △ Less

Submitted 20 May, 2021; v1 submitted 3 March, 2020; originally announced March 2020.

arXiv:1809.01633 [pdf, other]

Efficient Egocentric Visual Perception Combining Eye-tracking, a Software Retina and Deep Learning

Authors: Nina Hristozova, Piotr Ozimek, Jan Paul Siebert

Abstract: We present ongoing work to harness biological approaches to achieving highly efficient egocentric perception by combining the space-variant imaging architecture of the mammalian retina with Deep Learning methods. By pre-processing images collected by means of eye-tracking glasses to control the fixation locations of a software retina model, we demonstrate that we can reduce the input to a DCNN by… ▽ More We present ongoing work to harness biological approaches to achieving highly efficient egocentric perception by combining the space-variant imaging architecture of the mammalian retina with Deep Learning methods. By pre-processing images collected by means of eye-tracking glasses to control the fixation locations of a software retina model, we demonstrate that we can reduce the input to a DCNN by a factor of 3, reduce the required number of training epochs and obtain over 98% classification rates when training and validating the system on a database of over 26,000 images of 9 object classes. △ Less

Submitted 5 September, 2018; originally announced September 2018.

Comments: Accepted for: EPIC Workshop at the European Conference on Computer Vision, ECCV2018

arXiv:1707.07157 [pdf, other]

Single-Shot Clothing Category Recognition in Free-Configurations with Application to Autonomous Clothes Sorting

Authors: Li Sun, Gerardo Aragon-Camarasa, Simon Rogers, Rustam Stolkin, J. Paul Siebert

Abstract: This paper proposes a single-shot approach for recognising clothing categories from 2.5D features. We propose two visual features, BSP (B-Spline Patch) and TSD (Topology Spatial Distances) for this task. The local BSP features are encoded by LLC (Locality-constrained Linear Coding) and fused with three different global features. Our visual feature is robust to deformable shapes and our approach is… ▽ More This paper proposes a single-shot approach for recognising clothing categories from 2.5D features. We propose two visual features, BSP (B-Spline Patch) and TSD (Topology Spatial Distances) for this task. The local BSP features are encoded by LLC (Locality-constrained Linear Coding) and fused with three different global features. Our visual feature is robust to deformable shapes and our approach is able to recognise the category of unknown clothing in unconstrained and random configurations. We integrated the category recognition pipeline with a stereo vision system, clothing instance detection, and dual-arm manipulators to achieve an autonomous sorting system. To verify the performance of our proposed method, we build a high-resolution RGBD clothing dataset of 50 clothing items of 5 categories sampled in random configurations (a total of 2,100 clothing samples). Experimental results show that our approach is able to reach 83.2\% accuracy while classifying clothing items which were previously unseen during training. This advances beyond the previous state-of-the-art by 36.2\%. Finally, we evaluate the proposed approach in an autonomous robot sorting system, in which the robot recognises a clothing item from an unconstrained pile, grasps it, and sorts it into a box according to its category. Our proposed sorting system achieves reasonable sorting success rates with single-shot perception. △ Less

Submitted 22 July, 2017; originally announced July 2017.

Comments: 9 pages, accepted by IROS2017

arXiv:1610.05824 [pdf, other]

Robot Vision Architecture for Autonomous Clothes Manipulation

Authors: Li Sun, Gerardo Aragon-Camarasa, Simon Rogers, J. Paul Siebert

Abstract: This paper presents a novel robot vision architecture for perceiving generic 3D clothes configurations. Our architecture is hierarchically structured, starting from low-level curvatures, across mid-level geometric shapes \& topology descriptions; and finally approaching high-level semantic surface structure descriptions. We demonstrate our robot vision architecture in a customised dual-arm industr… ▽ More This paper presents a novel robot vision architecture for perceiving generic 3D clothes configurations. Our architecture is hierarchically structured, starting from low-level curvatures, across mid-level geometric shapes \& topology descriptions; and finally approaching high-level semantic surface structure descriptions. We demonstrate our robot vision architecture in a customised dual-arm industrial robot with our self-designed, off-the-self stereo vision system, carrying out autonomous grasping and dual-arm flattening. It is worth noting that the proposed dual-arm flattening approach is unique among the state-of-the-art robot autonomous system, which is the major contribution of this paper. The experimental results show that the proposed dual-arm flattening using stereo vision system remarkably outperforms the single-arm flattening and widely-cited Kinect-based sensing system for dexterous manipulation tasks. In addition, the proposed grasping approach achieves satisfactory performance on grasping various kind of garments, verifying the capability of proposed visual perception architecture to be adapted to more than one clothing manipulation tasks. △ Less

Submitted 18 October, 2016; originally announced October 2016.

Comments: 14 pages, under review

arXiv:1407.8004 [pdf, other]

An Investigation into the use of Images as Password Cues

Authors: Tony McBryan, Karen Renaud, J. Paul Siebert

Abstract: Computer users are generally authenticated by means of a password. Unfortunately passwords are often forgotten and replacement is expensive and inconvenient. Some people write their passwords down but these records can easily be lost or stolen. The option we explore is to find a way to cue passwords securely. The specific cueing technique we report on in this paper employs images as cues. The idea… ▽ More Computer users are generally authenticated by means of a password. Unfortunately passwords are often forgotten and replacement is expensive and inconvenient. Some people write their passwords down but these records can easily be lost or stolen. The option we explore is to find a way to cue passwords securely. The specific cueing technique we report on in this paper employs images as cues. The idea is to elicit textual descriptions of the images, which can then be used as passwords. We have defined a set of metrics for the kind of image that could function effectively as a password cue. We identified five candidate image types and ran an experiment to identify the image class with the best performance in terms of the defined metrics. The first experiment identified inkblot-type images as being superior. We tested this image, called a cueblot, in a real-life environment. We allowed users to tailor their cueblot until they felt they could describe it, and they then entered a description of the cueblot as their password. The cueblot was displayed at each subsequent authentication attempt to cue the password. Unfortunately, we found that users did not exploit the cueing potential of the cueblot, and while there were a few differences between textual descriptions of cueblots and non-cued passwords, they were not compelling. Hence our attempts to alleviate the difficulties people experience with passwords, by giving them access to a tailored cue, did not have the desired effect. We have to conclude that the password mechanism might well be unable to benefit from bolstering activities such as this one. △ Less

Submitted 9 August, 2014; v1 submitted 30 July, 2014; originally announced July 2014.

ACM Class: H.1.2

arXiv:1311.7295 [pdf, other]

Glasgow's Stereo Image Database of Garments

Authors: Gerardo Aragon-Camarasa, Susanne B. Oehler, Yuan Liu, Sun Li, Paul Cockshott, J. Paul Siebert

Abstract: To provide insight into cloth perception and manipulation with an active binocular robotic vision system, we compiled a database of 80 stereo-pair colour images with corresponding horizontal and vertical disparity maps and mask annotations, for 3D garment point cloud rendering has been created and released. The stereo-image garment database is part of research conducted under the EU-FP7 Clothes Pe… ▽ More To provide insight into cloth perception and manipulation with an active binocular robotic vision system, we compiled a database of 80 stereo-pair colour images with corresponding horizontal and vertical disparity maps and mask annotations, for 3D garment point cloud rendering has been created and released. The stereo-image garment database is part of research conducted under the EU-FP7 Clothes Perception and Manipulation (CloPeMa) project and belongs to a wider database collection released through CloPeMa (www.clopema.eu). This database is based on 16 different off-the-shelve garments. Each garment has been imaged in five different pose configurations on the project's binocular robot head. A full copy of the database is made available for scientific research only at https://sites.google.com/site/ugstereodatabase/. △ Less

Submitted 28 November, 2013; originally announced November 2013.

Comments: 7 pages, 6 figure, image database

arXiv:1210.5936 [pdf]

Modélisation multi-niveaux dans AA4MM

Authors: Benjamin Camus, Julien Siebert, Christine Bourjot, Vincent Chevrier

Abstract: In this article, we propose to represent a multi-level phenomenon as a set of interacting models. This perspective makes the levels of representation and their relationships explicit. To deal with coherence, causality and coordination issues between models, we rely on AA4MM, a metamodel dedicated to such a representation. We illustrate our proposal and we show the interest of our approach on a flo… ▽ More In this article, we propose to represent a multi-level phenomenon as a set of interacting models. This perspective makes the levels of representation and their relationships explicit. To deal with coherence, causality and coordination issues between models, we rely on AA4MM, a metamodel dedicated to such a representation. We illustrate our proposal and we show the interest of our approach on a flocking phenomenon. △ Less

Submitted 22 October, 2012; originally announced October 2012.

Journal ref: Journées Francophones sur les Systèmes Multi-Agents (2012) 43-52

Showing 1–20 of 20 results for author: Siebert, J