-
Blar-SQL: Faster, Stronger, Smaller NL2SQL
Authors:
José Manuel Domínguez,
Benjamín Errázuriz,
Patricio Daher
Abstract:
Large Language Models (LLMs) have gained considerable notoriety in the field of natural language to SQL tasks (NL2SQL). In this study, we show how task decomposition can greatly benefit LLMs in database understanding and query generation in order to answer human questions with an SQL query.
We fined-tuned open source models, specifically Llama-2 and Code Llama, by combining 2 different models ea…
▽ More
Large Language Models (LLMs) have gained considerable notoriety in the field of natural language to SQL tasks (NL2SQL). In this study, we show how task decomposition can greatly benefit LLMs in database understanding and query generation in order to answer human questions with an SQL query.
We fined-tuned open source models, specifically Llama-2 and Code Llama, by combining 2 different models each designated to focus on one of two tasks in order to leverage each model's core competency to further increase the accuracy of the final SQL query.
We propose a new framework to divide the schema into chunks in order to fit more information into a limited context. Our results are comparable with those obtained by GPT-4 at the same time being 135 times smaller, 90 times faster and more than 100 times cheaper than GPT-4.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Visibility and Separability for a Declarative Linearizability Proof of the Timestamped Stack: Extended Version
Authors:
Jesús Domínguez,
Aleksandar Nanevski
Abstract:
Linearizability is a standard correctness criterion for concurrent algorithms, typically proved by establishing the algorithms' linearization points (LP). However, LPs often hinder abstraction, and for some algorithms such as the timestamped stack, it is unclear how to even identify their LPs. In this paper, we show how to develop declarative proofs of linearizability by foregoing LPs and instead…
▽ More
Linearizability is a standard correctness criterion for concurrent algorithms, typically proved by establishing the algorithms' linearization points (LP). However, LPs often hinder abstraction, and for some algorithms such as the timestamped stack, it is unclear how to even identify their LPs. In this paper, we show how to develop declarative proofs of linearizability by foregoing LPs and instead employing axiomatization of so-called visibility relations. While visibility relations have been considered before for the timestamped stack, our study is the first to show how to derive the axiomatization systematically and intuitively from the sequential specification of the stack. In addition to the visibility relation, a novel separability relation emerges to generalize real-time precedence of procedure invocation. The visibility and separability relations have natural definitions for the timestamped stack, and enable a novel proof that reduces the algorithm to a simplified form where the timestamps are generated atomically.
△ Less
Submitted 7 August, 2023; v1 submitted 10 July, 2023;
originally announced July 2023.
-
Declarative Linearizability Proofs for Descriptor-Based Concurrent Helping Algorithms
Authors:
Jesús Domínguez,
Aleksandar Nanevski
Abstract:
Linearizability is a standard correctness criterion for concurrent algorithms, typically proved by establishing the algorithms' linearization points. However, relying on linearization points leads to proofs that are implementation-dependent, and thus hinder abstraction and reuse. In this paper we show that one can develop more declarative proofs by foregoing linearization points and instead relyin…
▽ More
Linearizability is a standard correctness criterion for concurrent algorithms, typically proved by establishing the algorithms' linearization points. However, relying on linearization points leads to proofs that are implementation-dependent, and thus hinder abstraction and reuse. In this paper we show that one can develop more declarative proofs by foregoing linearization points and instead relying on a technique of axiomatization of visibility relations. While visibility relations have been considered before, ours is the first study where the challenge is to formalize the helping nature of the algorithms. In particular, we show that by axiomatizing the properties of separation between events that contain bunches of help requests, we can extract what is common for high-level understanding of several descriptor-based helping algorithms of Harris et al. (RDCSS, MCAS, and optimizations), and produce novel proofs of their linearizability that share significant components.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Quality Characteristics of a Software Platform for Human-AI Teaming in Smart Manufacturing
Authors:
Philipp Haindl,
Thomas Hoch,
Javier Dominguez,
Julen Aperribai,
Nazim Kemal Ure,
Mehmet Tunçel
Abstract:
As AI-enabled software systems become more prevalent in smart manufacturing, their role shifts from a reactive to a proactive one that provides context-specific support to machine operators. In the context of an international research project, we develop an AI-based software platform that shall facilitate the collaboration between human operators and manufacturing machines. We conducted 14 structu…
▽ More
As AI-enabled software systems become more prevalent in smart manufacturing, their role shifts from a reactive to a proactive one that provides context-specific support to machine operators. In the context of an international research project, we develop an AI-based software platform that shall facilitate the collaboration between human operators and manufacturing machines. We conducted 14 structured interviews with stakeholders of the prospective software platform in order to determine the individual relevance of selected quality characteristics for human-AI teaming in smart manufacturing. These characteristics include the ISO 25010:2011 standard for software quality and AI-specific quality characteristics such as trustworthiness, explicability, and auditability. The interviewees rated trustworthiness, functional suitability, reliability, and security as the most important quality characteristics for this context, and portability, compatibility, and maintainability as the least important. Also, we observed agreement regarding the relevance of the quality characteristics among interviewees having the same role. On the other hand, the relevance of each quality characteristics varied depending on the concrete use case of the prospective software platform. The interviewees also were asked about the key success factors related to human-AI teaming in smart manufacturing. They identified improving the production cycle, increasing operator efficiency, reducing scrap, and reducing ergonomic risks as key success criteria. In this paper, we also discuss metrics for measuring the fulfillment of these quality characteristics, which we intend to operationalize and monitor during operation of the prospective software platform.
△ Less
Submitted 31 May, 2022;
originally announced May 2022.
-
Distributing Deep Learning Hyperparameter Tuning for 3D Medical Image Segmentation
Authors:
Josep Lluis Berral,
Oriol Aranda,
Juan Luis Dominguez,
Jordi Torres
Abstract:
Most research on novel techniques for 3D Medical Image Segmentation (MIS) is currently done using Deep Learning with GPU accelerators. The principal challenge of such technique is that a single input can easily cope computing resources, and require prohibitive amounts of time to be processed. Distribution of deep learning and scalability over computing devices is an actual need for progressing on…
▽ More
Most research on novel techniques for 3D Medical Image Segmentation (MIS) is currently done using Deep Learning with GPU accelerators. The principal challenge of such technique is that a single input can easily cope computing resources, and require prohibitive amounts of time to be processed. Distribution of deep learning and scalability over computing devices is an actual need for progressing on such research field. Conventional distribution of neural networks consist in data parallelism, where data is scattered over resources (e.g., GPUs) to parallelize the training of the model. However, experiment parallelism is also an option, where different training processes are parallelized across resources. While the first option is much more common on 3D image segmentation, the second provides a pipeline design with less dependence among parallelized processes, allowing overhead reduction and more potential scalability. In this work we present a design for distributed deep learning training pipelines, focusing on multi-node and multi-GPU environments, where the two different distribution approaches are deployed and benchmarked. We take as proof of concept the 3D U-Net architecture, using the MSD Brain Tumor Segmentation dataset, a state-of-art problem in medical image segmentation with high computing and space requirements. Using the BSC MareNostrum supercomputer as benchmarking environment, we use TensorFlow and Ray as neural network training and experiment distribution platforms. We evaluate the experiment speed-up, showing the potential for scaling out on GPUs and nodes. Also comparing the different parallelism techniques, showing how experiment distribution leverages better such resources through scaling. Finally, we provide the implementation of the design open to the community, and the non-trivial steps and methodology for adapting and deploying a MIS case as the here presented.
△ Less
Submitted 29 October, 2021;
originally announced October 2021.
-
Improving accuracy and speeding up Document Image Classification through parallel systems
Authors:
Javier Ferrando,
Juan Luis Dominguez,
Jordi Torres,
Raul Garcia,
David Garcia,
Daniel Garrido,
Jordi Cortada,
Mateo Valero
Abstract:
This paper presents a study showing the benefits of the EfficientNet models compared with heavier Convolutional Neural Networks (CNNs) in the Document Classification task, essential problem in the digitalization process of institutions. We show in the RVL-CDIP dataset that we can improve previous results with a much lighter model and present its transfer learning capabilities on a smaller in-domai…
▽ More
This paper presents a study showing the benefits of the EfficientNet models compared with heavier Convolutional Neural Networks (CNNs) in the Document Classification task, essential problem in the digitalization process of institutions. We show in the RVL-CDIP dataset that we can improve previous results with a much lighter model and present its transfer learning capabilities on a smaller in-domain dataset such as Tobacco3482. Moreover, we present an ensemble pipeline which is able to boost solely image input by combining image model predictions with the ones generated by BERT model on extracted text by OCR. We also show that the batch size can be effectively increased without hindering its accuracy so that the training process can be sped up by parallelizing throughout multiple GPUs, decreasing the computational time needed. Lastly, we expose the training performance differences between PyTorch and Tensorflow Deep Learning frameworks.
△ Less
Submitted 16 June, 2020;
originally announced June 2020.
-
Relationship between the visibility of political leaders during campaign and the outcome in general elections. A case study for Spain
Authors:
J. Estevez,
JJ. Dominguez,
M. Graña
Abstract:
In this article, the authors find the evidence that media coverage consisting of 13 online newspapers enhanced the electoral results of right wing party in Spain (Vox) during general elections in November 2019. We consider the political parties and leaders mentions in these media during the electoral campaign from 1st to 10th November 2019, and only visibility or prominence dimension is necessary…
▽ More
In this article, the authors find the evidence that media coverage consisting of 13 online newspapers enhanced the electoral results of right wing party in Spain (Vox) during general elections in November 2019. We consider the political parties and leaders mentions in these media during the electoral campaign from 1st to 10th November 2019, and only visibility or prominence dimension is necessary for the evidence.
△ Less
Submitted 18 February, 2020;
originally announced February 2020.
-
From nominal to higher-order rewriting and back again
Authors:
Jesús Domínguez,
Maribel Fernández
Abstract:
We present a translation function from nominal rewriting systems (NRSs) to combinatory reduction systems (CRSs), transforming closed nominal rules and ground nominal terms to CRSs rules and terms, respectively, while preserving the rewriting relation. We also provide a reduction-preserving translation in the other direction, from CRSs to NRSs, improving over a previously defined translation. These…
▽ More
We present a translation function from nominal rewriting systems (NRSs) to combinatory reduction systems (CRSs), transforming closed nominal rules and ground nominal terms to CRSs rules and terms, respectively, while preserving the rewriting relation. We also provide a reduction-preserving translation in the other direction, from CRSs to NRSs, improving over a previously defined translation. These tools, together with existing translations between CRSs and other higher-order rewriting formalisms, open up the path for a transfer of results between higher-order and nominal rewriting. In particular, techniques and properties of the rewriting relation, such as termination, can be exported from one formalism to the other.
△ Less
Submitted 11 December, 2015; v1 submitted 17 September, 2015;
originally announced September 2015.
-
Optimization strategies for parallel CPU and GPU implementations of a meshfree particle method
Authors:
Jose M. Domínguez,
Alejandro J. C. Crespo,
Moncho Gómez-Gesteira
Abstract:
Much of the current focus in high performance computing (HPC) for computational fluid dynamics (CFD) deals with grid based methods. However, parallel implementations for new meshfree particle methods such as Smoothed Particle Hydrodynamics (SPH) are less studied. In this work, we present optimizations for both central processing unit (CPU) and graphics processing unit (GPU) of a SPH method. These…
▽ More
Much of the current focus in high performance computing (HPC) for computational fluid dynamics (CFD) deals with grid based methods. However, parallel implementations for new meshfree particle methods such as Smoothed Particle Hydrodynamics (SPH) are less studied. In this work, we present optimizations for both central processing unit (CPU) and graphics processing unit (GPU) of a SPH method. These optimization strategies can be further applied to many other meshfree methods. The obtained performance for each architecture and a comparison between the most efficient implementations for CPU and GPU are shown.
△ Less
Submitted 18 November, 2011; v1 submitted 17 October, 2011;
originally announced October 2011.
-
Ethane: A Heterogeneous Parallel Search Algorithm for Heterogeneous Platforms
Authors:
Julián Domínguez,
Enrique Alba
Abstract:
In this paper we present Ethane, a parallel search algorithm specifically designed for its execution on heterogeneous hardware environments. With Ethane we propose an algorithm inspired in the structure of the chemical compound of the same name, implementing a heterogeneous island model based in the structure of its chemical bonds. We also propose a schema for describing a family of parallel heter…
▽ More
In this paper we present Ethane, a parallel search algorithm specifically designed for its execution on heterogeneous hardware environments. With Ethane we propose an algorithm inspired in the structure of the chemical compound of the same name, implementing a heterogeneous island model based in the structure of its chemical bonds. We also propose a schema for describing a family of parallel heterogeneous metaheuristics inspired by the structure of hydrocarbons in Nature, HydroCM (HydroCarbon inspired Metaheuristics), establishing a resem- blance between atoms and computers, and between chemical bonds and communication links. Our goal is to gracefully match computers of different power to algorithms of different behavior (GA and SA in this study), all them collaborating to solve the same problem. The analysis will show that Ethane, though simple, can solve search problems in a faster and more robust way than well-known panmitic and distributed algorithms very popular in the literature.
△ Less
Submitted 31 May, 2011; v1 submitted 30 May, 2011;
originally announced May 2011.