Search | arXiv e-print repository

The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective

Authors: Javier de la Rosa, Vladislav Mikhailov, Lemei Zhang, Freddy Wetjen, David Samuel, Peng Liu, Rolv-Arild Braaten, Petter Mæhlum, Magnus Breder Birkenes, Andrey Kutuzov, Tita Enstad, Hans Christian Farsethås, Svein Arne Brygfjeld, Jon Atle Gulla, Stephan Oepen, Erik Velldal, Wilfred Østgulen, Liljia Øvrelid, Aslak Sira Myhre

Abstract: The use of copyrighted materials in training language models raises critical legal and ethical questions. This paper presents a framework for and the results of empirically assessing the impact of publisher-controlled copyrighted corpora on the performance of generative large language models (LLMs) for Norwegian. When evaluated on a diverse set of tasks, we found that adding both books and newspap… ▽ More The use of copyrighted materials in training language models raises critical legal and ethical questions. This paper presents a framework for and the results of empirically assessing the impact of publisher-controlled copyrighted corpora on the performance of generative large language models (LLMs) for Norwegian. When evaluated on a diverse set of tasks, we found that adding both books and newspapers to the data mixture of LLMs tend to improve their performance, while the addition of fiction works seems to be detrimental. Our experiments could inform the creation of a compensation scheme for authors whose works contribute to AI development. △ Less

Submitted 24 January, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

Comments: 17 pages, 5 figures, 8 tables. Accepted at NoDaLiDa/Baltic-HLT 2025

arXiv:2402.01917 [pdf, ps, other]

Whispering in Norwegian: Navigating Orthographic and Dialectic Challenges

Authors: Per E Kummervold, Javier de la Rosa, Freddy Wetjen, Rolv-Arild Braaten, Per Erik Solberg

Abstract: This article introduces NB-Whisper, an adaptation of OpenAI's Whisper, specifically fine-tuned for Norwegian language Automatic Speech Recognition (ASR). We highlight its key contributions and summarise the results achieved in converting spoken Norwegian into written forms and translating other languages into Norwegian. We show that we are able to improve the Norwegian Bokmål transcription by Open… ▽ More This article introduces NB-Whisper, an adaptation of OpenAI's Whisper, specifically fine-tuned for Norwegian language Automatic Speech Recognition (ASR). We highlight its key contributions and summarise the results achieved in converting spoken Norwegian into written forms and translating other languages into Norwegian. We show that we are able to improve the Norwegian Bokmål transcription by OpenAI Whisper Large-v3 from a WER of 10.4 to 6.6 on the Fleurs Dataset and from 6.8 to 2.2 on the NST dataset. △ Less

Submitted 2 February, 2024; originally announced February 2024.

arXiv:2310.18110 [pdf, ps, other]

A Control-Bounded Quadrature Leapfrog ADC

Authors: Hampus Malmberg, Fredrik Feyling, Jose M. de la Rosa

Abstract: In this paper, the design flexibility of the control-bounded analog-to-digital converter principle is demonstrated. A band-pass analog-to-digital converter is considered as an application and case study. We show how a low-pass control-bounded analog-to-digital converter can be translated into a band-pass version where the guaranteed stability, converter bandwidth, and signal-to-noise ratio are pre… ▽ More In this paper, the design flexibility of the control-bounded analog-to-digital converter principle is demonstrated. A band-pass analog-to-digital converter is considered as an application and case study. We show how a low-pass control-bounded analog-to-digital converter can be translated into a band-pass version where the guaranteed stability, converter bandwidth, and signal-to-noise ratio are preserved while the center frequency for conversion can be positioned freely. The proposed converter is validated with behavioral simulations on several filter orders, center frequencies, and oversampling ratios. Additionally, we consider an op-amp circuit realization where the effects of first-order op-amp non-idealities are shown. Finally, robustness against component variations is demonstrated by Monte Carlo simulations. △ Less

Submitted 27 October, 2023; originally announced October 2023.

Comments: 13 pages and 16 figures

arXiv:2307.14784 [pdf]

doi 10.1002/adfm.202308819

New approach to designing functional materials for stealth technology: Radar experiment with bilayer absorbers and optimization of the reflection loss

Authors: Jaume Calvo de la Rosa, Aleix Bou Comas, Joan Manel Hernandez, Pilar Marin, Jose Maria Lopez-Villegas, Javier Tejada, Eugene M. Chudnovsky

Abstract: Microwave power absorption by a two-layer system deposited on a metallic surface has been studied in the experimental setup emulating the response to a radar signal. Layers containing hexaferrite and iron powder in a dried paint of thickness under 1mm have been used. The data is analyzed within a theoretical model derived for a bilayer system from the transmission line theory. A good agreement bet… ▽ More Microwave power absorption by a two-layer system deposited on a metallic surface has been studied in the experimental setup emulating the response to a radar signal. Layers containing hexaferrite and iron powder in a dried paint of thickness under 1mm have been used. The data is analyzed within a theoretical model derived for a bilayer system from the transmission line theory. A good agreement between experimental and theoretical results is found. The advantage of using a bilayer system over a single-layer system has been demonstrated. How the maximum microwave absorption (minimum reflection loss) can be achieved through the optimization of the filling factors and thicknesses of the two layers is shown. △ Less

Submitted 1 October, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

arXiv:2307.01672 [pdf, ps, other]

Boosting Norwegian Automatic Speech Recognition

Authors: Javier de la Rosa, Rolv-Arild Braaten, Per Egil Kummervold, Freddy Wetjen, Svein Arne Brygfjeld

Abstract: In this paper, we present several baselines for automatic speech recognition (ASR) models for the two official written languages in Norway: Bokmål and Nynorsk. We compare the performance of models of varying sizes and pre-training approaches on multiple Norwegian speech datasets. Additionally, we measure the performance of these models against previous state-of-the-art ASR models, as well as on ou… ▽ More In this paper, we present several baselines for automatic speech recognition (ASR) models for the two official written languages in Norway: Bokmål and Nynorsk. We compare the performance of models of varying sizes and pre-training approaches on multiple Norwegian speech datasets. Additionally, we measure the performance of these models against previous state-of-the-art ASR models, as well as on out-of-domain datasets. We improve the state of the art on the Norwegian Parliamentary Speech Corpus (NPSC) from a word error rate (WER) of 17.10\% to 7.60\%, with models achieving 5.81\% for Bokmål and 11.54\% for Nynorsk. We also discuss the challenges and potential solutions for further improving ASR models for Norwegian. △ Less

Submitted 4 July, 2023; originally announced July 2023.

Comments: 10 pages, 10 figures. Published as Proceedings NoDaLiDa 2023, pages 555--564

Journal ref: 2023. Boosting Norwegian Automatic Speech Recognition. In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pages 555--564, Tórshavn, Faroe Islands. University of Tartu Library

arXiv:2307.01387 [pdf, other]

ALBERTI, a Multilingual Domain Specific Language Model for Poetry Analysis

Authors: Javier de la Rosa, Álvaro Pérez Pozo, Salvador Ros, Elena González-Blanco

Abstract: The computational analysis of poetry is limited by the scarcity of tools to automatically analyze and scan poems. In a multilingual settings, the problem is exacerbated as scansion and rhyme systems only exist for individual languages, making comparative studies very challenging and time consuming. In this work, we present \textsc{Alberti}, the first multilingual pre-trained large language model f… ▽ More The computational analysis of poetry is limited by the scarcity of tools to automatically analyze and scan poems. In a multilingual settings, the problem is exacerbated as scansion and rhyme systems only exist for individual languages, making comparative studies very challenging and time consuming. In this work, we present \textsc{Alberti}, the first multilingual pre-trained large language model for poetry. Through domain-specific pre-training (DSP), we further trained multilingual BERT on a corpus of over 12 million verses from 12 languages. We evaluated its performance on two structural poetry tasks: Spanish stanza type classification, and metrical pattern prediction for Spanish, English and German. In both cases, \textsc{Alberti} outperforms multilingual BERT and other transformers-based models of similar sizes, and even achieves state-of-the-art results for German when compared to rule-based systems, demonstrating the feasibility and effectiveness of DSP in the poetry domain. △ Less

Submitted 3 July, 2023; originally announced July 2023.

Comments: Accepted for publication at SEPLN 2023: 39th International Conference of the Spanish Society for Natural Language Processing

arXiv:2303.03915 [pdf, other]

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Authors: Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral, Teven Le Scao, Leandro Von Werra, Chenghao Mou, Eduardo González Ponferrada, Huu Nguyen, Jörg Frohberg, Mario Šaško, Quentin Lhoest, Angelina McMillan-Major, Gerard Dupont, Stella Biderman, Anna Rogers, Loubna Ben allal, Francesco De Toni, Giada Pistilli, Olivier Nguyen, Somaieh Nikpoor, Maraim Masoud, Pierre Colombo, Javier de la Rosa , et al. (29 additional authors not shown)

Abstract: As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The BigScience workshop, a 1-year international and multidisciplinary initiative, was formed with the goal of researching and training large language models as a values-driven undertaking, putting issues of ethics, harm, and governance in the f… ▽ More As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The BigScience workshop, a 1-year international and multidisciplinary initiative, was formed with the goal of researching and training large language models as a values-driven undertaking, putting issues of ethics, harm, and governance in the foreground. This paper documents the data creation and curation efforts undertaken by BigScience to assemble the Responsible Open-science Open-collaboration Text Sources (ROOTS) corpus, a 1.6TB dataset spanning 59 languages that was used to train the 176-billion-parameter BigScience Large Open-science Open-access Multilingual (BLOOM) language model. We further release a large initial subset of the corpus and analyses thereof, and hope to empower large-scale monolingual and multilingual modeling projects with both the data and the processing tools, as well as stimulate research around this large multilingual corpus. △ Less

Submitted 7 March, 2023; originally announced March 2023.

Comments: NeurIPS 2022, Datasets and Benchmarks Track

ACM Class: I.2.7

arXiv:2211.06745 [pdf, ps, other]

doi 10.1109/MWSCAS57524.2023.10406080

Quadrature Control-Bounded ADCs

Authors: Hampus Malmberg, Fredrik Feyling, Jose M de la Rosa

Abstract: In this paper, the design flexibility of the control-bounded analog-to-digital converter principle is demonstrated by considering band-pass analog-to-digital conversion. We show how a low-pass control-bounded analog-to-digital converter can be translated into a band-pass version where the guaranteed stability, converter bandwidth, and signal-to-noise ratio are preserved while the center frequency… ▽ More In this paper, the design flexibility of the control-bounded analog-to-digital converter principle is demonstrated by considering band-pass analog-to-digital conversion. We show how a low-pass control-bounded analog-to-digital converter can be translated into a band-pass version where the guaranteed stability, converter bandwidth, and signal-to-noise ratio are preserved while the center frequency for conversion can be positioned freely. The proposed converter is validated with behavioral simulations for a variety of filter orders, notch-filter frequencies, and oversampling ratios. Finally, robustness against component variations is demonstrated by Monte Carlo simulations. △ Less

Submitted 19 February, 2024; v1 submitted 12 November, 2022; originally announced November 2022.

Comments: 5 pages, 6 figures, submitted to ISCAS 2023

Journal ref: 2023 IEEE 66th International Midwest Symposium on Circuits and Systems (MWSCAS), Tempe, AZ, USA, 2023, pp. 380-384

arXiv:2211.05100 [pdf, other]

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License. △ Less

Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

arXiv:2207.06814 [pdf, other]

BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling

Authors: Javier de la Rosa, Eduardo G. Ponferrada, Paulo Villegas, Pablo Gonzalez de Prado Salas, Manu Romero, Marıa Grandury

Abstract: The pre-training of large language models usually requires massive amounts of resources, both in terms of computation and data. Frequently used web sources such as Common Crawl might contain enough noise to make this pre-training sub-optimal. In this work, we experiment with different sampling methods from the Spanish version of mC4, and present a novel data-centric technique which we name… ▽ More The pre-training of large language models usually requires massive amounts of resources, both in terms of computation and data. Frequently used web sources such as Common Crawl might contain enough noise to make this pre-training sub-optimal. In this work, we experiment with different sampling methods from the Spanish version of mC4, and present a novel data-centric technique which we name $\textit{perplexity sampling}$ that enables the pre-training of language models in roughly half the amount of steps and using one fifth of the data. The resulting models are comparable to the current state-of-the-art, and even achieve better results for certain tasks. Our work is proof of the versatility of Transformers, and paves the way for small teams to train their models on a limited budget. Our models are available at this $\href{https://huggingface.co/bertin-project}{URL}$. △ Less

Submitted 14 July, 2022; originally announced July 2022.

Comments: Published at Procesamiento del Lenguaje Natural

Journal ref: Procesamiento del Lenguaje Natural, 68 (2022): 13-23

arXiv:2204.05211 [pdf, other]

Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0

Authors: Francesco De Toni, Christopher Akiki, Javier de la Rosa, Clémentine Fourrier, Enrique Manjavacas, Stefan Schweter, Daniel van Strien

Abstract: In this work, we explore whether the recently demonstrated zero-shot abilities of the T0 model extend to Named Entity Recognition for out-of-distribution languages and time periods. Using a historical newspaper corpus in 3 languages as test-bed, we use prompts to extract possible named entities. Our results show that a naive approach for prompt-based zero-shot multilingual Named Entity Recognition… ▽ More In this work, we explore whether the recently demonstrated zero-shot abilities of the T0 model extend to Named Entity Recognition for out-of-distribution languages and time periods. Using a historical newspaper corpus in 3 languages as test-bed, we use prompts to extract possible named entities. Our results show that a naive approach for prompt-based zero-shot multilingual Named Entity Recognition is error-prone, but highlights the potential of such an approach for historical languages lacking labeled datasets. Moreover, we also find that T0-like models can be probed to predict the publication date and language of a document, which could be very relevant for the study of historical texts. △ Less

Submitted 11 April, 2022; originally announced April 2022.

arXiv:2109.11588 [pdf, ps, other]

Relationships between some selection principles and star selection principles

Authors: Javier Casas de la Rosa

Abstract: Motivated by the definition of classical star selection principles, Cruz-Castillo, Ramírez-Páramo and Tenorio defined some selection principles and posed several questions about relationships between these notions and some of the classical star selection principles. The main goal of this paper is to answer the questions posed by these authors. In addition, we show that some of these defined select… ▽ More Motivated by the definition of classical star selection principles, Cruz-Castillo, Ramírez-Páramo and Tenorio defined some selection principles and posed several questions about relationships between these notions and some of the classical star selection principles. The main goal of this paper is to answer the questions posed by these authors. In addition, we show that some of these defined selection principles are equivalent by taking collections of complements; also, some other results are provided involving collections of refinements. △ Less

Submitted 23 September, 2021; originally announced September 2021.

arXiv:2109.08607 [pdf, other]

The futility of STILTs for the classification of lexical borrowings in Spanish

Authors: Javier de la Rosa

Abstract: The first edition of the IberLEF 2021 shared task on automatic detection of borrowings (ADoBo) focused on detecting lexical borrowings that appeared in the Spanish press and that have recently been imported into the Spanish language. In this work, we tested supplementary training on intermediate labeled-data tasks (STILTs) from part of speech (POS), named entity recognition (NER), code-switching,… ▽ More The first edition of the IberLEF 2021 shared task on automatic detection of borrowings (ADoBo) focused on detecting lexical borrowings that appeared in the Spanish press and that have recently been imported into the Spanish language. In this work, we tested supplementary training on intermediate labeled-data tasks (STILTs) from part of speech (POS), named entity recognition (NER), code-switching, and language identification approaches to the classification of borrowings at the token level using existing pre-trained transformer-based language models. Our extensive experimental results suggest that STILTs do not provide any improvement over direct fine-tuning of multilingual models. However, multilingual models trained on small subsets of languages perform reasonably better than multilingual BERT but not as good as multilingual RoBERTa for the given dataset. △ Less

Submitted 17 September, 2021; originally announced September 2021.

Journal ref: ADoBo 2021 Shared Task IberLEFT@SEPLN, CEUR Workshop Proceedings (Vol. 2943, pp. 947-955)

arXiv:2104.09617 [pdf, other]

Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model

Authors: Per E Kummervold, Javier de la Rosa, Freddy Wetjen, Svein Arne Brygfjeld

Abstract: In this work, we show the process of building a large-scale training set from digital and digitized collections at a national library. The resulting Bidirectional Encoder Representations from Transformers (BERT)-based language model for Norwegian outperforms multilingual BERT (mBERT) models in several token and sequence classification tasks for both Norwegian Bokmål and Norwegian Nynorsk. Our mode… ▽ More In this work, we show the process of building a large-scale training set from digital and digitized collections at a national library. The resulting Bidirectional Encoder Representations from Transformers (BERT)-based language model for Norwegian outperforms multilingual BERT (mBERT) models in several token and sequence classification tasks for both Norwegian Bokmål and Norwegian Nynorsk. Our model also improves the mBERT performance for other languages present in the corpus such as English, Swedish, and Danish. For languages not included in the corpus, the weights degrade moderately while keeping strong multilingual properties. Therefore, we show that building high-quality models within a memory institution using somewhat noisy optical character recognition (OCR) content is feasible, and we hope to pave the way for other memory institutions to follow. △ Less

Submitted 19 April, 2021; originally announced April 2021.

Comments: Accepted to NoDaLiDa 2021

arXiv:2103.02651 [pdf, other]

doi 10.1109/ISCAS45731.2020.9180811

Experimental Body-input Three-stage DC offset Calibration Scheme for Memristive Crossbar

Authors: Charanraj Mohan, L. A. Camuñas-Mesa, Elisa Vianello, Carlo Reita, José M. de la Rosa, Teresa Serrano-Gotarredona, Bernabé Linares-Barranco

Abstract: Reading several ReRAMs simultaneously in a neuromorphic circuit increases power consumption and limits scalability. Applying small inference read pulses is a vain attempt when offset voltages of the read-out circuit are decisively more. This paper presents an experimental validation of a three-stage calibration scheme to calibrate the DC offset voltage across the rows of the memristive crossbar. T… ▽ More Reading several ReRAMs simultaneously in a neuromorphic circuit increases power consumption and limits scalability. Applying small inference read pulses is a vain attempt when offset voltages of the read-out circuit are decisively more. This paper presents an experimental validation of a three-stage calibration scheme to calibrate the DC offset voltage across the rows of the memristive crossbar. The proposed method is based on biasing the body terminal of one of the differential pair MOSFETs of the buffer through a series of cascaded resistor banks arranged in three stages: coarse, fine and finer stages. The circuit is designed in a 130 nm CMOS technology, where the OxRAM-based binary memristors are built on top of it. A dedicated PCB and other auxiliary boards have been designed for testing the chip. Experimental results validate the presented approach, which is only limited by mismatch and electrical noise. △ Less

Submitted 3 March, 2021; originally announced March 2021.

Comments: 5 pages, 9 figures, conference paper published in ISCAS20

ACM Class: B.7

arXiv:2103.01271 [pdf, other]

doi 10.1109/ISCAS51556.2021.9401159

Implementation of binary stochastic STDP learning using chalcogenide-based memristive devices

Authors: C. Mohan, L. A. Camuñas-Mesa, J. M. de la Rosa, T. Serrano-Gotarredona, B. Linares-Barranco

Abstract: The emergence of nano-scale memristive devices encouraged many different research areas to exploit their use in multiple applications. One of the proposed applications was to implement synaptic connections in bio-inspired neuromorphic systems. Large-scale neuromorphic hardware platforms are being developed with increasing number of neurons and synapses, having a critical bottleneck in the online l… ▽ More The emergence of nano-scale memristive devices encouraged many different research areas to exploit their use in multiple applications. One of the proposed applications was to implement synaptic connections in bio-inspired neuromorphic systems. Large-scale neuromorphic hardware platforms are being developed with increasing number of neurons and synapses, having a critical bottleneck in the online learning capabilities. Spike-timing-dependent plasticity (STDP) is a widely used learning mechanism inspired by biology which updates the synaptic weight as a function of the temporal correlation between pre- and post-synaptic spikes. In this work, we demonstrate experimentally that binary stochastic STDP learning can be obtained from a memristor when the appropriate pulses are applied at both sides of the device. △ Less

Submitted 1 March, 2021; originally announced March 2021.

Journal ref: 2021 IEEE International Symposium on Circuits and Systems (ISCAS), 2021, pp. 1-5

arXiv:2011.09567 [pdf, ps, other]

Predicting metrical patterns in Spanish poetry with language models

Authors: Javier de la Rosa, Salvador Ros, Elena González-Blanco

Abstract: In this paper, we compare automated metrical pattern identification systems available for Spanish against extensive experiments done by fine-tuning language models trained on the same task. Despite being initially conceived as a model suitable for semantic tasks, our results suggest that BERT-based models retain enough structural information to perform reasonably well for Spanish scansion. In this paper, we compare automated metrical pattern identification systems available for Spanish against extensive experiments done by fine-tuning language models trained on the same task. Despite being initially conceived as a model suitable for semantic tasks, our results suggest that BERT-based models retain enough structural information to perform reasonably well for Spanish scansion. △ Less

Submitted 18 November, 2020; originally announced November 2020.

Comments: LXAI Workshop @ NeurIPS 2020

arXiv:1611.05360 [pdf]

The Life of Lazarillo de Tormes and of His Machine Learning Adversities

Authors: Javier de la Rosa, Juan-Luis Suárez

Abstract: Summit work of the Spanish Golden Age and forefather of the so-called picaresque novel, The Life of Lazarillo de Tormes and of His Fortunes and Adversities still remains an anonymous text. Although distinguished scholars have tried to attribute it to different authors based on a variety of criteria, a consensus has yet to be reached. The list of candidates is long and not all of them enjoy the sam… ▽ More Summit work of the Spanish Golden Age and forefather of the so-called picaresque novel, The Life of Lazarillo de Tormes and of His Fortunes and Adversities still remains an anonymous text. Although distinguished scholars have tried to attribute it to different authors based on a variety of criteria, a consensus has yet to be reached. The list of candidates is long and not all of them enjoy the same support within the scholarly community. Analyzing their works from a data-driven perspective and applying machine learning techniques for style and text fingerprinting, we shed light on the authorship of the Lazarillo. As in a state-of-the-art survey, we discuss the methods used and how they perform in our specific case. According to our methodology, the most likely author seems to be Juan Arce de Otálora, closely followed by Alfonso de Valdés. The method states that not certain attribution can be made with the given corpus. △ Less

Submitted 16 November, 2016; originally announced November 2016.

Comments: 66 pages, 11 figures

Journal ref: Lemir: Revista de Literatura Española Medieval y del Renacimiento, 20 (2016)

arXiv:1611.03896 [pdf, ps, other]

doi 10.3847/1538-4357/aa831d

The Supernovae Analysis Application (SNAP)

Authors: Amanda J. Bayless, Chris L. Fryer, Brandon Wiggins, Wesley Even, Ryan Wollaeger, Janie de la Rosa, Peter W. A. Roming, Lucy Frey, Patrick A. Young, Rob Thorpe, Luke Powell, Rachel Landers, Heather D. Persson, Rebecca Hay

Abstract: The SuperNovae Analysis aPplication (SNAP) is a new tool for the analysis of SN observations and validation of SN models. SNAP consists of an open source relational database with (a) observational light curve, (b) theoretical light curve, and (c) correlation table sets, statistical comparison software, and a web interface available to the community. The theoretical models are intended to span a gr… ▽ More The SuperNovae Analysis aPplication (SNAP) is a new tool for the analysis of SN observations and validation of SN models. SNAP consists of an open source relational database with (a) observational light curve, (b) theoretical light curve, and (c) correlation table sets, statistical comparison software, and a web interface available to the community. The theoretical models are intended to span a gridded range of parameter space. The goal is to have users to upload new SN models or new SN observations and run the comparison software to determine correlations via the web site. There are looming problems on the horizon that SNAP begins to solve. Namely, large surveys will discover thousands of SNe annually. Frequently, the parameter space of a new SN event is unbounded. SNAP will be a resource to constrain parameters and determine if an event needs follow-up without spending resources to create new light curve models from scratch. Secondly, there is not a rapidly available, systematic way to determine degeneracies between parameters or even what physics is needed to model a realistic SNe. The correlations made within the SNAP system begin to solve these problems. △ Less

Submitted 11 November, 2016; originally announced November 2016.

Comments: Submitted to AAS publishing, 22 pages, 8 figures

arXiv:1606.09025 [pdf, other]

doi 10.1051/0004-6361/201629968

SN 2015bh: NGC 2770's 4th supernova or a luminous blue variable on its way to a Wolf-Rayet star?

Authors: C. C. Thöne, A. de Ugarte Postigo, G. Leloudas, C. Gall, Z. Cano, K. Maeda, S. Schulze, S. Campana, K. Wiersema, J. Groh, J. de la Rosa, F. E. Bauer, D. Malesani, J. Maund, N. Morrell, Y. Beletsky

Abstract: Very massive stars in the final phases of their lives often show unpredictable outbursts that can mimic supernovae, so-called, "SN impostors", but the distinction is not always straigthforward. Here we present observations of a luminous blue variable (LBV) in NGC 2770 in outburst over more than 20 years that experienced a possible terminal explosion as type IIn SN in 2015, named SN 2015bh. This po… ▽ More Very massive stars in the final phases of their lives often show unpredictable outbursts that can mimic supernovae, so-called, "SN impostors", but the distinction is not always straigthforward. Here we present observations of a luminous blue variable (LBV) in NGC 2770 in outburst over more than 20 years that experienced a possible terminal explosion as type IIn SN in 2015, named SN 2015bh. This possible SN or "main event" was preceded by a precursor peaking $\sim$ 40 days before maximum. The total energy release of the main event is $\sim$1.8$\times$10$^{49}$ erg, which can be modeled by a $<$ 0.5 M$_\odot$ shell plunging into a dense CSM. All emission lines show a single narrow P-Cygni profile during the LBV phase and a double P-Cygni profile post maximum suggesting an association of this second component with the possible SN. Since 1994 the star has been redder than during a typical S-Dor like outburst. SN 2015bh lies within a spiral arm of NGC 2770 next to a number of small star-forming regions with a metallicity of $\sim$ 0.5 solar and a stellar population age of 7-10 Myr. SN 2015bh shares many similarities with SN 2009ip, which, together with other examples may form a new class of objects that exhibit outbursts a few decades prior to "hyper-eruption" or final core-collapse. If the star survives this event it is undoubtedly altered, and we suggest that these "zombie stars" may evolve from an LBV to a Wolf Rayet star over a very short timescale of only a few years. The final fate of these types of massive stars can only be determined with observations years after the possible SN. △ Less

Submitted 29 June, 2016; originally announced June 2016.

Comments: 29 pages, 20 figures

Journal ref: A&A 599, A129 (2017)

arXiv:1605.09660 [pdf, other]

doi 10.1117/12.2233069

FRIDA: diffraction-limited imaging and integral-field spectroscopy for the GTC

Authors: Alan M. Watson, José A. Acosta-Pulido, Luis C. Álvarez-Núñez, Vicente Bringas-Rico, Nicolás Cardiel, Salvador Cuevas, Oscar Chapa, José Javier Díaz García, Stephen S. Eikenberry, Carlos Espejo, Rubén A. Flores-Meza, Jorge Fuentes-Fernández, Jesús Gallego, José Leonardo Garcés Medina, Francisco Garzón López, Peter Hammersley, Carolina Keiman, Gerardo Lara, José Alberto López, Pablo L. López, Diana Lucero, Heidy Moreno Arce, Sergio Pascual Ramirez, Jesús Patrón Recio, Almudena Prieto , et al. (5 additional authors not shown)

Abstract: FRIDA is a diffraction-limited imager and integral-field spectrometer that is being built for the adaptive-optics focus of the Gran Telescopio Canarias. In imaging mode FRIDA will provide scales of 0.010, 0.020 and 0.040 arcsec/pixel and in IFS mode spectral resolutions of 1500, 4000 and 30,000. FRIDA is starting systems integration and is scheduled to complete fully integrated system tests at the… ▽ More FRIDA is a diffraction-limited imager and integral-field spectrometer that is being built for the adaptive-optics focus of the Gran Telescopio Canarias. In imaging mode FRIDA will provide scales of 0.010, 0.020 and 0.040 arcsec/pixel and in IFS mode spectral resolutions of 1500, 4000 and 30,000. FRIDA is starting systems integration and is scheduled to complete fully integrated system tests at the laboratory by the end of 2017 and to be delivered to GTC shortly thereafter. In this contribution we present a summary of its design, fabrication, current status and potential scientific applications. △ Less

Submitted 31 May, 2016; originally announced May 2016.

Comments: To appear in the proceedings of SPIE conference 9908 "Ground-based and Airborne Instrumentation for Astronomy VI". 8 pages

arXiv:1002.3582 [pdf]

doi 10.1155/2010/869810

T35: a small automatic telescope for long-term observing campaigns

Authors: Susana Martin-Ruiz, Francisco J. Aceituno, Miguel Abril, Luis P. Costillo, Antonio Garcia, Jose Luis de la Rosa, Isabel Bustamante, Juan Gutierrez-Soto, Hector Magan, Jose Luis Ramos, Marcos Ubierna

Abstract: The T35 is a small telescope (14") equipped with a large format CCD camera installed in the Sierra Nevada Observatory (SNO) in Southern Spain. This telescope will be a useful tool for the detecting and studying pulsating stars, particularly, in open clusters. In this paper, we describe the automation process of the T35 and show also some images taken with the new instrumentation. The T35 is a small telescope (14") equipped with a large format CCD camera installed in the Sierra Nevada Observatory (SNO) in Southern Spain. This telescope will be a useful tool for the detecting and studying pulsating stars, particularly, in open clusters. In this paper, we describe the automation process of the T35 and show also some images taken with the new instrumentation. △ Less

Submitted 18 February, 2010; originally announced February 2010.

Comments: 13 pages, 9 figures. Accepted for publication in the special issue "Robotic Astronomy" of Advances of Astronomy

arXiv:0807.4035 [pdf]

doi 10.1117/12.788192

UDP: an integral management system of embedded scripts implemented into the IMaX instrument of the Sunrise mission

Authors: R. Morales Munoz, P. Mellado, J. Marco de la Rosa, IMaX Team

Abstract: The UDP (User Defined Program) system is a scripting framework for controlling and extending instrumentation software. It has been specially designed for air- and space-borne instruments with flexibility, error control, reuse, automation, traceability and ease of development as its main objectives. All the system applications are connected through a database containing the valid script commands… ▽ More The UDP (User Defined Program) system is a scripting framework for controlling and extending instrumentation software. It has been specially designed for air- and space-borne instruments with flexibility, error control, reuse, automation, traceability and ease of development as its main objectives. All the system applications are connected through a database containing the valid script commands including descriptive information and source code. The system can be adapted to different projects without changes in the framework tools, thus achieving great level of flexibility and reusability. The UDP system comprises: an embedded system for the execution of scripts by the instrument software; automatic tools for aiding in the creation, modification, documentation and tracing of new scripting language commands; and interfaces for the creation of scripts and execution control. △ Less

Submitted 25 July, 2008; originally announced July 2008.

Comments: This paper has been presented in the SPIE 2008, Marselle, France

Journal ref: Proc.SPIEInt.Soc.Opt.Eng.7019:701916,2008

Showing 1–23 of 23 results for author: de la Rosa, J