Skip to main content

Showing 1–36 of 36 results for author: Jorge, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.14394  [pdf, other

    cs.CL

    Enhancing Portuguese Variety Identification with Cross-Domain Approaches

    Authors: Hugo Sousa, Rúben Almeida, Purificação Silvano, Inês Cantante, Ricardo Campos, Alípio Jorge

    Abstract: Recent advances in natural language processing have raised expectations for generative models to produce coherent text across diverse language varieties. In the particular case of the Portuguese language, the predominance of Brazilian Portuguese corpora online introduces linguistic biases in these models, limiting their applicability outside of Brazil. To address this gap and promote the creation… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: AAAI 2025

  2. arXiv:2502.14385  [pdf, ps, other

    cs.CL

    Tradutor: Building a Variety Specific Translation Model

    Authors: Hugo Sousa, Satya Almasian, Ricardo Campos, Alípio Jorge

    Abstract: Language models have become foundational to many widely used systems. However, these seemingly advantageous models are double-edged swords. While they excel in tasks related to resource-rich languages like English, they often lose the fine nuances of language forms, dialects, and varieties that are inherent to languages spoken in multiple regions of the world. Languages like European Portuguese ar… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: AAAI 2025

  3. arXiv:2408.16932  [pdf, ps, other

    cs.CL cs.AI

    Event Extraction for Portuguese: A QA-driven Approach using ACE-2005

    Authors: Luís Filipe Cunha, Ricardo Campos, Alípio Jorge

    Abstract: Event extraction is an Information Retrieval task that commonly consists of identifying the central word for the event (trigger) and the event's arguments. This task has been extensively studied for English but lags behind for Portuguese, partly due to the lack of task-specific annotated corpora. This paper proposes a framework in which two separated BERT-based models were fine-tuned to identify a… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Journal ref: Progress in Artificial Intelligence. EPIA 2023. Lecture Notes in Computer Science(), vol 14115. Springer, Cham

  4. ACE-2005-PT: Corpus for Event Extraction in Portuguese

    Authors: Luís Filipe Cunha, Purificação Silvano, Ricardo Campos, Alípio Jorge

    Abstract: Event extraction is an NLP task that commonly involves identifying the central word (trigger) for an event and its associated arguments in text. ACE-2005 is widely recognised as the standard corpus in this field. While other corpora, like PropBank, primarily focus on annotating predicate-argument structure, ACE-2005 provides comprehensive information about the overall event structure and semantics… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Journal ref: SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (2024)

  5. TEI2GO: A Multilingual Approach for Fast Temporal Expression Identification

    Authors: Hugo Sousa, Ricardo Campos, Alípio Jorge

    Abstract: Temporal expression identification is crucial for understanding texts written in natural language. Although highly effective systems such as HeidelTime exist, their limited runtime performance hampers adoption in large-scale applications and production environments. In this paper, we introduce the TEI2GO models, matching HeidelTime's effectiveness but with significantly improved runtime, supportin… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  6. arXiv:2401.15400  [pdf, other

    cs.CL cs.IR

    Indexing Portuguese NLP Resources with PT-Pump-Up

    Authors: Rúben Almeida, Ricardo Campos, Alípio Jorge, Sérgio Nunes

    Abstract: The recent advances in natural language processing (NLP) are linked to training processes that require vast amounts of corpora. Access to this data is commonly not a trivial process due to resource dispersion and the need to maintain these infrastructures online and up-to-date. New developments in NLP are often compromised due to the scarcity of data or lack of a shared repository that works as an… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: Demo Track, 3 pages

    MSC Class: 68P20 ACM Class: I.7.1

    Journal ref: PROPOR 2024

  7. Physio: An LLM-Based Physiotherapy Advisor

    Authors: Rúben Almeida, Hugo Sousa, Luís F. Cunha, Nuno Guimarães, Ricardo Campos, Alípio Jorge

    Abstract: The capabilities of the most recent language models have increased the interest in integrating them into real-world applications. However, the fact that these models generate plausible, yet incorrect text poses a constraint when considering their use in several domains. Healthcare is a prime example of a domain where text-generative trustworthiness is a hard requirement to safeguard patient well-b… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: Demo, ECIR 2024, 3rd Sword AI challenge 2023

    MSC Class: 68T07 ACM Class: I.2; J.3

    Journal ref: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, vol 14612. Springer, Cham

  8. arXiv:2312.16363  [pdf, other

    cs.CG

    Polygon Detection from a Set of Lines

    Authors: Alfredo Ferreira Jr., Manuel J. Fonseca, Joaquim A. Jorge

    Abstract: Detecting polygons defined by a set of line segments in a plane is an important step in analyzing vector drawings. This paper presents an approach combining several algorithms to detect basic polygons from arbitrary line segments. The resulting algorithm runs in polynomial time and space, with complexities of $O\bigl((N + M)^4\bigr)$ and $O\bigl((N + M)^2\bigr)$, where $N$ is the number of line se… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 5 pages, 5 figures, 1 table

  9. GPT Struct Me: Probing GPT Models on Narrative Entity Extraction

    Authors: Hugo Sousa, Nuno Guimarães, Alípio Jorge, Ricardo Campos

    Abstract: The importance of systems that can extract structured information from textual data becomes increasingly pronounced given the ever-increasing volume of text produced on a daily basis. Having a system that can effectively extract such information in an interoperable manner would be an asset for several domains, be it finance, health, or legal. Recent developments in natural language processing led… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  10. A Biomedical Entity Extraction Pipeline for Oncology Health Records in Portuguese

    Authors: Hugo Sousa, Arian Pasquali, Alípio Jorge, Catarina Sousa Santos, Mário Amorim Lopes

    Abstract: Textual health records of cancer patients are usually protracted and highly unstructured, making it very time-consuming for health professionals to get a complete overview of the patient's therapeutic course. As such limitations can lead to suboptimal and/or inefficient treatment procedures, healthcare providers would greatly benefit from a system that effectively summarizes the information of tho… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  11. tieval: An Evaluation Framework for Temporal Information Extraction Systems

    Authors: Hugo Sousa, Alípio Jorge, Ricardo Campos

    Abstract: Temporal information extraction (TIE) has attracted a great deal of interest over the last two decades, leading to the development of a significant number of datasets. Despite its benefits, having access to a large volume of corpora makes it difficult when it comes to benchmark TIE systems. On the one hand, different datasets have different annotation schemes, thus hindering the comparison between… ▽ More

    Submitted 24 November, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: 10 pages

  12. arXiv:2210.06376  [pdf, other

    cs.CL cs.AI

    Probing Commonsense Knowledge in Pre-trained Language Models with Sense-level Precision and Expanded Vocabulary

    Authors: Daniel Loureiro, Alípio Mário Jorge

    Abstract: Progress on commonsense reasoning is usually measured from performance improvements on Question Answering tasks designed to require commonsense knowledge. However, fine-tuning large Language Models (LMs) on these specific tasks does not directly evaluate commonsense learned during pre-training. The most direct assessments of commonsense knowledge in pre-trained LMs are arguably cloze-style tasks t… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  13. arXiv:2206.11339  [pdf, other

    cs.SI

    Precipitation event-based networks: an analysis of the relations between network metrics and meteorological properties

    Authors: Aurelienne A. S. Jorge, Douglas Uba, Alex A. Fernandes, Izabelly C. Costa, Leonardo B. L. Santos

    Abstract: The study of complex systems in nature is essential to understand the interactions between different elements and how they influence one another. Complex network theory is a powerful tool that helps us to analyze these interactions and gain insights into the behavior of such systems. Surprisingly, this theory has been underutilized in the field of weather science, which focuses on the immediate st… ▽ More

    Submitted 8 May, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  14. arXiv:2201.05503  [pdf, other

    cs.SI physics.soc-ph

    Global-threshold and backbone high-resolution weather radar networks are significantly complementary in a watershed

    Authors: Aurelienne A. S. Jorge, Iuri da Silva Diniz, Vander L. S. Freitas, Izabelly C. Costa, Leonardo B. L. Santos

    Abstract: There are several criteria for building up networks from time series related to different points in geographical space. The most used criterion is the Global-Threshold (GT). Using a weather radar dataset, this paper shows that the Backbone (BB) - a local-threshold criterion - generates networks whose geographical configuration is complementary to the GT networks. We compare the results for two wel… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: 7 pages, 6 figures To be submitted to Computers and Geosciences (Elsevier)

  15. arXiv:2201.05156   

    cs.IR cs.AI cs.LG

    Proceedings of the 4th Workshop on Online Recommender Systems and User Modeling -- ORSUM 2021

    Authors: João Vinagre, Alípio Mário Jorge, Marie Al-Ghossein, Albert Bifet

    Abstract: Modern online services continuously generate data at very fast rates. This continuous flow of data encompasses content - e.g., posts, news, products, comments -, but also user feedback - e.g., ratings, views, reads, clicks -, together with context data - user device, spatial or temporal data, user task or activity, weather. This can be overwhelming for systems and algorithms designed to train in b… ▽ More

    Submitted 17 January, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

  16. The CirCor DigiScope Dataset: From Murmur Detection to Murmur Classification

    Authors: Jorge Oliveira, Francesco Renna, Paulo Dias Costa, Marcelo Nogueira, Cristina Oliveira, Carlos Ferreira, Alipio Jorge, Sandra Mattos, Thamine Hatem, Thiago Tavares, Andoni Elola, Ali Bahrami Rad, Reza Sameni, Gari D Clifford, Miguel T. Coimbra

    Abstract: Cardiac auscultation is one of the most cost-effective techniques used to detect and identify many heart conditions. Computer-assisted decision systems based on auscultation can support physicians in their decisions. Unfortunately, the application of such systems in clinical trials is still minimal since most of them only aim to detect the presence of extra or abnormal waves in the phonocardiogram… ▽ More

    Submitted 24 December, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: 12 pages, 6 tables, 8 figures, in IEEE Journal of Biomedical and Health Informatics

  17. LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and Beyond

    Authors: Daniel Loureiro, Alípio Mário Jorge, Jose Camacho-Collados

    Abstract: Distributional semantics based on neural approaches is a cornerstone of Natural Language Processing, with surprising connections to human meaning representation as well. Recent Transformer-based Language Models have proven capable of producing contextual word representations that reliably convey sense-specific information, simply as a product of self-supervision. Prior work has shown that these co… ▽ More

    Submitted 1 April, 2022; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Accepted to Artificial Intelligence Journal (AIJ)

    Journal ref: Artificial Intelligence Volume 305, April 2022, 103661

  18. Towards augmented reality for corporate training

    Authors: Bruno R. Martins, Joaquim A. Jorge, Ezequiel R. Zorzal

    Abstract: Corporate training relates to employees acquiring essential skills to operate equipment or effectively performing required tasks both competently and safely. Unlike formal education, training can be incorporated into the task workflow and performed during working hours. Increasingly, organizations adopt different technologies to develop both individual skills and improve their organization. Studie… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    Comments: This paper is published in the Journal of Interactive Learning Environments (Routledge) 2021

    Journal ref: Interactive Learning Environments 0 (2021) 1-19

  19. A Review on Deep Learning in UAV Remote Sensing

    Authors: Lucas Prado Osco, José Marcato Junior, Ana Paula Marques Ramos, Lúcio André de Castro Jorge, Sarah Narges Fatholahi, Jonathan de Andrade Silva, Edson Takashi Matsubara, Hemerson Pistori, Wesley Nunes Gonçalves, Jonathan Li

    Abstract: Deep Neural Networks (DNNs) learn representation from data with an impressive capability, and brought important breakthroughs for processing images, time-series, natural language, audio, video, and many others. In the remote sensing field, surveys and literature revisions specifically involving DNNs algorithms' applications have been conducted in an attempt to summarize the amount of information p… ▽ More

    Submitted 20 August, 2023; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: 27 pages, 10 figures

    Journal ref: International Journal of Applied Earth Observation and Geoinformation, 2022

  20. Improving Portuguese Semantic Role Labeling with Transformers and Transfer Learning

    Authors: Sofia Oliveira, Daniel Loureiro, Alípio Jorge

    Abstract: The Natural Language Processing task of determining "Who did what to whom" is called Semantic Role Labeling. For English, recent methods based on Transformer models have allowed for major improvements in this task over the previous state of the art. However, for low resource languages, like Portuguese, currently available semantic role labeling models are hindered by scarce training data. In this… ▽ More

    Submitted 30 October, 2021; v1 submitted 4 January, 2021; originally announced January 2021.

    Comments: 30 pages, 3 figures; Fixed broken links in References

    Journal ref: 2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA), 2021, pp. 1-9

  21. A CNN Approach to Simultaneously Count Plants and Detect Plantation-Rows from UAV Imagery

    Authors: Lucas Prado Osco, Mauro dos Santos de Arruda, Diogo Nunes Gonçalves, Alexandre Dias, Juliana Batistoti, Mauricio de Souza, Felipe David Georges Gomes, Ana Paula Marques Ramos, Lúcio André de Castro Jorge, Veraldo Liesenberg, Jonathan Li, Lingfei Ma, José Marcato Junior, Wesley Nunes Gonçalves

    Abstract: In this paper, we propose a novel deep learning method based on a Convolutional Neural Network (CNN) that simultaneously detects and geolocates plantation-rows while counting its plants considering highly-dense plantation configurations. The experimental setup was evaluated in a cornfield with different growth stages and in a Citrus orchard. Both datasets characterize different plant density scena… ▽ More

    Submitted 14 February, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: 27 pages, 12 figures, 9 tables

    ACM Class: J.2

    Journal ref: ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING 174 (2021) 1-17

  22. arXiv:2005.06748  [pdf, other

    cs.IR cs.CY

    ECIR 2020 Workshops: Assessing the Impact of Going Online

    Authors: Sérgio Nunes, Suzanne Little, Sumit Bhatia, Ludovico Boratto, Guillaume Cabanac, Ricardo Campos, Francisco M. Couto, Stefano Faralli, Ingo Frommholz, Adam Jatowt, Alípio Jorge, Mirko Marras, Philipp Mayr, Giovanni Stilo

    Abstract: ECIR 2020 https://ecir2020.org/ was one of the many conferences affected by the COVID-19 pandemic. The Conference Chairs decided to keep the initially planned dates (April 14-17, 2020) and move to a fully online event. In this report, we describe the experience of organizing the ECIR 2020 Workshops in this scenario from two perspectives: the workshop organizers and the workshop participants. We pr… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: 10 pages, 3 figures, submitted to ACM SIGIR Forum

  23. arXiv:1909.06264  [pdf, other

    eess.IV cs.CV cs.LG

    A superpixel-driven deep learning approach for the analysis of dermatological wounds

    Authors: Gustavo Blanco, Agma J. M. Traina, Caetano Traina Jr., Paulo M. Azevedo-Marques, Ana E. S. Jorge, Daniel de Oliveira, Marcos V. N. Bedo

    Abstract: Background. The image-based identification of distinct tissues within dermatological wounds enhances patients' care since it requires no intrusive evaluations. This manuscript presents an approach, we named QTDU, that combines deep learning models with superpixel-driven segmentation methods for assessing the quality of tissues from dermatological ulcers. Method. QTDU consists of a three-stage pi… ▽ More

    Submitted 20 September, 2019; v1 submitted 13 September, 2019; originally announced September 2019.

  24. arXiv:1906.10007  [pdf, other

    cs.CL

    Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation

    Authors: Daniel Loureiro, Alipio Jorge

    Abstract: Contextual embeddings represent a new generation of semantic representations learned from Neural Language Modelling (NLM) that addresses the issue of meaning conflation hampering traditional word embeddings. In this work, we show that contextual embeddings can be used to achieve unprecedented gains in Word Sense Disambiguation (WSD) tasks. Our approach focuses on creating sense-level embeddings wi… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: Accepted to ACL 2019. Code and data: https://github.com/danlou/lmms

  25. arXiv:1906.10002  [pdf, other

    cs.CL cs.AI

    LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC)

    Authors: Daniel Loureiro, Alipio Jorge

    Abstract: This paper describes the LIAAD system that was ranked second place in the Word-in-Context challenge (WiC) featured in SemDeep-5. Our solution is based on a novel system for Word Sense Disambiguation (WSD) using contextual embeddings and full-inventory sense embeddings. We adapt this WSD system, in a straightforward manner, for the present task of detecting whether the same sense occurs in a pair o… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: Accepted at the SemDeep-5 Workshop in IJCAI 2019. Code and data: https://github.com/danlou/LMMS

  26. Preference rules for label ranking: Mining patterns in multi-target relations

    Authors: Cláudio Rebelo de Sá, Paulo Azevedo, Carlos Soares, Alípio Mário Jorge, Arno Knobbe

    Abstract: In this paper we investigate two variants of association rules for preference data, Label Ranking Association Rules and Pairwise Association Rules. Label Ranking Association Rules (LRAR) are the equivalent of Class Association Rules (CAR) for the Label Ranking task. In CAR, the consequent is a single class, to which the example is expected to belong to. In LRAR, the consequent is a ranking of the… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

    Journal ref: Information Fusion, Volume 40, March 2018, Pages 112-125

  27. arXiv:1809.00589  [pdf, other

    cs.CL

    Affordance Extraction and Inference based on Semantic Role Labeling

    Authors: Daniel Loureiro, Alípio Mário Jorge

    Abstract: Common-sense reasoning is becoming increasingly important for the advancement of Natural Language Processing. While word embeddings have been very successful, they cannot explain which aspects of 'coffee' and 'tea' make them similar, or how they could be related to 'shop'. In this paper, we propose an explicit word representation that builds upon the Distributional Hypothesis to represent meaning… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: Accepted at FEVER - EMNLP 2018

  28. arXiv:1711.05879  [pdf, other

    cs.OH

    (geo)graphs - Complex Networks as a shapefile of nodes and a shapefile of edges for different applications

    Authors: Leonardo B L Santos, Aurelienne A S Jorge, Marcio Rossato, Jessica D Santos, Onofre A Candido, Wilson Seron, Charles N de Santana

    Abstract: Spatial dependency and spatial embedding are basic physical properties of many phenomena modeled by networks. The most indicated computational environment to deal with spatial information is to use Georeferenced Information System (GIS) and Geographical Database Management Systems (GDBMS). Several models have been proposed in this direction, however there is a gap in the literature in generic fram… ▽ More

    Submitted 15 November, 2017; originally announced November 2017.

  29. arXiv:1705.06345  [pdf, ps, other

    cs.OH

    An Overview of Data Mining Applications in Oil and Gas Exploration: Structural Geology and Reservoir Property-Issues

    Authors: Hamed Nikhalat Jahromi, Alpio M. Jorge

    Abstract: Low oil prices have motivated energy executives to look into cost reduction in their supply chains more seriously. To this end, a new technology that is experimentally considered in hydrocarbon exploration is data mining. There are two major categories of geoscientific problems in which data mining is applied: structural geology and reservoir property-issues. This research overviews these categori… ▽ More

    Submitted 12 May, 2017; originally announced May 2017.

    Comments: Part of DM4OG 2017 proceedings (arXiv:1705.03451)

  30. arXiv:1705.03669  [pdf, other

    cs.AI stat.ML

    Mind the Gap: A Well Log Data Analysis

    Authors: Rui L. Lopes, Alípio Jorge

    Abstract: The main task in oil and gas exploration is to gain an understanding of the distribution and nature of rocks and fluids in the subsurface. Well logs are records of petro-physical data acquired along a borehole, providing direct information about what is in the subsurface. The data collected by logging wells can have significant economic consequences, due to the costs inherent to drilling wells, an… ▽ More

    Submitted 10 May, 2017; originally announced May 2017.

    Comments: Part of DM4OG 2017 proceedings (arXiv:1705.03451)

  31. Proceedings of the Workshop on Data Mining for Oil and Gas

    Authors: Alipio Jorge, German Larrazabal, Pablo Guillen, Rui L. Lopes

    Abstract: The process of exploring and exploiting Oil and Gas (O&G) generates a lot of data that can bring more efficiency to the industry. The opportunities for using data mining techniques in the "digital oil-field" remain largely unexplored or uncharted. With the high rate of data expansion, companies are scrambling to develop ways to develop near-real-time predictive analytics, data mining and machine l… ▽ More

    Submitted 26 May, 2017; v1 submitted 9 May, 2017; originally announced May 2017.

  32. arXiv:1612.09535  [pdf, ps, other

    cs.IR cs.CL

    PAMPO: using pattern matching and pos-tagging for effective Named Entities recognition in Portuguese

    Authors: Conceição Rocha, Alípio Jorge, Roberta Sionara, Paula Brito, Carlos Pimenta, Solange Rezende

    Abstract: This paper deals with the entity extraction task (named entity recognition) of a text mining process that aims at unveiling non-trivial semantic structures, such as relationships and interaction between entities or communities. In this paper we present a simple and efficient named entity extraction algorithm. The method, named PAMPO (PAttern Matching and POs tagging based algorithm for NER), relie… ▽ More

    Submitted 30 December, 2016; originally announced December 2016.

  33. Improving incremental recommenders with online bagging

    Authors: João Vinagre, Alípio Mário Jorge, João Gama

    Abstract: Online recommender systems often deal with continuous, potentially fast and unbounded flows of data. Ensemble methods for recommender systems have been used in the past in batch algorithms, however they have never been studied with incremental algorithms that learn from data streams. We evaluate online bagging with an incremental matrix factorization algorithm for top-N recommendation with positiv… ▽ More

    Submitted 26 March, 2018; v1 submitted 2 November, 2016; originally announced November 2016.

    Comments: Submitted to EPIA 2017

    Journal ref: In: Oliveira E., Gama J., Vale Z., Lopes Cardoso H. (eds) Progress in Artificial Intelligence. EPIA 2017. Lecture Notes in Computer Science, vol 10423. Springer, Cham

  34. Accelerating Recommender Systems using GPUs

    Authors: André Valente Rodrigues, Alípio Jorge, Inês Dutra

    Abstract: We describe GPU implementations of the matrix recommender algorithms CCD++ and ALS. We compare the processing time and predictive ability of the GPU implementations with existing multi-core versions of the same algorithms. Results on the GPU are better than the results of the multi-core versions (maximum speedup of 14.8).

    Submitted 7 November, 2015; originally announced November 2015.

    Journal ref: SAC '15 Proceedings of the 30th Annual ACM Symposium on Applied Computing Pages 879-884 ACM New York, NY, USA

  35. Evaluation of recommender systems in streaming environments

    Authors: João Vinagre, Alípio Mário Jorge, João Gama

    Abstract: Evaluation of recommender systems is typically done with finite datasets. This means that conventional evaluation methodologies are only applicable in offline experiments, where data and models are stationary. However, in real world systems, user feedback is continuously generated, at unpredictable rates. Given this setting, one important issue is how to evaluate algorithms in such a streaming dat… ▽ More

    Submitted 30 April, 2015; originally announced April 2015.

    Comments: Workshop on 'Recommender Systems Evaluation: Dimensions and Design' (REDD 2014), held in conjunction with RecSys 2014. October 10, 2014, Silicon Valley, United States

  36. arXiv:1111.2948  [pdf, ps, other

    cs.LG cs.IR

    Using Contextual Information as Virtual Items on Top-N Recommender Systems

    Authors: Marcos A. Domingues, Alipio Mario Jorge, Carlos Soares

    Abstract: Traditionally, recommender systems for the Web deal with applications that have two dimensions, users and items. Based on access logs that relate these dimensions, a recommendation model can be built and used to identify a set of N items that will be of interest to a certain user. In this paper we propose a method to complement the information in the access logs with contextual information without… ▽ More

    Submitted 15 November, 2011; v1 submitted 12 November, 2011; originally announced November 2011.

    Comments: Workshop on Context-Aware Recommender Systems (CARS'09) in conjunction with the 3rd ACM Conference on Recommender Systems (RecSys'09)

    ACM Class: I.2.6