Skip to main content

Showing 1–9 of 9 results for author: Tellez, E S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2201.07917  [pdf, other

    cs.IR cs.AI

    Similarity search on neighbor's graphs with automatic Pareto optimal performance and minimum expected quality setups based on hyperparameter optimization

    Authors: Eric S. Tellez, Guillermo Ruiz

    Abstract: This manuscript introduces an autotuned algorithm for searching nearest neighbors based on neighbor graphs and optimization metaheuristics to produce Pareto-optimal searches for quality and search speed automatically; the same strategy is also used to produce indexes that achieve a minimum quality. Our approach is described and benchmarked with other state-of-the-art similarity search methods, sho… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Comments: Submitted to a peer reviewed journal

  2. arXiv:2110.06128  [pdf, other

    cs.CL cs.CY cs.SI

    Regionalized models for Spanish language variations based on Twitter

    Authors: Eric S. Tellez, Daniela Moctezuma, Sabino Miranda, Mario Graff, Guillermo Ruiz

    Abstract: Spanish is one of the most spoken languages in the globe, but not necessarily Spanish is written and spoken in the same way in different countries. Understanding local language variations can help to improve model performances on regional tasks, both understanding local structures and also improving the message's content. For instance, think about a machine learning engineer who automatizes some l… ▽ More

    Submitted 9 December, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  3. A Case Study of Spanish Text Transformations for Twitter Sentiment Analysis

    Authors: Eric S. Tellez, Sabino Miranda-Jiménez, Mario Graff, Daniela Moctezuma, Oscar S. Siodia, Elio A. Villaseñor

    Abstract: Sentiment analysis is a text mining task that determines the polarity of a given text, i.e., its positiveness or negativeness. Recently, it has received a lot of attention given the interest in opinion mining in micro-blogging platforms. These new forms of textual expressions present new challenges to analyze text given the use of slang, orthographic and grammatical errors, among others. Along wit… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

  4. arXiv:2009.01826  [pdf, other

    cs.CL

    A Python Library for Exploratory Data Analysis on Twitter Data based on Tokens and Aggregated Origin-Destination Information

    Authors: Mario Graff, Daniela Moctezuma, Sabino Miranda-Jiménez, Eric S. Tellez

    Abstract: Twitter is perhaps the social media more amenable for research. It requires only a few steps to obtain information, and there are plenty of libraries that can help in this regard. Nonetheless, knowing whether a particular event is expressed on Twitter is a challenging task that requires a considerable collection of tweets. This proposal aims to facilitate, to a researcher interested, the process o… ▽ More

    Submitted 24 November, 2021; v1 submitted 3 September, 2020; originally announced September 2020.

  5. arXiv:1907.06258  [pdf, other

    cs.LG stat.ML

    Improving classification performance by feature space transformations and model selection

    Authors: Jose Ortiz-Bejar, Eric S. Tellez, Mario Graff

    Abstract: Improving the performance of classifiers is the realm of feature mapping, prototype selection, and kernel function transformations; these techniques aim for reducing the complexity, and also, improving the accuracy of models. In particular, our objective is to combine them to transform data's shape into another more convenient distribution; such that some simple algorithms, such as Naïve Bayes or… ▽ More

    Submitted 2 October, 2019; v1 submitted 14 July, 2019; originally announced July 2019.

  6. arXiv:1812.02307  [pdf, other

    cs.CL cs.LG stat.ML

    EvoMSA: A Multilingual Evolutionary Approach for Sentiment Analysis

    Authors: Mario Graff, Sabino Miranda-Jiménez, Eric S. Tellez, Daniela Moctezuma

    Abstract: Sentiment analysis (SA) is a task related to understanding people's feelings in written text; the starting point would be to identify the polarity level (positive, neutral or negative) of a given text, moving on to identify emotions or whether a text is humorous or not. This task has been the subject of several research competitions in a number of languages, e.g., English, Spanish, and Arabic, amo… ▽ More

    Submitted 30 September, 2019; v1 submitted 29 November, 2018; originally announced December 2018.

  7. A scalable solution to the nearest neighbor search problem through local-search methods on neighbor graphs

    Authors: Eric S. Tellez, Guillermo Ruiz, Edgar Chavez, Mario Graff

    Abstract: Near neighbor search (NNS) is a powerful abstraction for data access; however, data indexing is troublesome even for approximate indexes. For intrinsically high-dimensional data, high-quality fast searches demand either indexes with impractically large memory usage or preprocessing time. In this paper, we introduce an algorithm to solve a nearest-neighbor query $q$ by minimizing a kernel functio… ▽ More

    Submitted 29 June, 2021; v1 submitted 29 May, 2017; originally announced May 2017.

    Journal ref: Pattern Analysis and Applications 24 763--777 2021

  8. arXiv:1704.01975  [pdf, other

    cs.CL cs.AI stat.ML

    An Automated Text Categorization Framework based on Hyperparameter Optimization

    Authors: Eric S. Tellez, Daniela Moctezuma, Sabino Miranda-Jímenez, Mario Graff

    Abstract: A great variety of text tasks such as topic or spam identification, user profiling, and sentiment analysis can be posed as a supervised learning problem and tackle using a text classifier. A text classifier consists of several subprocesses, some of them are general enough to be applied to any supervised learning problem, whereas others are specifically designed to tackle a particular task, using c… ▽ More

    Submitted 14 September, 2017; v1 submitted 6 April, 2017; originally announced April 2017.

  9. arXiv:1612.05270  [pdf, other

    cs.CL cs.LG stat.ML

    A Simple Approach to Multilingual Polarity Classification in Twitter

    Authors: Eric S. Tellez, Sabino Miranda Jiménez, Mario Graff, Daniela Moctezuma, Ranyart R. Suárez, Oscar S. Siordia

    Abstract: Recently, sentiment analysis has received a lot of attention due to the interest in mining opinions of social media users. Sentiment analysis consists in determining the polarity of a given text, i.e., its degree of positiveness or negativeness. Traditionally, Sentiment Analysis algorithms have been tailored to a specific language given the complexity of having a number of lexical variations and e… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.