-
CTDGSI: A comprehensive exploitation of instance selection methods for automatic text classification. VII Concurso de Teses, Dissertações e Trabalhos de Graduação em SI -- XXI Simpósio Brasileiro de Sistemas de Informação
Authors:
Washington Cunha,
Leonardo Rocha,
Marcos André Gonçalves
Abstract:
Progress in Natural Language Processing (NLP) has been dictated by the rule of more: more data, more computing power and more complexity, best exemplified by the Large Language Models. However, training (or fine-tuning) large dense models for specific applications usually requires significant amounts of computing resources. This \textbf{Ph.D. dissertation} focuses on an under-investi\-gated NLP da…
▽ More
Progress in Natural Language Processing (NLP) has been dictated by the rule of more: more data, more computing power and more complexity, best exemplified by the Large Language Models. However, training (or fine-tuning) large dense models for specific applications usually requires significant amounts of computing resources. This \textbf{Ph.D. dissertation} focuses on an under-investi\-gated NLP data engineering technique, whose potential is enormous in the current scenario known as Instance Selection (IS). The IS goal is to reduce the training set size by removing noisy or redundant instances while maintaining the effectiveness of the trained models and reducing the training process cost. We provide a comprehensive and scientifically sound comparison of IS methods applied to an essential NLP task -- Automatic Text Classification (ATC), considering several classification solutions and many datasets. Our findings reveal a significant untapped potential for IS solutions. We also propose two novel IS solutions that are noise-oriented and redundancy-aware, specifically designed for large datasets and transformer architectures. Our final solution achieved an average reduction of 41\% in training sets, while maintaining the same levels of effectiveness in all datasets. Importantly, our solutions demonstrated speedup improvements of 1.67x (up to 2.46x), making them scalable for datasets with hundreds of thousands of documents.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Holography and charmonium structure in a finite density plasma
Authors:
Nelson R. F. Braga,
William S. Cunha,
Yan F. Ferreira
Abstract:
It has recently been proposed that the extra dimension in holographic models for charmoniun is related to its internal structure. Representing the interaction between the quark anti-quark pair by a string inside the background used in these models, the linear term of the Cornell potential was obtained. More than that, the dissociation in the thermal medium is also described in a consistent way. He…
▽ More
It has recently been proposed that the extra dimension in holographic models for charmoniun is related to its internal structure. Representing the interaction between the quark anti-quark pair by a string inside the background used in these models, the linear term of the Cornell potential was obtained. More than that, the dissociation in the thermal medium is also described in a consistent way. Here we extend this study to the case of a plasma with finite density. The combined effects of density and temperature are analyzed from the point of view of quark anti-quark interaction and the results obtained are consistent with the ones derived previously using spectral functions.
△ Less
Submitted 5 May, 2025;
originally announced May 2025.
-
A thorough benchmark of automatic text classification: From traditional approaches to large language models
Authors:
Washington Cunha,
Leonardo Rocha,
Marcos André Gonçalves
Abstract:
Automatic text classification (ATC) has experienced remarkable advancements in the past decade, best exemplified by recent small and large language models (SLMs and LLMs), leveraged by Transformer architectures. Despite recent effectiveness improvements, a comprehensive cost-benefit analysis investigating whether the effectiveness gains of these recent approaches compensate their much higher costs…
▽ More
Automatic text classification (ATC) has experienced remarkable advancements in the past decade, best exemplified by recent small and large language models (SLMs and LLMs), leveraged by Transformer architectures. Despite recent effectiveness improvements, a comprehensive cost-benefit analysis investigating whether the effectiveness gains of these recent approaches compensate their much higher costs when compared to more traditional text classification approaches such as SVMs and Logistic Regression is still missing in the literature. In this context, this work's main contributions are twofold: (i) we provide a scientifically sound comparative analysis of the cost-benefit of twelve traditional and recent ATC solutions including five open LLMs, and (ii) a large benchmark comprising {22 datasets}, including sentiment analysis and topic classification, with their (train-validation-test) partitions based on folded cross-validation procedures, along with documentation, and code. The release of code, data, and documentation enables the community to replicate experiments and advance the field in a more scientifically sound manner. Our comparative experimental results indicate that LLMs outperform traditional approaches (up to 26%-7.1% on average) and SLMs (up to 4.9%-1.9% on average) in terms of effectiveness. However, LLMs incur significantly higher computational costs due to fine-tuning, being, on average 590x and 8.5x slower than traditional methods and SLMs, respectively. Results suggests the following recommendations: (1) LLMs for applications that require the best possible effectiveness and can afford the costs; (2) traditional methods such as Logistic Regression and SVM for resource-limited applications or those that cannot afford the cost of tuning large LLMs; and (3) SLMs like Roberta for near-optimal effectiveness-efficiency trade-off.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
Holography and the internal structure of charmonium
Authors:
Nelson R. F. Braga,
Yan F. Ferreira,
William S. Cunha
Abstract:
Holographic models that consider classical vector fields in a 5-d background provide successful effective descriptions for heavy vector meson spectra. This holds both in the vacuum and in a thermal medium, like the quark gluon plasma. However, it is somehow mysterious the way that these phenomenological models work. In particular, what is the role of the fifth dimension and what is the relation be…
▽ More
Holographic models that consider classical vector fields in a 5-d background provide successful effective descriptions for heavy vector meson spectra. This holds both in the vacuum and in a thermal medium, like the quark gluon plasma. However, it is somehow mysterious the way that these phenomenological models work. In particular, what is the role of the fifth dimension and what is the relation between the holographic 5-d background and the physical (4-d) heavy mesons. Hadrons, in contrast to leptons, are composite particles with some internal structure, that depends on the energy at which they are observed. In this work, a static meson is represented by a heavy quark-antiquark pair with an interaction described by a Nambu Goto string living in the same 5-d background that provides field solutions leading to masses and decay constants of charmonium states. The interaction potential that shows up is linear for large distances with a string tension consistent with the effective Cornell potential. Introducing temperature $T$ in the background it is found, for the $J/ψ$ case, that there is a deconfining transition at some critical value of $T$. The results obtained indicate that the 5-d background is effectively representing the internal structure of the (static) charmonium (quasi) states.
△ Less
Submitted 14 May, 2025; v1 submitted 8 October, 2024;
originally announced October 2024.
-
A Strategy to Combine 1stGen Transformers and Open LLMs for Automatic Text Classification
Authors:
Claudio M. V. de Andrade,
Washington Cunha,
Davi Reis,
Adriana Silvina Pagano,
Leonardo Rocha,
Marcos André Gonçalves
Abstract:
Transformer models have achieved state-of-the-art results, with Large Language Models (LLMs), an evolution of first-generation transformers (1stTR), being considered the cutting edge in several NLP tasks. However, the literature has yet to conclusively demonstrate that LLMs consistently outperform 1stTRs across all NLP tasks. This study compares three 1stTRs (BERT, RoBERTa, and BART) with two open…
▽ More
Transformer models have achieved state-of-the-art results, with Large Language Models (LLMs), an evolution of first-generation transformers (1stTR), being considered the cutting edge in several NLP tasks. However, the literature has yet to conclusively demonstrate that LLMs consistently outperform 1stTRs across all NLP tasks. This study compares three 1stTRs (BERT, RoBERTa, and BART) with two open LLMs (Llama 2 and Bloom) across 11 sentiment analysis datasets. The results indicate that open LLMs may moderately outperform or match 1stTRs in 8 out of 11 datasets but only when fine-tuned. Given this substantial cost for only moderate gains, the practical applicability of these models in cost-sensitive scenarios is questionable. In this context, a confidence-based strategy that seamlessly integrates 1stTRs with open LLMs based on prediction certainty is proposed. High-confidence documents are classified by the more cost-effective 1stTRs, while uncertain cases are handled by LLMs in zero-shot or few-shot modes, at a much lower cost than fine-tuned versions. Experiments in sentiment analysis demonstrate that our solution not only outperforms 1stTRs, zero-shot, and few-shot LLMs but also competes closely with fine-tuned LLMs at a fraction of the cost.
△ Less
Submitted 18 August, 2024;
originally announced August 2024.
-
PATopics: An automatic framework to extract useful information from pharmaceutical patents documents
Authors:
Pablo Cecilio,
Antônio Perreira,
Juliana Santos Rosa Viegas,
Washington Cunha,
Felipe Viegas,
Elisa Tuler,
Fabiana Testa Moura de Carvalho Vicentini,
Leonardo Rocha
Abstract:
Pharmaceutical patents play an important role by protecting the innovation from copies but also drive researchers to innovate, create new products, and promote disruptive innovations focusing on collective health. The study of patent management usually refers to an exhaustive manual search. This happens, because patent documents are complex with a lot of details regarding the claims and methodolog…
▽ More
Pharmaceutical patents play an important role by protecting the innovation from copies but also drive researchers to innovate, create new products, and promote disruptive innovations focusing on collective health. The study of patent management usually refers to an exhaustive manual search. This happens, because patent documents are complex with a lot of details regarding the claims and methodology/results explanation of the invention. To mitigate the manual search, we proposed PATopics, a framework specially designed to extract relevant information for Pharmaceutical patents. PATopics is composed of four building blocks that extract textual information from the patents, build relevant topics that are capable of summarizing the patents, correlate these topics with useful patent characteristics and then, summarize the information in a friendly web interface to final users. The general contributions of PATopics are its ability to centralize patents and to manage patents into groups based on their similarities. We extensively analyzed the framework using 4,832 pharmaceutical patents concerning 809 molecules patented by 478 companies. In our analysis, we evaluate the use of the framework considering the demands of three user profiles -- researchers, chemists, and companies. We also designed four real-world use cases to evaluate the framework's applicability. Our analysis showed how practical and helpful PATopics are in the pharmaceutical scenario.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
Is it a work or leisure travel? Applying text classification to identify work-related travel on social networks
Authors:
Lucas Félix,
Washington Cunha,
Jussara Almeida
Abstract:
In today's digital era, the use of Social Networks (SNs) and Location-Based SNs (LBSNs) has become integral for travelers seeking Points of Interest (POI) and sharing travel experiences. This trend is supported by the fact that a significant majority of American travelers utilize SNs during their trips. However, the abundance of information available on these platforms presents a challenge in iden…
▽ More
In today's digital era, the use of Social Networks (SNs) and Location-Based SNs (LBSNs) has become integral for travelers seeking Points of Interest (POI) and sharing travel experiences. This trend is supported by the fact that a significant majority of American travelers utilize SNs during their trips. However, the abundance of information available on these platforms presents a challenge in identifying the best options. To address this issue, Recommender Systems (RS) are commonly employed to suggest POIs based on user history, with the integration of contextual information enhancing the quality of recommendations. Notably, incorporating user travel purpose, which is often overlooked but holds potential in characterizing travelers' behavior, can lead to more tailored recommendations. In this study, we propose a model to predict whether a trip is leisure or work-related, utilizing state-of-the-art Automatic Text Classification (ATC) models such as BERT, RoBERTa, and BART to enhance the understanding of user travel purposes and improve recommendation accuracy in specific travel scenarios.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
A Novel Two-Step Fine-Tuning Pipeline for Cold-Start Active Learning in Text Classification Tasks
Authors:
Fabiano Belém,
Washington Cunha,
Celso França,
Claudio Andrade,
Leonardo Rocha,
Marcos André Gonçalves
Abstract:
This is the first work to investigate the effectiveness of BERT-based contextual embeddings in active learning (AL) tasks on cold-start scenarios, where traditional fine-tuning is infeasible due to the absence of labeled data. Our primary contribution is the proposal of a more robust fine-tuning pipeline - DoTCAL - that diminishes the reliance on labeled data in AL using two steps: (1) fully lever…
▽ More
This is the first work to investigate the effectiveness of BERT-based contextual embeddings in active learning (AL) tasks on cold-start scenarios, where traditional fine-tuning is infeasible due to the absence of labeled data. Our primary contribution is the proposal of a more robust fine-tuning pipeline - DoTCAL - that diminishes the reliance on labeled data in AL using two steps: (1) fully leveraging unlabeled data through domain adaptation of the embeddings via masked language modeling and (2) further adjusting model weights using labeled data selected by AL. Our evaluation contrasts BERT-based embeddings with other prevalent text representation paradigms, including Bag of Words (BoW), Latent Semantic Indexing (LSI), and FastText, at two critical stages of the AL process: instance selection and classification. Experiments conducted on eight ATC benchmarks with varying AL budgets (number of labeled instances) and number of instances (about 5,000 to 300,000) demonstrate DoTCAL's superior effectiveness, achieving up to a 33% improvement in Macro-F1 while reducing labeling efforts by half compared to the traditional one-step method. We also found that in several tasks, BoW and LSI (due to information aggregation) produce results superior (up to 59% ) to BERT, especially in low-budget scenarios and hard-to-classify tasks, which is quite surprising.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
TPDR: A Novel Two-Step Transformer-based Product and Class Description Match and Retrieval Method
Authors:
Washington Cunha,
Celso França,
Leonardo Rocha,
Marcos André Gonçalves
Abstract:
There is a niche of companies responsible for intermediating the purchase of large batches of varied products for other companies, for which the main challenge is to perform product description standardization, i.e., matching an item described by a client with a product described in a catalog. The problem is complex since the client's product description may be: (1) potentially noisy; (2) short an…
▽ More
There is a niche of companies responsible for intermediating the purchase of large batches of varied products for other companies, for which the main challenge is to perform product description standardization, i.e., matching an item described by a client with a product described in a catalog. The problem is complex since the client's product description may be: (1) potentially noisy; (2) short and uninformative (e.g., missing information about model and size); and (3) cross-language. In this paper, we formalize this problem as a ranking task: given an initial client product specification (query), return the most appropriate standardized descriptions (response). In this paper, we propose TPDR, a two-step Transformer-based Product and Class Description Retrieval method that is able to explore the semantic correspondence between IS and SD, by exploiting attention mechanisms and contrastive learning. First, TPDR employs the transformers as two encoders sharing the embedding vector space: one for encoding the IS and another for the SD, in which corresponding pairs (IS, SD) must be close in the vector space. Closeness is further enforced by a contrastive learning mechanism leveraging a specialized loss function. TPDR also exploits a (second) re-ranking step based on syntactic features that are very important for the exact matching (model, dimension) of certain products that may have been neglected by the transformers. To evaluate our proposal, we consider 11 datasets from a real company, covering different application contexts. Our solution was able to retrieve the correct standardized product before the 5th ranking position in 71% of the cases and its correct category in the first position in 80% of the situations. Moreover, the effectiveness gains over purely syntactic or semantic baselines reach up to 3.7 times, solving cases that none of the approaches in isolation can do by themselves.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Irida-Graphene: A New 2D Carbon Allotrope
Authors:
M. L. Pereira Junior,
W. F. da Cunha,
W. F. Giozza,
R. T. de Sousa Junior,
L. A. Ribeiro Junior
Abstract:
Several 2D carbon-based materials have been computationally designed in the last years due to the success achieved by graphene. Here, we propose a new 2D all-sp$^2$ carbon allotrope, named Irida-Graphene (IG), using a bottom-up approach. IG is composed of fused rings containing 3-6-8 carbon atoms. We employed density functional theory calculations and reactive (ReaxFF) molecular dynamics simulatio…
▽ More
Several 2D carbon-based materials have been computationally designed in the last years due to the success achieved by graphene. Here, we propose a new 2D all-sp$^2$ carbon allotrope, named Irida-Graphene (IG), using a bottom-up approach. IG is composed of fused rings containing 3-6-8 carbon atoms. We employed density functional theory calculations and reactive (ReaxFF) molecular dynamics simulations to examine its mechanical, structural, electronic, and optical properties. Results showed that IG exhibits good dynamical and thermal stabilities. Its estimated elastic modulus varies between 80-113 GPa. IG is a metallic material and presents a Dirac cone above the Fermi level in the center of the band. The intense optical activity of IG is restricted to the infrared and violet regions. IG can act as a violet collector for photon energies of about 3.0 eV since it presents very low reflectivity and refractive index greater than one.
△ Less
Submitted 18 August, 2022;
originally announced August 2022.
-
On the Multidimensional Random Subset Sum Problem
Authors:
Luca Becchetti,
Arthur Carvalho Walraven da Cunha,
Andrea Clementi,
Francesco d'Amore,
Hicham Lesfari,
Emanuele Natale,
Luca Trevisan
Abstract:
In the Random Subset Sum Problem, given $n$ i.i.d. random variables $X_1, ..., X_n$, we wish to approximate any point $z \in [-1,1]$ as the sum of a suitable subset $X_{i_1(z)}, ..., X_{i_s(z)}$ of them, up to error $\varepsilon$. Despite its simple statement, this problem is of fundamental interest to both theoretical computer science and statistical mechanics. More recently, it gained renewed at…
▽ More
In the Random Subset Sum Problem, given $n$ i.i.d. random variables $X_1, ..., X_n$, we wish to approximate any point $z \in [-1,1]$ as the sum of a suitable subset $X_{i_1(z)}, ..., X_{i_s(z)}$ of them, up to error $\varepsilon$. Despite its simple statement, this problem is of fundamental interest to both theoretical computer science and statistical mechanics. More recently, it gained renewed attention for its implications in the theory of Artificial Neural Networks. An obvious multidimensional generalisation of the problem is to consider $n$ i.i.d. $d$-dimensional random vectors, with the objective of approximating every point $\mathbf{z} \in [-1,1]^d$. In 1998, G. S. Lueker showed that, in the one-dimensional setting, $n=\mathcal{O}(\log \frac 1\varepsilon)$ samples guarantee the approximation property with high probability.In this work, we prove that, in $d$ dimensions, $n = \mathcal{O}(d^3\log \frac 1\varepsilon \cdot (\log \frac 1\varepsilon + \log d))$ samples suffice for the approximation property to hold with high probability. As an application highlighting the potential interest of this result, we prove that a recently proposed neural network model exhibits universality: with high probability, the model can approximate any neural network within a polynomial overhead in the number of parameters.
△ Less
Submitted 17 November, 2022; v1 submitted 28 July, 2022;
originally announced July 2022.
-
On non-compact gradient solitons
Authors:
Antonio W. Cunha,
Erin Griffin
Abstract:
In this paper we extend existing results for generalized solitons, called $q$-solitons, to the complete case by considering non-compact solitons. By placing regularity conditions on the vector field $X$ and curvature conditions on $M$, we are able to use the chosen properties of the tensor $q$ to see that such non-compact $q$-solitons are stationary and $q$-flat.
We conclude by applying our resu…
▽ More
In this paper we extend existing results for generalized solitons, called $q$-solitons, to the complete case by considering non-compact solitons. By placing regularity conditions on the vector field $X$ and curvature conditions on $M$, we are able to use the chosen properties of the tensor $q$ to see that such non-compact $q$-solitons are stationary and $q$-flat.
We conclude by applying our results to the examples of ambient obstruction solitons, Cotton solitons, and Bach solitons to demonstrate the utility of these general theorems for various flows.
△ Less
Submitted 9 June, 2023; v1 submitted 12 July, 2022;
originally announced July 2022.
-
On the Thermomechanical Properties and Fracture Patterns of the Novel Nonbenzenoid Carbon Allotrope (Biphenylene Network): A Reactive Molecular Dynamics Study
Authors:
M. L. Pereira Júnior,
W. F. da Cunha,
R. T. de Sousa Junior,
G. D. Amvame Nze,
D. S. Galvão,
L. A. Ribeiro Júnior
Abstract:
Recently, a new two-dimensional carbon allotrope, named biphenylene network (BPN) was experimentally realized. The BPN structure is composed of four-, six-, and eight-membered rings of sp$^2$-hybridized carbon atoms. In this work, we carried out fully-atomistic reactive (ReaxFF) molecular dynamics simulations to study the thermomechanical properties and fracture patterns of non-defective and defec…
▽ More
Recently, a new two-dimensional carbon allotrope, named biphenylene network (BPN) was experimentally realized. The BPN structure is composed of four-, six-, and eight-membered rings of sp$^2$-hybridized carbon atoms. In this work, we carried out fully-atomistic reactive (ReaxFF) molecular dynamics simulations to study the thermomechanical properties and fracture patterns of non-defective and defective (nanocracks) BPN. Our results show that under uniaxial tensile loading, BPN is converted into four distinct morphologies before fracture starts. This conversion process is dependent on the stretching direction. Some of the formed structures are mainly formed by eight-membered rings, which have different shapes in each morphology. In one of them, a graphitization process was observed before the complete fracture. Importantly, in the presence of nanocracks, no new morphologies are formed. BPN exhibits a distinct fracture process when contrasted to graphene. After the critical strain threshold, the graphene transitions from an elastic to a brittle regime, while BPN can exhibit different inelastic stages. These stages are associated with the appearance of new morphologies. However, BPN shares some of the exceptional graphene properties. Its calculated Young's modulus and melting point values are comparable to the graphene ones, about 1019.4 GPa and 4024K, respectively.
△ Less
Submitted 23 September, 2021;
originally announced September 2021.
-
Dynamics and Structural Transformations of Carbon Onion-Like under High-Velocity Impacts
Authors:
M. L. Pereira Júnior,
W. F. da Cunha,
R. T. de Sousa Júnior,
G. D. Amvame Nzeb,
D. S. Galvão,
L. A. Ribeiro Júnior
Abstract:
Carbon nano-onions (CNO) are multi-shell fullerenes. In the present work, we used fully atomistic reactive (ReaxFF) molecular dynamics simulations to study the dynamics and structural transformations of CNO structures under high-velocity impacts against a fixed and rigid substrate. We considered single and multi-shell CNO (up to six shells) and at different impact velocities (from 2 up to 7 Km/s).…
▽ More
Carbon nano-onions (CNO) are multi-shell fullerenes. In the present work, we used fully atomistic reactive (ReaxFF) molecular dynamics simulations to study the dynamics and structural transformations of CNO structures under high-velocity impacts against a fixed and rigid substrate. We considered single and multi-shell CNO (up to six shells) and at different impact velocities (from 2 up to 7 Km/s). Our results indicated three regimes formed after the CNO impact: slightly deformed CNO (quasi-elastic collision, below 2.0 Km/s), collapsed CNO (inelastic collisions, between 3.0 and 5.0 Km/s) forming a diamondoid-like core, and fragmented CNO yielding linear atomic carbon chains (above 5.0 Km/s). We also discussed the dynamical reconfiguration of carbon-carbon bonds during the collision process. The impact of CNO against the substrate yielded $sp^3$-like bond types for all the used initial velocities. At intermediate velocities (between 3.0 and 5.0 Km/s), the inelastic collision forms diamondoid-like cores by converting a substantial quantity of $sp^2$ bonds into $sp^3$ ones. In the high velocities regime, the total number of $sp^1$, $sp^2$, and $sp^3$ bonds tend to be similar.
△ Less
Submitted 14 September, 2021;
originally announced September 2021.
-
Some characterizations of $ρ$-Einstein solitons
Authors:
Absos Ali Shaikh,
Antonio W. Cunha,
Prosenjit Mandal
Abstract:
In this article we have showed that a gradient $ρ$-Einstein soliton with a vector field of bounded norm and satisfying some other conditions is isometric to the Euclidean sphere. Later, we have proved that a non-trivial complete gradient $ρ$-Einstein soliton with finite weighted Dirichlet integral and certain restriction on Ricci curvature must be of constant scalar curvature and steady Ricci flat…
▽ More
In this article we have showed that a gradient $ρ$-Einstein soliton with a vector field of bounded norm and satisfying some other conditions is isometric to the Euclidean sphere. Later, we have proved that a non-trivial complete gradient $ρ$-Einstein soliton with finite weighted Dirichlet integral and certain restriction on Ricci curvature must be of constant scalar curvature and steady Ricci flat. Finally, we have proved that a non-shrinking or non-expanding gradient traceless Ricci soliton possessing some conditions must be steady.
△ Less
Submitted 8 December, 2020;
originally announced January 2021.
-
A Reactive Molecular Dynamics Study on the Mechanical Properties of a Recently Synthesized Amorphous Carbon Monolayer Converted into a Nanotube/Nanoscroll
Authors:
Marcelo L. Pereira Júnior,
Wiliam F. Cunha,
Douglas S. Galvão,
Luiz A. Ribeiro Júnior
Abstract:
Recently, laser-assisted chemical vapor deposition was used to synthesize a free-standing, continuous, and stable monolayer amorphous carbon (MAC). MAC is a pure carbon structure composed of randomly distributed five, six, seven, and eight atom rings, which differs from disordered graphene. More recently, amorphous MAC-based nanotubes (a-CNT) and nanoscrolls (A-CNS) were proposed. In this work, we…
▽ More
Recently, laser-assisted chemical vapor deposition was used to synthesize a free-standing, continuous, and stable monolayer amorphous carbon (MAC). MAC is a pure carbon structure composed of randomly distributed five, six, seven, and eight atom rings, which differs from disordered graphene. More recently, amorphous MAC-based nanotubes (a-CNT) and nanoscrolls (A-CNS) were proposed. In this work, we have investigated (through fully atomistic reactive molecular dynamics simulations) the mechanical properties and melting points of pristine and a-CNT and a-CNS. Results showed that a-CNT and a-CNS have distinct elastic properties and fracture patterns concerning their pristine analogs. Both a-CNT and a-CNS presented a non-elastic regime before their total rupture, whereas the CNT and CNS undergo a direct conversion to fractured forms after a critical strain threshold. The critical strain for the fracture of the a-CNT and a-CNS are about 30% and 25%, respectively, and they are lower than the corresponding CNT and CNS cases. Although less resilient to tension, the amorphous tubular structures have similar thermal stability in relation to the pristine cases with melting points of 5500K, 6300K, 5100K, and 5900K for a-CNT, CNT, a-CNS, and CNS, respectively. An interesting result is whereas the behavior of the pristine systems is substantially different depending on the system being a nanotube or a nanoscroll, thus indicating that the topology plays an important role, the same is not true for the amorphous version of the nanostructures, thus indicating that the structural disorder overcomes the topological features.
△ Less
Submitted 14 December, 2020;
originally announced December 2020.
-
Electronic and Structural Properties of Janus SMoSe/MoX$_2$ (X=S,Se) In-plane Heterojunctions: A DFT Study
Authors:
Ramiro M. dos Santos,
Wiliam F. da Cunha,
William F. Giozza,
Rafael T. de Sousa Júnior,
Luiz F. Roncaratti Júnior,
Luiz A. Ribeiro Júnior
Abstract:
The electronic and structural properties of Janus MoSSe/MoX$_2$ (X=S,Se) in-plane heterojunctions, endowed with single-atom vacancies, were studied using density functional theory calculations. The stability of these structures was verified from cohesion energy calculations. Results showed that single-atom vacancies induce the appearance of flat midgap states, and a substantial amount of charge is…
▽ More
The electronic and structural properties of Janus MoSSe/MoX$_2$ (X=S,Se) in-plane heterojunctions, endowed with single-atom vacancies, were studied using density functional theory calculations. The stability of these structures was verified from cohesion energy calculations. Results showed that single-atom vacancies induce the appearance of flat midgap states, and a substantial amount of charge is localized in the vicinity of these defects. As a consequence, these heterojunctions presented an intrinsic dipole moment. No bond reconstructions were noted by removing an atom from the lattice, regardless of its chemical species. Our calculations predicted indirect electronic bandgap values between 1.6-1.7 eV.
△ Less
Submitted 10 December, 2020;
originally announced December 2020.
-
Charge Density Wave Transport in Porous Graphene Nanoribbons
Authors:
Wiliam F. da Cunha,
Marcelo L. Pereira Júnior,
William F. Giozza,
Rafael T. de Sousa Junior,
Luiz A. Ribeiro Júnior,
Geraldo M. e Silva
Abstract:
Porous graphene (PG) forms a class of graphene-related materials with nanoporous architectures. Their unique atomic arrangements present interconnected networks with high surface area and high pore volume. Some remarkable properties of PG, such as high mechanical strength and good thermal stability, have been widely studied. However, their electrical conductivity, and most importantly, their charg…
▽ More
Porous graphene (PG) forms a class of graphene-related materials with nanoporous architectures. Their unique atomic arrangements present interconnected networks with high surface area and high pore volume. Some remarkable properties of PG, such as high mechanical strength and good thermal stability, have been widely studied. However, their electrical conductivity, and most importantly, their charge transport mechanism are still not fully understood. Herein, we employed a numerical approach based on a 2D tight-binding model Hamiltonian to first reveal the nature of the charge transport mechanism in PG nanoribbons. Results showed that the charge transport in these materials is mediated by charge density waves. These carrier species are dynamically stable and present very shallow lattice distortions. The porosity allows for an alternative to the usual arising of polaron-like charge carriers and it can preserve the PG semiconducting character even in broader nanoribbons. The charge density waves move in PG within the optical regime with terminal velocities varying from 0.50 up to 1.15 A/fs. These velocities are lower than the ones for polarons in conventional graphene nanoribbons (2.2-5.1 A/fs).
△ Less
Submitted 4 November, 2020;
originally announced November 2020.
-
Tuning Magnetic Properties of Penta-Graphene Bilayers Through Doping with Boron and Oxygen
Authors:
Ramiro M. dos Santos,
Wiliam F. da Cunha,
Rafael T. de Sousa Junior,
William F. Giozza,
Luiz A. Ribeiro Junior
Abstract:
Penta-graphene (PG) is a carbon allotrope that has recently attracted the attention of the materials science community due to its interesting properties for renewable energy applications. Although unstable in its pure form, it has been shown that functionalization may stabilize its structure. A question that arises is whether its outstanding electronic properties could also be further improved usi…
▽ More
Penta-graphene (PG) is a carbon allotrope that has recently attracted the attention of the materials science community due to its interesting properties for renewable energy applications. Although unstable in its pure form, it has been shown that functionalization may stabilize its structure. A question that arises is whether its outstanding electronic properties could also be further improved using such a procedure. As PG bilayers present both sp$^2$ and sp$^3$ carbon planes, it consists of a flexible candidate for functionalization tuning of electromagnetic properties. In this work, we perform density functional theory simulations to investigate how the electronic and structural properties of PG bilayers can be tuned as a result of substitution doping. Specifically, we observed the emergence of different magnetic properties when boron was used as dopant species. On the other hand, in the case of doping with oxygen, the rupture of bonds in the sp$^2$ planes does not induce a magnetic moment in the material.
△ Less
Submitted 22 October, 2020; v1 submitted 21 July, 2020;
originally announced July 2020.
-
Intrinsic Properties of Bipolarons in Armchair Graphene Nanoribbons
Authors:
Gesiel G. Silva,
Wiliam F. da Cunha,
Marcelo L. Pereira Junior,
Luiz F. Roncaratti Junior,
Luiz A. Ribeiro Junior
Abstract:
We performed an investigation concerning bipolaron dynamics in armchair graphene nanoribbons (AGNRs) under the influence of different electric field and electron-phonon coupling regimes. By studying the response to the electric excitation, we were able to determine the effective mass and terminal velocity of this quasiparticle in AGNRs. Remarkably, bipolarons in narrower AGNRs move as fast as the…
▽ More
We performed an investigation concerning bipolaron dynamics in armchair graphene nanoribbons (AGNRs) under the influence of different electric field and electron-phonon coupling regimes. By studying the response to the electric excitation, we were able to determine the effective mass and terminal velocity of this quasiparticle in AGNRs. Remarkably, bipolarons in narrower AGNRs move as fast as the ones in conjugated polymers. Our findings pave the way to enhance the understanding of the behavior of charge carriers in graphene nanoribbons.
△ Less
Submitted 20 July, 2020;
originally announced July 2020.
-
Charge Transport Mechanism in Chevron--Graphene Nanoribbons
Authors:
Marcelo Lopes Pereira Junior,
Wiliam Ferreira da Cunha,
Rafael Timoteo de Sousa Junior,
William Ferreira Giozza,
Geraldo Magela e Silva,
Luiz Antonio Ribeiro Junior
Abstract:
From the moment atomic precision control of the growth process of graphene was achieved, more elaborated carbon allotropes were proposed opening new channels for flat optoelectronics at the nanoscale. A special type of this material presenting a V-shape (or "kinked" pattern) was recently synthesized and named Chevron-graphene nanoribbons (C-GNRs). To realize the reach of C--GNRs in developing new…
▽ More
From the moment atomic precision control of the growth process of graphene was achieved, more elaborated carbon allotropes were proposed opening new channels for flat optoelectronics at the nanoscale. A special type of this material presenting a V-shape (or "kinked" pattern) was recently synthesized and named Chevron-graphene nanoribbons (C-GNRs). To realize the reach of C--GNRs in developing new applications, the formation, and transport of charge carriers in their lattices should be primarily understood. Here, we investigate the static and dynamical properties of quasiparticles in C-GNRs. We study the effects of electron-phonon coupling and doping on the system. We also determine the kind of charge carriers present in C--GNR. It is observed that a phase transition occurs between a delocalized regime of conduction and regimes mediated by charge carriers. Such a phase transition is highly dependent on the doping concentration. Remarkably, crucial differences from the transport in standard graphene nanoribbons are identified. These factors are noted to have a profound impact on the mobility on the system which, in turn, should decisively impact the performance of electronic devices based on C-GNRs.
△ Less
Submitted 14 July, 2020;
originally announced July 2020.
-
Fitting potential energy surface of reactive system via genetic algorithm
Authors:
Wiliam Ferreira da Cunha,
Luiz Fernando Roncaratti,
Ricardo Gargano,
Geraldo Magela e Silva
Abstract:
In this work, we present a new fitting of the Na+HF potential energy surface (PES) utilizing a new optmization method based in Genetic Algorithm. Topology studies, such as isoenergetic contours and Minimum Energy Path(MEP), show that the quality of this new PES is comparable to the best PES of literature. These facts, suggests that this new approach can be utilized as new tool to fit PES of reac…
▽ More
In this work, we present a new fitting of the Na+HF potential energy surface (PES) utilizing a new optmization method based in Genetic Algorithm. Topology studies, such as isoenergetic contours and Minimum Energy Path(MEP), show that the quality of this new PES is comparable to the best PES of literature. These facts, suggests that this new approach can be utilized as new tool to fit PES of reactive systems.
△ Less
Submitted 14 November, 2005;
originally announced November 2005.