Search | arXiv e-print repository

OpenMM 8: Molecular Dynamics Simulation with Machine Learning Potentials

Authors: Peter Eastman, Raimondas Galvelis, Raúl P. Peláez, Charlles R. A. Abreu, Stephen E. Farr, Emilio Gallicchio, Anton Gorenko, Michael M. Henry, Frank Hu, Jing Huang, Andreas Krämer, Julien Michel, Joshua A. Mitchell, Vijay S. Pande, João PGLM Rodrigues, Jaime Rodriguez-Guerra, Andrew C. Simmonett, Sukrit Singh, Jason Swails, Philip Turner, Yuanqing Wang, Ivy Zhang, John D. Chodera, Gianni De Fabritiis, Thomas E. Markland

Abstract: Machine learning plays an important and growing role in molecular simulation. The newest version of the OpenMM molecular dynamics toolkit introduces new features to support the use of machine learning potentials. Arbitrary PyTorch models can be added to a simulation and used to compute forces and energy. A higher-level interface allows users to easily model their molecules of interest with general… ▽ More Machine learning plays an important and growing role in molecular simulation. The newest version of the OpenMM molecular dynamics toolkit introduces new features to support the use of machine learning potentials. Arbitrary PyTorch models can be added to a simulation and used to compute forces and energy. A higher-level interface allows users to easily model their molecules of interest with general purpose, pretrained potential functions. A collection of optimized CUDA kernels and custom PyTorch operations greatly improves the speed of simulations. We demonstrate these features on simulations of cyclin-dependent kinase 8 (CDK8) and the green fluorescent protein (GFP) chromophore in water. Taken together, these features make it practical to use machine learning to improve the accuracy of simulations at only a modest increase in cost. △ Less

Submitted 29 November, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

Comments: 16 pages, 5 figures

ACM Class: J.2; J.3

arXiv:2308.00992 [pdf, other]

The evolution of Complexity co-occurring keywords: bibliometric analysis and network approach

Authors: Tanya Araújo, Alexandre Abreu, Francisco Louçã

Abstract: Bibliometric studies based on the Web of Science (WOS) database have become an increasingly popular method for analysing the structure of scientific research. So do network approaches, which, based on empirical data, make it possible to characterize the emergence of topological structures over time and across multiple research areas. Our paper is a contribution to interweaving these two lines of r… ▽ More Bibliometric studies based on the Web of Science (WOS) database have become an increasingly popular method for analysing the structure of scientific research. So do network approaches, which, based on empirical data, make it possible to characterize the emergence of topological structures over time and across multiple research areas. Our paper is a contribution to interweaving these two lines of research that have progressed in separate ways but whose common applications have been increasingly more frequent. Among other attributes, Author Keywords and Keywords Plus are used as units of analysis that enable us to identify changes in the topics of interest and related bibliography. By considering the co-occurrence of those keywords with the Author Keyword \texttt{Complexity}, we provide an overview of the evolution of studies on Complexity Sciences, and compare this evolution in seven scientific fields. The results show a considerable increase in the number of papers dealing with complexity, as well as a general tendency across different disciplines for this literature to move from a more foundational, general and conceptual to a more applied and specific set of co-occurring keywords. Moreover, we provide evidence of changing topologies of networks of co-occurring keywords, which are described through the computation of some topological coefficients. In so doing, we emphasize the distinguishing structures that characterize the networks of the seven research areas. △ Less

Submitted 2 August, 2023; originally announced August 2023.

Comments: 25 pages, 7 figures

arXiv:2307.11918 [pdf]

Bibliometric Analysis of Publisher and Journal Instructions to Authors on Generative-AI in Academic and Scientific Publishing

Authors: Conner Ganjavi, Michael B. Eppler, Asli Pekcan, Brett Biedermann, Andre Abreu, Gary S. Collins, Inderbir S. Gill, Giovanni E. Cacciamani

Abstract: We aim to determine the extent and content of guidance for authors regarding the use of generative-AI (GAI), Generative Pretrained models (GPTs) and Large Language Models (LLMs) powered tools among the top 100 academic publishers and journals in science. The websites of these publishers and journals were screened from between 19th and 20th May 2023. Among the largest 100 publishers, 17% provided g… ▽ More We aim to determine the extent and content of guidance for authors regarding the use of generative-AI (GAI), Generative Pretrained models (GPTs) and Large Language Models (LLMs) powered tools among the top 100 academic publishers and journals in science. The websites of these publishers and journals were screened from between 19th and 20th May 2023. Among the largest 100 publishers, 17% provided guidance on the use of GAI, of which 12 (70.6%) were among the top 25 publishers. Among the top 100 journals, 70% have provided guidance on GAI. Of those with guidance, 94.1% of publishers and 95.7% of journals prohibited the inclusion of GAI as an author. Four journals (5.7%) explicitly prohibit the use of GAI in the generation of a manuscript, while 3 (17.6%) publishers and 15 (21.4%) journals indicated their guidance exclusively applies to the writing process. When disclosing the use of GAI, 42.8% of publishers and 44.3% of journals included specific disclosure criteria. There was variability in guidance of where to disclose the use of GAI, including in the methods, acknowledgments, cover letter, or a new section. There was also variability in how to access GAI guidance and the linking of journal and publisher instructions to authors. There is a lack of guidance by some top publishers and journals on the use of GAI by authors. Among those publishers and journals that provide guidance, there is substantial heterogeneity in the allowable uses of GAI and in how it should be disclosed, with this heterogeneity persisting among affiliated publishers and journals in some instances. The lack of standardization burdens authors and threatens to limit the effectiveness of these regulations. There is a need for standardized guidelines in order to protect the integrity of scientific output as GAI continues to grow in popularity. △ Less

Submitted 21 July, 2023; originally announced July 2023.

Comments: Pages 16, 1 figure, 2 tables

MSC Class: A.0

arXiv:2207.05620 [pdf]

LudVision -- Remote Detection of Exotic Invasive Aquatic Floral Species using Drone-Mounted Multispectral Data

Authors: António J. Abreu, Luís A. Alexandre, João A. Santos, Filippo Basso

Abstract: Remote sensing is the process of detecting and monitoring the physical characteristics of an area by measuring its reflected and emitted radiation at a distance. It is being broadly used to monitor ecosystems, mainly for their preservation. Ever-growing reports of invasive species have affected the natural balance of ecosystems. Exotic invasive species have a critical impact when introduced into n… ▽ More Remote sensing is the process of detecting and monitoring the physical characteristics of an area by measuring its reflected and emitted radiation at a distance. It is being broadly used to monitor ecosystems, mainly for their preservation. Ever-growing reports of invasive species have affected the natural balance of ecosystems. Exotic invasive species have a critical impact when introduced into new ecosystems and may lead to the extinction of native species. In this study, we focus on Ludwigia peploides, considered by the European Union as an aquatic invasive species. Its presence can negatively impact the surrounding ecosystem and human activities such as agriculture, fishing, and navigation. Our goal was to develop a method to identify the presence of the species. We used images collected by a drone-mounted multispectral sensor to achieve this, creating our LudVision data set. To identify the targeted species on the collected images, we propose a new method for detecting Ludwigia p. in multispectral images. The method is based on existing state-of-the-art semantic segmentation methods modified to handle multispectral data. The proposed method achieved a producer's accuracy of 79.9% and a user's accuracy of 95.5%. △ Less

Submitted 13 July, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

arXiv:2012.02530 [pdf, other]

Logic Synthesis Meets Machine Learning: Trading Exactness for Generalization

Authors: Shubham Rai, Walter Lau Neto, Yukio Miyasaka, Xinpei Zhang, Mingfei Yu, Qingyang Yi Masahiro Fujita, Guilherme B. Manske, Matheus F. Pontes, Leomar S. da Rosa Junior, Marilton S. de Aguiar, Paulo F. Butzen, Po-Chun Chien, Yu-Shan Huang, Hoa-Ren Wang, Jie-Hong R. Jiang, Jiaqi Gu, Zheng Zhao, Zixuan Jiang, David Z. Pan, Brunno A. de Abreu, Isac de Souza Campos, Augusto Berndt, Cristina Meinhardt, Jonata T. Carvalho, Mateus Grellert , et al. (15 additional authors not shown)

Abstract: Logic synthesis is a fundamental step in hardware design whose goal is to find structural representations of Boolean functions while minimizing delay and area. If the function is completely-specified, the implementation accurately represents the function. If the function is incompletely-specified, the implementation has to be true only on the care set. While most of the algorithms in logic synthes… ▽ More Logic synthesis is a fundamental step in hardware design whose goal is to find structural representations of Boolean functions while minimizing delay and area. If the function is completely-specified, the implementation accurately represents the function. If the function is incompletely-specified, the implementation has to be true only on the care set. While most of the algorithms in logic synthesis rely on SAT and Boolean methods to exactly implement the care set, we investigate learning in logic synthesis, attempting to trade exactness for generalization. This work is directly related to machine learning where the care set is the training set and the implementation is expected to generalize on a validation set. We present learning incompletely-specified functions based on the results of a competition conducted at IWLS 2020. The goal of the competition was to implement 100 functions given by a set of care minterms for training, while testing the implementation using a set of validation minterms sampled from the same function. We make this benchmark suite available and offer a detailed comparative analysis of the different approaches to learning △ Less

Submitted 15 December, 2020; v1 submitted 4 December, 2020; originally announced December 2020.

Comments: In this 23 page manuscript, we explore the connection between machine learning and logic synthesis which was the main goal for International Workshop on logic synthesis. It includes approaches applied by ten teams spanning 6 countries across the world

arXiv:2002.08992 [pdf, other]

Total tessellation cover and quantum walk

Authors: Alexandre Abreu, Luís Cunha, Celina de Figueiredo, Franklin Marquezino, Daniel Posner, Renato Portugal

Abstract: We propose the total staggered quantum walk model and the total tessellation cover of a graph. This model uses the concept of total tessellation cover to describe the motion of the walker who is allowed to hop both to vertices and edges of the graph, in contrast with previous models in which the walker hops either to vertices or edges. We establish bounds on $T_t(G)$, which is the smallest number… ▽ More We propose the total staggered quantum walk model and the total tessellation cover of a graph. This model uses the concept of total tessellation cover to describe the motion of the walker who is allowed to hop both to vertices and edges of the graph, in contrast with previous models in which the walker hops either to vertices or edges. We establish bounds on $T_t(G)$, which is the smallest number of tessellations required in a total tessellation cover of $G$. We highlight two of these lower bounds $T_t(G) \geq ω(G)$ and $T_t(G)\geq is(G)+1$, where $ω(G)$ is the size of a maximum clique and $is(G)$ is the number of edges of a maximum induced star subgraph. Using these bounds, we define the good total tessellable graphs with either $T_t(G)=ω(G)$ or $T_t(G)=is(G)+1$. The $k$-total tessellability problem aims to decide whether a given graph $G$ has $T_t(G) \leq k$. We show that $k$-total tessellability is in $\mathcal{P}$ for good total tessellable graphs. We establish the $\mathcal{NP}$-completeness of the following problems when restricted to the following classes: ($is(G)+1$)-total tessellability for graphs with $ω(G) = 2$; $ω(G)$-total tessellability for graphs $G$ with $is(G)+1 = 3$; $k$-total tessellability for graphs $G$ with $\max\{ω(G), is(G)+1\}$ far from $k$; and $4$-total tessellability for graphs $G$ with $ω(G) = is(G)+1 = 4$. As a consequence, we establish hardness results for bipartite graphs, line graphs of triangle-free graphs, universal graphs, planar graphs, and $(2,1)$-chordal graphs. △ Less

Submitted 20 February, 2020; originally announced February 2020.

MSC Class: 05C ACM Class: G.2.1; G.2.2

arXiv:1908.10844 [pdf, other]

The Tessellation Cover Number of Good Tessellable Graphs

Authors: Alexandre Abreu, Luís Cunha, Celina de Figueiredo, Luis Kowada, Franklin Marquezino, Renato Portugal, Daniel Posner

Abstract: A tessellation of a graph is a partition of its vertices into vertex disjoint cliques. A tessellation cover of a graph is a set of tessellations that covers all of its edges, and the tessellation cover number, denoted by $T(G)$, is the size of a smallest tessellation cover. The \textsc{$t$-tessellability} problem aims to decide whether a graph $G$ has $T(G)\leq t$ and is $\mathcal{NP}$-complete fo… ▽ More A tessellation of a graph is a partition of its vertices into vertex disjoint cliques. A tessellation cover of a graph is a set of tessellations that covers all of its edges, and the tessellation cover number, denoted by $T(G)$, is the size of a smallest tessellation cover. The \textsc{$t$-tessellability} problem aims to decide whether a graph $G$ has $T(G)\leq t$ and is $\mathcal{NP}$-complete for $t\geq 3$. Since the number of edges of a maximum induced star of $G$, denoted by $is(G)$, is a lower bound on $T(G)$, we define good tessellable graphs as the graphs~$G$ such that $T(G)=is(G)$. The \textsc{good tessellable recognition (gtr)} problem aims to decide whether $G$ is a good tessellable graph. We show that \textsc{gtr} is $\mathcal{NP}$-complete not only if $T(G)$ is known or $is(G)$ is fixed, but also when the gap between $T(G)$ and $is(G)$ is large. As a byproduct, we obtain graph classes that obey the corresponding computational complexity behaviors. △ Less

Submitted 28 August, 2019; originally announced August 2019.

Comments: 14 pages, 3 figures

arXiv:1712.07476 [pdf, other]

doi 10.1016/j.tcs.2019.09.013

The graph tessellation cover number: extremal bounds, efficient algorithms and hardness

Authors: A. Abreu, L. Cunha, T. Fernandes, C. de Figueiredo, L. Kowada, F. Marquezino, D. Posner, R. Portugal

Abstract: A tessellation of a graph is a partition of its vertices into vertex disjoint cliques. A tessellation cover of a graph is a set of tessellations that covers all of its edges. The $t$-tessellability problem aims to decide whether there is a tessellation cover of the graph with $t$ tessellations. This problem is motivated by its applications to quantum walk models, in especial, the evolution operato… ▽ More A tessellation of a graph is a partition of its vertices into vertex disjoint cliques. A tessellation cover of a graph is a set of tessellations that covers all of its edges. The $t$-tessellability problem aims to decide whether there is a tessellation cover of the graph with $t$ tessellations. This problem is motivated by its applications to quantum walk models, in especial, the evolution operator of the staggered model is obtained from a graph tessellation cover. We establish upper bounds on the tessellation cover number given by the minimum between the chromatic index of the graph and the chromatic number of its clique graph and we show graph classes for which these bounds are tight. We prove $\mathcal{NP}$-completeness for $t$-tessellability if the instance is restricted to planar graphs, chordal (2,1)-graphs, (1,2)-graphs, diamond-free graphs with diameter five, or for any fixed $t$ at least 3. On the other hand, we improve the complexity for 2-tessellability to a linear-time algorithm. △ Less

Submitted 20 December, 2017; originally announced December 2017.

Comments: 13 pages, 5 figs, accepted in Latin 2018

Journal ref: Theoretical Computer Science 801, 175-191, 2020

arXiv:1711.03362 [pdf, other]

Estimation of optimal encoding ladders for tiled 360° VR video in adaptive streaming systems

Authors: Cagri Ozcinar, Ana De Abreu, Sebastian Knorr, Aljosa Smolic

Abstract: Given the significant industrial growth of demand for virtual reality (VR), 360° video streaming is one of the most important VR applications that require cost-optimal solutions to achieve widespread proliferation of VR technology. Because of its inherent variability of data-intensive content types and its tiled-based encoding and streaming, 360° video requires new encoding ladders in adaptive str… ▽ More Given the significant industrial growth of demand for virtual reality (VR), 360° video streaming is one of the most important VR applications that require cost-optimal solutions to achieve widespread proliferation of VR technology. Because of its inherent variability of data-intensive content types and its tiled-based encoding and streaming, 360° video requires new encoding ladders in adaptive streaming systems to achieve cost-optimal and immersive streaming experiences. In this context, this paper targets both the provider's and client's perspectives and introduces a new content-aware encoding ladder estimation method for tiled 360° VR video in adaptive streaming systems. The proposed method first categories a given 360° video using its features of encoding complexity and estimates the visual distortion and resource cost of each bitrate level based on the proposed distortion and resource cost models. An optimal encoding ladder is then formed using the proposed integer linear programming (ILP) algorithm by considering practical constraints. Experimental results of the proposed method are compared with the recommended encoding ladders of professional streaming service providers. Evaluations show that the proposed encoding ladders deliver better results compared to the recommended encoding ladders in terms of objective quality for 360° video, providing optimal encoding ladders using a set of service provider's constraint parameters. △ Less

Submitted 9 November, 2017; originally announced November 2017.

Comments: The 19th IEEE International Symposium on Multimedia (ISM 2017), Taichung, Taiwan

Journal ref: The 19th IEEE International Symposium on Multimedia (ISM 2017), Taichung, Taiwan

arXiv:1711.02386 [pdf, other]

Viewport-aware adaptive 360° video streaming using tiles for virtual reality

Authors: Cagri Ozcinar, Ana De Abreu, Aljosa Smolic

Abstract: 360° video is attracting an increasing amount of attention in the context of Virtual Reality (VR). Owing to its very high-resolution requirements, existing professional streaming services for 360° video suffer from severe drawbacks. This paper introduces a novel end-to-end streaming system from encoding to displaying, to transmit 8K resolution 360° video and to provide an enhanced VR experience us… ▽ More 360° video is attracting an increasing amount of attention in the context of Virtual Reality (VR). Owing to its very high-resolution requirements, existing professional streaming services for 360° video suffer from severe drawbacks. This paper introduces a novel end-to-end streaming system from encoding to displaying, to transmit 8K resolution 360° video and to provide an enhanced VR experience using Head Mounted Displays (HMDs). The main contributions of the proposed system are about tiling, integration of the MPEG-Dynamic Adaptive Streaming over HTTP (DASH) standard, and viewport-aware bitrate level selection. Tiling and adaptive streaming enable the proposed system to deliver very high-resolution 360° video at good visual quality. Further, the proposed viewport-aware bitrate assignment selects an optimum DASH representation for each tile in a viewport-aware manner. The quality performance of the proposed system is verified in simulations with varying network bandwidth using realistic view trajectories recorded from user experiments. Our results show that the proposed streaming system compares favorably compared to existing methods in terms of PSNR and SSIM inside the viewport. △ Less

Submitted 7 November, 2017; originally announced November 2017.

Comments: IEEE International Conference on Image Processing (ICIP) 2017

Journal ref: 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017

arXiv:1705.09014 [pdf, ps, other]

doi 10.1016/j.tcs.2019.09.013

The tessellation problem of quantum walks

Authors: A. Abreu, L. Cunha, T. Fernandes, C. de Figueiredo, L. Kowada, F. Marquezino, D. Posner, R. Portugal

Abstract: Quantum walks have received a great deal of attention recently because they can be used to develop new quantum algorithms and to simulate interesting quantum systems. In this work, we focus on a model called staggered quantum walk, which employs advanced ideas of graph theory and has the advantage of including the most important instances of other discrete-time models. The evolution operator of th… ▽ More Quantum walks have received a great deal of attention recently because they can be used to develop new quantum algorithms and to simulate interesting quantum systems. In this work, we focus on a model called staggered quantum walk, which employs advanced ideas of graph theory and has the advantage of including the most important instances of other discrete-time models. The evolution operator of the staggered model is obtained from a tessellation cover, which is defined in terms of a set of partitions of the graph into cliques. It is important to establish the minimum number of tessellations required in a tessellation cover, and what classes of graphs admit a small number of tessellations. We describe two main results: (1) infinite classes of graphs where we relate the chromatic number of the clique graph to the minimum number of tessellations required in a tessellation cover, and (2) the problem of deciding whether a graph is $k$-tessellable for $k\ge 3$ is NP-complete. △ Less

Submitted 24 May, 2017; originally announced May 2017.

Comments: 10 pages, 7 figs

Journal ref: Theoretical Computer Science 801, 175-191, 2020

arXiv:1603.06123 [pdf, other]

Optimal Lagrange Multipliers for Dependent Rate Allocation in Video Coding

Authors: Ana De Abreu, Gene Cheung, Pascal Frossard, Fernando Pereira

Abstract: In a typical video rate allocation problem, the objective is to optimally distribute a source rate budget among a set of (in)dependently coded data units to minimize the total distortion of all units. Conventional Lagrangian approaches convert the lone rate constraint to a linear rate penalty scaled by a multiplier in the objective, resulting in a simpler unconstrained formulation. However, the se… ▽ More In a typical video rate allocation problem, the objective is to optimally distribute a source rate budget among a set of (in)dependently coded data units to minimize the total distortion of all units. Conventional Lagrangian approaches convert the lone rate constraint to a linear rate penalty scaled by a multiplier in the objective, resulting in a simpler unconstrained formulation. However, the search for the "optimal" multiplier, one that results in a distortion-minimizing solution among all Lagrangian solutions that satisfy the original rate constraint, remains an elusive open problem in the general setting. To address this problem, we propose a computation-efficient search strategy to identify this optimal multiplier numerically. Specifically, we first formulate a general rate allocation problem where each data unit can be dependently coded at different quantization parameters (QP) using a previous unit as predictor, or left uncoded at the encoder and subsequently interpolated at the decoder using neighboring coded units. After converting the original rate constrained problem to the unconstrained Lagrangian counterpart, we design an efficient dynamic programming (DP) algorithm that finds the optimal Lagrangian solution for a fixed multiplier. Finally, within the DP framework, we iteratively compute neighboring singular multiplier values, each resulting in multiple simultaneously optimal Lagrangian solutions, to drive the rates of the computed Lagrangian solutions towards the bit budget. We terminate when a singular multiplier value results in two Lagrangian solutions with rates below and above the bit budget. In extensive monoview and multiview video coding experiments, we show that our DP algorithm and selection of optimal multipliers on average outperform comparable rate control solutions used in video compression standards such as HEVC that do not skip frames in Y-PSNR. △ Less

Submitted 19 March, 2016; originally announced March 2016.

arXiv:1506.07823 [pdf, ps, other]

Optimal Layered Representation for Adaptive Interactive Multiview Video Streaming

Authors: Ana De Abreu, Laura Toni, Nikolaos Thomos, Thomas Maugey, Fernando Pereira, Pascal Frossard

Abstract: We consider an interactive multiview video streaming (IMVS) system where clients select their preferred viewpoint in a given navigation window. To provide high quality IMVS, many high quality views should be transmitted to the clients. However, this is not always possible due to the limited and heterogeneous capabilities of the clients. In this paper, we propose a novel adaptive IMVS solution base… ▽ More We consider an interactive multiview video streaming (IMVS) system where clients select their preferred viewpoint in a given navigation window. To provide high quality IMVS, many high quality views should be transmitted to the clients. However, this is not always possible due to the limited and heterogeneous capabilities of the clients. In this paper, we propose a novel adaptive IMVS solution based on a layered multiview representation where camera views are organized into layered subsets to match the different clients constraints. We formulate an optimization problem for the joint selection of the views subsets and their encoding rates. Then, we propose an optimal and a reduced computational complexity greedy algorithms, both based on dynamic-programming. Simulation results show the good performance of our novel algorithms compared to a baseline algorithm, proving that an effective IMVS adaptive solution should consider the scene content and the client capabilities and their preferences in navigation. △ Less

Submitted 25 June, 2015; originally announced June 2015.

arXiv:1501.00149 [pdf]

iReclass - An automatic system for recording classes

Authors: Edson Lopes, José Caetano, António Abreu, Frederico Grilo

Abstract: This paper presents the details of a system capable of recording on video a traditional class. By traditional class it is meant a teacher, a blackboard and a white canvas where course notes are projected. The system is able to track the movements of the lecturer, while recording it on video at the required frame rate (e.g., 25 fps). The system is also capable of understanding five arm gestures mad… ▽ More This paper presents the details of a system capable of recording on video a traditional class. By traditional class it is meant a teacher, a blackboard and a white canvas where course notes are projected. The system is able to track the movements of the lecturer, while recording it on video at the required frame rate (e.g., 25 fps). The system is also capable of understanding five arm gestures made by the lecturer with the intent of controlling which scenario is recorded: himself, the blackboard or the white canvas. The remaining two gestures are for start/stop the recorder. The system is composed by a Kinect sensor, a video camera, a microphone, one pan-tilt system and one pan system, using a total of three step motors. △ Less

Submitted 31 December, 2014; originally announced January 2015.

Comments: 12 pages

Showing 1–14 of 14 results for author: Abreu, A