-
Science cited in policy documents: Evidence from the Overton database
Authors:
Zhichao Fang,
Jonathan Dudek,
Ed Noyons,
Rodrigo Costas
Abstract:
To reflect the extent to which science is cited in policy documents, this paper explores the presence of policy document citations for over 18 million Web of Science-indexed publications published between 2010 and 2019. Enabled by the policy document citation data provided by Overton, a searchable index of policy documents worldwide, the results show that there are 3.9% of publications in the data…
▽ More
To reflect the extent to which science is cited in policy documents, this paper explores the presence of policy document citations for over 18 million Web of Science-indexed publications published between 2010 and 2019. Enabled by the policy document citation data provided by Overton, a searchable index of policy documents worldwide, the results show that there are 3.9% of publications in the dataset cited at least once by policy documents. Policy document citations present a citation delay towards newly published publications and show a stronger predominance to the document types of review and article. Based on the Overton database, publications in the field of Social Sciences and Humanities have the highest relative presence in policy document citations, followed by Life and Earth Sciences and Biomedical and Health Sciences. Our findings shed light not only on the impact of scientific knowledge on the policy-making process, but also on the particular focus of policy documents indexed by Overton on specific research areas.
△ Less
Submitted 13 July, 2024;
originally announced July 2024.
-
Co-link analysis as a monitoring tool: A webometric use case to map the web relationships of research projects
Authors:
Jonathan Dudek,
David G. Pina,
Rodrigo Costas
Abstract:
This study explores the societal embeddedness of the websites of research projects. It combines two aims: characterizing research projects based on their weblink relationships, and discovering external societal actors that relate to the projects via weblinks. The study was based on a set of 121 EU-funded research projects and their websites. Domains referring to the websites of the research projec…
▽ More
This study explores the societal embeddedness of the websites of research projects. It combines two aims: characterizing research projects based on their weblink relationships, and discovering external societal actors that relate to the projects via weblinks. The study was based on a set of 121 EU-funded research projects and their websites. Domains referring to the websites of the research projects were collected and used in visualizations of co-link relationships. These analyses revealed clusters of topical similarity among the research projects as well as among referring entities. Furthermore, a first step into unveiling potentially relevant stakeholders around research projects was made. Weblink analysis is discussed as an insightful tool for monitoring the internal and external linkages of research projects, representing a relevant application of webometric methods.
△ Less
Submitted 8 October, 2021;
originally announced October 2021.
-
The role of scientific output in public debates in times of crisis: A case study of the reopening of schools during the COVID-19 pandemic
Authors:
Gabriela F. Nane,
François van Schalkwyk,
Jonathan Dudek,
Daniel Torres-Salinas,
Rodrigo Costas,
Nicolas Robinson-Garcia
Abstract:
Situations in which no scientific consensus has been reached due to either insufficient, inconclusive or contradicting findings place strain on governments and public organizations which are forced to take action under circumstances of uncertainty. In this chapter, we focus on the case of COVID-19, its effects on children and the public debate around the reopening of schools. The aim is to better…
▽ More
Situations in which no scientific consensus has been reached due to either insufficient, inconclusive or contradicting findings place strain on governments and public organizations which are forced to take action under circumstances of uncertainty. In this chapter, we focus on the case of COVID-19, its effects on children and the public debate around the reopening of schools. The aim is to better understand the relationship between policy interventions in the face of an uncertain and rapidly changing knowledge landscape and the subsequent use of scientific information in public debates related to the policy interventions. Our approach is to combine scientific information from journal articles and preprints with their appearance in the popular media, including social media. First, we provide a picture of the different scientific areas and approaches, by which the effects of COVID-19 on children are being studied. Second, we identify news media and social media attention around the COVID-19 scientific output related to children and schools. We focus on policies and media responses in three countries: Spain, South Africa and the Netherlands. These countries have followed very different policy actions with regard to the reopening of schools and represent very different policy approaches to the same problem. We analyse the activity in (social) media around the debate between COVID-19, children and school closures by focusing on the use of references to scientific information in the debate. Finally, we analyse the dominant topics that emerge in the news outlets and the online debates. We draw attention to illustrative cases of miscommunication related to scientific output and conclude the chapter by discussing how information from scientific publication, the media and policy actions shape the public discussion in the context of a global health pandemic.
△ Less
Submitted 8 January, 2021;
originally announced January 2021.
-
Taming Discrete Integration via the Boon of Dimensionality
Authors:
Jeffrey M. Dudek,
Dror Fried,
Kuldeep S. Meel
Abstract:
Discrete integration is a fundamental problem in computer science that concerns the computation of discrete sums over exponentially large sets. Despite intense interest from researchers for over three decades, the design of scalable techniques for computing estimates with rigorous guarantees for discrete integration remains the holy grail. The key contribution of this work addresses this scalabili…
▽ More
Discrete integration is a fundamental problem in computer science that concerns the computation of discrete sums over exponentially large sets. Despite intense interest from researchers for over three decades, the design of scalable techniques for computing estimates with rigorous guarantees for discrete integration remains the holy grail. The key contribution of this work addresses this scalability challenge via an efficient reduction of discrete integration to model counting. The proposed reduction is achieved via a significant increase in the dimensionality that, contrary to conventional wisdom, leads to solving an instance of the relatively simpler problem of model counting.
Building on the promising approach proposed by Chakraborty et al, our work overcomes the key weakness of their approach: a restriction to dyadic weights. We augment our proposed reduction, called DeWeight, with a state of the art efficient approximate model counter and perform detailed empirical analysis over benchmarks arising from neural network verification domains, an emerging application area of critical importance. DeWeight, to the best of our knowledge, is the first technique to compute estimates with provable guarantees for this class of benchmarks.
△ Less
Submitted 20 October, 2020;
originally announced October 2020.
-
DPMC: Weighted Model Counting by Dynamic Programming on Project-Join Trees
Authors:
Jeffrey M. Dudek,
Vu H. N. Phan,
Moshe Y. Vardi
Abstract:
We propose a unifying dynamic-programming framework to compute exact literal-weighted model counts of formulas in conjunctive normal form. At the center of our framework are project-join trees, which specify efficient project-join orders to apply additive projections (variable eliminations) and joins (clause multiplications). In this framework, model counting is performed in two phases. First, the…
▽ More
We propose a unifying dynamic-programming framework to compute exact literal-weighted model counts of formulas in conjunctive normal form. At the center of our framework are project-join trees, which specify efficient project-join orders to apply additive projections (variable eliminations) and joins (clause multiplications). In this framework, model counting is performed in two phases. First, the planning phase constructs a project-join tree from a formula. Second, the execution phase computes the model count of the formula, employing dynamic programming as guided by the project-join tree. We empirically evaluate various methods for the planning phase and compare constraint-satisfaction heuristics with tree-decomposition tools. We also investigate the performance of different data structures for the execution phase and compare algebraic decision diagrams with tensors. We show that our dynamic-programming model-counting framework DPMC is competitive with the state-of-the-art exact weighted model counters cachet, c2d, d4, and miniC2D.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
Parallel Weighted Model Counting with Tensor Networks
Authors:
Jeffrey M. Dudek,
Moshe Y. Vardi
Abstract:
A promising new algebraic approach to weighted model counting makes use of tensor networks, following a reduction from weighted model counting to tensor-network contraction. Prior work has focused on analyzing the single-core performance of this approach, and demonstrated that it is an effective addition to the current portfolio of weighted-model-counting algorithms.
In this work, we explore the…
▽ More
A promising new algebraic approach to weighted model counting makes use of tensor networks, following a reduction from weighted model counting to tensor-network contraction. Prior work has focused on analyzing the single-core performance of this approach, and demonstrated that it is an effective addition to the current portfolio of weighted-model-counting algorithms.
In this work, we explore the impact of multi-core and GPU use on tensor-network contraction for weighted model counting. To leverage multiple cores, we implement a parallel portfolio of tree-decomposition solvers to find an order to contract tensors. To leverage a GPU, we use TensorFlow to perform the contractions. We compare the resulting weighted model counter on 1914 standard weighted model counting benchmarks and show that it significantly improves the virtual best solver.
△ Less
Submitted 14 June, 2021; v1 submitted 28 June, 2020;
originally announced June 2020.
-
The stability of Twitter metrics: A study on unavailable Twitter mentions of scientific publications
Authors:
Zhichao Fang,
Jonathan Dudek,
Rodrigo Costas
Abstract:
This paper investigates the stability of Twitter counts of scientific publications over time. For this, we conducted an analysis of the availability statuses of over 2.6 million Twitter mentions received by the 1,154 most tweeted scientific publications recorded by Altmetric.com up to October 2017. Results show that of the Twitter mentions for these highly tweeted publications, about 14.3% have be…
▽ More
This paper investigates the stability of Twitter counts of scientific publications over time. For this, we conducted an analysis of the availability statuses of over 2.6 million Twitter mentions received by the 1,154 most tweeted scientific publications recorded by Altmetric.com up to October 2017. Results show that of the Twitter mentions for these highly tweeted publications, about 14.3% have become unavailable by April 2019. Deletion of tweets by users is the main reason for unavailability, followed by suspension and protection of Twitter user accounts. This study proposes two measures for describing the Twitter dissemination structures of publications: Degree of Originality (i.e., the proportion of original tweets received by a paper) and Degree of Concentration (i.e., the degree to which retweets concentrate on a single original tweet). Twitter metrics of publications with relatively low Degree of Originality and relatively high Degree of Concentration are observed to be at greater risk of becoming unstable due to the potential disappearance of their Twitter mentions. In light of these results, we emphasize the importance of paying attention to the potential risk of unstable Twitter counts, and the significance of identifying the different Twitter dissemination structures when studying the Twitter metrics of scientific publications.
△ Less
Submitted 21 January, 2020;
originally announced January 2020.
-
Efficient Contraction of Large Tensor Networks for Weighted Model Counting through Graph Decompositions
Authors:
Jeffrey M. Dudek,
Leonardo Dueñas-Osorio,
Moshe Y. Vardi
Abstract:
Constrained counting is a fundamental problem in artificial intelligence. A promising new algebraic approach to constrained counting makes use of tensor networks, following a reduction from constrained counting to the problem of tensor-network contraction. Contracting a tensor network efficiently requires determining an efficient order to contract the tensors inside the network, which is itself a…
▽ More
Constrained counting is a fundamental problem in artificial intelligence. A promising new algebraic approach to constrained counting makes use of tensor networks, following a reduction from constrained counting to the problem of tensor-network contraction. Contracting a tensor network efficiently requires determining an efficient order to contract the tensors inside the network, which is itself a difficult problem.
In this work, we apply graph decompositions to find contraction orders for tensor networks. We prove that finding an efficient contraction order for a tensor network is equivalent to the well-known problem of finding an optimal carving decomposition. Thus memory-optimal contraction orders for planar tensor networks can be found in cubic time. We show that tree decompositions can be used both to find carving decompositions and to factor tensor networks with high-rank, structured tensors.
We implement these algorithms on top of state-of-the-art solvers for tree decompositions and show empirically that the resulting weighted model counter is quite effective and useful as part of a portfolio of counters.
△ Less
Submitted 27 April, 2020; v1 submitted 12 August, 2019;
originally announced August 2019.
-
ADDMC: Weighted Model Counting with Algebraic Decision Diagrams
Authors:
Jeffrey M. Dudek,
Vu H. N. Phan,
Moshe Y. Vardi
Abstract:
We present an algorithm to compute exact literal-weighted model counts of Boolean formulas in Conjunctive Normal Form. Our algorithm employs dynamic programming and uses Algebraic Decision Diagrams as the primary data structure. We implement this technique in ADDMC, a new model counter. We empirically evaluate various heuristics that can be used with ADDMC. We then compare ADDMC to state-of-the-ar…
▽ More
We present an algorithm to compute exact literal-weighted model counts of Boolean formulas in Conjunctive Normal Form. Our algorithm employs dynamic programming and uses Algebraic Decision Diagrams as the primary data structure. We implement this technique in ADDMC, a new model counter. We empirically evaluate various heuristics that can be used with ADDMC. We then compare ADDMC to state-of-the-art exact weighted model counters (Cachet, c2d, d4, and miniC2D) on 1914 standard model counting benchmarks and show that ADDMC significantly improves the virtual best solver.
△ Less
Submitted 2 June, 2020; v1 submitted 11 July, 2019;
originally announced July 2019.
-
The Hard Problems Are Almost Everywhere For Random CNF-XOR Formulas
Authors:
Jeffrey M. Dudek,
Kuldeep S. Meel,
Moshe Y. Vardi
Abstract:
Recent universal-hashing based approaches to sampling and counting crucially depend on the runtime performance of SAT solvers on formulas expressed as the conjunction of both CNF constraints and variable-width XOR constraints (known as CNF-XOR formulas). In this paper, we present the first study of the runtime behavior of SAT solvers equipped with XOR-reasoning techniques on random CNF-XOR formula…
▽ More
Recent universal-hashing based approaches to sampling and counting crucially depend on the runtime performance of SAT solvers on formulas expressed as the conjunction of both CNF constraints and variable-width XOR constraints (known as CNF-XOR formulas). In this paper, we present the first study of the runtime behavior of SAT solvers equipped with XOR-reasoning techniques on random CNF-XOR formulas. We empirically demonstrate that a state-of-the-art SAT solver scales exponentially on random CNF-XOR formulas across a wide range of XOR-clause densities, peaking around the empirical phase-transition location. On the theoretical front, we prove that the solution space of a random CNF-XOR formula 'shatters' at all nonzero XOR-clause densities into well-separated components, similar to the behavior seen in random CNF formulas known to be difficult for many SAT algorithms.
△ Less
Submitted 17 October, 2017;
originally announced October 2017.
-
Combining the $k$-CNF and XOR Phase-Transitions
Authors:
Jeffrey M. Dudek,
Kuldeep S. Meel,
Moshe Y. Vardi
Abstract:
The runtime performance of modern SAT solvers on random $k$-CNF formulas is deeply connected with the 'phase-transition' phenomenon seen empirically in the satisfiability of random $k$-CNF formulas. Recent universal hashing-based approaches to sampling and counting crucially depend on the runtime performance of SAT solvers on formulas expressed as the conjunction of both $k$-CNF and XOR constraint…
▽ More
The runtime performance of modern SAT solvers on random $k$-CNF formulas is deeply connected with the 'phase-transition' phenomenon seen empirically in the satisfiability of random $k$-CNF formulas. Recent universal hashing-based approaches to sampling and counting crucially depend on the runtime performance of SAT solvers on formulas expressed as the conjunction of both $k$-CNF and XOR constraints (known as $k$-CNF-XOR formulas), but the behavior of random $k$-CNF-XOR formulas is unexplored in prior work. In this paper, we present the first study of the satisfiability of random $k$-CNF-XOR formulas. We show empirical evidence of a surprising phase-transition that follows a linear trade-off between $k$-CNF and XOR constraints. Furthermore, we prove that a phase-transition for $k$-CNF-XOR formulas exists for $k = 2$ and (when the number of $k$-CNF constraints is small) for $k > 2$.
△ Less
Submitted 27 February, 2017;
originally announced February 2017.