Skip to main content

Showing 1–15 of 15 results for author: Chacko, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.06579  [pdf, other

    cs.SI

    An Agent-based Model of Citation Behavior

    Authors: George Chacko, Minhyuk Park, Vikram Ramavarapu, Ananth Grama, Pablo Robles-Granda, Tandy Warnow

    Abstract: Whether citations can be objectively and reliably used to measure productivity and scientific quality of articles and researchers can, and should, be vigorously questioned. However, citations are widely used to estimate the productivity of researchers and institutions, effectively creating a 'grubby' motivation to be well-cited. We model citation growth, and this grubby interest using an agent-bas… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  2. EC-SBM Synthetic Network Generator

    Authors: The-Anh Vu-Le, Lahari Anne, George Chacko, Tandy Warnow

    Abstract: Generating high-quality synthetic networks with realistic community structure is vital to effectively evaluate community detection algorithms. In this study, we propose a new synthetic network generator called the Edge-Connected Stochastic Block Model (EC-SBM). The goal of EC-SBM is to take a given clustered real-world network and produce a synthetic network that resembles the clustered real-world… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  3. arXiv:2502.02050  [pdf, ps, other

    cs.SI

    RECCS: Realistic Cluster Connectivity Simulator for Synthetic Network Generation

    Authors: Lahari Anne, The-Anh Vu-Le, Minhyuk Park, Tandy Warnow, George Chacko

    Abstract: The limited availability of useful ground-truth communities in real-world networks presents a challenge to evaluating and selecting a "best" community detection method for a given network or family of networks. The use of synthetic networks with planted ground-truths is one way to address this challenge. While several synthetic network generators can be used for this purpose, Stochastic Block Mode… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  4. arXiv:2502.00686  [pdf, ps, other

    cs.SI

    Improved Community Detection using Stochastic Block Models

    Authors: Minhyuk Park, Daniel Wang Feng, Siya Digra, The-Anh Vu-Le, Lahari Anne, George Chacko, Tandy Warnow

    Abstract: Identifying edge-dense communities that are also well-connected is an important aspect of understanding community structure. Prior work has shown that community detection methods can produce poorly connected communities, and some can even produce internally disconnected communities. In this study we evaluate the connectivity of communities obtained using Stochastic Block Models. We find that SBMs… ▽ More

    Submitted 13 February, 2025; v1 submitted 2 February, 2025; originally announced February 2025.

    Comments: See arXiv:2408.10464 for a previous version of this manuscript

  5. arXiv:2408.13647  [pdf, other

    cs.SI

    Synthetic Networks That Preserve Edge Connectivity

    Authors: Lahari Anne, The-Anh Vu-Le, Minhyuk Park, Tandy Warnow, George Chacko

    Abstract: Since true communities within real-world networks are rarely known, synthetic networks with planted ground truths are valuable for evaluating the performance of community detection methods. Of the synthetic network generation tools available, Stochastic Block Models (SBMs) produce networks with ground truth clusters that well approximate input parameters from real-world networks and clusterings. H… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Comments: 12 pages, 5 figures

  6. arXiv:2408.10464  [pdf, ps, other

    cs.SI

    Improved Community Detection using Stochastic Block Models

    Authors: Minhyuk Park, Daniel Wang Feng, Siya Digra, The-Anh Vu-Le, George Chacko, Tandy Warnow

    Abstract: Community detection approaches resolve complex networks into smaller groups (communities) that are expected to be relatively edge-dense and well-connected. The stochastic block model (SBM) is one of several approaches used to uncover community structure in graphs. In this study, we demonstrate that SBM software applied to various real-world and synthetic networks produces poorly-connected to disco… ▽ More

    Submitted 13 February, 2025; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: See arXiv:2502.00686 for an extended version of this manuscript submitted for review

  7. arXiv:2303.02813  [pdf, other

    cs.SI cs.DL

    Well-Connected Communities in Real-World and Synthetic Networks

    Authors: Minhyuk Park, Yasamin Tabatabaee, Vikram Ramavarapu, Baqiao Liu, Vidya Kamath Pailodi, Rajiv Ramachandran, Dmitriy Korobskiy, Fabio Ayres, George Chacko, Tandy Warnow

    Abstract: Integral to the problem of detecting communities through graph clustering is the expectation that they are "well connected". In this respect, we examine five different community detection approaches optimizing different criteria: the Leiden algorithm optimizing the Constant Potts Model, the Leiden algorithm optimizing modularity, Iterative K-Core Clustering (IKC), Infomap, and Markov Clustering (M… ▽ More

    Submitted 14 August, 2023; v1 submitted 5 March, 2023; originally announced March 2023.

  8. AOC; Assembling Overlapping Communities

    Authors: Akhil Jakatdar, Baqiao Liu, Tandy Warnow, George Chacko

    Abstract: Through discovery of meso-scale structures, community detection methods contribute to the understanding of complex networks. Many community finding methods, however, rely on disjoint clustering techniques, in which node membership is restricted to one community or cluster. This strict requirement limits the ability to inclusively describe communities since some nodes may reasonably be assigned to… ▽ More

    Submitted 4 October, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: This version submitted to Quantitative Science Studies

    Journal ref: Quantitative Science Studies (2022)

  9. arXiv:2111.07410  [pdf, other

    cs.DL cs.SI physics.soc-ph

    Center-Periphery Structure in Communities: Extracellular Vesicles

    Authors: Eleanor Wedell, Minhyuk Park, Dmitriy Korobskiy, Tandy Warnow, George Chacko

    Abstract: Clustering and community detection in networks are of broad interest and have been the subject of extensive research that spans several fields. We are interested in the relatively narrow question of detecting communities of scientific publications that are linked by citations. These publication communities can be used to identify scientists with shared interests who form communities of researchers… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

    Journal ref: Quantitative Science Studies (2022)

  10. arXiv:2007.14452  [pdf, other

    cs.DL cs.SI physics.soc-ph

    Finding Scientific Communities In Citation Graphs: Convergent Clustering

    Authors: Shreya Chandrasekharan, Mariam Zaka, Stephen Gallo, Tandy Warnow, George Chacko

    Abstract: Understanding the nature and organization of scientific communities is of broad interest. The `Invisible College' is a historical metaphor for one such type of community and the search for such `colleges' can be framed as the detection and analysis of small groups of scientists working on problems of common interests. Case studies have previously been conducted on individual communities with respe… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Journal ref: Quantitative Science Studies (2021)

  11. Delayed Recognition; the Co-citation Perspective

    Authors: Wenxi Zhao, Dmitriy Korobskiy, George Chacko

    Abstract: A Sleeping Beauty is a publication that is apparently unrecognized for some period of time before experiencing sudden recognition by citation. Various reasons, including resistance to new ideas, have been attributed to such delayed recognition. We examine this phenomenon in the special case of co-citations, which represent new ideas generated through the combination of existing ones. Using relativ… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Journal ref: Frontiers in Research Metrics and Analytics (2021)

  12. Frequently Co-cited Publications: Features and Kinetics

    Authors: Sitaram Devarakonda, James Bradley, Dmitriy Korobskiy, Tandy Warnow, George Chacko

    Abstract: Co-citation measurements can reveal the extent to which a concept representing a novel combination of existing ideas evolves towards a specialty. The strength of co-citation is represented by its frequency, which accumulates over time. Of interest is whether underlying features associated with the strength of co-citation can be identified. We use the proximal citation network for a given pair of a… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

    MSC Class: 01A85; 01A90 ACM Class: H.3.7

    Journal ref: Quantitative Science Studies (2020)

  13. Viewing Computer Science through Citation Analysis; Salton and Bergmark Redux

    Authors: Sitaram Devarakonda, Dmitriy Korobskiy, Tandy Warnow, George Chacko

    Abstract: Computer science has experienced dramatic growth and diversification over the last twenty years. Towards a current understanding of the structure of this discipline, we analyze a cohort of the computer science literature using the DBLP database. For insight on the features of this cohort and the relationship within its components, we constructed article level clusters based on either direct citati… ▽ More

    Submitted 22 December, 2019; originally announced December 2019.

    MSC Class: K.2 ACM Class: K.2

  14. arXiv:1911.08775  [pdf

    cs.DL physics.soc-ph

    Do disruption index indicators measure what they propose to measure? The comparison of several indicator variants with assessments by peers

    Authors: Lutz Bornmann, Sitaram Devarakonda, Alexander Tekles, George Chacko

    Abstract: Recently, Wu, Wang, and Evans (2019) and Bu, Waltman, and Huang (2019) proposed a new family of indicators, which measure whether a scientific publication is disruptive to a field or tradition of research. Such disruptive influences are characterized by citations to a focal paper, but not its cited references. In this study, we are interested in the question of convergent validity, i.e., whether t… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

  15. Co-citations in context: disciplinary heterogeneity is relevant

    Authors: James Bradley, Sitaram Devarakonda, Avon Davey, Dmitriy Korobskiy, Siyu Liu, Djamil Lakhdar-Hamina, Tandy Warnow, George Chacko

    Abstract: Citation analysis of the scientific literature has been used to study and define disciplinary boundaries, to trace the dissemination of knowledge, and to estimate impact. Co-citation, the frequency with which pairs of publications are cited, provides insight into how documents relate to each other and across fields. Co-citation analysis has been used to characterize combinations of prior work as c… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Journal ref: Quantitative Science Studies Oct 11, 2019