-
Comparing User Activity on X and Mastodon
Authors:
Shiori Hironaka,
Mitsuo Yoshida,
Kazuyuki Shudo
Abstract:
The "Fediverse", a federation of decentralized social media servers, has emerged after a decade in which centralized platforms like X (formerly Twitter) have dominated the landscape. The structure of a federation should affect user activity, as a user selects a server to access the Fediverse and posts are distributed along the structure. This paper reports on the differences in user activity betwe…
▽ More
The "Fediverse", a federation of decentralized social media servers, has emerged after a decade in which centralized platforms like X (formerly Twitter) have dominated the landscape. The structure of a federation should affect user activity, as a user selects a server to access the Fediverse and posts are distributed along the structure. This paper reports on the differences in user activity between Twitter and Mastodon, a prominent example of decentralized social media. The target of the analysis is Japanese posts because both Twitter and Mastodon are actively used especially in Japan. Our findings include a larger number of replies on Twitter, more consistent user engagement on mstdn.jp, and different topic preferences on each server.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
Clustering Coefficient Reflecting Pairwise Relationships within Hyperedges
Authors:
Rikuya Miyashita,
Shiori Hironaka,
Kazuyuki Shudo
Abstract:
Hypergraphs are generalizations of simple graphs that allow for the representation of complex group interactions beyond pairwise relationships. Clustering coefficients quantify local link density in networks and have been widely studied for both simple graphs and hypergraphs. However, existing clustering coefficients for hypergraphs treat each hyperedge as a distinct unit rather than a collection…
▽ More
Hypergraphs are generalizations of simple graphs that allow for the representation of complex group interactions beyond pairwise relationships. Clustering coefficients quantify local link density in networks and have been widely studied for both simple graphs and hypergraphs. However, existing clustering coefficients for hypergraphs treat each hyperedge as a distinct unit rather than a collection of potentially related node pairs, failing to capture intra-hyperedge pairwise relationships and incorrectly assigning zero values to nodes with meaningful clustering patterns. We propose a novel clustering coefficient that addresses this fundamental limitation by transforming hypergraphs into weighted graphs, where edge weights reflect relationship strength between nodes based on hyperedge connections. Our definition satisfies three key conditions: values in the range $[0,1]$, consistency with simple graph clustering coefficients, and effective capture of intra-hyperedge pairwise relationships -- a capability absent from existing approaches. Theoretical evaluation on higher-order motifs demonstrates that our definition correctly assigns values to motifs where existing definitions fail (motifs III, IV-a, IV-b of order 3), while empirical evaluation on three real-world datasets shows similar overall clustering tendencies with more detailed measurements, especially for hypergraphs with larger hyperedges. The proposed clustering coefficient enables accurate quantification of local density in complex networks, revealing structural characteristics missed by existing definitions in systems where group membership implies connections between members, such as social communities and co-authorship networks.
△ Less
Submitted 4 July, 2025; v1 submitted 31 October, 2024;
originally announced October 2024.
-
Estimation of Graph Features Based on Random Walks Using Neighbors' Properties
Authors:
Tsuyoshi Hasegawa,
Shiori Hironaka,
Kazuyuki Shudo
Abstract:
Using random walks for sampling has proven advantageous in assessing the characteristics of large and unknown social networks. Several algorithms based on random walks have been introduced in recent years. In the practical application of social network sampling, there is a recurrent reliance on an application programming interface (API) for obtaining adjacent nodes. However, owing to constraints r…
▽ More
Using random walks for sampling has proven advantageous in assessing the characteristics of large and unknown social networks. Several algorithms based on random walks have been introduced in recent years. In the practical application of social network sampling, there is a recurrent reliance on an application programming interface (API) for obtaining adjacent nodes. However, owing to constraints related to query frequency and associated API expenses, it is preferable to minimize API calls during the feature estimation process. In this study, considering the acquisition of neighboring nodes as a cost factor, we introduce a feature estimation algorithm that outperforms existing algorithms in terms of accuracy. Through experiments that simulate sampling on known graphs, we demonstrate the superior accuracy of our proposed algorithm when compared to existing alternatives.
△ Less
Submitted 17 September, 2024; v1 submitted 13 September, 2024;
originally announced September 2024.
-
Follower--Followee Ratio Category and User Vector for Analyzing Following Behavior
Authors:
Hayato Oshimo,
Shiori Hironaka,
Mitsuo Yoshida,
Kyoji Umemura
Abstract:
Analyzing following behavior is important in many applications. Following behavior may depend on the main intention of the follower. Users may either follow their friends or they may follow celebrities to know more about them. It is difficult to estimate users' intention from their following relationships. In this paper, we propose an approach to analyze following relationships. First, we investig…
▽ More
Analyzing following behavior is important in many applications. Following behavior may depend on the main intention of the follower. Users may either follow their friends or they may follow celebrities to know more about them. It is difficult to estimate users' intention from their following relationships. In this paper, we propose an approach to analyze following relationships. First, we investigated the similarity between users. Similar followers and followees are likely to be friends. However, when the follower and followee are not similar, it is likely that follower seeks to obtain more information on the followee. Second, we categorized users by the network structure. We then proposed analysis of following behavior based on similarity and category of users estimated from tweets and user data. We confirmed the feasibility of the proposed method through experiments. Finally, we examined users in different categories and analyzed their following behavior.
△ Less
Submitted 25 October, 2022;
originally announced October 2022.
-
Comparison of Indicators of Location Homophily Using Twitter Follow Graph
Authors:
Shiori Hironaka,
Mitsuo Yoshida,
Kyoji Umemura
Abstract:
Location homophily is a tendency of Twitter users whose followers tend to be in the same or nearby areas. Intuitively, although users with a higher number of follower relationships might have negative homophily indicators, it is worth consulting actual Twitter data. Moreover, there may be certain functions regarding the numbers of friends and followers that are more directly correlated to the homo…
▽ More
Location homophily is a tendency of Twitter users whose followers tend to be in the same or nearby areas. Intuitively, although users with a higher number of follower relationships might have negative homophily indicators, it is worth consulting actual Twitter data. Moreover, there may be certain functions regarding the numbers of friends and followers that are more directly correlated to the homophily. In this study, the ratio of the number of friends to the number of followers is shown to be a more effective negative indicator of homophily, and the results for 10 different countries are verified.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
User's Centrality Analysis for Home Location Estimation
Authors:
Shiori Hironaka,
Mitsuo Yoshida,
Kyoji Umemura
Abstract:
User attributes, such as home location, are useful for many applications. Many researchers have been tackling how to estimate users' home locations using relationships among users. It is known that the home locations of certain users, such as celebrities, are hard to estimate using relationships. However, because estimating the home locations of all celebrities is not actually hard, it is importan…
▽ More
User attributes, such as home location, are useful for many applications. Many researchers have been tackling how to estimate users' home locations using relationships among users. It is known that the home locations of certain users, such as celebrities, are hard to estimate using relationships. However, because estimating the home locations of all celebrities is not actually hard, it is important to clarify the characteristics of users whose home locations are hard to estimate. We analyze whether centralities, which represent users' characteristics, and the tendency to have the same home locations as friends are related. The results indicate that PageRank and HITS scores are related to whether users have the same home location as friends, and that users with higher HITS scores have the same home location as their friends less often. This result indicates that there are two types of users whose home locations are difficult to estimate: hub users who follow many celebrities and authority users who are celebrities.
△ Less
Submitted 29 October, 2019;
originally announced October 2019.
-
Analysis of Home Location Estimation with Iteration on Twitter Following Relationship
Authors:
Shiori Hironaka,
Mitsuo Yoshida,
Kyoji Umemura
Abstract:
User's home locations are used by numerous social media applications, such as social media analysis. However, since the user's home location is not generally open to the public, many researchers have been attempting to develop a more accurate home location estimation. A social network that expresses relationships between users is used to estimate the users' home locations. The network-based home l…
▽ More
User's home locations are used by numerous social media applications, such as social media analysis. However, since the user's home location is not generally open to the public, many researchers have been attempting to develop a more accurate home location estimation. A social network that expresses relationships between users is used to estimate the users' home locations. The network-based home location estimation method with iteration, which propagates the estimated locations, is used to estimate more users' home locations. In this study, we analyze the function of network-based home location estimation with iteration while using the social network based on following relationships on Twitter. The results indicate that the function that selects the most frequent location among the friends' location has the best accuracy. Our analysis also shows that the 88% of users, who are in the social network based on following relationships, has at least one correct home location within one-hop (friends and friends of friends). According to this characteristic of the social network, we indicate that twice is sufficient for iteration.
△ Less
Submitted 30 August, 2016;
originally announced August 2016.
-
Negative Beta Encoder
Authors:
Tohru Kohda,
Satoshi Hironaka,
Kazuyuki Aihara
Abstract:
A new class of analog-to-digital (A/D) and digital-to-analog (D/A) converters using a flaky quantiser, called the $β$-encoder, has been shown to have exponential bit rate accuracy while possessing a self-correction property for fluctuations of the amplifier factor $β$ and the quantiser threshold $ν$. The probabilistic behavior of such a flaky quantiser is explained as the deterministic dynamics…
▽ More
A new class of analog-to-digital (A/D) and digital-to-analog (D/A) converters using a flaky quantiser, called the $β$-encoder, has been shown to have exponential bit rate accuracy while possessing a self-correction property for fluctuations of the amplifier factor $β$ and the quantiser threshold $ν$. The probabilistic behavior of such a flaky quantiser is explained as the deterministic dynamics of the multi-valued Rényi map. That is, a sample $x$ is always confined to a contracted subinterval while successive approximations of $x$ are performed using $β$-expansion even if $ν$ may vary at each iteration. This viewpoint enables us to get the decoded sample, which is equal to the midpoint of the subinterval, and its associated characteristic equation for recovering $β$ which improves the quantisation error by more than $3{dB}$ when $β>1.5$. The invariant subinterval under the Rényi map shows that $ν$ should be set to around the midpoint of its associated greedy and lazy values. %in terms of its quantisation MSE (mean square error). Furthermore, a new A/D converter is introduced called the negative $β$-encoder, which further improves the quantisation error of the $β$-encoder. A two-state Markov chain describing the $β$-encoder suggests that a negative eigenvalue of its associated transition probability matrix reduces the quantisation error.
△ Less
Submitted 28 July, 2009; v1 submitted 19 August, 2008;
originally announced August 2008.