Retweet graphs of political hashtags in 2021 Iranian presidential election
Data files
Apr 15, 2026 version files 657.53 MB
-
README.md
2.12 KB
-
Retweet_Data.zip
657.52 MB
Abstract
Social networks have been a huge point of interest in recent decades in multiple research areas, studied from various perspectives, including the way information is spread between users on social platforms. This dataset is a collection of retweet networks of 16 political hashtags from former Farsi Twitter (now Farsi X), gathered in the time span of April 29, 2021, to June 24, 2021, in the vicinity of the 2021 Iranian presidential election.
Dataset DOI: 10.5061/dryad.34tmpg4wq
Description of the data and file structure
File: Retweet_Data.zip
Description:
Each hashtag contains 11 graphml files corresponding to 11 sub-intervals of the total time span and two other pre-processed graphml files, which are the combination of the first half and the second half of the total time interval.
Each of the 11 original graphml files contains multidigraphs of nodes as users and edges as retweets. The nodes have attributes:
- 'name'
- 'screenname'
- 'user_id'
- 'id',
while the edges have only one attribute:
- 'created_at',
which is a timestamp of the retweet time. Name, screenname and user_id are the only unique user attributes among different time intervals and hashtags.
The remaining two graphml files are weighted digraphs with link weights being the number of times a retweet has happened between the predecessor and the succesor nodes.
Name of the hastags have been replaced with alphabet letters for privacy reasons. For example a file named 'E_c_5-18_5-25' means it belongs to the hashtag 'E' and it stores the retweets from 5-18 till 5-25 of that hashtag which is the 'third' sub-interval of the total time span (shown by the small letter 'c' as the 'third' alphabet letter). The files with suffix '1sthalf' include a merge of the retweets of the first 5 sub-intervals (letters a-e) while those with '2ndhalf' are a merge of theretweets of the last 6 sub-intervals (letters f-k).
Code/software
The presented data are of the extension '.graphml' and have been originally studied in the related research using the python package networkx. These files can also be viewed and studied using other graph analysis tools, such as python packages like igraph and graph-tool or softwares like Gephi.
Access information
The presented files are processed from raw data originally gathered via the former Twitter API by CCNET group members Parham Moradi and Parsa Bigdeli, courtesy of the CCNET group. For more information, visit CCNET webpage.
