Multirelational Twitter Bot Detection using Graph Neural Networks

Chris Pollett
CS Dept, SJSU
Joint Work with:
Ketan Jadhav (Graduate), Petros Potikas, and Katerina Potika

July 22, 2025

Introduction

What are Bots?

Bots are automated programs and can be controlled by algorithms to post certain content or behave in a certain way.
Initial uses of social media bots involved posting weather updates of a local area by automated accounts.
As technology grew, bots were used for customer service, subtasks on X or even virtual assistants.
Bots generate between 20.8% and 29.2% of the content posted to Twitter [1] and huge revenue through automated ads
Have also been used maliciously throughout its history
Spam Bots are used to post large volume of tweets to influence people in a certain way

Motivation

Usage of Bot Accounts

Good useful cases of bot accounts (WeatherBot, Customer Query)
Malicious purposes of bot usage
- Spreading misinformation (Spam Bots)
- Cyberbullying
- Scams, Frauds
- Cyber attacks
During the COVID 19 pandemic, more than 40% of the accounts tweeting about opening were bots. [2]
In context of the 2018 US Midterms, [3] classified 21.1 percent of the accounts as bots, which in turn generated 30.6 percent of the tweets
Scam bot accounts have stolen over $90K from deceived users according to [4]

Related Work

Prior Work Summary

Title	Year	Approach	Datasets
BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection [6]	2022	User profile data and relations classified using ML algorithms	ByteDance Security AI Challenge
MRLBot: Multi-Dimensional Representation Learning for Social Media Bot Detection [7]	2023	User relation with neural networks	Cresci-15
Detect Me If You Can: Spam Bot Detection Using Inductive Representation Learning [8]	2019	Profile properties with neighborhood relations with representational learning	Cresci-15
SATAR: A Self-supervised Approach to Twitter Account Representation Learning and its Application in Bot Detection [9]	2021	Tweet, metadata and neighborhood relations with self-supervised learning	TwiBot-20, Cresci-17, PAN-19
Heterogeneity-Aware Twitter Bot Detection with Relational Graph Transformers [10]	2021	Focus on neighborhood relation graphs with GNNs	Twibot-20

Techniques

Graph Neural Networks

Used for deep node embeddings using structural as well as semantic information
Message passing
- Each node transmits its information to neighboring nodes
- Information received from other nodes is aggregated and updated its own information
- This process goes on iteratively for each node to learn more information
Information can include node as well as edge information depending on the type of GNN algorithm

GraphSAGE and GAT

GraphSAGE uses sampling and aggregation, meaning starts with a small subset for learning, and keeps expanding to include more node information
Nodes are sampled i.e some selected for aggregation
Flexibility in selecting aggregating function
Leads to inclusion of local and global properties

GAT uses attention mechanism, by assigning importances to neighboring node features and iteratively updating them through learning node information
Can start learning at multiple random points (attention heads) simultaneously, facilitating independent learning

GNN Approaches Summary

	GCN	R-GCN	GraphSAGE	GAT
Use Case	Basic convolutional graph-based learning	Handles graphs with multiple edge types	Supports scalable inductive learning for large graphs	If need attention mechanism for varying neighbor importance
How Works	Captures local graph structure	Captures multi-relational graph structure	Captures local structure and supports inductive learning	Captures complex relationships with attention
Aggregation	Weighted sum of neighbor features	Weighted sum of neighbor features for each relation type	Sample and aggregate (mean, LSTM, pooling)	Attention-weighted sum of neighbor features

Natural Language Processing

RoBERTa: A Robustly Optimized BERT Pretraining Approach [15]
Extended Pretraining and Data Scale:
- RoBERTa extends BERT's pretraining time and uses ten times more data
- 160GB of uncompressed text compared to 16GB used to train BERT
Optimized Training Techniques:
- Eliminates the Next Sentence Prediction (NSP) task,
- Focuses on more dynamic masking of input tokens, and
- Uses larger mini-batches
Dimension Size:
- 768 for the base models (RoBERTa-base)

Methodology

Dataset

	Users	Bots	Tweets	Edges
Twibot-22 [16]	860057	139943	86764167	170185937
Subset Used	139943	139943	N/A	2349098

Relation	Edges
Following	2626979
Follower	1116655
Post	40887365
Like	595794

Feature Vector

For us, these will be tuples of data associated with a node in a graph (such a node corresponds to a user).
Four types of information processed to obtain feature vectors:
- User bio using NLP technique RoBERTa
- Numerical properties of user profile like follower count, following count, number of active days, number of tweets, screen name length
- Categorical properties of user profile like does account have a profile image, is the account verified, is the account protected
This information is concatenated in the initial layer of the architecture to be used as GNN features.

Experiment Set-Ups

Features Extracted from Dataset

Categorical Properties:
- is the user account protected
- is the user account verified
- does the user have a default profile image or a custom one
Edge Index and Edge type:
- From user relations

Numerical Properties:
- followers count
- active days
- length of the user screen name
- following count
- number of tweets posted
Description:
- User bio

GNN models

Used pyTorch Geometric neural networks library for GNN models
Defined each model as a combination of layers that processed and combined the various input features along with the GNN model
Used the following hyper parameters for training through tuning:
- embedding size = 32
- dropout = 0.1
- learning rate = 0.01
- weight decay = 0.05
- loss = Cross Entropy Loss
- activation = ReLu

Results

Evaluation Metrics

Accuracy: Accuracy measures the proportion of correct predictions out of all predictions made.
$Accuracy := \frac{Number of correctly classified instances}{Total Number of Instances}$
Precision: Precision measures the proportion of true positive predictions out of all positive
predictions made.
$Precision := \frac{True Positives}{True Positives + False Positives}$
Recall: Recall (or Sensitivity) measures the proportion of true positive predictions out of all actual positive cases.
$Recall := \frac{True Positives}{True Positives + False Negatives}$
F1 Score: The F1 Score is the harmonic mean of precision and recall, balancing the two metrics
$F_{1} -Score := 2 × \frac{Precision × Recall}{Precision + Recall}$

Conclusion and Future Work

Conclusion

Twitter bots can be used for good and malice, identifying bots is important.
Various techniques can be used to identify bots:
- User profile details and Machine Learning
- Natural Language Processing
- User relations and Graph Neural Networks
Leveraged GNNs to understand complex relationships of the account social network.
Combined text based data, user account details and graphs to develop strong classification models.
Used accuracy, precision, recall and F1-score to evaluate the models
Graph Attention Network model with interaction relation constructed by combining the post and like relation gave the highest accuracy of 73.83

References

[1] ''Estimating twitter's bot-free monetizable daily active users,'' https://www.similarweb.com/blog/insights/social-media-news/twitter-bot-research/, 2022.

[2] Alison Grace Johansen. ''What's a Twitter bot and how to spot one'', https://us.norton.com/blog/emerging-threats/what-are-twitter-bots-and-how-to-spot-them

[3] Luceri, et al. ''View of Evolution of bot and human behavior during elections'' (2019)

[4] Nizzoli, et al. ''Charting the Landscape of Online Cryptocurrency Manipulation'', (2020)

[5] - K.-C. Yang, E. Ferrara, and F. Menczer. ''Botometer 101: Social bot practicum for computational social scientists,'' Journal of Computational Social Science, vol. 5, no. 2, pp. 1511--1528, 2022.

[6] Li, Shudong. Zhao, Chuanyu. Li, Qing. Huang, Jiuming. Zhao, Dawei. Zhu, Pei can. ''BotFinder: A Novel Framework for Social Bots Detection in Online Social Networks Based on Graph Embedding and Community Detection.'' 10.21203/rs.3.rs-1871702/v1. (2022).

[7] Zeng, F. Sun, Y. Li, Y. ''MRLBot: Multi-Dimensional Representation Learning for Social Media Bot Detection''. Electronics 2023, 12, 2298. https://doi.org/10.3390/electronics12102298 (2023)

[8] Alhosseini, Seyed. Bin Tareaf, Raad. Najafi, Pejman. Meinel, Christoph. ''Detect Me If You Can: Spam Bot Detection Using Inductive Representation Learning.'' 10.1145/3308560.3316504. (2019).

References

[9] Shangbin Feng. Herun Wan. Ningnan Wang. Jundong Li. Minnan Luo. SATAR: A Self-supervised Approach to Twitter Account Representation Learning and its Application in Bot Detection. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management (CIKM '21). Association for Computing Machinery, New York, NY, USA, 3808--3817. https://doi.org/10.1145/3459637.3481949 (2021)

[10] Shangbin Feng. Zhaoxuan Tan. Rui Li. Minnan Luo. ''Heterogeneity-aware Twitter Bot Detection with Relational Graph Transformers''. https://doi.org/10.48550/arXiv.2109.02927

[11] T. N. Kipf and M. Welling, ''Semi-supervised classification with graph convolutional networks,'' CoRR, vol. abs/1609.02907 (2016)

[12] M. Schlichtkrull. T. N. Kipf. P. Bloem. R. van den Berg. I. Titov, and M. Welling. ''Modeling relational data with graph convolutional networks.'' (2017)

[13] P. Velickovic. G. Cucurull. A. Casanova. A. Romero. P. Lio. Y. Bengio. ''Graph attention networks.'' (2018)

[14] W. L. Hamilton. R. Ying. J. Leskovec. ''Inductive representation learning on large graphs.'' CoRR, vol. abs/1706.02216, (2017)

[15] Y. Liu. M. Ott. N. Goyal. J. Du. M. Joshi. D. Chen. O. Levy. M. Lewis. L. Zettlemoyer. V. Stoyanov. ''Roberta: A robustly optimized BERT pretraining approach.'' CoRR, vol. abs/1907.11692 (2019)

[16] S. Feng, et al. ''Twibot-22: Towards graph-based twitter bot detection.'' (2023)

Multirelational Twitter Bot Detection using Graph Neural Networks

Outline

Introduction

Social media and 'X' aka Twitter

What are Bots?

An Example of a bot generated tweet vs that of a real user.

Motivation

Usage of Bot Accounts

Results of Botometer [5] application when given a few user ids as input

Related Work

Previous Work

Prior Work Summary

Techniques

Graph Neural Networks

Different GNN algorithms

GCNs and RGCNs

GraphSAGE and GAT

GNN Approaches Summary

Natural Language Processing

Methodology

Process Flowchart: Multirelational Twitter Bot Detection using Graph Neural Networks

Dataset

Dataset

Feature Vector

Experiment Set-Ups

Features Extracted from Dataset

GNN models

Results

Evaluation Metrics

Accuracy Results

Precision Results

Recall Results

F1-Score Results

Conclusion and Future Work

Conclusion

Future Work

References

References

References

Thank You

Q & A

Process Flowchart:
Multirelational Twitter Bot Detection using Graph Neural Networks

F₁-Score Results