Chris Pollett
CS Dept, SJSU
Joint Work with:
Ketan Jadhav (Graduate), Petros Potikas, and Katerina Potika
July 22, 2025
Title | Year | Approach | Datasets |
---|---|---|---|
BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection [6] | 2022 | User profile data and relations classified using ML algorithms | ByteDance Security AI Challenge |
MRLBot: Multi-Dimensional Representation Learning for Social Media Bot Detection [7] | 2023 | User relation with neural networks | Cresci-15 |
Detect Me If You Can: Spam Bot Detection Using Inductive Representation Learning [8] | 2019 | Profile properties with neighborhood relations with representational learning | Cresci-15 |
SATAR: A Self-supervised Approach to Twitter Account Representation Learning and its Application in Bot Detection [9] | 2021 | Tweet, metadata and neighborhood relations with self-supervised learning | TwiBot-20, Cresci-17, PAN-19 |
Heterogeneity-Aware Twitter Bot Detection with Relational Graph Transformers [10] | 2021 | Focus on neighborhood relation graphs with GNNs | Twibot-20 |
GCN | R-GCN | GraphSAGE | GAT | |
---|---|---|---|---|
Use Case | Basic convolutional graph-based learning | Handles graphs with multiple edge types | Supports scalable inductive learning for large graphs | If need attention mechanism for varying neighbor importance |
How Works | Captures local graph structure | Captures multi-relational graph structure | Captures local structure and supports inductive learning | Captures complex relationships with attention |
Aggregation | Weighted sum of neighbor features | Weighted sum of neighbor features for each relation type | Sample and aggregate (mean, LSTM, pooling) | Attention-weighted sum of neighbor features |
Users | Bots | Tweets | Edges | |
---|---|---|---|---|
Twibot-22 [16] | 860057 | 139943 | 86764167 | 170185937 |
Subset Used | 139943 | 139943 | N/A | 2349098 |
Relation | Edges |
---|---|
Following | 2626979 |
Follower | 1116655 |
Post | 40887365 |
Like | 595794 |
We also create a synthetic relationship, we call Interaction to indicate whether a given user liked a post of another given user. We can modify this further to track the number of liked posts.
Takeaways: GraphSAGE performs similar on all relations GAT for interaction relation is the best followed by RGCN for follower + following
Takeaways: GraphSAGE performs similar on all relations GAT for interaction relation is the best followed by RGCN for follower + following + interaction
Takeaways: GAT for follower + following + interaction relation is the best followed by RGCN for follower + following + interaction
Takeaways: GraphSAGE performs similar on all relations GAT for interaction relation is the best followed by RGCN for follower + following
[1] ''Estimating twitter's bot-free monetizable daily active users,'' https://www.similarweb.com/blog/insights/social-media-news/twitter-bot-research/, 2022.
[2] Alison Grace Johansen. ''What's a Twitter bot and how to spot one'', https://us.norton.com/blog/emerging-threats/what-are-twitter-bots-and-how-to-spot-them
[3] Luceri, et al. ''View of Evolution of bot and human behavior during elections'' (2019)
[4] Nizzoli, et al. ''Charting the Landscape of Online Cryptocurrency Manipulation'', (2020)
[5] - K.-C. Yang, E. Ferrara, and F. Menczer. ''Botometer 101: Social bot practicum for computational social scientists,'' Journal of Computational Social Science, vol. 5, no. 2, pp. 1511--1528, 2022.
[6] Li, Shudong. Zhao, Chuanyu. Li, Qing. Huang, Jiuming. Zhao, Dawei. Zhu, Pei can. ''BotFinder: A Novel Framework for Social Bots Detection in Online Social Networks Based on Graph Embedding and Community Detection.'' 10.21203/rs.3.rs-1871702/v1. (2022).
[7] Zeng, F. Sun, Y. Li, Y. ''MRLBot: Multi-Dimensional Representation Learning for Social Media Bot Detection''. Electronics 2023, 12, 2298. https://doi.org/10.3390/electronics12102298 (2023)
[8] Alhosseini, Seyed. Bin Tareaf, Raad. Najafi, Pejman. Meinel, Christoph. ''Detect Me If You Can: Spam Bot Detection Using Inductive Representation Learning.'' 10.1145/3308560.3316504. (2019).
[9] Shangbin Feng. Herun Wan. Ningnan Wang. Jundong Li. Minnan Luo. SATAR: A Self-supervised Approach to Twitter Account Representation Learning and its Application in Bot Detection. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management (CIKM '21). Association for Computing Machinery, New York, NY, USA, 3808--3817. https://doi.org/10.1145/3459637.3481949 (2021)
[10] Shangbin Feng. Zhaoxuan Tan. Rui Li. Minnan Luo. ''Heterogeneity-aware Twitter Bot Detection with Relational Graph Transformers''. https://doi.org/10.48550/arXiv.2109.02927
[11] T. N. Kipf and M. Welling, ''Semi-supervised classification with graph convolutional networks,'' CoRR, vol. abs/1609.02907 (2016)
[12] M. Schlichtkrull. T. N. Kipf. P. Bloem. R. van den Berg. I. Titov, and M. Welling. ''Modeling relational data with graph convolutional networks.'' (2017)
[13] P. Velickovic. G. Cucurull. A. Casanova. A. Romero. P. Lio. Y. Bengio. ''Graph attention networks.'' (2018)
[14] W. L. Hamilton. R. Ying. J. Leskovec. ''Inductive representation learning on large graphs.'' CoRR, vol. abs/1706.02216, (2017)
[15] Y. Liu. M. Ott. N. Goyal. J. Du. M. Joshi. D. Chen. O. Levy. M. Lewis. L. Zettlemoyer. V. Stoyanov. ''Roberta: A robustly optimized BERT pretraining approach.'' CoRR, vol. abs/1907.11692 (2019)
[16] S. Feng, et al. ''Twibot-22: Towards graph-based twitter bot detection.'' (2023)