Skip to content

SMDT: Social Media Data Toolkit|

What is SMDT?

SMDT (Social Media Data Toolkit) is a comprehensive Python library designed to streamline the ingestion, standardization, and analysis of social media data. It provides a unified interface for handling data from diverse platforms, enabling researchers to focus on analysis rather than data wrangling.

Why use SMDT?

  • Unified Schema: Convert messy JSON dumps from various platforms into a consistent, queryable format.
  • Modular Design: Easily plug in new data readers, enrichers, or analysis modules.
  • Research Ready: Built specifically for computational social science workflows, supporting reproducibility and scalability.

Citation

If you use SMDT in your research, please cite the following paper:

bibtex
@article{smdt2026,
  title={Social Media Data Toolkit: Standardization and Anonymization of Social Network Datasets},
  author={Najafi, Ali and Iannucci, Letizia and Kivelä, Mikko and Varol, Onur},
  journal={arXiv preprint arXiv:2604.27710},
  year={2026}
}