Data Guide
Navigating SMAT data fields.
Data is delivered via JSON blobs that look like this when making an API request.

SMAT API JSON blob from a search for "Bannon" on Gettr
To get to the actual data, you need to navigate into
hits.hits
and then each result will be under a number in the nested structure.When we add a post to our database we generate a number of fields that can be found at the level above the actual data responses. So if we are looking in response "0", the meta-fields continue until "_source" which is the beginning of the actual data. These are generated by Elasticsearch and are not part of the data from the site itself. Learn more about these fields in Elasticsearch's guide.

Red line highlighting some meta-fields in SMAT data
Each dataset will be described in detail in the following sections with more fields laid out but this chart can function as a quick-start guide to a few key fields. The "Key Fields" sub-header in each data source is not every interesting field just a few, so do look into the data more as well.
- Platform: This is just the plain text readable name of the platforms not the name of the platform field in the api.
- Platform Endpoint: This is the name of that platform when you're making an API request.
- Username: This is the handle (especially user slug) for the account that is posting the message though not necessarily the author of the post contained (i.e. if it's a forward). This is their @ not their full name.
- Post content: This is the actual body of an individual post or message.
Platform | API Site Parameter | Username Field | Content Field |
---|---|---|---|
telegram | userinfo.username | message | |
gab | account.acct | content | |
tiktok_video | author | desc | |
tiktok_comment | author | text | |
parler | username | body | |
gettr | uinf.username | txt | |
truth_social | account.acct or account.username | content_cleaned | |
4chan | name | htmlparsedcom | |
win | author | content | |
8kun | name | htmlparsedcom | |
kiwifarms | author_username | post_text | |
bitchute_comment | fullname | content | |
bitchute_video | channel_slug | N/A but look at meta.description or meta.title | |
rumble_comment | username | text | |
rumble_video | channel_id | N/A but look at full_description and metadata.name | |
vk | author | text | |
lbry_comment | channel_name | comment | |
lbry_video | signing_channel.value.title | N/A but look at value.title and value.description | |
poal | user | content | |
wimkin | author_username | content | |
minds | user.username or user.name | body | |
mewe | username | content |
Description
Collected Data
Key Fields
Additional Resources
Telegram is a freemium, cross-platform, cloud-based messaging application and network. Telegram’s operational center is based in Dubai, UAE. Telegram data schema consists of channels, which users can join and post messages, images, videos or other media. Users and channels can also forward content across channels or private groups. Channels can be private or public depending upon whether they require a user invite to join. Telegram is an infamous messaging application because of its use by far-right and neo-Nazi groups in the United States, for providing news related to the Russian invasion of Ukraine, as well as other authoritarian regimes like the Myanmar Tatmadaw. It is contended as to the specifics on whether Telegram data is transported and stored in an end-to-end encrypted manner.
SMAT targets specific Telegram channels for collection due to the vast volume of Telegram users and content. SMAT utilizes subject matter experts and community volunteers to source hundreds of channels for crawling in addition to automated methods yielding over 65,000 channels being crawled. SMAT crawls users metadata, messages, all media, and channel metadata from the seed crawling set. Some of the categories of these channels are: Russian state and Russian affiliated propaganda, white nationalist groups, and European far-right groups like Querdenker.
Platform endpoint:
telegram
- Username :
userinfo.username
- The name of the channel:
channelusername
- The title of the channel:
channeltitle
- The self-description of the channel:
channelabout
- Post content :
message