Talkwalker: Most-used fields#
The table below gives information about most-used fields that you can import from Talkwalker. Other fields might also be available in Adverity.
The fields that you can fetch in Adverity are updated regularly to reflect updates to data source APIs.
API name |
Adverity UI name |
Description |
Use in Adverity |
|---|---|---|---|
Engagement |
engagement |
A key metric representing the sum of actions made by others on an article or post. Can be used as a histogram type or for sorting search results. |
metric |
Published |
published |
The timestamp indicating when an article or post was originally published. Can be used as a histogram type or for sorting search results. |
dimension |
Reach |
reach |
The potential number of people who were exposed to an article or post. This metric is available in histogram data. |
metric |
Search indexed |
search_indexed |
The timestamp indicating when an article or post was indexed by Talkwalker. Can be used as a histogram type or for sorting search results. |
dimension |
Sentiment |
sentiment |
The overall emotional tone or opinion expressed in the content, often categorized as positive, neutral, or negative. Can be used for breakdown in histograms. |
metric |
article_extended_attributes.facebook_likes |
article_extended_attributes.facebook_likes |
The number of ‘likes’ an article received on Facebook. |
metric |
article_extended_attributes.facebook_shares |
article_extended_attributes.facebook_shares |
The number of times an article was shared on Facebook. |
metric |
article_extended_attributes.num_comments |
article_extended_attributes.num_comments |
The total number of comments an article or post received. |
metric |
article_extended_attributes.twitter_shares |
article_extended_attributes.twitter_shares |
The number of times an article was shared (retweeted) on Twitter. |
metric |
cluster_id |
cluster_id |
Unique identifier for a cluster of similar content or mentions. |
dimension |
content |
content |
The full text content of the article or post. |
dimension |
content_snippet |
content_snippet |
A short excerpt or summary of the article’s content. |
dimension |
domain_url |
domain_url |
The root domain URL of the website where the content was published. |
dimension |
entity_url |
entity_url |
The URL of the content entity or document. |
dimension |
extra_author_attributes.gender |
extra_author_attributes.gender |
The gender of the author of the content. This can be used for demographic distribution in histograms. |
dimension |
extra_author_attributes.id |
extra_author_attributes.id |
Unique identifier for the author of the content. |
dimension |
extra_author_attributes.name |
extra_author_attributes.name |
The name of the author of the content. |
dimension |
extra_source_attributes.id |
extra_source_attributes.id |
A unique identifier for the source (e.g., website, social media profile) of the content. |
dimension |
extra_source_attributes.name |
extra_source_attributes.name |
The name of the source (e.g., website name, social media profile name) of the content. |
dimension |
extra_source_attributes.world_data.city |
extra_source_attributes.world_data.city |
The city associated with the source of the content, used for geographical analysis. This is available in source-based distributions. |
dimension |
extra_source_attributes.world_data.continent |
extra_source_attributes.world_data.continent |
The continent where the content source is located. |
dimension |
extra_source_attributes.world_data.country |
extra_source_attributes.world_data.country |
The country associated with the geographical location of the content source. |
dimension |
extra_source_attributes.world_data.country_code |
extra_source_attributes.world_data.country_code |
The ISO 3166-1 alpha-2 country code associated with the geographical location of the content source. |
dimension |
extra_source_attributes.world_data.latitude |
extra_source_attributes.world_data.latitude |
The latitude coordinate of the content source’s geographical location. |
metric |
extra_source_attributes.world_data.longitude |
extra_source_attributes.world_data.longitude |
The longitude coordinate of the content source’s geographical location. |
metric |
extra_source_attributes.world_data.region |
extra_source_attributes.world_data.region |
The region associated with the source of the content, used for geographical analysis. This is available in source-based distributions. |
dimension |
extra_source_attributes.world_data.resolution |
extra_source_attributes.world_data.resolution |
The level of geographical detail for the source’s location data. |
dimension |
fluency_level |
fluency_level |
A numerical score indicating the linguistic fluency or quality of the content. |
metric |
host_url |
host_url |
The URL of the host website or platform where the content is published. |
dimension |
images |
images |
A list of image objects associated with the content. |
dimension |
indexed |
indexed |
Indicates whether the content has been indexed by Talkwalker. |
dimension |
lang |
lang |
The language in which the content is written. This can be used for language-based distributions in histograms. |
dimension |
parent_url |
parent_url |
The URL of the parent page or post from which the current content originated or is linked. |
dimension |
porn_level |
porn_level |
A numerical score indicating the likelihood of the content containing pornographic material. |
metric |
post_type |
post_type |
The type of the social media post or content, e.g., ‘TEXT’, ‘IMAGE’, ‘VIDEO’. |
dimension |
report_date |
report_date |
The publication date of the content, represented as a timestamp in milliseconds. |
dimension |
root_url |
root_url |
The base URL of the domain where the content is hosted. |
dimension |
source_extended_attributes.alexa_pageviews |
source_extended_attributes.alexa_pageviews |
The estimated number of pageviews for the source according to Alexa ranking data. |
metric |
source_extended_attributes.alexa_unique_visitors |
source_extended_attributes.alexa_unique_visitors |
The estimated number of unique visitors to the source according to Alexa ranking data. |
metric |
source_type |
source_type |
The category or platform of the data source (e.g., news site, blog, forum, social network). |
dimension |
spam_level |
spam_level |
A numerical score indicating the likelihood of the content being spam. |
metric |
tags_internal |
tags_internal |
Internal tags applied to the document within the Talkwalker project. These tags can be managed via the Talkwalker API and used for filtering search results. |
dimension |
title |
title |
The full title of the article or post. |
dimension |
title_snippet |
title_snippet |
A short excerpt or summary of the article’s title. |
dimension |
url |
url |
The direct URL of the article or post. |
dimension |
videos |
videos |
A list of video objects associated with the content. |
dimension |
word_count |
word_count |
The total number of words in the content. |
metric |