Twitter is truly the public square of the world. That includes all countries and people speaking in all languages. The Twitter product has support for all languages, and so has Symplur.
Tweets in different languages
We track tweets in all languages from English, to Norwegian, to Arabic.
You can even focus specifically on one language by employing the Tweet Language Filter. See the breakdown of languages used in tweets with the Language Report.
All reports support multiple languages out of the box. There is only one report where our Natural Language Processing (NLP) is created specifically for the English language and that is the Sentiment Report.
We find that the majority of tweets in the healthcare industry are written in the English language, even when the user is based in a country where English is not the primary language.
Profiles in different languages
People's profiles can be written in any language and we support all of them. Typically it is the Twitter bio that can be in a non-English language. Our Bio Description Filter works on all languages.
All Twitter profiles has a default language that control their preferred user interface language. We track this meta data and it can be a great indicator of the user's primary language. See a breakdown of this in the User Language Report. You can even filter for people's default language with the User Language Filter.
We frequently find that people with a default language other than English, still tweets primarily in English for their healthcare conversation engagement.
Tweet location, People location and Time Zones
For all the conversations we track we receive much meta data attached to each individual tweet. One of these meta data is the geo location of the user at the moment they tweeted. We cover this globally and you can view this data in the Geolocation Report.
All Twitter profiles has a location field that is a free form text field where they can write anything. We try to map this on a global map to visualize where their home base is at. You can view this data in the Location & Time Zones Report. Feel free to use the corresponding Mapped Location Filter and Time Zone Filter.
Healthcare Stakeholders in different languages
One of the most powerful features in Symplur Signals is our ability to categorize Twitter accounts according to certain stakeholder groups. Our categorization process supports multiple languages, and as a result you will find Twitter accounts in Arabic and other languages perfectly categorized.
We pay special attention to this for all healthcare conferences. No matter what country it took place or with what language it was conducted in, we will categorize the top influencers of that conference.
Our process is assisted by Google Translate technology for accurate translation of over 100 languages.
Databases in different languages
The data available in Symplur Signals comes from three sources; hashtags, keywords and user accounts. All three can be based in any language.
Hashtags are typically language agnostic since most of them do not consist of English words. But some broad hashtags are simply words like "#diabetes". We track a multitude of these English hashtag terms, but also hashtag terms in other languages like Spanish and German.
Keywords are terms or phrases that are mentioned in a tweet. We track diagnosis, symptoms, treatment phrases, drug names and more. Many are in English, many are Latin-based just like much of our medical terminology is and thus we cover a host of languages. Some keyword and phrases are specifically in other languages, and others are language agnostic such as certain drug names.
There are no limitations in terms of language or location that our system can track.