A dataset is the source and the starting point of your analysis. It is the data from which you will extract insights.
On the Symplur platform we have five types of datasets:
Keyword dataset types can consist of one or more words, for example, "diabetes," "lung cancer," "non-small cell lung carcinoma." All usage of this keyword or phrase will be found in that dataset. Twitter sends Symplur 100% of all tweets that include the phrase "lung cancer."
This is an easy dataset type. We get all tweets that contain a hashtag such as "#LCSM".
3. Twitter Accounts:
This dataset contains all tweets from a certain account and also all tweets mentioning that same Twitter account. In other words, this dataset type contains bi-direction conversations with an account. An example would be "@symplur".
This dataset type will contain all tweets that contain certain URLs regardless of the text in the tweet. These URLs can be a specific URL or they can be broad such as a top-level domain name. Examples:
This dataset contains all tweets containing a given stock ticker symbol (prefaced by a "$" sign) of a publicly traded company in the healthcare industry. An example would be $PFE (Pfizer).