This dataset consists of IDs of geotagged Twitter posts from within the United States. They are provided as files per day and state as well as per day and county. In addition, files containing the aggregated number of hashtags from these tweets are provided per day and state and per day and county. This data is organized as a ZIP-file per month containing several zip-files per day which hold the txt-files with the ID/hash information. Also part of the dataset are two shapefiles for the US counties and states and Python scripts for the data collection and sorting geotags into counties.
Aufzeichnung (mechanisch/elektronisch)
Recording
Geotagged tweets from the US
No user sampling. Selection of Tweets with geo-location within bounding box for the United States (-128.6, 24.5), (-59, 50). Sampling by Twitter API unknown.