Documentation
Here you'll find documentation on how to use Tractor.
What is Tractor?
Tractor is, primarily, a tweet retrieval tool based on Apify, a marketplace and service for scrapers.
Tractor allows you to do exactly the same things you'd do in Apify, just much more quickly and with less clicks.
How do I use Tractor?
Using Tractor is pretty easy. Let's sum it up in 3 steps:
- Setup your Apify Key
- Ask for the tweets you want to find
- Get those tweets or schedule a task to run periodically
Setting up your Apify key
There are some prerequisites to this step.You must create an Apify account and that account must not be free. You have to, at least, sign up for the lowest tier, which currently costs $50/month.
These $50 are not completely lost, though. These now become a kind of "credit" to use the tool. Anything that costs money inside Apify will use this money. That means you can scrape a huge amount of tweets for under $5, for example; and continue doing so during the month, until the subscription is renewed.
For more details, please check out Apify's pricing page.
Once you To setup your Apify key, you'll head to the Apify settings, and go under "Integrations". There, you will find a list of keys, think of them like passwords that ideitify you uniquely. Grab one of those and put it in the Apify Key field inside Tractor.
You will only need to follow this step once. We save the key so you can skip this step altogether next time you come into Tractor.Asking for tweets
Once setup, just ask for tweets you'd like to analyze. Anything and everything applies:
- Get tweets from @graphext containing the word "data"
- Get tweets containing the term "cat" but not the term "dog"
- Get tweets using the hashtag #chocolate that are not replies
- Get tweets that contain photos talking about cats
As you can see, this is quite powerful, since we can be quite specific about our needs. This will generate a list of so called "search terms", things you can search in Twitter's search bar.
Fine tuning the queries
When the search terms are generated, you can then scope them to a specific date range, change the frequency of the search (from daily to annually) and maybe add some curated lists.
Getting the data
Once the queries are in place, we can change the number of tweets we want to bring. 5000 is usually a good start, since the system will likely not bring that much data, although that depends heavily on what you are searching.
You can rest assured that you won't get more than the maximum you set (and as such, you won't be over charged), but you can definitely get less than the maximum.
If everything looks good, you can just click the "Get tweets" button and lots of gears will start spinning to fetch your data. Once it's finished, a new button will pop up, downloading your data in a Graphext-friendly format.
You can optionally set up a schedule, which would repeat the task periodically.
You can select the frequency of your schedule and hit the schedule button. This will generate two links:
- The first one, a link to the Apify Console where you can check the Storage object where all your data will live
- The second one, a direct link to the raw data, very convenient to just plug into Graphext as a "Remote File". This will fetch the data from the Storage and create a Graphext project. Afterwards, you can optionally also turn Recurrence, which tells Graphext to fetch data periodically from the specified link, for a spectacular combo: a full Graphext project updated on the latest twitter data.