name: apify-skill description: Run web scrapers and extract data from websites and social media platforms using Apify actors. Supports Instagram, TikTok, Twitter/X, LinkedIn, Facebook, YouTube, Google Search, and general web crawling. allowed-tools: "apify_actor" metadata: author: machina version: "1.0" category: web
Apify Web Scraping Skill
Run pre-built web scrapers (Actors) to extract data from websites and social media platforms.
How It Works
This skill provides instructions for the Apify Actor tool node. Connect the Apify Actor node to Zeenie's input-tools handle to enable web scraping capabilities.
apify_run_actor Tool
Run any Apify actor and retrieve scraped results.
Schema Fields
| Field | Type | Required | Description |
|---|---|---|---|
| actor_id | string | Yes | Actor ID from Apify Store (e.g., apify/instagram-scraper) |
| input_json | string | No | Actor input as JSON string (default: {}) |
| max_results | integer | No | Maximum items to return (default: 100) |
Popular Actors
| Platform | Actor ID | Use Case |
|---|---|---|
apify/instagram-scraper |
Profiles, posts, hashtags, comments | |
| TikTok | clockworks/tiktok-scraper |
Videos, profiles, trends, hashtags |
| Twitter/X | apidojo/tweet-scraper |
Tweets, profiles, search results |
curious_coder/linkedin-profile-scraper |
Profiles, companies | |
apify/facebook-posts-scraper |
Posts, pages, groups | |
| YouTube | apify/youtube-scraper |
Videos, channels, comments |
| Google Search | apify/google-search-scraper |
SERP results, organic listings |
| Google Maps | apify/google-maps-scraper |
Places, reviews, business info |
| Web Crawler | apify/website-content-crawler |
Any website content |
| Web Scraper | apify/web-scraper |
Custom scraping with selectors |
Response Format
{
"run_id": "abc123xyz",
"actor_id": "apify/instagram-scraper",
"status": "SUCCEEDED",
"items": [
{ "id": "123", "text": "Post content...", "likes": 500 }
],
"item_count": 50,
"dataset_id": "dataset-id-here",
"compute_units": 0.5,
"started_at": "2026-02-23T10:00:00Z",
"finished_at": "2026-02-23T10:02:30Z"
}
Examples
Scrape Instagram profile:
{
"actor_id": "apify/instagram-scraper",
"input_json": "{\"directUrls\": [\"https://instagram.com/natgeo\"], \"resultsLimit\": 50}",
"max_results": 50
}
Search TikTok hashtag:
{
"actor_id": "clockworks/tiktok-scraper",
"input_json": "{\"hashtags\": [\"trending\", \"fyp\"], \"resultsPerPage\": 30}",
"max_results": 30
}
Search Twitter/X:
{
"actor_id": "apidojo/tweet-scraper",
"input_json": "{\"searchTerms\": [\"AI automation\"], \"maxItems\": 100}",
"max_results": 100
}
Google Search:
{
"actor_id": "apify/google-search-scraper",
"input_json": "{\"queries\": \"best AI tools 2026\", \"maxPagesPerQuery\": 3}",
"max_results": 30
}
Crawl website content:
{
"actor_id": "apify/website-content-crawler",
"input_json": "{\"startUrls\": [{\"url\": \"https://docs.example.com\"}], \"maxCrawlDepth\": 2, \"maxCrawlPages\": 50}",
"max_results": 50
}
Scrape Google Maps places:
{
"actor_id": "apify/google-maps-scraper",
"input_json": "{\"searchStringsArray\": [\"restaurants in San Francisco\"], \"maxCrawledPlaces\": 20}",
"max_results": 20
}
Common Input Parameters by Actor
Instagram Scraper
| Parameter | Type | Description |
|---|---|---|
| directUrls | array | Instagram URLs to scrape |
| resultsLimit | number | Max results per URL (1-1000) |
| maxComments | number | Comments per post (0-100) |
| searchType | string | Type of search: hashtag, user, place |
TikTok Scraper
| Parameter | Type | Description |
|---|---|---|
| profiles | array | TikTok usernames to scrape |
| hashtags | array | Hashtags to scrape (without #) |
| resultsPerPage | number | Results per request |
Twitter/X Scraper
| Parameter | Type | Description |
|---|---|---|
| searchTerms | array | Search queries or hashtags |
| twitterHandles | array | Usernames (without @) |
| maxItems | number | Maximum tweets to fetch |
Google Search Scraper
| Parameter | Type | Description |
|---|---|---|
| queries | string | Search query |
| maxPagesPerQuery | number | Pages to scrape (10 results/page) |
| languageCode | string | Language filter (e.g., en) |
| countryCode | string | Country filter (e.g., us) |
Website Content Crawler
| Parameter | Type | Description |
|---|---|---|
| startUrls | array | URLs to crawl from (objects with url key) |
| maxCrawlDepth | number | Link depth (0 = start URLs only) |
| maxCrawlPages | number | Total pages limit |
Error Responses
Actor not found:
{
"success": false,
"error": "Actor 'invalid/actor-id' not found"
}
Timeout:
{
"success": false,
"error": "Actor run timed out after 300 seconds"
}
Run failed:
{
"success": false,
"error": "Actor run failed with status FAILED"
}
Run Status Values
| Status | Meaning |
|---|---|
| SUCCEEDED | Run completed successfully |
| FAILED | Run encountered an error |
| ABORTED | Run was manually stopped |
| TIMED-OUT | Run exceeded timeout limit |
| RUNNING | Run still in progress |
Use Cases
| Use Case | Actor | Description |
|---|---|---|
| Social listening | tweet-scraper | Monitor brand mentions |
| Competitor research | instagram-scraper | Analyze competitor posts |
| Lead generation | linkedin-profile-scraper | Extract business contacts |
| SEO research | google-search-scraper | Analyze SERP rankings |
| Content aggregation | website-content-crawler | Collect articles from sites |
| Price monitoring | web-scraper | Track product prices |
| Review analysis | google-maps-scraper | Gather customer reviews |
Guidelines
- Actor IDs: Use format
username/actor-namefrom Apify Store - Input JSON: Must be valid JSON string with actor-specific parameters
- Timeouts: Default 300 seconds, configurable on the node
- Max Results: Limits items returned (not items scraped)
- Memory: Higher memory = faster execution, higher cost
Pricing Notes
Apify uses compute units (CU) based on memory and duration:
- CU = Memory (GB) x Duration (hours)
- Free tier: $5/month credits
- Check actor pricing on Apify Store before large scrapes
Setup Requirements
- Connect the Apify Actor node to Zeenie's
input-toolshandle - Configure your Apify API token in Credentials Modal
- Select an actor or enter custom actor ID
- Provide actor-specific input as JSON
Security Notes
- Respect website terms of service
- Use reasonable scraping rates to avoid IP bans
- Don't scrape personal data without consent
- Store scraped data securely
- Comply with GDPR and data protection laws