apify-skill - SKILL.md Agent Skill

name: apify-skill description: Run web scrapers and extract data from websites and social media platforms using Apify actors. Supports Instagram, TikTok, Twitter/X, LinkedIn, Facebook, YouTube, Google Search, and general web crawling. allowed-tools: "apify_actor" metadata: author: machina version: "1.0" category: web

Apify Web Scraping Skill

Run pre-built web scrapers (Actors) to extract data from websites and social media platforms.

How It Works

This skill provides instructions for the Apify Actor tool node. Connect the Apify Actor node to Zeenie's input-tools handle to enable web scraping capabilities.

apify_run_actor Tool

Run any Apify actor and retrieve scraped results.

Schema Fields

Field	Type	Required	Description
actor_id	string	Yes	Actor ID from Apify Store (e.g., `apify/instagram-scraper`)
input_json	string	No	Actor input as JSON string (default: `{}`)
max_results	integer	No	Maximum items to return (default: 100)

Popular Actors

Platform	Actor ID	Use Case
Instagram	`apify/instagram-scraper`	Profiles, posts, hashtags, comments
TikTok	`clockworks/tiktok-scraper`	Videos, profiles, trends, hashtags
Twitter/X	`apidojo/tweet-scraper`	Tweets, profiles, search results
LinkedIn	`curious_coder/linkedin-profile-scraper`	Profiles, companies
Facebook	`apify/facebook-posts-scraper`	Posts, pages, groups
YouTube	`apify/youtube-scraper`	Videos, channels, comments
Google Search	`apify/google-search-scraper`	SERP results, organic listings
Google Maps	`apify/google-maps-scraper`	Places, reviews, business info
Web Crawler	`apify/website-content-crawler`	Any website content
Web Scraper	`apify/web-scraper`	Custom scraping with selectors

Response Format

{
  "run_id": "abc123xyz",
  "actor_id": "apify/instagram-scraper",
  "status": "SUCCEEDED",
  "items": [
    { "id": "123", "text": "Post content...", "likes": 500 }
  ],
  "item_count": 50,
  "dataset_id": "dataset-id-here",
  "compute_units": 0.5,
  "started_at": "2026-02-23T10:00:00Z",
  "finished_at": "2026-02-23T10:02:30Z"
}

Examples

Scrape Instagram profile:

{
  "actor_id": "apify/instagram-scraper",
  "input_json": "{\"directUrls\": [\"https://instagram.com/natgeo\"], \"resultsLimit\": 50}",
  "max_results": 50
}

Search TikTok hashtag:

{
  "actor_id": "clockworks/tiktok-scraper",
  "input_json": "{\"hashtags\": [\"trending\", \"fyp\"], \"resultsPerPage\": 30}",
  "max_results": 30
}

Search Twitter/X:

{
  "actor_id": "apidojo/tweet-scraper",
  "input_json": "{\"searchTerms\": [\"AI automation\"], \"maxItems\": 100}",
  "max_results": 100
}

Google Search:

{
  "actor_id": "apify/google-search-scraper",
  "input_json": "{\"queries\": \"best AI tools 2026\", \"maxPagesPerQuery\": 3}",
  "max_results": 30
}

Crawl website content:

{
  "actor_id": "apify/website-content-crawler",
  "input_json": "{\"startUrls\": [{\"url\": \"https://docs.example.com\"}], \"maxCrawlDepth\": 2, \"maxCrawlPages\": 50}",
  "max_results": 50
}

Scrape Google Maps places:

{
  "actor_id": "apify/google-maps-scraper",
  "input_json": "{\"searchStringsArray\": [\"restaurants in San Francisco\"], \"maxCrawledPlaces\": 20}",
  "max_results": 20
}

Common Input Parameters by Actor

Instagram Scraper

Parameter	Type	Description
directUrls	array	Instagram URLs to scrape
resultsLimit	number	Max results per URL (1-1000)
maxComments	number	Comments per post (0-100)
searchType	string	Type of search: `hashtag`, `user`, `place`

TikTok Scraper

Parameter	Type	Description
profiles	array	TikTok usernames to scrape
hashtags	array	Hashtags to scrape (without #)
resultsPerPage	number	Results per request

Twitter/X Scraper

Parameter	Type	Description
searchTerms	array	Search queries or hashtags
twitterHandles	array	Usernames (without @)
maxItems	number	Maximum tweets to fetch

Google Search Scraper

Parameter	Type	Description
queries	string	Search query
maxPagesPerQuery	number	Pages to scrape (10 results/page)
languageCode	string	Language filter (e.g., `en`)
countryCode	string	Country filter (e.g., `us`)

Website Content Crawler

Parameter	Type	Description
startUrls	array	URLs to crawl from (objects with `url` key)
maxCrawlDepth	number	Link depth (0 = start URLs only)
maxCrawlPages	number	Total pages limit

Error Responses

Actor not found:

{
  "success": false,
  "error": "Actor 'invalid/actor-id' not found"
}

Timeout:

{
  "success": false,
  "error": "Actor run timed out after 300 seconds"
}

Run failed:

{
  "success": false,
  "error": "Actor run failed with status FAILED"
}

Run Status Values

Status	Meaning
SUCCEEDED	Run completed successfully
FAILED	Run encountered an error
ABORTED	Run was manually stopped
TIMED-OUT	Run exceeded timeout limit
RUNNING	Run still in progress

Use Cases

Use Case	Actor	Description
Social listening	tweet-scraper	Monitor brand mentions
Competitor research	instagram-scraper	Analyze competitor posts
Lead generation	linkedin-profile-scraper	Extract business contacts
SEO research	google-search-scraper	Analyze SERP rankings
Content aggregation	website-content-crawler	Collect articles from sites
Price monitoring	web-scraper	Track product prices
Review analysis	google-maps-scraper	Gather customer reviews

Guidelines

Actor IDs: Use format username/actor-name from Apify Store
Input JSON: Must be valid JSON string with actor-specific parameters
Timeouts: Default 300 seconds, configurable on the node
Max Results: Limits items returned (not items scraped)
Memory: Higher memory = faster execution, higher cost

Pricing Notes

Apify uses compute units (CU) based on memory and duration:

CU = Memory (GB) x Duration (hours)
Free tier: $5/month credits
Check actor pricing on Apify Store before large scrapes

Setup Requirements

Connect the Apify Actor node to Zeenie's input-tools handle
Configure your Apify API token in Credentials Modal
Select an actor or enter custom actor ID
Provide actor-specific input as JSON

Security Notes

Respect website terms of service
Use reasonable scraping rates to avoid IP bans
Don't scrape personal data without consent
Store scraped data securely
Comply with GDPR and data protection laws