Web scraping and AI-powered search API for extracting data from any website
Author: Anakin
Version: 0.0.2
Type: Tool Plugin
Anakin is a powerful web scraping and AI-powered search plugin for Dify. Built on the AnakinScraper API, it enables your AI applications to extract data from any website, perform intelligent web searches, and conduct deep research using advanced AI pipelines.
Scrapes a single URL, returning HTML, markdown, and optionally structured JSON.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| url | string | Yes | - | Target URL to scrape (HTTP/HTTPS) |
| country | string | No | "us" | Proxy location from 207 countries |
| use_browser | boolean | No | false | Enable headless Chrome for JavaScript-heavy sites |
| generate_json | boolean | No | false | Use AI to extract structured data |
| session_id | string | No | - | Browser session ID for authenticated pages |
Response includes:
Scrape up to 10 URLs simultaneously in parallel.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| urls | string | Yes | - | Comma-separated list of URLs (1-10) |
| country | string | No | "us" | Proxy location from 207 countries |
| use_browser | boolean | No | false | Enable headless Chrome for JavaScript-heavy sites |
| generate_json | boolean | No | false | Use AI to extract structured data from each page |
Synchronous AI-powered web search returning results with citations and relevance scoring.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| prompt | string | Yes | - | Search query or question |
| limit | number | No | 5 | Maximum results to return |
Response includes:
Note: This is a synchronous endpoint - results are returned immediately without polling.
Multi-stage automated research pipeline combining search, scraping, and AI synthesis.
| Parameter | Type | Required | Description |
|---|---|---|---|
| prompt | string | Yes | Research question or topic |
Response includes:
Note: This is an async operation that typically takes 1-5 minutes.
Execute pre-configured scraper templates for domain-specific structured data extraction.
| Parameter | Type | Required | Description |
|---|---|---|---|
| url | string | Yes | Target URL to scrape |
| scraper_code | string | Yes | Configuration identifier |
| scraper_params | string | No | JSON string of scraper-specific parameters |
Response: Structured JSON matching the scraper's defined schema.
Returns structured product data automatically extracted by AI.
Scrapes pages that require login using your saved browser session.
| Tool | Type | Typical Duration |
|---|---|---|
| URL Scraper | Async | 3-15 seconds |
| Batch Scraper | Async | 5-30 seconds |
| AI Search | Sync | Immediate |
| Deep Research | Async | 1-5 minutes |
| Custom Scraper | Async | 3-15 seconds |
| Code | Meaning | Action |
|---|---|---|
| 400 | Invalid parameters | Check your input |
| 401 | Invalid API key | Verify your API key |
| 402 | Plan upgrade required | Upgrade your Anakin plan |
| 404 | Job not found | Job may have expired |
| 429 | Rate limit exceeded | Wait and retry |
| 5xx | Server error | Retry with backoff |
Proxy routing supports 207 countries. Common codes:
This plugin is provided by Anakin. Usage is subject to Anakin's Terms of Service and Privacy Policy.