scrapegraph_start_smartscraper

Scrapegraphai API

Tool

Extract content from a webpage using AI by providing a natural language prompt and a URL.

Pricing

Per call

$0.08

Model

flat

Pay only for what you use. No subscriptions.

Inputs

headers

object

website_markdown

string

steps

string

object

user_prompt *

string

total_pages

number

render_heavy_js

boolean

number_of_scrolls

number

website_html

string

website_url *

string

stealth

boolean

output_schema

object

mock

boolean

Try It

API

MCP Config

Input Parameters

headers

Optional custom HTTP headers to send with the request. Useful for setting User-Agent, cookies, authentication tokens, and other request metadata. Example: {"User-Agent": "Mozilla/5.0...", "Cookie": "session=abc123"}

website_markdown

Raw Markdown content to process directly (max 2MB). Mutually exclusive with website_url and website_html. Perfect for extracting structured data from Markdown documentation, README files, or any content already in Markdown format.

steps

Optional array of interaction steps to perform on the webpage before extraction. Each step is a string describing the action to take (e.g., “click on filter button”, “wait for results to load”). Example: ["click on search button", "type query in search box", "wait for results"]

Optional cookies object for authentication and session management. Useful for accessing authenticated pages or maintaining session state. Example: {"session_id": "abc123", "auth_token": "xyz789"}

user_prompt

Natural language description of what information you want to extract from the webpage.

total_pages

Optional parameter to enable pagination and scrape multiple pages. Specify the number of pages to extract data from. Default: 1 Range: 1-100

render_heavy_js

number_of_scrolls

Optional parameter for infinite scroll pages. Specify how many times to scroll down to load more content before extraction. Default: 0 Range: 0-50

website_html

Raw HTML content to process directly (max 2MB). Mutually exclusive with website_url and website_markdown. Useful when you already have HTML content cached or want to process modified HTML.

website_url

The URL of the webpage you want to extract information from. You must provide exactly one of: website_url, website_html, or website_markdown.

stealth

output_schema

Optional schema to structure the output. If provided, the AI will attempt to format the results according to this schema.

mock

Cost per run

Execution cost

$0.08

Deducted from your xPay allowance

Scrapegraphai API

Tool