Pricing
Per call
$0.02
Model
flat
Pay only for what you use. No subscriptions.
Inputs
remove_images
booleancountry
stringmetadata
objectformats
arrayremove_css_selectors
stringtransformer
stringwait_before_scraping
integerlinks_on_page
objectllm_extract
objectparser
objectscreen_size
objecturl_to_scrape *
stringactions
arrayremove_class_names
arrayTry It
API
MCP Config
Input Parameters
Residential country to load the request from. Supported values are: * US (United States) * CA (Canada) * IT (Italy) * IN (India) * GB (England) * JP (Japan) * MX (Mexico) * AU (Australia) * ID (Indonesia) * UA (UAE) * RU (Russia) * RANDOM Some operations, like scraping Google Search and Google News, support all countries.
User-defined metadata. Not supported yet
Formats in which you want the content.
Option to remove certain CSS selectors from the content. Optionally, you can also pass a JSON stringified array of specific selectors you want to remove. The CSS selectors removed when this option is set to default are ['nav','footer','script','style','noscript','svg',[role=alert],[role=banner],[role=dialog],[role=alertdialog],[role=region][aria-label*=skip i],[aria-modal=true]] Available options: `default`, `none`, `array`
Specify the HTML transformer to use, if any. Postlight's Mercury Parser library is used to remove ads and other unwanted content from the scraped content. Available options: `postlight`, `none`
Time to wait in milliseconds before starting the scraping.
With this option, you can get all the links present on the page you scrape.
When defining json as a format, you can use this parameter to specify the parser to use. Parsers are useful to extract structured content from web pages. Olostep has a few parsers built in for most common web pages, and you can also create your own parsers.
Configuration for screen size. Preset dimensions are available through screen_type: desktop (1920x1080), mobile (414x896), or default (768x1024).
The URL to start scraping from.
Actions to perform on the page before getting the content.
List of class names to remove from the content.

