Scraper
Scrape
Scrape a URL. Output is returned in the response.
POST
/
scraper
/
scrape
Scrape any webpage URL and get text, html or markdown. A Chrome browser with Javascript enabled is used to ensure every webpage can be scraped. Set advanced_proxy
to true
to enable our advanced scraping proxy which will cicumvent most bot detection and blocking systems (using advanced proxy does not currently cost any extra).
Authorizations
Authorization
string
headerrequiredBearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
application/json
url
string
requiredURL to scrape
format
enum<string>
default: textFormat to save all crawled page content to
Available options:
text
, html
, markdown
extract_object
object
advanced_proxy
boolean
default: falseUse advanced proxy -- default is false
Response
200 - application/json
status
enum<string>
requiredScraping status
Available options:
scraped
, failed
meta
object
requiredPage metadata
text
string
Scraped content