Oxylabs

Get data from any website with Oxylabs.

Oxylabs is a web scraping service that retrieves public web data at scale, with tools designed to navigate regional restrictions.

Oxylabs Docuemnt Loader Node

Features

  • Retrieve data from Google, Amazon and any other website

  • Set geolocation

  • Utilize the browser rendering

  • Parse the data

  • Specify User Agent types

  • Process content with text splitters

Required Parameters

  • Connect Credential: Oxylabs API credentials

  • Query: Search query or URL

  • Source: One of the available sources:

    • Universal - scrape any website

    • Google Search - scrape Google Search results

    • Amazon Product - scrape Amazon Product information

    • Amazon Search - scrape Amazon Search results

Optional Parameters

  • Geolocation: Sets the proxy's geo location to retrieve data. See documentation for more details.

  • Render: Enables JavaScript rendering when set to true.

  • Parse: Returns parsed data when set to true, as long as a dedicated parser exists for the submitted URL's page type.

  • User Agent Type: Device type and browser.

Outputs

  • Document: Array of document objects containing metadata and pageContent

  • Text: Concatenated string from pageContent of documents

Document Structure

Each document contains:

  • pageContent: Extracted page content

Last updated