Scrape Tool
Extracts text or attribute values from the current page using a selector (and optional attribute), returning the matched values.
Arguments
| Name | Type | Description |
|---|---|---|
| task | DownloadTask | Required. A task from the previous Start or Crawl tool response |
| selector | String | Required. Selector for getting interesting data on a web page |
| attributeName | String | Optional. Attribute name to get data from. Use val to get inner text. Default value: val |
Remarks
The selector argument is a selector of the following format: CSS|XPATH: selector. The first part defines the selector type, the second one should be a selector in the corresponding type.
Supported types:
DownloadTask
Represents a single page download request produced by a crawl or scrape job.
Fields:
| Name | Type | Description |
|---|---|---|
| Id | String | Required. Task Id |
| Url | String | Required. Page URL |
Return Type
Array of String