Scrape parse configs / Generate

Test or generate configurations..

Route: /api/{tenant:minlength(2)}/v{version:apiVersion}/scrape_parse_configs/generate

Method: POST

Authorisation: Minimum role: ADMIN

Request Arguments
Name Type Source
inputDTO_scrape_parse_config_generate_inputBody

  Request body example

Response object

Response status: 200 (OK)

Response type: DTO_scrape_parse_config_generate_output

Wrapper: DTO_response_wrap

Other response statuses: 400 (BadRequest), 403 (Forbidden)

  Response body example

DTO_scrape_parse_config_generate_input
NameTypeReadOnlyDescription
scrape_domain_idInt32Scrape domain id
configsDTO_scrape_parse_config[]Configs
test_urlsString[]Test urls
modeENUM scraper_parse_generate_modeMode
human_explained_extractionsString[]For generate mode, explain what should be extracted.
DTO_scrape_parse_config_generate_output
NameTypeReadOnlyDescription
configsDTO_scrape_parse_config[]Configs
extractionsDTO_scrape_parse_config_generate_output_extraction[]Extractions
DTO_scrape_parse_config
NameTypeReadOnlyDescription
created_byDTO_reference_userReadOnlyCreated by
last_updated_byDTO_reference_userReadOnlyLast updated by
sub_configsDTO_scrape_parse_config[]Sub configs
idInt32ReadOnlyLeave empty on input
scrape_domain_idInt32Scrape domain id
apply_on_url_patternStringApply on url pattern
unique_keyStringUnique key
xpathStringXpath
is_collectionBooleanIs collection
extraction_modeENUM scraper_parse_extraction_modeExtraction mode
post_process_ruleENUM scraper_parse_post_processPost process rule
used_lastDateTime (nullable)Will update if it was more than 10 minutes (or similar) to avoid too many updates.
createdDateTimeReadOnlyCreated
last_updatedDateTimeReadOnlyLast updated
DTO_reference_user
NameTypeReadOnlyDescription
profile_picStringReadOnlyProfile pic
emailStringReadOnlyEmail
idInt32Id
labelObjectReadOnlyLabel
DTO_scrape_parse_config_generate_output_extraction
NameTypeReadOnlyDescription
key_indexInt32Key index
keyStringKey
sub_keyStringSub key
valuesDTO_scrape_parse_config_generate_output_extraction_value[]Values
DTO_scrape_parse_config_generate_output_extraction_value
NameTypeReadOnlyDescription
test_urlStringTest url
valueStringValue
ENUM scraper_parse_extraction_mode
ValueNameDescription
0NOT_SETNOT_SET
1INNER_HTMLINNER_HTML
2INNER_TEXTINNER_TEXT
3ATTRIBUTE_HREFATTRIBUTE_HREF
4ATTRIBUTE_SRCATTRIBUTE_SRC
5DATETIMEDATETIME
6ARIA_LABELARIA_LABEL
ENUM scraper_parse_post_process
ValueNameDescription
0NOT_SETNOT_SET
1PATH_JOINPATH_JOIN
ENUM scraper_parse_generate_mode
ValueNameDescription
0UNKNOWNUNKNOWN
1TESTTEST
2GENERATEGENERATE
3WARM_UPWARM_UP