Generic XML (and RSS) Support

Currently, scraper doesn't work very with normal XML documents, specifically - RSS.

There's a few ways to solve this.

1. The included CSS selector engine (https://github.com/PuerkitoBio/goquery) doesn't parse XML properly it seems. We could modify the HTML to make it conform. OR

1. Each configured path could have a `mode`, where:

    1. `html` implies `selector` is a CSS selector (and is the default `mode`)
    1. `xml` implies `selector` is an XPATH selector (eww) OR
    1. `xml` implies `selector` is a new format: `foo bar`, simply traverses into `<foo>` then into `<bar>` OR
    1. `xml` removes all other settings, and simply converts XML into JSON directly (this is probably the easiest)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generic XML (and RSS) Support #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Generic XML (and RSS) Support #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions