Updated Scraper Reference (markdown)

pull/964/head
Thibaut Courouble 11 years ago
parent a8ffebca24
commit 09c71d78f6

@ -59,7 +59,7 @@ Configuration is done via class attributes and divided into three main categorie
* `base_url` [String] **(required in `UrlScraper`)**
The documents' location. Only URLs _inside_ the `base_url` will be scraped. "inside" more or less means "starting with" except that `/docs` is outside `/doc` (but `/doc/` is inside).
`FileScraper`'s default is `localhost`. (Note: any iframe, image, or skipped link pointing to localhost will be removed by the `CleanLocalUrls` filter; the value should be overridden if the documents are available online.)
Defaults to `localhost` in `FileScraper`. (Note: any iframe, image, or skipped link pointing to localhost will be removed by the `CleanLocalUrls` filter; the value should be overridden if the documents are available online.)
Unless `root_path` is set, the root/initial URL is equal to `base_url`.
* `root_path` [String] **(inherited)**

Loading…
Cancel
Save