From 6de2a2fca80f4fb4cc8f07d31d4bb8f73d7c768a Mon Sep 17 00:00:00 2001 From: Thibaut Courouble Date: Thu, 12 Dec 2013 08:48:56 -0800 Subject: [PATCH] Updated Scraper Reference (markdown) --- Scraper-Reference.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/Scraper-Reference.md b/Scraper-Reference.md index 65ba94f5..6a117395 100644 --- a/Scraper-Reference.md +++ b/Scraper-Reference.md @@ -65,6 +65,10 @@ Configuration is done via class attributes and divided into three main categorie * `root_path` [String] **(inherited)** The path from the `base_url` of the root URL. +* `initial_paths` [Array] **(inherited)** + A list of paths (from the `base_url`) to add to the initial queue. Useful for scraping isolated documents. + Defaults to `[]`. _(Note: the `root_path` is added to the array at runtime.)_ + * `dir` [String] **(required, `FileScraper` only)** The absolute path where the files are located on the local filesystem. _Note: `FileScraper` works exactly like `UrlScraper` (manipulating the same kind of URLs) except that it substitutes `base_url` with `dir` in order to read files instead of making HTTP requests._