Safe Haskell | Safe |
---|---|
Language | Haskell2010 |
Documentation
data CrawlDirective Source
SimpleDirective (String -> [CrawlAction]) | access content to find absolute follow-up urls |
RelativeDirective (String -> [CrawlAction]) | as simple, but found relative urls are completed |
FollowUpDirective (CrawlResult -> [CrawlAction]) | as simple, but with access to complete result |
DelayDirective Int CrawlDirective | wait additional seconds before executing |
RetryDirective Int CrawlDirective | if given directive yields no results use add. retries |
AlternativeDirective CrawlDirective CrawlDirective | fallback to second argument if first yields no results |
RestartChainDirective (CrawlAction, CrawlDirective) | the possibility to start a new chain (when using alternative) |
GuardDirective (CrawlAction -> Bool) | not crawling anything, just a blacklisting option |
DirectiveSequence [CrawlDirective] | chaining of directives |