noccylabs/pagescalpel

$ git tag


$ git branch
* master

PageScalpel

Extract information from webpages

  Page
  Operation
   '-ScalpelInterface

Default Scalpels

| Scraper | Alias | |----------------------|----------|----------------------------------------- | MetadataScraper | meta | Scrapes <meta> and related tags. | ImageScraper | images | Finds all available <img> sources. | FeedScraper | feeds | Finds RSS and Atom feeds on the page.

Site-specific Scalpels