- https://github.com/scrapinghub/portia
- http://blog.florian-hopf.de/2014/07/scrapy-and-elasticsearch.html
- http://nrabinowitz.github.io/pjscrape/
- http://toddhayton.com/2015/03/20/scraping-with-casperjs/
- https://github.com/ruipgil/scraperjs/blob/master/README.md
- http://www.camroncade.com/casperjs-tips-and-tricks/
- http://albertoarena.co.uk/grabql-a-query-language-for-data-scraping/
- http://stackoverflow.com/questions/3207418/crawler-vs-scraper
- https://en.wikipedia.org/wiki/Web_scraping
- https://www.openhub.net/p/scraperwiki
- https://www.openhub.net/p/fabpots_Goutte
- http://www.seekquarry.com/
- https://github.com/nrabinowitz/pjscrape
- PhantomJS
- http://webscraper.io/
PluginSnarf and Integrator are a step in this direction.
Also see: Html Feed
Related: Link checker