Skip to content

a fully distributed web client, bread-first traversal, and <a> expression parser for crawling content from static html documents. supports broken-link detection and content invalidation.

Notifications You must be signed in to change notification settings

clay-curry/python-html-crawler

Repository files navigation

Crawler

No instructions; just the following advice:

EXPECT BUGS

About

a fully distributed web client, bread-first traversal, and <a> expression parser for crawling content from static html documents. supports broken-link detection and content invalidation.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages