The program takes a list of URLs (from a file or stdin), extracts all hyperlinks from markup, and prints them as a list to stdout. Based on http://www.ibm.com/developerworks/aix/library/au-threadingpython/
- nose
- coverage
- bs4 >= 4.5.0
- requests >= 2.10.0
nosetests -v --with-coverage --cover-package main tests.py
cat ../data/urls.txt | python main.py
python main.py ../data/urls.txt