You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The exact code i used to test this articles/website
Made a script called can_parse.py and ran with each of the urls as an arg with current master. Might be worth adding to the repository as a test script.
Traceback (most recent call last):
File "/home/palfrey/src/newspaper4k/can_parse.py", line 8, in <module>
article.parse()
File "/home/palfrey/src/newspaper4k/newspaper/article.py", line 466, in parse
authors = self.extractor.get_authors(self.doc)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/palfrey/src/newspaper4k/newspaper/extractors/content_extractor.py", line 59, in get_authors
return self.author_extractor.parse(doc)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/palfrey/src/newspaper4k/newspaper/extractors/authors_extractor.py", line 99, in parse
if "@graph" in script_tag:
^^^^^^^^^^^^^^^^^^^^^^
TypeError: argument of type 'NoneType' is not iterable
The text was updated successfully, but these errors were encountered:
First please check that it is really an issue with the library, and not some special case of website:
Your report as follows:
Website that does not parse correctly:
Some sample urls that I have tried
The exact code i used to test this articles/website
Made a script called
can_parse.py
and ran with each of the urls as an arg with current master. Might be worth adding to the repository as a test script.Other information, remarks, messages, etc:
The text was updated successfully, but these errors were encountered: