-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
formatted names not recognized #53
Comments
1 and 2 are not found, while 3 is found. Hm, this is a grey area to me. I see gnfinder as a tool that finds names in For example it definitely does not support PDF, MS Doc, Excel spreasheets etc. Following this logic XML, HTML, JSON are marked up texts and need to be converted first to a plain text. |
Hmmm ok...sad... I thought rich text would have been ok...my bad then Thank you for your answer! |
From other side |
Hi @dimus! Shall I keep it open? |
Yes, please keep it open, I did not get to it yet, was concentrated on gnverifier for a while. I do want to find a good solution for this. |
Hi again,
Using always more your amazing tool, I went through following issue:
When names are formatted, they do not get recognized.
Here are the different inputs:
input1.txt
input2.txt
input3.txt
and resulting outputs:
1
{ "metadata": { "date": "2020-06-02T12:34:06.580524+02:00", "gnfinderVersion": "v0.11.0", "withBayes": true, "tokensAround": 0, "language": "eng", "detectLanguage": false, "totalWords": 3, "totalCandidates": 1, "totalNames": 0 }, "names": null }
2
{ "metadata": { "date": "2020-06-02T12:35:54.584728+02:00", "gnfinderVersion": "v0.11.0", "withBayes": true, "tokensAround": 0, "language": "eng", "detectLanguage": false, "totalWords": 3, "totalCandidates": 1, "totalNames": 0 }, "names": null }
3
Do you think it is easily doable to recognize them?
Otherwise I'll have to find a way of substracting the
<i> </i>
and so on before submitting the test to gnfinder.The text was updated successfully, but these errors were encountered: