Skip to content

Commit 7c2f7c3

Browse files
Annotation is better
1 parent 6ac1755 commit 7c2f7c3

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

annotate_text.py

+2-2
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@
1919
qui estoit senescaus de la tiere ,%.%.
2020
Robiers ses freres ,%.%. Gautiers de voignori ,%.%. Gautiers de Mombelyart ,%.%.
2121
Eustasces d'escouflans ,%.%. Guis dou plaissie %,%. et ses freres ,%% Henris D'ardillieres ,%.%. Ogiers de saint chienon ,%.%.""".replace(
22-
"%", "").replace("\n", " ").replace(" ", "")
22+
"%", "").replace("\n", " ")
2323

2424
print(input_text)
2525

@@ -30,4 +30,4 @@
3030
logger.setLevel(logging.DEBUG)
3131

3232
tokenizer = Seq2SeqTokenizer.load("/home/thibault/dev/boudams/models/linear-conv2019-05-24--14:08:58-0.0001.tar", device="cpu")
33-
print("".join(tokenizer.annotate_text(input_text)))
33+
print(" ".join(tokenizer.annotate_text(input_text.replace(" ", ""))))

0 commit comments

Comments
 (0)