Releases: explosion/spacy-layout
Releases · explosion/spacy-layout
v0.0.12
- Add
as_tuples
argument in spaCyLayout.pipe
, analogous to spaCy's Language.pipe
. (#36)
- Add support for document index tables. (#34, @mkessy)
- Fix typos in docs. (#33, @svlandeg)
- Add code snippet for visualizing a page with
matplotlib
to docs. (#26, @wjbmattingly)
- Add serialization section to docs with note on deserializing with extension attributes.
v0.0.11
- Fix regression that would cause incorrect pagination numbers to be represented.
v0.0.10
- Allow
DoclingDocument
as input to spaCyLayout.__call__
to convert already processed documents to spaCy Doc
objects.
v0.0.9
- Add
Doc._.markdown
with Markdown representation of the document.
v0.0.8
- Fix serialization of extension attributes and
pandas.DataFrame
via spaCy's DocBin
. (#11, #14)
v0.0.7
- Fix bounding boxes for top left origin, refactor and add tests.
v0.0.6
- Add support for tables as layout spans and via shortcut
Doc._.tables
.
- Add
Span._.data
for table data as a pandas.DataFrame
.
- Allow customizing table display text in
Doc.text
via display_table
callback option.
v0.0.5
- Improve bounding box calculation for bottom left origin.
v0.0.4
- Fix bounding boxes for bottom left origin.
v0.0.3
- Add
spaCyLayout.pipe
to process multiple documents.
- Use
nlp.pipe
internally for tokenization.
- Also accept
bytes
as input.