PDF417 decoder for python

Small python project that can decode, to an extent, 2D PDF417 barcodes from a PNG image.

I wrote this because I wanted to know what was in the barcode in my emailed driver's license and I couldn't find another OSS package to do this.

Some caveats:

yes it works on PNG images, but it won't work on photos -- they have to be screenshots of a high-quality source (no, screenshot of a photo doesn't count. you can't wish for more wishes)
only has been tested on a single screenshot I took of my driver's license
doesn't fully implement the standard -- in particular, doesn't know how to do byte / numeric encodings, just text
I sourced the spec from wikipedia and this pdf and copied and pasted the lookup tables from OCR -- they may be incomplete or wrong

That said, it has lots of asserts and has worked at least once.

Usage

# help text / flags
./decoder.py -h
# parse a PNG file
./decoder.py barcode-screenshot.png

There are some CSVs in here with PDF417 lookup tables.

cluster-{index}-bs.csv have bsbsbsbs (bar-space) lookups for clusters (0, 3, 6)
text-codes.csv has the lookup for converting hi & low text codes to ascii chars / state commands

In particular:

tests. can be unit tests of individual routines or link it up to a barcode generator
more fully decode the standard, or use another library to decode the standard once codewords have been parsed
bugfixes as always
link to a better command-line barcode reader so I can archive this repo
support non-screenshot images (i.e. photos), preferably by using an external library to do the image scanning
use standard intermediate formats for barcode scans if those exist
git grep todo

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.envrc		.envrc
.gitignore		.gitignore
README.md		README.md
cluster-0-bs.csv		cluster-0-bs.csv
cluster-3-bs.csv		cluster-3-bs.csv
cluster-6-bs.csv		cluster-6-bs.csv
decoder.py		decoder.py
modes.py		modes.py
requirements.txt		requirements.txt
text-codes.csv		text-codes.csv
words.py		words.py