Recognize units in HTML tables
headers.py — extract headers from (unpublished) HTML tables dump
headers.txt — output of headers.py
vocabulary.py — generate Pint vocabulary using QUDT ontology
vocabulary.txt — output of vocabulary.py
units.py — parse units from headers.txt
units.txt — output of units.py
Unit names are as in QUDT, but without prefix
http://qudt.org/vocab/unit#