https://github.com/pdfliberation/OCRToolkit
好像有意思