⸗but cannot blacklist
-, what would I do?
If I want hyphens to be encoded with ⸗ but cannot blacklist -, what would I do?
Train a specific post-correction model! I did this for
ſ and it "repaired" from 3% CER to 0.2% with cor-asv-ann.
@stweil Have you ever trained a model with the data provided at https://github.com/jze/ocropus-model_fraktur?
No, we are still busy with GT4HistOCR and the ÖNB data, mixing both in one model after enhancing the ÖNB texts with long s.