⸗
but cannot blacklist -
, what would I do?
If I want hyphens to be encoded with ⸗ but cannot blacklist -, what would I do?
Train a specific post-correction model! I did this for s
→ ſ
and it "repaired" from 3% CER to 0.2% with cor-asv-ann.
ocrd_repair_inconsistencies
≠ ocrd-segment-repair
!
sbb-textline-detector
.
@stweil Have you ever trained a model with the data provided at https://github.com/jze/ocropus-model_fraktur?
No, we are still busy with GT4HistOCR and the ÖNB data, mixing both in one model after enhancing the ÖNB texts with long s.