hi @beniz ! just caught jolibrain/deepdetect#543 going by and was curious if there is any info related to trained models and/or language support? i'm not up to date on CTC and NCNN so any info available would be greatly appreciated. thanks again for the great work you all do!
Hi @cchadowitz-pf how are things ? Yes, we've patched NCNN so that OCR runs on the RPi3 and embedded at large. Expect ~600ms for a ResNet-18 + LSTM + CTC. Basically, we are treating NCNN as an inference-only CPU light version of Caffe.
Now, on the doc + models side, there's an update soon, some of us are actually busy on it, with refreshed website, Open Source platform for training, and the release of many models, including text detection and good OCR
:+1: very cool. are there reference models/data that you're working with, or is this a model that you're building internally? i'm personally interested in multi-lingual OCR as I'm relying on Tesseract at the moment.
that all sounds super exciting, i can't wait to check it all out :)
@cchadowitz-pf I'm not sure I have your email, if you can shoot a quick message to firstname.lastname@example.org, we will be talking to people we know in order to get feedback from the website, platform and model selection.
our OCR models are trained on Caffe then converted to NCNN for customers that need embedded models. The full pipeline shall be documented, though some of the stuff will come out earlier.
If you have metrics for Tesseract on some tiny dataset you can share, that'd be interesting to us.
we will provide a model that detect text, and an OCR to run on the crops