A small C++ implementation of LSTM networks, focused on OCR.
I'm looking for a way of reliably extracting single lines from tables using something like clstm or ocropus. I think I want to just restrict the page segmentation to the Y axis and assume that X is always full width. Is there a way of achieving this?
Good morning to everyone
I'm looking for an OCR engine capable of eadind a tax form. Is this the right project? Are there some c++ API documentation?
Is there anyone who knows to read and write for the Net with protobuf?
I am trying to install clstm on ubuntu 14.04 and I have errors. I saw a closed bug about it, but it doesn't help
clstm_proto.cc:18:22: fatal error: clstm.pb.h: No such file or directory
don't be shy :smile:
well, nevermind, I found out the error
I am training with run-uw3-500
it's been training for 12 hours, when does is stop? Do I need to stop it or it will run indifenetly?
also, did you guys see «nvidia digits»? Can you use it with clstm?
Shuai Wang GitHub
@moskiteau Hi,Sébastien, I am also trying to install clstm on Ubuntu 14.04. Do you mind sharing the installation guide? Thanks for your help!!!!
Usually, you train for several days and decide when to stop based on a test set
@kba I could reproduce your segfault in a jessie vm
it's weird, it only happens if clstm is installed into the system-wide python environment
And something is seriously wrong with Python paths in this box, usr/lib overriding usr/local/lib, 2.7 modules importing 3.4 modules, it's mayhem and annoying and unproductive, so I've stopped. I can confirm that it works with pip-installed pillow in Python 2.
@kba do you think the training script belongs into clstm? I'm not so sure, since there's already the C++ CLI and the run_uw300 script for demonstration purposes
on the other hand, maybe we could just throw it the less than idiosyncratic command-line utils and replace them with nice python-based ones
i have some code for live-plotting train/dev errors with bokeh
Mostly, I just wanted something with a cleaner CLI, now. It's not much more than an extended version of the run_uw3 test script.
no offense, @tmbdev
i had mix between a typo and a too harsh expression, it was intended as an edit
I wouldn't replace the original tools, more like "augment" them :-) To train directly or as example python code to build your own things with it. Mostly I was thinking for testing and CI with all values configurable.
I would keep the original CLI as they are, since people might be using them in their setups, no need to break those.