Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Activity
Richard Eckart de Castilho
@reckart
at some point, we plan to make tokens/sentence boundaries editable, but not quite there yet
for layers with sentence-level granularity, the OpenNLP doccat recommender should be offered
lisa563
@lisa563
Thank you very much.
lisa563
@lisa563
Can the relation between entities be predicted? How should I configure it?
Richard Eckart de Castilho
@reckart
Relation prediction is being worked on on a branch
Richard Eckart de Castilho
@reckart
@lisa563 btw, a proper confidence threshold setting on the recommender sidebar is also pretty much done now and will be in the 0.19.0 release when it comes out
lisa563
@lisa563
Thank you.
Ivan Habernal
@habernal
hey all, I'm trying to find out whether there is a quick fix for wrapping longer annotation spans. Looking into the HTML code, it seems it's a SVG text tag, which doesn't look like supporting text wrapping - so there is perhaps no quick fix, correct?
Richard Eckart de Castilho
@reckart
At the Moment you only have the option of prewrapping the text before Import and then using the “ brat line oriented” display mode
Ivan Habernal
@habernal
Alright, I see, that's what I thought
I tested also the HTML mode which is in fact not bad, the only thing is that you cannot select and remove existing annotation
Richard Eckart de Castilho
@reckart
It is a known issue even in brat
You can
When you move the mouse over an annotation a popup appears
Click on that to select the annotation
Ivan Habernal
@habernal
yeah... well hidden :)
Richard Eckart de Castilho
@reckart
Not the best UX but it works
Ivan Habernal
@habernal
confirmed
ok, say I prepare my documents as HTML with just paragraphs <p> - how about implicit pre-processing (tokenization etc.), will that work?
I mean, I could simply try out but maybe there's some warning signs
Richard Eckart de Castilho
@reckart
implicit tokenization & sentence splitting works with the usual quality provided by the Java BreakIterator
Preparing the HTML with a different tokenizer would also be possible to do externally (via DKPro Core), but not in INCEpTION at this time.
Ivan Habernal
@habernal
Thanks! I'll give it a shot
Ivan Habernal
@habernal
I tested it with a very simple HTML (three paragraphs), and there are some issues - I annoated three words, on the left-hand site (the document view) it highlighted something different, and in the tool window (right-hand side), the Text field contains also something different. So it's not really working :(
Richard Eckart de Castilho
@reckart
which version?
Ivan Habernal
@habernal
0.18.3 (2021-03-09 20:02:49, build bf16e970)
Richard Eckart de Castilho
@reckart
I had tested with a simple two paragraph HTML ;) looked ok for me. so we'll have to look again. Could you provide your test document in an issue?
Ivan Habernal
@habernal
These are not really public documents
Richard Eckart de Castilho
@reckart
ok "very simple HTML" sounded like you had cooked up a test document
Ivan Habernal
@habernal
yes, but with some real texts to see how that works :)