Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Repo info
Activity
  • 15:11
    shangsu starred Erudika/para
  • Dec 12 17:42
    chakrakan starred Erudika/para
  • Dec 09 00:42
    dunglason6789p starred Erudika/para
  • Dec 08 14:44
    repanda starred Erudika/para
  • Dec 07 10:05

    albogdano on master

    fixed javadoc readme added user-agent header to clie… and 1 more (compare)

  • Dec 06 04:55
    jimtje starred Erudika/para
  • Nov 28 12:59

    albogdano on master

    exclude unused dependencies fro… Merge pull request #69 from ces… (compare)

  • Nov 28 12:59
    albogdano closed #69
  • Nov 28 12:34
    cesarsotovalero opened #69
  • Nov 28 12:20
    Travis cesarsotovalero/para (master) passed (1)
  • Nov 28 12:14
    cesarsotovalero starred Erudika/para
  • Nov 28 12:10
    Travis cesarsotovalero/para (master) fixed (2)
  • Nov 28 11:21
    Travis cesarsotovalero/para (master) failed (1)
  • Nov 25 16:13
    albogdano commented #68
  • Nov 25 15:46
    heprotecbuthealsoattac commented #68
  • Nov 25 13:44
    albogdano commented #66
  • Nov 25 13:43
    albogdano closed #68
  • Nov 25 13:43
    albogdano labeled #68
  • Nov 22 19:36
    albogdano commented #68
  • Nov 22 19:00
    albogdano commented #68
Alex Bogdanovski
@albogdano
the same should also work for Scoold
J. R. Schmid
@sixtyfive
hi
just signed up for paraio.com to try something out with para-cli, but trying to create my items results in "input/Kahhala_ges.html is not a file or is too big (max. 400 KB)" (no shit, it has ~20MB). is that a limitation of the unpaid account or something i can change somewhere?
J. R. Schmid
@sixtyfive
also, i added some excerpts from my full-size files (via para-cli create "test/*.html" --type "book" --sanitize" and tried to search by using "para-cli search 'blah'" and get no results. the "blah" is in Arabic letters, though. is that the reason, does it only deal with Latin letters?
Alex Bogdanovski
@albogdano
@sixtyfive you can't change that limit
J. R. Schmid
@sixtyfive
@albogdano okay, so i'd have to do my own install of Para?
Alex Bogdanovski
@albogdano
that's a limitation of the underlying database used in ParaIO.com which is DynamoDB
Arabic should work out of the box
I'll probably refactor para-cli at some point to remove that limit
it's hardcoded there at the moment
J. R. Schmid
@sixtyfive
oooic
well if i can't get any search results out of even the excerpt files then it's not much use
Alex Bogdanovski
@albogdano
try different queries
J. R. Schmid
@sixtyfive
did
Alex Bogdanovski
@albogdano
like "blah*"
J. R. Schmid
@sixtyfive
also just single-word queries that i verified beforehand should yield results
aha
asterisk...
also let me put an actual, latin-letter "blah" in one of the files
okay, "blah" works just as well as "blah", but neither "أحمد" nor "أحمد" (which is the word right next to "blah" now inside of one of the HTML tags yiel any result.
also, the result JSON includes the Arabic as a string of numbers ... which can be parsed back into unicode i'm guessing, so that wouldn't be the problem.
Alex Bogdanovski
@albogdano
you should check the actual data stored in Para with a GET /v1/book/id to make sure it's indexed correctly
J. R. Schmid
@sixtyfive
interesting
Alex Bogdanovski
@albogdano
I think the encoding is lost along the way
J. R. Schmid
@sixtyfive
there's two objects even though i re-uploaded that same file 3 times.
but ParaObject 1568721642159 doesn't seem to be anything i uploaded
is the "text" attribute of the JSON that's returned as a response to GET the full text that was stored? because that's nowhere even close to the 40-ish kB that it should be scratches head
anyways, doesn't matter much given you said the 400kB limit is hardcoded and i have a whole folder of 20-40MB files
i'll try one of those JavaScript full-text search thingies
thank you!
Alex Bogdanovski
@albogdano
I'll have to do some more testing with Arabic
can you reencode the file to be UTF-8
and try reindexing it
J. R. Schmid
@sixtyfive
and Hebraic, and Persian, and all the Indian languages, etc, etc ;-)...
oh, it is UTF-8
if we dealt in any other currency than UTF-8 here, we'd all have ended up in asylums long ago
Alex Bogdanovski
@albogdano
ok I'm going to write this down as a bug for now and will work on it later
J. R. Schmid
@sixtyfive
cool :-)
i'll say good bye for now!
Alex Bogdanovski
@albogdano
bye
Alex Bogdanovski
@albogdano
@sixtyfive your issue has been fixed in para-cli@1.11.0
prog20901
@prog20901
hi
i am new to para
Alex Bogdanovski
@albogdano
Hello!
prog20901
@prog20901
can someone help me
Alex Bogdanovski
@albogdano
How can I help you?
prog20901
@prog20901
any demo is there to see how para works? any show case?
Alex Bogdanovski
@albogdano
prog20901
@prog20901
this is the main page which i already know...where i can see the demos or showcase of web-sites which uses para