by

Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Activity
  • 15:43
    aroc725 opened #181
  • 09:10
    aflip edited #179
  • 04:07
    baridhi commented #29
  • Aug 10 17:50
    nutsci opened #180
  • Aug 10 17:00
    name-andy46 commented #169
  • Aug 10 12:17
    aflip opened #179
  • Aug 08 17:28
    arky opened #118
  • Aug 06 11:17
    arky commented #110
  • Aug 06 10:19
    arky commented #110
  • Aug 06 06:05
    arky edited #117
  • Aug 06 06:03
    arky edited #117
  • Aug 06 06:00
    arky opened #117
  • Aug 05 07:38
    rairj opened #116
  • Jul 30 14:59
    tam663 opened #178
  • Jul 30 13:22
    MartinThoma opened #115
  • Jul 30 10:26
    vinayak-mehta closed #177
  • Jul 30 10:26
    vinayak-mehta commented #177
  • Jul 30 10:09
    MartinThoma opened #177
  • Jul 30 10:06
    MartinThoma commented #176
  • Jul 30 09:46
    vinayak-mehta closed #176
Vinayak Mehta
@vinayak-mehta
@pravarag This weekend for sure, sorry for the lateness.
Pravar Agrawal
@pravarag
no problem @vinayak-mehta even I'm enjoying festival season :D
Pravar Agrawal
@pravarag
@vinayak-mehta did you check the PR?
abhishekasodaria
@abhishekasodaria
hello
i am trying to install camelot
but showing
cv version
has no matching distribution
@vortex_ape_twitter
abhishekasodaria
@abhishekasodaria
its not working properly
every step somekind of error occurs
Vinayak Mehta
@vinayak-mehta
@pravarag Yes, got busy with something else
@abhishekasodaria Did you follow the steps mentioned in the docs?
Vinayak Mehta
@vinayak-mehta
@pravarag Reviewed the PR, requested some changes. Sorry for the late review on this.
Pravar Agrawal
@pravarag
@vinayak-mehta thanks for the update. I'll go for the requested changes then.
phdkiran
@phdkiran
is there an option to complement strip_text like replace_text?
I am trying to insert a space for a new line during the table detection
Mariano Rodriguez
@marianorodriguez
hello! Can someone explain to me the difference between pip install camelot-pyand pip install camelot-py[cv]? which of those should I install to use Camelot inside a Python script?
Vinayak Mehta
@vinayak-mehta
@phdkiran You can do that in the pandas dataframe itself
@marianorodriguez Please use pip install camelot-py[cv]
Arky
@arky
@vinayak-mehta Trying to adopt excalibur for my workshops in SE. Asia. Is it possible to build executables for easy install for Windows, Mac, Linux both 32bit and 64bit. Perhaps using Pyinstaller or something similar. This would save me lot of time and also drive adoption.
Vinayak Mehta
@vinayak-mehta
Hi @arky did you try the existing Win and Linux (should work on macOS) executables? https://github.com/camelot-dev/excalibur/releases/tag/v0.4.0
Ghostscript still needs to be installed separately for these
Arky
@arky
@vinayak-mehta Sweet I wasn't aware there were already available. I am going to give them a shot.
Vinayak Mehta
@vinayak-mehta
Please let me know if you face any problems. And if you have any ideas for the following issues, please comment. Solving these issues will make installation easy. I haven't been getting time from the day job to work on them.
Arky
@arky
Will do. For starters, I think it much simple to do user education, by providing clear instructions on how to install and where to get them would solve most issues. Perhaps
a wiki page or dedicated web site/page ie. get-camelot.github.io and it has big blue button to get the exe for target OS along with links to dependecies would drive adoption.
slhappyls
@slhappyls
Chinese user is saying hi
1 reply
Arky
@arky
@vinayak-mehta Any update on camelot-dev/excalibur#99
5 replies
@vinayak-mehta These are quite important for wider adoption of the tools in civil socities out here
Arky
@arky
@vinayak-mehta Is this the correct commands to generate the current excalibur executables for gnu/linux and macOS ? "pyi-makespec --paths=excalibur/executors/celery_executor.py arthur.py
pyinstaller --onefile --add-data "excalibur/www/templates:excalibur/www/templates" --add-data "excalibur/www/static:excalibur/www/static" --add-data "excalibur/config_templates:excalibur/config_templates" arthur.py"
Vinayak Mehta
@vinayak-mehta
Arky
@arky
Thanks, Not sure Makefile for mulitple OS's without modification.
13 replies
Arky
@arky
Can someone tell me what's solution for 'No module named click' issue on gnu/Linux. ./excalibur-ubuntu-latest-x64 Traceback (most recent call last): File "arthur.py", line 5, in <module> File "/opt/hostedtoolcache/Python/3.8.3/x64/lib/python3.8/site-packages/PyInstaller/loader/pyimod03_importers.py", line 623, in exec_module File "excalibur/cli.py", line 5, in <module> ModuleNotFoundError: No module named 'click' [12776] Failed to execute script arthur
Vinayak Mehta
@vinayak-mehta
Oh you'll need to do pip install click for that
1 reply
Arky
@arky
Hey, I have made a usability suggestion to better handle excalibur missing requirements on MS Windows. Looking forward to your feedback camelot-dev/excalibur#111
Arky
@arky
Please download and test the latest exacalibur 0.4.3 https://github.com/camelot-dev/excalibur/releases/tag/v0.4.3
Arky
@arky
@vinayak-mehta I have been trying to test Excalibur with Python 3 (32bit) Windows 7. I got stuck with 'backports' module not found errors. Still trying to figure it out.
Vinayak Mehta
@vinayak-mehta
Can you post the full traceback here?
6 replies
Or in a gist / pastebin
nftopham
@nftopham
Hello, I am getting a huge amount of debug messages when running Camelot. The extraction works fine and passing suppres_warnings=True does not do anything.
they are all logs/debug messages from pdfminer
nftopham
@nftopham
I have disabled them manually via logging.getLogger("pdfminer").setLevel(logging.WARNING) but this is not really desirable
Vinayak Mehta
@vinayak-mehta
@nftopham I understand, thanks for reporting it here. I'll start work on fixing logging and the CLI's terminal output in general soon.
nftopham
@nftopham
Thanks, it's great otherwise
nftopham
@nftopham
Hi @vinayak-mehta I wanted to share with you a problem I had with Camelot and the solution
so I was getting a NotImplementedError because the PDF version I was reading had an unsupported encryption protocol, as stated on the camelot docs
so I searched for some solutions and ended up re-writing the file using ghostscript and downgrading the version. this actually completely removed the encryption which is quite funny. so much for password protected PDFs!
here is my solution, a bit messy right now but you get the gist. would be great if this could be included in future releases as there are only going to be more PDFs written > version 1.4 and PyPDF2 seems to be not interested in a fix
try:
    tables = camelot.read_pdf(**camelot_params)
except NotImplementedError:
    output = os.system('gswin64c -sDEVICE=pdfwrite -dCompatabilityLevel=1.4 -dSAFER -dNOPAUSE -dBATCH -o temp.pdf C:/Users/User/Desktop/input.pdf')
    url = os.path.join(os.getcwd(),"temp.pdf")
    camelot_params = get_camelot_params(meta, url)
    tables = camelot.read_pdf(**camelot_params)
Vinayak Mehta
@vinayak-mehta
@nftopham Did you also try qpdf like mentioned in the docs? https://camelot-py.readthedocs.io/en/master/user/quickstart.html#reading-encrypted-pdfs