Where communities thrive
your own community
Explore more communities
Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf)
hey! is there any way to convert a PDF to HTML and maintain it's styles so it renders similarly to the PDF?
i'm assuming not, given Tika docs say "Note that the XHTML format is used here only to convey structural information, not to render the documents for browsing!"
are you still maintaining yomu? It would be great to know if you are in a position to merge PRs - I understand totally either way -- but it would just be great to know :)