CMS MADE SIMPLE FORGE

Document Search

 

[#8977] choice of method to extract text

avatar
Created By: Ursula Prager-Ramsa (ulli)
Date Submitted: 2013-02-25 04:32

Assigned To:
Resolution: None
State: Open
Summary:
choice of method to extract text
Detailed Description:
i have with text2pdf (tried several  on the internet available versions of this
file) the problem that it extracts only a small amount of the included text. But
as an example "texttopdf" as a part of xpdf gives a much better result but with
the big disadvatage that it can only be used via shell-exec.
It would be useful to have the choice what method is used on an uploaded file to
extract the text.  Best would be if you can define a method bound on the file
extentsion - so you can add other files eg. excel, powerpoint...

History

Comments
avatar
Date: 2013-02-25 05:19
Posted By: Oliver Seddon (oliverseddon)

Hi Ursula,

I've added an updated version, 1.4.1 which has what appears to be a more
reliable PDF2text class. Could you give it a go and see what result you get?





Thanks
      
Updates

Updated: 2013-02-25 05:19
resolution_id: => 5