PDF: text searchable with Preview but no text content in Foxtrot Pro [message #1777] |
Wed, 06 March 2024 17:50 |
Artighel
Messages: 10 Registered: November 2021
|
Junior Member |
|
|
Hello all,
I am currently trialing FT Pro 8 and stumbled upon a perplexing issue:
I have a Pdf where text is selectable and searchable with plain old Preview, yet when indexed in FT, no text content is shown in the metadata (viewable with the "text only" FT view);therefore it is not searchable in FT !
I gather that this is an image pdf where PDFkit could be doing on-the-fly OCR in Preview ? In which case, is FT unable to perform the same action? It would avoid the need to find and perform OCR on PDF image files.
Thank you for any pointers !
|
|
|
Re: PDF: text searchable with Preview but no text content in Foxtrot Pro [message #1778 is a reply to message #1777] |
Wed, 06 March 2024 18:03 |
FoxTrot Engineering
Messages: 406 Registered: April 2020
|
Senior Member |
|
|
Indeed, Preview.app automatically performs OCR when you open PDF files containing images, letting the user belive it contains indexable text.
FoxTrot can also perform OCR on PDF files, but it won't do this automatically at indexing time, as it would slow down indexing considerably. Therefore, you have to manually perform OCR on these files to convert them to OCR'ed PFD files that can be indexed.
To do so, search for PDF files by name or by type, then select the files that you want to convert, and use the contextual menu (right-click) "PDF Optical Character Recognition".
You can specifically search for large PDF files with no textual content, which are very susceptible to contain scanned text: check our FoxTrot Tips page.
Jérôme - FoxTrot Engineering
[Updated on: Wed, 06 March 2024 18:07] Report message to a moderator
|
|
|
|
|
|