Re: FoxTrot not indexing files containing utf16 tag [message #300 is a reply to message #299] |
Thu, 28 May 2015 10:58 |
FoxTrot Engineering
Messages: 406 Registered: April 2020
|
Senior Member |
|
|
elsacha wrote:
> We ran into a problem building an index with html files containing a utf16
> tag.
> If both lines and , are in the file, then FoxTrot indexes only the name of the
> file and does not index their contents at all.
FoxTrot relies on the Spotlight importers to extract indexable text from files; unfortunately, the RichText importer has some bugs or limitations concerning UTF-16 html files, and/or html files with incoherent charset tags.
If removing the charset=UTF-16 tag fixes the problem, maybe you can do a multi-file search and replace using, for example, TextWrangler?
If the files are really UTF-16, you may have ton convert them to UTF-8; TextBatchConv can do this.
Jérôme - CTM Engineering
------------------------------------------------------------ ---------
"FoxTrot is the one app with which I would have to go back to the PC
because Spotlight is so profoundly useless for serious research (don't
get me started). FoxTrot steps in and does about everything I need to
and does it quickly and with grace. Everytime I have emailed the devs,
I get a timely and responsive answer. I have a few quibbles of course
such as the use of non-standard Boolean operators (| instead of OR for
example) but overall I am very, very pleased.
Believe me, I am a serous researcher and this is what you want!"
FoxTrot Personal Search user comment on www.versiontracker.com
Download a demo version from www.foxtrot.ch
------------------------------------------------------------ ---------
Jérôme - FoxTrot Engineering
|
|
|