Gumbo is not indexing entire file [message #1553] |
Sun, 06 November 2022 15:25 |
Atlas
Messages: 140 Registered: August 2009
|
Senior Member |
|
|
1. I have an html file that's over 100MB (yes, I know it's big, but it's an archive export), and I'm using Gumbo as my default html indexer by using `defaults write com.ctmdev.FoxTrotShared PreferGumbo -bool YES`.
2. Noticed that Foxtrot was not able to search for certain words that I know is in the file.
3. When I view the html file in plain text in Foxtrot, I see that it only contains text for the top 1/5 of the file. Unsurprisingly, anything that's not indexed to text is also not searchable by Foxtrot.
4. As sanity check, I ran the same search on second machine that doesn't have Gumbo turned on, and I can search the file just fine.
Question: Am I missing something in my setting? I know that there's a size limit for text files, but is there a size limit for html files as well if I use Gumbo?
Thank you.
|
|
|