FoxTrot Search Forum
FoxTrot Search for macOS Forum

Home » Public Forums » FoxTrot Search User Forum » Broken words at the end of line (How to remove the carriage return)
Re: Broken words at the end of line [message #1271 is a reply to message #1269] Wed, 29 September 2021 15:53 Go to previous messageGo to previous message
FoxTrot Engineering
Messages: 420
Registered: April 2020
Senior Member
Depending on how the PDF file has been created, it may be possible (but I am actually not sure of this) that hyphenated words are handled as a single word split on two lines; but usually, they are instead handled by the software generating the PDF as two distinct words separated by an hyphen and a line feed, and in this case, FoxTrot won't find the full word.

If you want to specifically search for a given hyphenated form of a word, you may search for a quoted string: ["be haviorism"].

If you want to search for different hyphenated forms of a word, you may search for a multiple quoted string using the | operator: ["behaviorism"|"be haviorism"|"behavio rism"].

If you are a regular expression expert, you may also try something like:
[any document of type] [PDF]
[then apply advanced filter] [contents] [contains the regular expression] [be(\p{Pd}\s+)?ha(\p{Pd}\s+)?vio(\p{Pd}\s+)?ri(\p{Pd}\s+)?sm ]
(in this regex, \p{Pd} matches any kind of dash character (minus, hyphen, dash etc) and \s+ any spacing character, including line feed or return).
Hope this helps


Jérôme - FoxTrot Engineering
 
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Previous Topic: how to find all non searchable pdf
Next Topic: Foxtrot Search over VPN from different locations (Foxtrot server)
Goto Forum:
  


Current Time: Thu Aug 21 14:44:56 GMT+2 2025