FoxTrot Search Forum: FoxTrot Search User Forum » Foxtrot seems confused with certain unicode characters

Home » Public Forums » FoxTrot Search User Forum » Foxtrot seems confused with certain unicode characters

Show: Today's Messages :: Polls :: Message Navigator
E-mail to friend

Re: Foxtrot seems confused with certain unicode characters [message #1745 is a reply to message #1744]

Thu, 07 December 2023 11:32

FoxTrot Engineering
Messages: 420
Registered: April 2020

Senior Member

You are right, "hyphen" and "non-breaking hyphen" are considered equivalent when "Ignore Composition" is enabled; this is however an intended effect, as the "non-breaking hyphen" character can be decomposed to the "hyphen" character, in a non-breaking variant (see unicode U+2011)

We have updated the FAQ accordingly:
Ignore Composition: in Unicode, some characters can be encoded using either a single codepoint, or a sequence of codepoints. This is especially the case for accented lowercase Roman vowels (those part of ISO-8859-1), and Korean letters. Also, some characters can be decomposed to an “equivalent” character or sequence, e.g. ¼ can be decomposed to 1/4, ² to 2, ④ to 4, 𝒄 to c, non-breaking hyphen to hyphen etc. When enabled, both forms are considered equal

Jérôme - FoxTrot Engineering

Report message to a moderator

[Message index]

		Foxtrot seems confused with certain unicode characters By: Atlas on Wed, 06 December 2023 15:46
		Re: Foxtrot seems confused with certain unicode characters By: FoxTrot Engineering on Thu, 07 December 2023 11:32

Previous Topic:	Unusually long and empty blacklist window in version 8.0 build 2968 (Apple Silicon)
Next Topic:	It's a paid upgrade -- you'll learn after you install and try to use it

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

]

Current Time: Sun Jul 06 06:55:40 GMT+2 2025