Is there a standard/preferred list order for non-alphanumeric characters?

58008@lemmy.world · 7 months ago

Is there a standard/preferred list order for non-alphanumeric characters?

CountVon@sh.itjust.works · edit-2 7 months ago

There is a Unicode Technical Standard for this, called the Unicode Collation Algorithm. Whether everyone uses it, I can’t say. As it says on the linked page:

Conformance to the Unicode Standard does not imply conformance to any UTS.

So in other words it’s possible to conform to the Unicode Standard without adhering to the Unicode Collation Algorithm.

whatever this is: ¦

That is the pipe symbol, or vertical bar. When it has a gap in the middle it may be known as the broken pipe symbol or broken bar. It’s considered the same symbol with or without the gap. Early terminals displayed it with a gap to make it distinguishable from lower-case L characters.

elmicha@feddit.de · 7 months ago

The vertical bar (pipe) and broken bar are not the same symbol. Wikipedia has a whole section about it (“Solid vertical bar versus broken bar”). Only the pipe character can be used for pipes in Linux/Windows/Mac terminals.

RegalPotoo@lemmy.world · 7 months ago

This is the technically correct answer, and like lots of things is waaaaay more complicated than you’d expect.

Shadow@lemmy.ca · 7 months ago

Ascii numbers?

fubo@lemmy.world · 7 months ago

If your input is limited to ASCII, sure.

But ASCII is only a 7-bit standard, and only supports those characters needed by American English computer users in the 1960s. Lots of characters you might see in “plain text” are not part of ASCII; including all accented characters, all non-Latin alphabets, and many common symbols and punctuation marks including these: £€¢©™°

(Yes, you could get accented characters in the pre-Unicode days using 8-bit “extended ASCII”, e.g. IBM/Windows code pages. However, those are not really ASCII and they will break if the text is interpreted as the wrong code page.)

Unicode collation is the Right Thing today.

Pronell@lemmy.world · 7 months ago

That’s the best standard I can think of.