#Unicode

Jim DeLaHunt

@jdlh@mstdn.ca

My quest at #fedicon2025 is to find #Fediverse services and handles with non-Latin characters. Can you link me to examples?
I hear there are many #Japan ese people active in Fediverse, but all the examples I see have only Latin script. #Unicode #Fedicon #Mastodon #UniversalAcceptance

Jim DeLaHunt

@jdlh@mstdn.ca

Jim DeLaHunt

@jdlh@mstdn.ca

Jim DeLaHunt

@jdlh@mstdn.ca

Michel Mariani

@mikaeru@mastodon.social

Beautifully crafted BabelStone Han font, by Andrew West 魏安

#BabelStone Han v. 15.1.3 is a free #Unicode #CJK #font with over 57,000 Han characters (#hanzi, #kanji, #hanja), and 62,061 Unicode characters in total. It is a Song/Ming style (宋体/明體) font, with glyphs modelled on the official character forms used in the People's Republic of China, and is primarily intended for writing Modern Standard #Chinese, Classical Chinese, and various Sinitic languages and dialects.

🔗 https://www.babelstone.co.uk/Fonts/Han.html

ALT text details

Repeated: 龙 U+9F99 U+31342 U+2EE5D

Michel Mariani

@mikaeru@mastodon.social

New in the CJK Variations utility of Unicopedia Sinica:

- Support for the latest Ideographic Variation Database (IVD 2025), adding the new CAAPH Collection.

- Support for the updated BabelStone Collection (unregistered), based on the latest BabelStone Han font (v17.0.0 BETA), by Andrew C. West (魏安), 1960-2025 RIP (安息吧).

🔗 https://https://codeberg.org/tonton-pixel/unicopedia-sinica

#Unicopedia #Unicode #Unihan #CJK #IdeographicVariationDatabase #IVD #CAAPH #BabelStone

ALT text details

Screenshot of the CJK Variations utility of Unicopedia Sinica for Unicode character U+3AB4

ALT text details

Screenshot of the CJK Variations utility of Unicopedia Sinica for Unicode character U+4E9B

Michel Mariani

@mikaeru@mastodon.social

New in the CJK Variations utility of Unicopedia Sinica:

- Support for the latest Ideographic Variation Database (IVD 2025), adding the new CAAPH Collection.

- Support for the updated BabelStone Collection (unregistered), based on the latest BabelStone Han font (v17.0.0 BETA), by Andrew C. West (魏安), 1960-2025 RIP (安息吧).

🔗 https://https://codeberg.org/tonton-pixel/unicopedia-sinica

#Unicopedia #Unicode #Unihan #CJK #IdeographicVariationDatabase #IVD #CAAPH #BabelStone

ALT text details

Screenshot of the CJK Variations utility of Unicopedia Sinica for Unicode character U+3AB4

ALT text details

Screenshot of the CJK Variations utility of Unicopedia Sinica for Unicode character U+4E9B

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!

#introduction

SnoopJ

@SnoopJ@hachyderm.io

I am once again talking about how in Swedish the emoji 🐙 and 🦑 may both be referred to as "bläckfisk" (and are in this sense 'the same') and how this is a good example of #Unicode problems someone who speaks only English may not even realize exist

Timwi

@Timwi@nerdculture.de

I just found out that #Unicode has segment-display digit characters. The below screenshot is all in one font (#JuliaMono). The characters are U+1FBF0 to U+1FBF9. Unicode is gorgeous

Paris Web

@ParisWeb@mamot.fr

Avec @MoritzBrouhaha, découvrez l'histoire du standard informatique Unicode, utilisé par tout le monde à travers le globe dans nos communications quotidiennes.

https://www.paris-web.fr/2025/conference/a-la-decouverte-du-monde-au-travers-de-lunicode

#unicode #standards #typographie #internationalisation

ALT text details

« À la découverte du monde au travers de l’Unicode » par Loïc Marleix

Paris Web

@ParisWeb@mamot.fr

Avec @MoritzBrouhaha, découvrez l'histoire du standard informatique Unicode, utilisé par tout le monde à travers le globe dans nos communications quotidiennes.

https://www.paris-web.fr/2025/conference/a-la-decouverte-du-monde-au-travers-de-lunicode

#unicode #standards #typographie #internationalisation

ALT text details

« À la découverte du monde au travers de l’Unicode » par Loïc Marleix

Paris Web

@ParisWeb@mamot.fr

Avec @MoritzBrouhaha, découvrez l'histoire du standard informatique Unicode, utilisé par tout le monde à travers le globe dans nos communications quotidiennes.

https://www.paris-web.fr/2025/conference/a-la-decouverte-du-monde-au-travers-de-lunicode

#unicode #standards #typographie #internationalisation

ALT text details

« À la découverte du monde au travers de l’Unicode » par Loïc Marleix

Paul Melis

@paulmelis@social.edu.nl

The recycling symbol ♻ in a git branch name, what a time to be alive 😎

Also, nice of #github to warn about possibly hidden characters, but not sure it applies in this case

https://github.com/JuliaLang/julia/pull/58418

#unicode

Paul Melis

@paulmelis@social.edu.nl

The recycling symbol ♻ in a git branch name, what a time to be alive 😎

Also, nice of #github to warn about possibly hidden characters, but not sure it applies in this case

https://github.com/JuliaLang/julia/pull/58418

#unicode

Michel Mariani

@mikaeru@mastodon.social · Reply to Michel Mariani's post

@electronjs

No Electron support for the latest Unicode version is a major hindrance for my open-source Unicopedia Plus application, which I have to keep in Beta version for a long time because of that...

https://codeberg.org/tonton-pixel/unicopedia-plus

#Unicode #Support #Unicopedia

Michel Mariani

@mikaeru@mastodon.social · Reply to Michel Mariani's post

@electronjs

No Electron support for the latest Unicode version is a major hindrance for my open-source Unicopedia Plus application, which I have to keep in Beta version for a long time because of that...

https://codeberg.org/tonton-pixel/unicopedia-plus

#Unicode #Support #Unicopedia

Head·word /ˈhedˌwɜː(ɹ)d/ n.

@headword@lingo.lol · Reply to Unicode Watch �🔍's post

@UnicodeWatch

Interesting to see letters like :Dania_LongI: , :Phonotypic_ith: , and :Phonotypic_oi: proposed for inclusion in Unicode! :Unicode:

#EnglishPhonotypicAlphabet #PhonotypicAlphabet #Phonotypic #Dania #Phonetic #Phonetics #PhoneticTranscription #Unicode

SnoopJ

@SnoopJ@hachyderm.io

TIL that in #Unicode, U+23BE through U+23CC are a series of symbols dedicated to the notation (?!) of dentistry

ALT text details

Dentistry notation symbols, listing Unicode characters from 23BE through 23CC, all of which have names beginning with "DENTISTRY SYMBOL" The symbols appear to indicate different directions and have overlaid shapes (circle, triangle, wave)

SnoopJ

@SnoopJ@hachyderm.io

TIL that in #Unicode, U+23BE through U+23CC are a series of symbols dedicated to the notation (?!) of dentistry

ALT text details

Michel Mariani

@mikaeru@mastodon.social

The Ideographic Research Group (IRG) is responsible for preparing and reviewing sets of CJK unified ideographs to be included in the Unicode Standard.

Current and future IRG source prefixes used to be listed in the main IRG homepage, but are now available in a separate dedicated page:

🔗 https://www.unicode.org/irg/prefixes.html

#unicode #unihan #irg #cjk #cjkv #cjkui

Michel Mariani

@mikaeru@mastodon.social · Reply to Le Monde.fr's post

@lemonde

<U+1F1E7, U+1F1EC> 🇧🇬 #Bulgarie #Bulgaria
<U+1F1ED, U+1F1FA> 🇭🇺 #Hongrie #Hungary

#Unicode #Emoji #Drapeaux #Flags

ALT text details

Emoji: drapeaux de la Bulgarie [BG] et de la Hongrie [HU]

Michel Mariani

@mikaeru@mastodon.social · Reply to Le Monde.fr's post

@lemonde

<U+1F1E7, U+1F1EC> 🇧🇬 #Bulgarie #Bulgaria
<U+1F1ED, U+1F1FA> 🇭🇺 #Hongrie #Hungary

#Unicode #Emoji #Drapeaux #Flags

ALT text details

Emoji: drapeaux de la Bulgarie [BG] et de la Hongrie [HU]

Stephan Michels

@michels@mastodon.social

I added typographic guides to my Unicode viewer. I first tried the new TextRenderer, but found it too limited. I then switched back to CoreText. However, I then noticed that SwiftUI was cutting off some parts of the glyphs. It seems that they don’t expect the glyphs to extend beyond their bounding box.

#SwiftUI #Unicode

Stephan Michels

@michels@mastodon.social

#SwiftUI #Unicode

Michel Mariani

@mikaeru@mastodon.social

Apart from the issue of line formatting of plain text in the new Unicode contact form <https://support.unicode.org/osticket/open.php>, it appears that some pretty innocuous characters such as the vertical bar | or the degree sign ° are getting stripped out from the latest reports, in <https://www.unicode.org/review/pri526/> for instance.

Ironically enough, it seems that the Unicode contact form is not Unicode-conformant/compliant then. Maybe some kind of "Make ASCII Great Again" thing?

#Unicode #ContactForm #MakeASCIIGreatAgain

ALT text details

Example of vertical bar | character getting stripped out from a PRI 526 report

ALT text details

Example of degree sign ° character getting stripped out from a PRI 526 report

Michel Mariani

@mikaeru@mastodon.social

Ironically enough, it seems that the Unicode contact form is not Unicode-conformant/compliant then. Maybe some kind of "Make ASCII Great Again" thing?

#Unicode #ContactForm #MakeASCIIGreatAgain

ALT text details

Example of vertical bar | character getting stripped out from a PRI 526 report

ALT text details

Example of degree sign ° character getting stripped out from a PRI 526 report

Aaron “#e14n pro” Madlon-Kay

@amake@mastodon.social

The iOS 18.5 SDK finally came out and the only change for Unicode coverage is the *removal* of a bunch of Sinhala codepoints:

ඁ෦෧෨෩෪෫෬෭෮෯𑇡𑇢𑇣𑇤𑇥𑇦𑇧𑇨𑇩𑇪𑇫𑇬𑇭𑇮𑇯𑇰𑇱𑇲𑇳𑇴

(Those of you on iOS 18.4: Enjoy seeing those glyphs while you can!)

#ios #unicode

Aaron “#e14n pro” Madlon-Kay

@amake@mastodon.social

The iOS 18.5 SDK finally came out and the only change for Unicode coverage is the *removal* of a bunch of Sinhala codepoints:

ඁ෦෧෨෩෪෫෬෭෮෯𑇡𑇢𑇣𑇤𑇥𑇦𑇧𑇨𑇩𑇪𑇫𑇬𑇭𑇮𑇯𑇰𑇱𑇲𑇳𑇴

(Those of you on iOS 18.4: Enjoy seeing those glyphs while you can!)

#ios #unicode

Michel Mariani

@mikaeru@mastodon.social · Reply to Michel Mariani's post

Unicode's new contact form at <https://support.unicode.org/osticket/open.php> is apparently an HTML editor "in disguise"; the only way I found to force it to keep the formatting of my plain text messages was to select the HTML mode and paste the text inside a <pre></pre> tag...

Still, some contents gets unexpectedly stripped out after submission of the report, like text between "<" and ">".

#Unicode #ContactForm #Sabotage

Michel Mariani

@mikaeru@mastodon.social · Reply to Michel Mariani's post

Still, some contents gets unexpectedly stripped out after submission of the report, like text between "<" and ">".

#Unicode #ContactForm #Sabotage

Timwi

@Timwi@nerdculture.de

I just found out that #Unicode has segment-display digit characters. The below screenshot is all in one font (#JuliaMono). The characters are U+1FBF0 to U+1FBF9. Unicode is gorgeous

Timwi

@Timwi@nerdculture.de

I just found out that #Unicode has segment-display digit characters. The below screenshot is all in one font (#JuliaMono). The characters are U+1FBF0 to U+1FBF9. Unicode is gorgeous

Michel Mariani

@mikaeru@mastodon.social

New utilities in Unicopedia Ægypta:

- Hieroglyph Picture Book
- Hieroglyph Taxonomy

🔗 https://codeberg.org/tonton-pixel/unicopedia-aegypta

#unicopedia #egyptian #hieroglyphs #taxonomy #picturebook #javascript #desktopapplication #electronjs #unicode

ALT text details

Hieroglyph Picture Book utility screenshot

ALT text details

Hieroglyph Taxonomy utility screenshot

Michel Mariani

@mikaeru@mastodon.social

New utilities in Unicopedia Ægypta:

- Hieroglyph Picture Book
- Hieroglyph Taxonomy

🔗 https://codeberg.org/tonton-pixel/unicopedia-aegypta

#unicopedia #egyptian #hieroglyphs #taxonomy #picturebook #javascript #desktopapplication #electronjs #unicode

ALT text details

Hieroglyph Picture Book utility screenshot

ALT text details

Hieroglyph Taxonomy utility screenshot

Michel Mariani

@mikaeru@mastodon.social

In case my feedback to the UTC gets garbled once again, here are the links to the plain text messages I attempted to submit through copy-paste from their new contact page <https://support.unicode.org/osticket/open.php>: no truly WYSIWYG editor, no basic preview mode either...

https://tonton-pixel.codeberg.page/PRI-519-Feedback-2025-05-19.txt
https://tonton-pixel.codeberg.page/PRI-519-Feedback-2025-05-18.txt
https://tonton-pixel.codeberg.page/PRI-519-Feedback-2025-05-13.txt

I'm dreaming of a simple world without technology wanting to "help" us so much. We shouldn't have to struggle to achieve simple tasks...

#Unicode #ContactForm #BadDesign

Michel Mariani

@mikaeru@mastodon.social

I'm dreaming of a simple world without technology wanting to "help" us so much. We shouldn't have to struggle to achieve simple tasks...

#Unicode #ContactForm #BadDesign

Ian Wagner

@ianthetechie@fosstodon.org

Today's fun with Unicode, OpenStreetMap, Foursquare OS Places, and giving users useful search results :)

https://ianwwagner.com/til/unicode-normalization-forms

#unicode #programming #openstreetmap

ALT text details

A list of search results showing the same restaurant twice. The first result has a different appearance, being comprised of mathematical unicode symbols.

SnoopJ

@SnoopJ@hachyderm.io

SnoopJ

@SnoopJ@hachyderm.io

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

Michel Mariani

@mikaeru@mastodon.social

From time to time (since this represents a tremendous amount of translation/adaptation work), a French version of the "code charts" gets published by the Unicode Consortium: the latest one is for Unicode 16.0:

https://www.unicode.org/Public/16.0.0/charts/fr/CodeCharts.pdf

This is especially useful for French speakers in #Canada, #France, #Belgium, #Switzerland, etc. but may soon be obsolete for #Quebec, in case it gets "absorbed" by a neighboring country whose official language is now English only...

#unicode #codecharts #french

Michel Mariani

@mikaeru@mastodon.social

https://www.unicode.org/Public/16.0.0/charts/fr/CodeCharts.pdf

#unicode #codecharts #french

Michel Mariani

@mikaeru@mastodon.social

De temps en temps (cela représente un énorme travail d'adaptation), une version française des "code charts" est publiée par le Consortium Unicode, la dernière en date est pour Unicode 16.0:

https://www.unicode.org/Public/16.0.0/charts/fr/CodeCharts.pdf

Malheureusement, celle-ci risque d'être bientôt obsolète pour les francophones de la belle province de Québec, dans le cas où celle-ci serait «absorbée» par un pays voisin dont la langue officielle est désormais uniquement l'anglais...

#unicode #codecharts #français #québec #canada

Michel Mariani

@mikaeru@mastodon.social

De temps en temps (cela représente un énorme travail d'adaptation), une version française des "code charts" est publiée par le Consortium Unicode, la dernière en date est pour Unicode 16.0:

https://www.unicode.org/Public/16.0.0/charts/fr/CodeCharts.pdf

#unicode #codecharts #français #québec #canada

Steve Faulkner

@SteveFaulkner@mastodon.social

👁️short note on emoji text alternative variations

"Unicode symbols do not have inbuilt text alternatives. They are exposed in the browser accessibility tree as a text symbol"

#emoji #screenreaders #a11y #unicode #webDev

https://html5accessibility.com/stuff/2022/01/17/short-note-on-emoji-text-alternative-variations/

Steve Faulkner

@SteveFaulkner@mastodon.social

👁️short note on emoji text alternative variations

"Unicode symbols do not have inbuilt text alternatives. They are exposed in the browser accessibility tree as a text symbol"

#emoji #screenreaders #a11y #unicode #webDev

https://html5accessibility.com/stuff/2022/01/17/short-note-on-emoji-text-alternative-variations/

Steve Faulkner

@SteveFaulkner@mastodon.social

👁️short note on emoji text alternative variations

"Unicode symbols do not have inbuilt text alternatives. They are exposed in the browser accessibility tree as a text symbol"

#emoji #screenreaders #a11y #unicode #webDev

https://html5accessibility.com/stuff/2022/01/17/short-note-on-emoji-text-alternative-variations/

Michel Mariani

@mikaeru@mastodon.social

Unicopedia Anatolica is a developer-oriented set of #Unicode utilities related to Anatolian hieroglyphs, wrapped into one single app, built with #Electron.

Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-anatolica

#anatolian #hieroglyphs #unicopedia #javascript #unicode #characters #codepoints #codecharts #desktopapplication #electronjs #glyphs #localfonts

ALT text details

Unicopedia Anatolica Social Preview

Michel Mariani

@mikaeru@mastodon.social

Unicopedia Anatolica is a developer-oriented set of #Unicode utilities related to Anatolian hieroglyphs, wrapped into one single app, built with #Electron.

Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-anatolica

#anatolian #hieroglyphs #unicopedia #javascript #unicode #characters #codepoints #codecharts #desktopapplication #electronjs #glyphs #localfonts

ALT text details

Unicopedia Anatolica Social Preview

Michel Mariani

@mikaeru@mastodon.social

Considerations about Egyptian Hieroglyph legacy characters, by Michel Suignard, proposing to add a new kEH_AltMapping property to the Unikemet database (UAX#57):

🔗 https://www.unicode.org/L2/L2025/25110-egyptian.pdf

#unicode #unikemet #hieroglyphs

ALT text details

Examples of Egyptian hieroglyph variants: 𓃺𓃹 𓅩𓅨 𓅫𓅪 𓂩𓂧

ALT text details

Examples of pairs of horizontally mirrored Egyptian hieroglyphs: 𓁜𓁝 𓁛𓁞 𓁩𓁪 𓁫𓁬

Michel Mariani

@mikaeru@mastodon.social

Unicopedia Ægypta is a developer-oriented set of #Unicode utilities related to Egyptian hieroglyphs, wrapped into one single app, built with #Electron.

Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-aegypta

#characters #codecharts #codepoints #desktopapplication #egyptian #electronjs #glyphs #hieroglyph #hieroglyphs #javascript #localfonts #unicode #unicopedia #unikemet

ALT text details

Unicopedia Ægypta Social Preview

Michel Mariani

@mikaeru@mastodon.social

Unicopedia Plus is a developer-oriented set of Unicode, Unihan, Unikemet & emoji utilities wrapped into one single app, built with #Electron.

Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-plus

#characters #chinese #cjk #codepoints #desktopapplication #electronjs #emoji #ivd #japanese #javascript #kangxi #kangxiradicals #korean #normalization #opensource #regex #segmentation #strokecount #unicode #unicopedia #unihan #unikemet

ALT text details

Unicopedia Plus Social Preview

Michel Mariani

@mikaeru@mastodon.social

Unicopedia Sinica is a developer-oriented set of #Unicode utilities related to ideographs, wrapped into one single app, built with #Electron.

Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-sinica

#characters #chinese #cjk #cjkrelated #cjkv #codecharts #codepoints #components #confusables #desktopapplication #electronjs #glyphs #ideographs #ideographicdescriptionsequences #ids #japanese #javascript #kangxi #kangxiradicals #korean #localfonts #opensource #strokes #tangut #unicode #unicopedia #unihan #vietnamese

ALT text details

Unicopedia Sinica Social Preview

Michel Mariani

@mikaeru@mastodon.social

#Unicode #Symbols: #Diversity

U+2640 FEMALE SIGN
U+2642 MALE SIGN
U+26A2 DOUBLED FEMALE SIGN
U+26A3 DOUBLED MALE SIGN
U+26A4 INTERLOCKED FEMALE AND MALE SIGN
U+26A5 MALE AND FEMALE SIGN
U+26A6 MALE WITH STROKE SIGN
U+26A7 MALE WITH STROKE AND MALE AND FEMALE SIGN
U+26A8 VERTICAL MALE WITH STROKE SIGN
U+26A9 HORIZONTAL MALE WITH STROKE SIGN
U+26B2 NEUTER

ALT text details

Unicode Symbols: Diversity ♀♂⚢⚣⚤⚥⚦⚧⚨⚩⚲

Michel Mariani

@mikaeru@mastodon.social

#Unicode #Emoji: #Hearts #Galore

U+2764 U+FE0F U+1FA77 U+1F9E1 U+1F49B U+1F49A U+1F499 U+1FA75 U+1F49C U+1F90E U+1F5A4 U+1FA76 U+1F90D

U+1F49F U+2764 U+FE0F U+200D U+1F525 U+1F494 U+2764 U+FE0F U+200D U+1FA79 U+2763 U+FE0F U+1F498 U+1F493 U+1F497 U+1F496 U+1F49D U+1F495 U+1F49E

U+1F970 U+1F60D U+1F618 U+1F63B U+1F48C U+1FAF6 U+1FAF6 U+1F3FB U+1FAF6 U+1F3FC U+1FAF6 U+1F3FD U+1FAF6 U+1F3FE U+1FAF6 U+1F3FF U+1FAC0

ALT text details

Unicode Emoji: Hearts Galore ❤️🩷🧡💛💚💙🩵💜🤎🖤🩶🤍 💟❤️‍🔥💔❤️‍🩹❣️💘💓💗💖💝💕💞 🥰😍😘😻💌🫶🫶🏻🫶🏼🫶🏽🫶🏾🫶🏿🫀

Michel Mariani

@mikaeru@mastodon.social

#Unicode #Emoji: #Sweat & #Tears

U+1F4A6 U+1F4A7 U+1F979 U+1F639 U+1F63F

U+1F602 U+1F605 U+1F613 U+1F622 U+1F625 U+1F62A U+1F62D U+1F630 U+1F923 U+1F972 U+1F975

ALT text details

Unicode Emoji: Sweat & Tears 💦💧🥹😹😿 😂😅😓😢😥😪😭😰🤣🥲🥵

Michel Mariani

@mikaeru@mastodon.social

#Unicode #Emoji: #Skintones

U+1F473 U+1F473 U+1F3FB U+1F473 U+1F3FC U+1F473 U+1F3FD U+1F473 U+1F3FE U+1F473 U+1F3FF

U+1F478 U+1F478 U+1F3FB U+1F478 U+1F3FC U+1F478 U+1F3FD U+1F478 U+1F3FE U+1F478 U+1F3FF

ALT text details

Unicode Emoji: Skin Tones 👳➔👳🏻👳🏼👳🏽👳🏾👳🏿 👸➔👸🏻👸🏼👸🏽👸🏾👸🏿

Michel Mariani

@mikaeru@mastodon.social

#Unicode #Emoji: #Math #Geekiness

<U+1F605> <U+1F4A7> <U+1F604>

ALT text details

Unicode Emoji: Math Geekiness log(😅) =💧log(😄)

Michel Mariani

@mikaeru@mastodon.social

#Unicode #Emoji: #Japanese #Buttons

U+1F201 U+1F202 U+FE0F U+1F233 U+1F237 U+FE0F U+1F236 U+1F21A U+1F251 U+1F238 U+1F23A U+000A U+1F22F U+1F250 U+1F239 U+1F232 U+1F234 U+3297 U+FE0F U+3299 U+FE0F U+1F235

ALT text details

Unicode Emoji: Japanese Buttons 🈁🈂️🈳🈷️🈶🈚🉑🈸🈺 🈯🉐🈹🈲🈴㊗️㊙️🈵

▇ ▃ ▃ ▇ ▇ ▄ ▃ ▆ [(-.-)]

@liilliil@im-in.space

Offering a new #FediverseSymbol: ꙮ

The previously suggested symbol ⁂ is good for depict group and unity, but is poor in terms of associations: “3 snowflakes”.

Polish fediusers have noticed a piece of an old Russian manuscript, it says about ‘many-eyed seraphim’ (серафим многоокий). An unknown 15th-century monk played with the combination of the letters oo, turning them into a multi-eyed creature. The character found in only 1 manuscript, but despite this, it has been added into #Unicode.

Not only does the symbol beautifully reflect the unity of the fediverse, but it also shows an all-seeing open-minded wise and powerful being (Ezekiel 1:18, 10:12 etc)

also: https://social.hackerspace.pl/@q3k/110446350216259023

Andrew C

@achadwick@urbanists.social

Hey, fedi #Unicode nerds! :boostRequest:

#OpenStreetMap's Andy Mabbett (@Pigsonthewing) is asking whether anyone knows about any instances of the #OrdnanceSurvey's bench mark symbol appearing in actual print, on a page. Looks a bit like ⭱ or ⤒ but a broader arrow. Usually found carved on stone or brick all over the UK/ROI.

Their goal is to propose it as a Unicode symbol! https://community.openstreetmap.org/t/os-bench-mark-symbol-in-printed-documents/128182

Any known international usage of this symbol would doubtless be appreciated too

@openstreetmap

ALT text details

Another non-print example. This is another form of the symbol. I don't know how common. This one's a pre-cast metal (?) plaque with a serial number set into a wall. The arrow lacks the top bar, but above it, along with the O and S of Ordnance Survey, are some very specific-looking slots. Perhaps the slots accepted some sort of surveying equipment. Photo by Gary Rogers on geograph. Links can be found in this thread below.

ALT text details

A non-print example. The most common form of the symbol, although other variants exist. It's carved into a smooth block of stone on the side of a building or monument plinth, and it looks like a capital T with an upside down V overlaid onto it so that the two angled lines from the V come together with the T's vertical stroke to meat its horizontal stroke at a single point. All the bottom lines taper toward that point. Looking at it another way, it's a horizontal line with an arrow pointing at it, saying "here, this level!" They were used for marking height reference points during various surveys of the British isles. Photo by Mike Taylor on geograph.org.uk, CC:by

Andrew C

@achadwick@urbanists.social

Hey, fedi #Unicode nerds! :boostRequest:

Their goal is to propose it as a Unicode symbol! https://community.openstreetmap.org/t/os-bench-mark-symbol-in-printed-documents/128182

Any known international usage of this symbol would doubtless be appreciated too

@openstreetmap

ALT text details

Michel Mariani

@mikaeru@mastodon.social · Reply to Michel Mariani's post

Today (April Fools' Day), Adobe is apparently back to the list of full members (voting) of the Unicode Consortium, but for how long this time: one full year?

« Ça s’en va et ça revient
C’est fait de tout petits riens
Ça se chante et ça se danse
Et ça revient, ça se retient
Comme une chanson populaire »

Full members (voting) of the Unicode Consortium: Adobe, Airbnb, Amazon, Apple, Google, Meta, Microsoft, Salesforce, Translated.

https://home.unicode.org/membership/members/

#unicode #members

ALT text details

Full members (voting) of the Unicode Consortium: Adobe, Airbnb, Amazon, Apple, Google, Meta, Microsoft, Salesforce, Translated.

SnoopJ

@SnoopJ@hachyderm.io

the most important part of #Unicode history is when a mouse fell out of a light fixture and got added to the count of members present at a Technical Committee meeting (9 Nov 2016)

https://www.unicode.org/L2/L2016/16325.htm#149-A94

ALT text details

Screenshot of meeting notes for UTC Meeting 149. Text reads: Mouse now present. 6.502 members represented. [149-A94] Action Item for Landlord: Capture and exile the mouse that just fell out of the light fixture.

Terence Eden

@Edent@mastodon.social

Which is your favourite #Unicode telephone?

Option	Voters
🕾	1 (1%)
🕿	5 (7%)
☏	18 (27%)
☎	43 (64%)

Terence Eden

@Edent@mastodon.social

Which is your favourite #Unicode telephone?

Option	Voters
🕾	1 (1%)
🕿	5 (7%)
☏	18 (27%)
☎	43 (64%)

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

sibaku

@sibaku@mas.to

Found out something interesting/annoying related to #unicode! There is an issue with the character 浅. You might see it one of two ways (see screenshots) depending on which font you use, which was the cause of my confusion. One form has 2 and the other 3 horizontal strokes. So why is that?

ALT text details

The simplified Chinese Hanzi equivalent of 浅

ALT text details

The Japanese kanji 浅

Michel Mariani

@mikaeru@mastodon.social

The Ideographic Research Group (IRG) is responsible for preparing and reviewing sets of CJK unified ideographs to be included in the Unicode Standard.

The IRG homepage is now including comprehensive lists of current and future IRG source prefixes...

🔗 https://www.unicode.org/irg/

#unicode #unihan #irg #cjk #cjkv #cjkui

Yngve Mardal Moe 🐍🐢🪡

@yngvem@fosstodon.org

It's happening, @marieroald and I are doing our third #PyConUS, this time with a tutorial on Packaging with uv and a talk about #Unicode in #Python!

Yngve Mardal Moe 🐍🐢🪡

@yngvem@fosstodon.org

It's happening, @marieroald and I are doing our third #PyConUS, this time with a tutorial on Packaging with uv and a talk about #Unicode in #Python!

sibaku

@sibaku@mas.to

ALT text details

The simplified Chinese Hanzi equivalent of 浅

ALT text details

The Japanese kanji 浅

Martin Owens :inkscape:

@doctormo@floss.social

It might have taken an ungodly amount of time. But getting these corner cases right in this PDF export is going to mean the world to a lot of people.

Arabic and Hebrew and non messing up the glyphs.

#inkscape #pdf #cmyk #arabic #language #unicode #text #glyphs #hewbrew

ALT text details

Sample Text on three PDF pages read: מילים נסתרות كلمات مخفية مرحبا بالعالم "Text on Path" curved on a thick line "تجربة نص على المنحى" curved on a thin line "What is Lorem Ipsum?" ... full text explaining lorum ipsum flowing around a large lack circle ... "Can we do Arabic?" ... A passage in arabic from the Quran flowing around a smaller black circle ...

Martin Owens :inkscape:

@doctormo@floss.social

It might have taken an ungodly amount of time. But getting these corner cases right in this PDF export is going to mean the world to a lot of people.

Arabic and Hebrew and non messing up the glyphs.

#inkscape #pdf #cmyk #arabic #language #unicode #text #glyphs #hewbrew

ALT text details

Jake in the desert

@jake4480@c.im

Some Pac-Man and other alien space invadery type symbols now in Unicode, via this Symbols for Legacy Computing Supplement: https://unicode.org/charts//PDF/Unicode-16.0/U160-1CC00.pdf

#Unicode

ALT text details

A portion from the linked PDF of Unicode symbols showing Pac-Man and Space Invaders type symbols

Jake in the desert

@jake4480@c.im

Some Pac-Man and other alien space invadery type symbols now in Unicode, via this Symbols for Legacy Computing Supplement: https://unicode.org/charts//PDF/Unicode-16.0/U160-1CC00.pdf

#Unicode

ALT text details

A portion from the linked PDF of Unicode symbols showing Pac-Man and Space Invaders type symbols

Jake in the desert

@jake4480@c.im

Some Pac-Man and other alien space invadery type symbols now in Unicode, via this Symbols for Legacy Computing Supplement: https://unicode.org/charts//PDF/Unicode-16.0/U160-1CC00.pdf

#Unicode

ALT text details

A portion from the linked PDF of Unicode symbols showing Pac-Man and Space Invaders type symbols

Dan 🌈

@phrawzty@hachyderm.io

Today I learned that there is a specific #unicode "record separator" symbol, formally known as "U+001E Information Separator Two".

https://codepoints.net/U+001E

It is meant to be used to indicate a separation between two units of information. An example of where this could be used is in a separated-value file, e.g. a CSV, but using this symbol instead of a comma.

This is interesting because there are vanishingly few instances where the record separator symbol would appear in most contexts, but many instances where a comma appears. Using this symbol instead of a comma (or a semi-colon, or an exclamation point, or any one of the usual separators) could make some data hygiene scenarios much more straightforward.

Dan 🌈

@phrawzty@hachyderm.io

Today I learned that there is a specific #unicode "record separator" symbol, formally known as "U+001E Information Separator Two".

https://codepoints.net/U+001E

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

Axel ⌨🐧🐪🚴😷 | R.I.P Natenom

@xtaran@chaos.social

#UserAgent based banning of #textmode browsers is sooooo lame.

$ lynx -useragent=🖕 https://[…]

#Lynx #TUI #Browser #TextModeBrowser #Unicode #UTF8 #Emoji

SnoopJ

@SnoopJ@hachyderm.io

After a long period of quiet, I have released an update to the `unicode-age` #Python package

https://pypi.org/project/unicode-age/

The package now supports #Unicode 16.0

Matthias Wiesmann

@thias@mastodon.social

Treasure Hunt – Braille Hints

So I prepared a treasure hunt for my older daughter, which involved some form of coded message. I found a braille table I could 3D-print, using a real system instead of some made-up code gave me the opportunity to explain how/why this was used in reality, you find braille codes in lifts, staircase handrails.

#3dprinting #braille #unicode #python

https://wiesmann.codiferes.net/wordpress/archives/37764

SnoopJ

@SnoopJ@hachyderm.io

TIL that the #Unicode Consortium is working on guidance for detecting "URLs"¹ in text:

https://www.unicode.org/L2/L2024/24217r2-uts58-working-draft.html

¹ scare quotes because URL is formally defined as ASCII-only, but "IRI" is a confusing term and everybody just wants to call the Unicode-aware equivalent a "URL"

Ausir

@ausir@meowr.me

brand new combining diacritics dropping soon in Unicode 17, to be used for transcribing rare historical uses, and even more so for really tryhard conlangs!

#linguistics #unicode #conlang

Simon Tatham

@simontatham@hachyderm.io

In the old #ASCII days, you could change a letter between upper and lower case by XORing its character code with 0x20. Of course, if you tried this with anything that wasn't a letter, you'd get nonsense results.

If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.

A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.

>>> chr(ord('🚗') ^ 0x20)
'🚷'

Paul McGuire

@ptmcg@fosstodon.org

Here are some emojidentifiers for your next Python code:

import math
乁_ツ_ㄏ = None
乁_益_ㄏ = math.nan

def minnums(values: list | 乁_ツ_ㄏ = 乁_ツ_ㄏ):
if (
values is 乁_ツ_ㄏ
or not all(isinstance(n, (float, int))
for n in values)
):
return 乁_益_ㄏ
return min(values)

#python #unicode #emoji

SnoopJ

@SnoopJ@hachyderm.io

After a long period of quiet, I have released an update to the `unicode-age` #Python package

https://pypi.org/project/unicode-age/

The package now supports #Unicode 16.0

SnoopJ

@SnoopJ@hachyderm.io

After a long period of quiet, I have released an update to the `unicode-age` #Python package

https://pypi.org/project/unicode-age/

The package now supports #Unicode 16.0

Simon Tatham

@simontatham@hachyderm.io

If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.

A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.

>>> chr(ord('🚗') ^ 0x20)
'🚷'

Simon Tatham

@simontatham@hachyderm.io

If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.

A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.

>>> chr(ord('🚗') ^ 0x20)
'🚷'

Simon Tatham

@simontatham@hachyderm.io

If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.

A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.

>>> chr(ord('🚗') ^ 0x20)
'🚷'

Simon Tatham

@simontatham@hachyderm.io

If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.

A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.

>>> chr(ord('🚗') ^ 0x20)
'🚷'

Simon Tatham

@simontatham@hachyderm.io

If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.

A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.

>>> chr(ord('🚗') ^ 0x20)
'🚷'

Simon Tatham

@simontatham@hachyderm.io

If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.

A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.

>>> chr(ord('🚗') ^ 0x20)
'🚷'

Frank Meerkötter

@fmeerkoetter@mountains.social

Love this book/comic the kids picked up from the library.

#comics #graphicnovel #unicode

Frank Meerkötter

@fmeerkoetter@mountains.social

Love this book/comic the kids picked up from the library.

#comics #graphicnovel #unicode

Markus Redeker

@mrdk@mathstodon.xyz · Reply to 0xDE's post

@11011110 At least these symbols have a meaning! But nobody knows what “Angzarr” (⍼) is and why it is in Unicode (https://en.wikipedia.org/wiki/Angzarr).

#Unicode #Angzarr #Mathematics

Markus Redeker

@mrdk@mathstodon.xyz · Reply to 0xDE's post

@11011110 At least these symbols have a meaning! But nobody knows what “Angzarr” (⍼) is and why it is in Unicode (https://en.wikipedia.org/wiki/Angzarr).

#Unicode #Angzarr #Mathematics

Revath S Kumar

@revathskumar@fosstodon.org · Reply to Revath S Kumar :javascript:'s post

Wrote a small web utility to visualize the different string normalization forms of a text.

https://string-normalize.surge.sh/?str=I+%e2%99%a5+K%c3%b6ln

Not the best design 😄 , but feedbacks are welcome.

#javascript #TypeScript #string #unicode

ALT text details

desktop view of string normalize web page, showing NFC, NFD, NFKC and NFKD normalization forms of text "I ♥ Köln" is visible

ALT text details

mobile view of string normalize web page, showing NFC, NFD and NFKC normalization forms of text "I ♥ Köln" is visible

Michel Mariani

@mikaeru@mastodon.social

New utility in Unicopedia Sinica:
- Pan-CJK Font Variants
(port from Unicopedia Plus, with Serif/明朝体 font style instead of Sans-Serif/ゴシック体)

🔗 https://codeberg.org/tonton-pixel/unicopedia-sinica

#unicopedia #unicode #unihan #cjkfont #variants

ALT text details

Pan-CJK Font Variants utility screenshot

Michel Mariani

@mikaeru@mastodon.social

New utility in Unicopedia Plus:
- Unihan Phonetics

🔗 https://codeberg.org/tonton-pixel/unicopedia-plus

#unicopedia #unicode #unihan #phonetics

ALT text details

Unihan Phonetics utility screenshot

Revath S Kumar

@revathskumar@fosstodon.org · Reply to Revath S Kumar :javascript:'s post

Wrote a small web utility to visualize the different string normalization forms of a text.

https://string-normalize.surge.sh/?str=I+%e2%99%a5+K%c3%b6ln

Not the best design 😄 , but feedbacks are welcome.

#javascript #TypeScript #string #unicode

ALT text details

desktop view of string normalize web page, showing NFC, NFD, NFKC and NFKD normalization forms of text "I ♥ Köln" is visible

ALT text details

mobile view of string normalize web page, showing NFC, NFD and NFKC normalization forms of text "I ♥ Köln" is visible

SnoopJ

@SnoopJ@hachyderm.io

have you ever "naturally" (i.e. not discussion among #Unicode experts) encountered a font that correctly renders ꙮ?

Option	Voters
yes	0 (0%)
no	0 (0%)
what the hell are you talking about	0 (0%)

Revath S Kumar

@revathskumar@fosstodon.org

New blog post : "JavaScript : understanding string normalize"

https://blog.revathskumar.com/2025/01/javascript-understanding-string-normalize.html

#javascript #string #unicode

Qiita - 人気の記事

@qiita@rss-mstdn.studiofreesia.com

Unicode - 恩恵と厄介事
https://qiita.com/chai0917/items/16fa57fc3078c5314d8d?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

#qiita #Windows #C #Unicode

Qiita - 人気の記事

@qiita@rss-mstdn.studiofreesia.com

[謹賀新年] 世界中に配置した Oracle Active Data Guard から新年のご挨拶
https://qiita.com/shirok/items/1da55c23b33c5228049a?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

#qiita #oracle #Unicode #oci #dataguard #OracleDatabase

Paul McGuire

@ptmcg@fosstodon.org · Reply to Axel Rauschmayer's post

@rauschma Ah! I did something similar in Python - this is valid Python code:

def ℎ𝕖𝐥l𝙤():
try:
ℎ𝙚𝕝𝗹𝘰_ = "Hello"
w𝔬𝓇ˡ𝚍﹎ = "World"
𝖕𝘳𝒊𝖓𝑡(f"{𝗵𝒆𝘭𝓵𝚘﹍}, {𝑤º𝘳ｌ𝑑︴}!")
except Ｔ𝗒ₚ𝕖E𝗿𝗋𝗈𝓻 as ᵉ𝒙ⅽ:
𝐩ᵣ𝚒𝖓𝓉("failed: {}".𝕗𝕠r𝑚𝖺𝘵(ⅇ𝔵𝚌))

if _︳ｎ𝗮𝖒𝓮﹍︳ == "__main__":
h𝙚ⅼ𝐥𝕠()

https://ptmcg.pythonanywhere.com/font_mixer

#python #unicode

Scott Williams 🐧

@vwbusguy@mastodon.online

"This coding interview is just going to be determining the human friendly length of a unicode utf-8 string."

Junior level dev: "Oh, this is going to be easy. How do they not know about len()?"

Senior level dev: "Oh, brilliant - a test of tolerance for pain by evaluating various code point chains with emoji, accents, and LTR/RTL markers. I'll start by writing some tests for 8-bit ord and char conversions with lookahead evals."

#unicode #programming

ALT text details

Python docs showing how the same one letter can count for one or two character lengths in unicode depending on the code point definition.

Scott Williams 🐧

@vwbusguy@mastodon.online

"This coding interview is just going to be determining the human friendly length of a unicode utf-8 string."

Junior level dev: "Oh, this is going to be easy. How do they not know about len()?"

#unicode #programming

ALT text details

Python docs showing how the same one letter can count for one or two character lengths in unicode depending on the code point definition.

Scott Williams 🐧

@vwbusguy@mastodon.online

"This coding interview is just going to be determining the human friendly length of a unicode utf-8 string."

Junior level dev: "Oh, this is going to be easy. How do they not know about len()?"

#unicode #programming

ALT text details

Python docs showing how the same one letter can count for one or two character lengths in unicode depending on the code point definition.

SiljeLB

@siljelb@snabelen.no

TIL that a proposal was made in 1997 to add #tengwar to #unicode. I'm disappointed it hasn't been made official yet though. Here's a link to the proposal document: https://www.unicode.org/wg2/docs/n1641.pdf #Tolkien #LordOfTheRings

omg! ubuntu

@omgubuntu@floss.social

Ubuntu LTS users will shortly be able to see and use the 8 new emoji included in Unicode 16.0.

https://www.omgubuntu.co.uk/2024/12/ubuntu-update-support-for-emoji-16-0

#ubuntu #unicode #opensource

Michel Mariani

@mikaeru@mastodon.social

In the open-source application `Unicopedia Sinica`, both data files used for the `CJK Components` and the `CJK Related` utilities are now in a consistent JSON format with MIT license: `cjk-ids.json` and `cjk-related.json` respectively.

🔗 https://codeberg.org/tonton-pixel/unicopedia-sinica

#unicopedia #cjk #unihan #unicode #json

ALT text details

CJK Related utility screenshot

ALT text details

CJK Components utility screenshot

ALT text details

CJK Related utility screenshot

SnoopJ

@SnoopJ@hachyderm.io

HUH, #Unicode UAX#31 offers official guidance on hashtag identifiers, and I have somehow managed to miss that completely for several years (introduced along with Unicode 11.0 in 2018).

https://www.unicode.org/reports/tr31/#hashtag_identifiers

It's not like I re-read the whole document regularly or anything but yea huh

Aaron “#e14n pro” Madlon-Kay

@amake@mastodon.social · Reply to Aaron “#e14n pro” Madlon-Kay's post

iOS 18.2 did not add any new #Unicode coverage, at least at the code point level. Nevertheless, I have updated #IsItTofu

https://tofu.quest/?q=%F0%9F%A5%A8

洪民憙 (Hong Minhee)

@hongminhee@hollo.social

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.

#introduction

Eniko | Kitsune Tails out now!

@eniko@peoplemaking.games

Btw here's a little #gamedev unicode protip: unicode defines several character ranges as private use areas. You can map code points in these ranges to whatever glyph you want. This can be very handy for custom characters in your game that won't conflict with established unicode characters

In our games we use the PUA for keyboard and controller button glyphs

#unicode

Ausir

@ausir@meowr.me

brand new combining diacritics dropping soon in Unicode 17, to be used for transcribing rare historical uses, and even more so for really tryhard conlangs!

#linguistics #unicode #conlang

Michael Zöllner

@emnullfuenf@chaos.social

My study "Unicode Spaces" will be published in Slanted Magazine - Experimental Type 3!

#typography #unicode #whitespace

ALT text details

Listing of Unicode white space characters

ALT text details

Steamboat Willy formed with whitespaces in text.

ALT text details

Flower formed with whitespaces in text.

SnoopJ

@SnoopJ@hachyderm.io

TIL that the #Unicode Consortium is working on guidance for detecting "URLs"¹ in text:

https://www.unicode.org/L2/L2024/24217r2-uts58-working-draft.html

¹ scare quotes because URL is formally defined as ASCII-only, but "IRI" is a confusing term and everybody just wants to call the Unicode-aware equivalent a "URL"

Marcus Rohrmoser 🌻

@mro@digitalcourage.social · Reply to zirias (on snac)'s post

@zirias @stefano #hashtags are #unicode defined: https://www.unicode.org/reports/tr31/#D2

read 'em like this https://codeberg.org/seppo/seppo/src/commit/87bf300/lib/tag.ml#L31

Terence Eden

@Edent@mastodon.social

iOS 14 gets support for the Unicode Power Symbol!

https://shkspr.mobi/blog/2020/09/ios-14-gets-support-for-the-power-symbol/

#emoji #ios #power #unicode

Jim DeLaHunt

@jdlh@mstdn.ca · Reply to Jim DeLaHunt's post

A cool change is that the Core Specification of the Unicode Standard is now released as a static HTML subsite, backed up by an archiveable #PDF of 1,140 pages.

https://unicode.org/versions/Unicode16.0.0/core-spec/

You can now link to specific sections and paragraphs, e.g.

"Unicode is about plain text, see: https://unicode.org/versions/Unicode16.0.0/core-spec/chapter-2/#G642" .

I helped out in a small way with the project to produce the core spec as HTML + PDF. I think it is a marvellous improvement.

#18n #fonts #PDF #unicode

Jim DeLaHunt

@jdlh@mstdn.ca

Yay! #Unicode version 16.0 is released!

Announcement: https://blog.unicode.org/2024/09/announcing-unicode-standard-version-160.html

#18n #fonts #PDF #unicode

liilliil 🇫🇯🇱🇨

@liilliil@mastodon.online

Народ, айда форсить наш, славянский, кириллический #fediverseSymbol!
«Три снежинки» — ⁂ — потенциальный повод для многочисленных подъёбок

Польские ребята (@brie) нашли лучшего кандидата — ꙮ, «серафим многꙮкий». Символ, найденный в 1928 году только в одной (!) рукописи, и только из-за этого (!) добавленный в #Unicode несколько веков ждал своего часа
https://ru.wikipedia.org/wiki/Мультиокулярная_О

(English version https://im-in.space/@liilliil/113028392518272881 )

AmyFou 🥥🌴

@amyfou@lingo.lol

I am a #linguist (non-tenure track, uni) interested in every single thing about #languages, esp #Indigenous ones, #academics & #teaching Side gig in #ComunityBased #LanguageTech (#webdev #React #postgres #hasura #graphQL #nodeJS #nginx #linux #podman #kubernetes #docker #unicode lol). I love #animals and will ask you too many questions about your #dogs #cats #horses #sheep #goats #chickens #bunnies #piggies #cows etc . Proud #UglyDogs fan. Love #nature #birds #photography #art 👋

洪民憙 (Hong Minhee)

@hongminhee@fosstodon.org · Reply to 洪民憙 (Hong Minhee)'s post

こんにちは、私はソウルに住んでいる30代後半のオープンソースソフトウェアエンジニアで、自由・オープンソースソフトウェアとフェディバースの熱烈な支持者です。名前は洪民憙（ホン・ミンヒ）です。

私はTypeScript用のActivityPubサーバーフレームワークである「@fedify」と、1人用フェディバースのマイクロブログである「@hollo」の作成者でもあります。

私は東アジア言語（いわゆるCJK）とUnicodeにも興味が多いです。日本語、英語、韓国語で話しかけてください。（または、漢文でも！）

#自己紹介 #CJK #Unicode #日本語 #英語 #韓国語

洪民憙 (Hong Minhee)

@hongminhee@fosstodon.org

Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.

I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, and @hollo, a fediverse microblog for single users.

#introduction

Chunshek

@chunshek@prettyaweso.me

#Introduction post for my own Mastodon instance!

• I’m a 44-year-old jack-of-all-trades.
• I grew up in #HongKong, lived in the #US. My partner of 14 years and I moved to #Taiwan in 2020.
• We are “parents” to one remaining dog.
• I speak 6 #languages, and have dabbled in many others.
• Things I will nerd out about: #Unicode, #typography, #typhoons.
• I am a person of faith, but not a fan of organized religions.
• I type in #Dvorak.
• I curate pop music at @soniccruise.

ALT text details

A man happily holding a ripe yellow pineapple in his left hand, while pointing at the pineapple with his right hand, smiling at the camera.

ALT text details

A man standing in front of a wall covered in dozens of containers of various types of instant ramen and udon noodles. The man's facial expression shows amusement.

ALT text details

A man kneels down next to two tilted mailboxes in Taipei, Taiwan, pretending to be carrying one of the mailboxes on his back.

ALT text details

A top-down shot of a man lying down, looking into the eyes of a shiba inu dog. The dog has curled up into a resting position.

洪民憙 (Hong Minhee)

@hongminhee@fosstodon.org · Reply to 洪民憙 (Hong Minhee)'s post

If you believe that Chinese characters in #Chinese, #Korean, and #Japanese should all be divided into language-specific codes, then it is logical that the Latin characters in English, French, Italian, and German should all be divided into language-specific codes as well. Caveat: I don't believe so.

#Unicode

洪民憙 (Hong Minhee)

@hongminhee@fosstodon.org

Well, I vote for Han unification of #Unicode, and I rather think that more Chinese characters should have been unified (e.g., 高 & 髙, 產 & 産, 內 & 内). 🤷

#漢字 #hanzi #hanja #kanji

▇ ▃ ▃ ▇ ▇ ▄ ▃ ▆ [(-.-)]

@liilliil@im-in.space

Offering a new #FediverseSymbol: ꙮ

The previously suggested symbol ⁂ is good for depict group and unity, but is poor in terms of associations: “3 snowflakes”.

Not only does the symbol beautifully reflect the unity of the fediverse, but it also shows an all-seeing open-minded wise and powerful being (Ezekiel 1:18, 10:12 etc)

also: https://social.hackerspace.pl/@q3k/110446350216259023

xChaos

@xChaos@f.cz

Nebaví vás googlit unicode znaky pro subscript a superscript? Mě už taky ne :-)

Akordy pro psaní horního a dolního indexu (ve smyslu Unicode) na klávesnici Windows se dají snadno vygooglit. Pod Linuxem je to ovšem trochu věda:

1) nejdřív Pravý alt + pravý shift + backspace + 2 (ano, čtyřhmat)
2) potom znak, který má být dolní index, třeba číslovka (což ovšem na české klávesnici, na kterou jste přepnutí, taky s shiftem, takže dvouhmat).

H₂O

Pro horní index ve stejném čtyřhmatu akorát nahradíte tu dvojku trojkou:

a² + b² = c²

Slušné akordy, ne? problém je, že pokud čtyřhmat nedomáčknete přesně (?) tak ten Backspace má tendenci fungovat jako backspace, takže umaže jeden znak... no zkrátka, dělám to pokaždé na několikátý pokus, zatím :-)

Vůbec jsem nepochopil návod
https://www.abclinuxu.cz/blog/kenyho_stesky/2020/8/psani-hornich-a-dolnich-indexu-pres-compose-key
... asi proto, že nevím, která PC klávesa je "compose key", ale v komentářích čtenářů jsem si všiml návodu pro slovenskou klávesnici a funguje mi i pro český layout a tak to předávám dál.

#tipy #x11 #linux #czech #unicode

SnoopJ

@SnoopJ@hachyderm.io

the most important part of #Unicode history is when a mouse fell out of a light fixture and got added to the count of members present at a Technical Committee meeting (9 Nov 2016)

https://www.unicode.org/L2/L2016/16325.htm#149-A94

ALT text details

Nemo_bis 🌈

@nemobis@mamot.fr

Re-#introduction: recurring topics here.

#Wikimedia #Wikidata #Wikipedia #MediaWiki #OpenStreetMap #Wikimania #Wikisource #WikiCite #OpenRefine #wiki #Wiktionary #WikiLovesMonuments #Wikibase #Wikiquote

#i18n #L10n #translatewiki.net #Unicode #CLDR #languages

#Copyright #PublicDomain #PubblicoDominio #Copyleft #CreativeCommons #OpenData #UploadFilters #LicenzaLibera #DatiAperti

#InternetArchive #books #biblioteche #library #Koha #KohaILS #GLAM

#WikiTeam #digipres #ArchiveTeam #XSLT

1/4

Michel Mariani

@mikaeru@mastodon.social · Reply to Design Brouhaha's post

@MoritzBrouhaha

Je viens tout juste d'acquérir les cinq premiers numéros d’Unicode à Gogo ! Tous disponibles à la boutique du Musée de l'Imprimerie et de la Communication graphique.

Excellent ! 💮

#Unicode

ALT text details

Les cinq premiers numéros d’Unicode à Gogo !

Michel Mariani

@mikaeru@mastodon.social

Unicopedia Ægypta is a developer-oriented set of #Unicode utilities related to Egyptian hieroglyphs, wrapped into one single app, built with #Electron.

Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-aegypta

#characters #codecharts #codepoints #desktopapplication #egyptian #electronjs #glyphs #hieroglyph #hieroglyphs #javascript #localfonts #unicode #unicopedia #unikemet

ALT text details

Unicopedia Ægypta Social Preview

Matthias Wiesmann

@thias@mastodon.social

Treasure Hunt – Braille Hints

#3dprinting #braille #unicode #python

https://wiesmann.codiferes.net/wordpress/archives/37764

Michel Mariani

@mikaeru@mastodon.social

Beautifully crafted BabelStone Han font, by Andrew West 魏安

🔗 https://www.babelstone.co.uk/Fonts/Han.html

ALT text details

Repeated: 龙 U+9F99 U+31342 U+2EE5D

Michel Mariani

@mikaeru@mastodon.social

Unicopedia Plus is a developer-oriented set of Unicode, Unihan, Unikemet & emoji utilities wrapped into one single app, built with #Electron.

Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-plus

ALT text details

Unicopedia Plus Social Preview

Michel Mariani

@mikaeru@mastodon.social

Unicopedia Sinica is a developer-oriented set of #Unicode utilities related to ideographs, wrapped into one single app, built with #Electron.

Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-sinica

ALT text details

Unicopedia Sinica Social Preview

꧁ᐊ𰻞ᵕ̣̣̣̣̣̣́́♛ᵕ̣̣̣̣̣̣́́𰻞ᐅ꧂

@idontlikenames@mastodon.gamedev.place

New 2d numeral system just dropped‽‽‽

It's based on ᚛ᚑᚌᚐᚋ᚜ & ☯ & bijective base 6, & works left→right or left←right

#math #unicode #linguistics #pixelart #ui #blackandwhite #design #inspiration #language

Aaron “#e14n pro” Madlon-Kay

@amake@mastodon.social

Newly covered #Unicode code points in #iOS 17.0:

ᜍ᜕ᜟ

My tooling also indicated that these are covered, but they don't actually show up on my iPhone:

􀑝

Gerrit Imsieke

@gimsieke@mastodon.cloud

Formatting people’s names correctly in a given context, for a given purpose, is hard. International linguists recently helped update the #Unicode Common Locale Data Repository (#CLDR). It will help programmers display person names correctly in many settings.
Mike McKenna wrote about it in “A Story Teller’s Case Study: Unlocking the Power of CLDR Person Name Formatting – A Solution for Formatting Names in a Globalized World” https://www.unicode.org/media/CLDR_Person_Name_White_Paper_June%202023.pdf