
Stephan Michels
@michels@mastodon.social
That was new to me. You can combine any character with COMBINING ENCLOSING KEYCAP (U+20E3) to get a character for a keyboard shortcut.
#Unicode
@michels@mastodon.social
That was new to me. You can combine any character with COMBINING ENCLOSING KEYCAP (U+20E3) to get a character for a keyboard shortcut.
#Unicode
@michels@mastodon.social
That was new to me. You can combine any character with COMBINING ENCLOSING KEYCAP (U+20E3) to get a character for a keyboard shortcut.
#Unicode
@rnd@toot.cat
one thing i don't understand at all is why #unicode is specifically set up so codepoints larger than U+10FFFF are treated as invalid, not even "reserved for future use"
are we completely sure that we NEVER end up needing more than 1114112 codepoints? sure, right now we're at 159801, less than 15%, but who knows what will happen in the future
@frontenddogma@mas.to
Targeting Specific Characters With CSS Rules, by @Edent:
https://shkspr.mobi/blog/2025/09/targetting-specific-characters-with-css-rules/
@frontenddogma@mas.to
Targeting Specific Characters With CSS Rules, by @Edent:
https://shkspr.mobi/blog/2025/09/targetting-specific-characters-with-css-rules/
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@amake@mastodon.social
Newly covered #Unicode code points in iOS 26.
I have to admit I have not updated anything to 26 yet. At least on Mac I usually wait for #MacPorts issues to be cleared up, but this one might take me a while...
㇀㇁㇂㇃㇄㇅㇆㇇㇈㇉㇊㇋㇌㇍㇎㇏㇐㇑㇒㇓㇔㇕㇖㇗㇘㇙㇚㇛㇜㇝㇞㇟㇠㇡㇢㇣𞓐𞓑𞓒𞓓𞓔𞓕𞓖𞓗𞓘𞓙𞓚𞓛𞓜𞓝𞓞𞓟𞓠𞓡𞓢𞓣𞓤𞓥𞓦𞓧𞓨𞓩𞓪𞓫𞓮𞓯𞓬𞓭𞓰𞓱𞓲𞓳𞓴𞓵𞓶𞓷𞓸𞓹𠁣𠃛𠊎𠖄𠖫𠗻𠘆𠜖𠞩𠞭𠠃𠠝𠠫𠢕𠴭𠺅𠺣𠻞𡌴𡟓𡨞𡳞𡽜𢄧𢎙𢒉𢓜𢛟𢜳𢬳𢯭𢯾𢱤𢲴𢳪𢶀𢺴𢻷𢼌𢼛𢿞𣁳𣍐𣗺𣦼𣩈𣮈𣲩𣸤𣼎𤁢𤊶𤍒𤐙𤐰𤖯𤘅𤞚𤡯𤲍𤶃𤸁𤺅𤺪𤿎𥉔𥌚𥍉𥏘𥐵𥯟𥯥𥰔𥴊𥽕𦃓𦉎𦊓𦒨𦘅𦜆𧉅𧉟𧌄𧜞𧩣𧮙𧰵𧺤𧻴𧿳𨂿𨅔𨒇𨢑𩏠𩑾𩔵𩚨𩛩𩜄𩜇𩜰𩟗𩣳𩨑𩵱𩸙𩼧𪀋𪐞𪖐𪖶𪘒𪜶𪢼𪳕𪹚𫓩𫝏𫝘𫝙𫝞𫝺𫝻𫞭𫞼𫟂𫟊𫟧𫠄𫠛𫣆𫰡𬈜𬏛𬠖𬤐𬦰𬬺𬮤𮀎𮣳𮭦𰣻𰵝𰵞𰵧𰹬𰾫𱂐𱮒𱱿𱳪𲂎
@amake@mastodon.social
Newly covered #Unicode code points in iOS 26.
I have to admit I have not updated anything to 26 yet. At least on Mac I usually wait for #MacPorts issues to be cleared up, but this one might take me a while...
㇀㇁㇂㇃㇄㇅㇆㇇㇈㇉㇊㇋㇌㇍㇎㇏㇐㇑㇒㇓㇔㇕㇖㇗㇘㇙㇚㇛㇜㇝㇞㇟㇠㇡㇢㇣𞓐𞓑𞓒𞓓𞓔𞓕𞓖𞓗𞓘𞓙𞓚𞓛𞓜𞓝𞓞𞓟𞓠𞓡𞓢𞓣𞓤𞓥𞓦𞓧𞓨𞓩𞓪𞓫𞓮𞓯𞓬𞓭𞓰𞓱𞓲𞓳𞓴𞓵𞓶𞓷𞓸𞓹𠁣𠃛𠊎𠖄𠖫𠗻𠘆𠜖𠞩𠞭𠠃𠠝𠠫𠢕𠴭𠺅𠺣𠻞𡌴𡟓𡨞𡳞𡽜𢄧𢎙𢒉𢓜𢛟𢜳𢬳𢯭𢯾𢱤𢲴𢳪𢶀𢺴𢻷𢼌𢼛𢿞𣁳𣍐𣗺𣦼𣩈𣮈𣲩𣸤𣼎𤁢𤊶𤍒𤐙𤐰𤖯𤘅𤞚𤡯𤲍𤶃𤸁𤺅𤺪𤿎𥉔𥌚𥍉𥏘𥐵𥯟𥯥𥰔𥴊𥽕𦃓𦉎𦊓𦒨𦘅𦜆𧉅𧉟𧌄𧜞𧩣𧮙𧰵𧺤𧻴𧿳𨂿𨅔𨒇𨢑𩏠𩑾𩔵𩚨𩛩𩜄𩜇𩜰𩟗𩣳𩨑𩵱𩸙𩼧𪀋𪐞𪖐𪖶𪘒𪜶𪢼𪳕𪹚𫓩𫝏𫝘𫝙𫝞𫝺𫝻𫞭𫞼𫟂𫟊𫟧𫠄𫠛𫣆𫰡𬈜𬏛𬠖𬤐𬦰𬬺𬮤𮀎𮣳𮭦𰣻𰵝𰵞𰵧𰹬𰾫𱂐𱮒𱱿𱳪𲂎
@bortzmeyer@mastodon.gougere.fr
Dans le coffre aux trésors d’Unicode 17 : des chameaux et un trombone : https://linuxfr.org/news/dans-le-coffre-aux-tresors-d-unicode-17-des-chameaux-et-un-trombone
@bortzmeyer@mastodon.gougere.fr
Dans le coffre aux trésors d’Unicode 17 : des chameaux et un trombone : https://linuxfr.org/news/dans-le-coffre-aux-tresors-d-unicode-17-des-chameaux-et-un-trombone
@Edent@mastodon.social
Android will *not* be getting most of the Unicode 17 updates.
Some of its fonts are over a decade out of date - and Google refuses to re-use its own Noto font stack.
I've raised the issue at:
https://issuetracker.google.com/issues/366415133
If you're a Googler please ask someone to prioritise this issue. Can everyone else please hit the +1 button.
@Edent@mastodon.social
Android will *not* be getting most of the Unicode 17 updates.
Some of its fonts are over a decade out of date - and Google refuses to re-use its own Noto font stack.
I've raised the issue at:
https://issuetracker.google.com/issues/366415133
If you're a Googler please ask someone to prioritise this issue. Can everyone else please hit the +1 button.
@flying_saucers@mastodon.social
@flying_saucers@mastodon.social
@triker@mstdn.plus
I just learned how to type unicode letters and dingbats in Linux!
Ctrl + Shift + U press all 3 keys at once then let all three letters go.
then type in the unicode and press enter.
https://en.wikipedia.org/wiki/List_of_Unicode_characters
IE.
Ctrl + Shift + U 2713 is a tick or check mark
✓
Similarly, I can write ñ (n tilde) with:
ctrl + shift + U 00f1
See dingbats block for more check mark choices.
https://en.wikipedia.org/wiki/Dingbats_(Unicode_block)
All of unicode here:
https://home.unicode.org/
@crickxson@post.lurk.org
Each time i use https://shapecatcher.com
I'm gratefull to #BenjaminMilde to have build it and keep it running.
"You know what some #character looks like, but you've forgotten its name or its #Unicode code point. Now what do you do? #Shapecatcher is a new website, that helps you to find specific Unicode characters, just by #sketching their shape. Currently about 10000 of the most important Unicode characters are compared to your sketch and are analysed for similarities.
Under the hood, Shapecatcher uses so called "#shape contexts" to find similarities between two shapes. Shape contexts, a robust mathematical way of describing the concept of similarity between shapes, is a feature descriptor first proposed by #SergeBelongie and #JitendraMalik."
@crickxson@post.lurk.org
Each time i use https://shapecatcher.com
I'm gratefull to #BenjaminMilde to have build it and keep it running.
"You know what some #character looks like, but you've forgotten its name or its #Unicode code point. Now what do you do? #Shapecatcher is a new website, that helps you to find specific Unicode characters, just by #sketching their shape. Currently about 10000 of the most important Unicode characters are compared to your sketch and are analysed for similarities.
Under the hood, Shapecatcher uses so called "#shape contexts" to find similarities between two shapes. Shape contexts, a robust mathematical way of describing the concept of similarity between shapes, is a feature descriptor first proposed by #SergeBelongie and #JitendraMalik."
@xfq@w3c.social
Today marks 37 years since Joe Becker's landmark "Unicode 88" document!
@xfq@w3c.social
Today marks 37 years since Joe Becker's landmark "Unicode 88" document!
@timbray@cosocial.ca
Three small announcements:
1. RFC 9839, a guide to which Unicode characters you should never use: https://www.rfc-editor.org/rfc/rfc9839.html
2. Blog piece with background and context, “RFC 9839 and Bad Unicode”: https://www.tbray.org/ongoing/When/202x/2025/08/14/RFC9839
3. A little Go library that implements 9839’s exclusion subsets: https://github.com/timbray/RFC9839
@timbray@cosocial.ca
Three small announcements:
1. RFC 9839, a guide to which Unicode characters you should never use: https://www.rfc-editor.org/rfc/rfc9839.html
2. Blog piece with background and context, “RFC 9839 and Bad Unicode”: https://www.tbray.org/ongoing/When/202x/2025/08/14/RFC9839
3. A little Go library that implements 9839’s exclusion subsets: https://github.com/timbray/RFC9839
@timbray@cosocial.ca
Three small announcements:
1. RFC 9839, a guide to which Unicode characters you should never use: https://www.rfc-editor.org/rfc/rfc9839.html
2. Blog piece with background and context, “RFC 9839 and Bad Unicode”: https://www.tbray.org/ongoing/When/202x/2025/08/14/RFC9839
3. A little Go library that implements 9839’s exclusion subsets: https://github.com/timbray/RFC9839
@idontlikenames@mastodon.gamedev.place
american: OwO
cyrilic: ꙮшꙮ
armenian: ՕաՕ
georgian: ტოტ ႣⴍႣ
gothic: 𐍈𐌸𐍈
greek: ΘωΘ ΩωΩ ΦωΦ ΟωΟ
coptic: ⲐⲱⲐ ⲪⲱⲪ ⲞⲱⲞ
hebrew: סשס
ge'ez: ዐሠዐ
chinese: 口山口
inuktitut: ᑭᓚᓗᑫ ᐁᓚᓗᐁ
vai: ꖘꕀꖘ ꖴꕀꖴ
khmer: ឰឃឰ ២ឃ២ ៙ឃ៙
sinhala: ඞ෴ඞ ට෴ට මයම
tibetan: ༠ྻ ༠ ༠ྏ ༠
jap: ᶘᵒᴥᵒᶅ
#owo #protoworld #linguistics #language #unicode #writing #kaomoji
@idontlikenames@mastodon.gamedev.place
american: OwO
cyrilic: ꙮшꙮ
armenian: ՕաՕ
georgian: ტოტ ႣⴍႣ
gothic: 𐍈𐌸𐍈
greek: ΘωΘ ΩωΩ ΦωΦ ΟωΟ
coptic: ⲐⲱⲐ ⲪⲱⲪ ⲞⲱⲞ
hebrew: סשס
ge'ez: ዐሠዐ
chinese: 口山口
inuktitut: ᑭᓚᓗᑫ ᐁᓚᓗᐁ
vai: ꖘꕀꖘ ꖴꕀꖴ
khmer: ឰឃឰ ២ឃ២ ៙ឃ៙
sinhala: ඞ෴ඞ ට෴ට මයම
tibetan: ༠ྻ ༠ ༠ྏ ༠
jap: ᶘᵒᴥᵒᶅ
#owo #protoworld #linguistics #language #unicode #writing #kaomoji
@jdlh@mstdn.ca
My quest at #fedicon2025 is to find #Fediverse services and handles with non-Latin characters. Can you link me to examples?
I hear there are many #Japan ese people active in Fediverse, but all the examples I see have only Latin script. #Unicode #Fedicon #Mastodon #UniversalAcceptance
@jdlh@mstdn.ca
My quest at #fedicon2025 is to find #Fediverse services and handles with non-Latin characters. Can you link me to examples?
I hear there are many #Japan ese people active in Fediverse, but all the examples I see have only Latin script. #Unicode #Fedicon #Mastodon #UniversalAcceptance
@jdlh@mstdn.ca
My quest at #fedicon2025 is to find #Fediverse services and handles with non-Latin characters. Can you link me to examples?
I hear there are many #Japan ese people active in Fediverse, but all the examples I see have only Latin script. #Unicode #Fedicon #Mastodon #UniversalAcceptance
@jdlh@mstdn.ca
My quest at #fedicon2025 is to find #Fediverse services and handles with non-Latin characters. Can you link me to examples?
I hear there are many #Japan ese people active in Fediverse, but all the examples I see have only Latin script. #Unicode #Fedicon #Mastodon #UniversalAcceptance
@mikaeru@mastodon.social
Beautifully crafted BabelStone Han font, by Andrew West 魏安
#BabelStone Han v. 15.1.3 is a free #Unicode #CJK #font with over 57,000 Han characters (#hanzi, #kanji, #hanja), and 62,061 Unicode characters in total. It is a Song/Ming style (宋体/明體) font, with glyphs modelled on the official character forms used in the People's Republic of China, and is primarily intended for writing Modern Standard #Chinese, Classical Chinese, and various Sinitic languages and dialects.
@mikaeru@mastodon.social
New in the CJK Variations utility of Unicopedia Sinica:
- Support for the latest Ideographic Variation Database (IVD 2025), adding the new CAAPH Collection.
- Support for the updated BabelStone Collection (unregistered), based on the latest BabelStone Han font (v17.0.0 BETA), by Andrew C. West (魏安), 1960-2025 RIP (安息吧).
🔗 https://https://codeberg.org/tonton-pixel/unicopedia-sinica
#Unicopedia #Unicode #Unihan #CJK #IdeographicVariationDatabase #IVD #CAAPH #BabelStone
@mikaeru@mastodon.social
New in the CJK Variations utility of Unicopedia Sinica:
- Support for the latest Ideographic Variation Database (IVD 2025), adding the new CAAPH Collection.
- Support for the updated BabelStone Collection (unregistered), based on the latest BabelStone Han font (v17.0.0 BETA), by Andrew C. West (魏安), 1960-2025 RIP (安息吧).
🔗 https://https://codeberg.org/tonton-pixel/unicopedia-sinica
#Unicopedia #Unicode #Unihan #CJK #IdeographicVariationDatabase #IVD #CAAPH #BabelStone
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@Timwi@nerdculture.de
I just found out that #Unicode has segment-display digit characters. The below screenshot is all in one font (#JuliaMono). The characters are U+1FBF0 to U+1FBF9. Unicode is gorgeous
@ParisWeb@mamot.fr
Avec @MoritzBrouhaha, découvrez l'histoire du standard informatique Unicode, utilisé par tout le monde à travers le globe dans nos communications quotidiennes.
https://www.paris-web.fr/2025/conference/a-la-decouverte-du-monde-au-travers-de-lunicode
@ParisWeb@mamot.fr
Avec @MoritzBrouhaha, découvrez l'histoire du standard informatique Unicode, utilisé par tout le monde à travers le globe dans nos communications quotidiennes.
https://www.paris-web.fr/2025/conference/a-la-decouverte-du-monde-au-travers-de-lunicode
@ParisWeb@mamot.fr
Avec @MoritzBrouhaha, découvrez l'histoire du standard informatique Unicode, utilisé par tout le monde à travers le globe dans nos communications quotidiennes.
https://www.paris-web.fr/2025/conference/a-la-decouverte-du-monde-au-travers-de-lunicode
@paulmelis@social.edu.nl
The recycling symbol ♻ in a git branch name, what a time to be alive 😎
Also, nice of #github to warn about possibly hidden characters, but not sure it applies in this case
@paulmelis@social.edu.nl
The recycling symbol ♻ in a git branch name, what a time to be alive 😎
Also, nice of #github to warn about possibly hidden characters, but not sure it applies in this case
@mikaeru@mastodon.social · Reply to Michel Mariani's post
No Electron support for the latest Unicode version is a major hindrance for my open-source Unicopedia Plus application, which I have to keep in Beta version for a long time because of that...
@mikaeru@mastodon.social · Reply to Michel Mariani's post
No Electron support for the latest Unicode version is a major hindrance for my open-source Unicopedia Plus application, which I have to keep in Beta version for a long time because of that...
@headword@lingo.lol · Reply to Unicode Watch �🔍's post
Interesting to see letters like ,
, and
proposed for inclusion in Unicode!
#EnglishPhonotypicAlphabet #PhonotypicAlphabet #Phonotypic #Dania #Phonetic #Phonetics #PhoneticTranscription #Unicode
@mikaeru@mastodon.social
The Ideographic Research Group (IRG) is responsible for preparing and reviewing sets of CJK unified ideographs to be included in the Unicode Standard.
Current and future IRG source prefixes used to be listed in the main IRG homepage, but are now available in a separate dedicated page:
@mikaeru@mastodon.social · Reply to Le Monde.fr's post
@mikaeru@mastodon.social · Reply to Le Monde.fr's post
@michels@mastodon.social
I added typographic guides to my Unicode viewer. I first tried the new TextRenderer, but found it too limited. I then switched back to CoreText. However, I then noticed that SwiftUI was cutting off some parts of the glyphs. It seems that they don’t expect the glyphs to extend beyond their bounding box.
@michels@mastodon.social
I added typographic guides to my Unicode viewer. I first tried the new TextRenderer, but found it too limited. I then switched back to CoreText. However, I then noticed that SwiftUI was cutting off some parts of the glyphs. It seems that they don’t expect the glyphs to extend beyond their bounding box.
@mikaeru@mastodon.social
Apart from the issue of line formatting of plain text in the new Unicode contact form <https://support.unicode.org/osticket/open.php>, it appears that some pretty innocuous characters such as the vertical bar | or the degree sign ° are getting stripped out from the latest reports, in <https://www.unicode.org/review/pri526/> for instance.
Ironically enough, it seems that the Unicode contact form is not Unicode-conformant/compliant then. Maybe some kind of "Make ASCII Great Again" thing?
@mikaeru@mastodon.social
Apart from the issue of line formatting of plain text in the new Unicode contact form <https://support.unicode.org/osticket/open.php>, it appears that some pretty innocuous characters such as the vertical bar | or the degree sign ° are getting stripped out from the latest reports, in <https://www.unicode.org/review/pri526/> for instance.
Ironically enough, it seems that the Unicode contact form is not Unicode-conformant/compliant then. Maybe some kind of "Make ASCII Great Again" thing?
@amake@mastodon.social
@amake@mastodon.social
@mikaeru@mastodon.social · Reply to Michel Mariani's post
Unicode's new contact form at <https://support.unicode.org/osticket/open.php> is apparently an HTML editor "in disguise"; the only way I found to force it to keep the formatting of my plain text messages was to select the HTML mode and paste the text inside a <pre></pre> tag...
Still, some contents gets unexpectedly stripped out after submission of the report, like text between "<" and ">".
@mikaeru@mastodon.social · Reply to Michel Mariani's post
Unicode's new contact form at <https://support.unicode.org/osticket/open.php> is apparently an HTML editor "in disguise"; the only way I found to force it to keep the formatting of my plain text messages was to select the HTML mode and paste the text inside a <pre></pre> tag...
Still, some contents gets unexpectedly stripped out after submission of the report, like text between "<" and ">".
@Timwi@nerdculture.de
I just found out that #Unicode has segment-display digit characters. The below screenshot is all in one font (#JuliaMono). The characters are U+1FBF0 to U+1FBF9. Unicode is gorgeous
@Timwi@nerdculture.de
I just found out that #Unicode has segment-display digit characters. The below screenshot is all in one font (#JuliaMono). The characters are U+1FBF0 to U+1FBF9. Unicode is gorgeous
@mikaeru@mastodon.social
New utilities in Unicopedia Ægypta:
- Hieroglyph Picture Book
- Hieroglyph Taxonomy
🔗 https://codeberg.org/tonton-pixel/unicopedia-aegypta
#unicopedia #egyptian #hieroglyphs #taxonomy #picturebook #javascript #desktopapplication #electronjs #unicode
@mikaeru@mastodon.social
New utilities in Unicopedia Ægypta:
- Hieroglyph Picture Book
- Hieroglyph Taxonomy
🔗 https://codeberg.org/tonton-pixel/unicopedia-aegypta
#unicopedia #egyptian #hieroglyphs #taxonomy #picturebook #javascript #desktopapplication #electronjs #unicode
@mikaeru@mastodon.social
In case my feedback to the UTC gets garbled once again, here are the links to the plain text messages I attempted to submit through copy-paste from their new contact page <https://support.unicode.org/osticket/open.php>: no truly WYSIWYG editor, no basic preview mode either...
https://tonton-pixel.codeberg.page/PRI-519-Feedback-2025-05-19.txt
https://tonton-pixel.codeberg.page/PRI-519-Feedback-2025-05-18.txt
https://tonton-pixel.codeberg.page/PRI-519-Feedback-2025-05-13.txt
I'm dreaming of a simple world without technology wanting to "help" us so much. We shouldn't have to struggle to achieve simple tasks...
@mikaeru@mastodon.social
In case my feedback to the UTC gets garbled once again, here are the links to the plain text messages I attempted to submit through copy-paste from their new contact page <https://support.unicode.org/osticket/open.php>: no truly WYSIWYG editor, no basic preview mode either...
https://tonton-pixel.codeberg.page/PRI-519-Feedback-2025-05-19.txt
https://tonton-pixel.codeberg.page/PRI-519-Feedback-2025-05-18.txt
https://tonton-pixel.codeberg.page/PRI-519-Feedback-2025-05-13.txt
I'm dreaming of a simple world without technology wanting to "help" us so much. We shouldn't have to struggle to achieve simple tasks...
@mikaeru@mastodon.social
From time to time (since this represents a tremendous amount of translation/adaptation work), a French version of the "code charts" gets published by the Unicode Consortium: the latest one is for Unicode 16.0:
https://www.unicode.org/Public/16.0.0/charts/fr/CodeCharts.pdf
This is especially useful for French speakers in #Canada, #France, #Belgium, #Switzerland, etc. but may soon be obsolete for #Quebec, in case it gets "absorbed" by a neighboring country whose official language is now English only...
@mikaeru@mastodon.social
From time to time (since this represents a tremendous amount of translation/adaptation work), a French version of the "code charts" gets published by the Unicode Consortium: the latest one is for Unicode 16.0:
https://www.unicode.org/Public/16.0.0/charts/fr/CodeCharts.pdf
This is especially useful for French speakers in #Canada, #France, #Belgium, #Switzerland, etc. but may soon be obsolete for #Quebec, in case it gets "absorbed" by a neighboring country whose official language is now English only...
@mikaeru@mastodon.social
De temps en temps (cela représente un énorme travail d'adaptation), une version française des "code charts" est publiée par le Consortium Unicode, la dernière en date est pour Unicode 16.0:
https://www.unicode.org/Public/16.0.0/charts/fr/CodeCharts.pdf
Malheureusement, celle-ci risque d'être bientôt obsolète pour les francophones de la belle province de Québec, dans le cas où celle-ci serait «absorbée» par un pays voisin dont la langue officielle est désormais uniquement l'anglais...
@mikaeru@mastodon.social
De temps en temps (cela représente un énorme travail d'adaptation), une version française des "code charts" est publiée par le Consortium Unicode, la dernière en date est pour Unicode 16.0:
https://www.unicode.org/Public/16.0.0/charts/fr/CodeCharts.pdf
Malheureusement, celle-ci risque d'être bientôt obsolète pour les francophones de la belle province de Québec, dans le cas où celle-ci serait «absorbée» par un pays voisin dont la langue officielle est désormais uniquement l'anglais...
@SteveFaulkner@mastodon.social
👁️short note on emoji text alternative variations
"Unicode symbols do not have inbuilt text alternatives. They are exposed in the browser accessibility tree as a text symbol"
#emoji #screenreaders #a11y #unicode #webDev
https://html5accessibility.com/stuff/2022/01/17/short-note-on-emoji-text-alternative-variations/
@SteveFaulkner@mastodon.social
👁️short note on emoji text alternative variations
"Unicode symbols do not have inbuilt text alternatives. They are exposed in the browser accessibility tree as a text symbol"
#emoji #screenreaders #a11y #unicode #webDev
https://html5accessibility.com/stuff/2022/01/17/short-note-on-emoji-text-alternative-variations/
@SteveFaulkner@mastodon.social
👁️short note on emoji text alternative variations
"Unicode symbols do not have inbuilt text alternatives. They are exposed in the browser accessibility tree as a text symbol"
#emoji #screenreaders #a11y #unicode #webDev
https://html5accessibility.com/stuff/2022/01/17/short-note-on-emoji-text-alternative-variations/
@mikaeru@mastodon.social
Unicopedia Anatolica is a developer-oriented set of #Unicode utilities related to Anatolian hieroglyphs, wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-anatolica
#anatolian #hieroglyphs #unicopedia #javascript #unicode #characters #codepoints #codecharts #desktopapplication #electronjs #glyphs #localfonts
@mikaeru@mastodon.social
Unicopedia Anatolica is a developer-oriented set of #Unicode utilities related to Anatolian hieroglyphs, wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-anatolica
#anatolian #hieroglyphs #unicopedia #javascript #unicode #characters #codepoints #codecharts #desktopapplication #electronjs #glyphs #localfonts
@mikaeru@mastodon.social
Considerations about Egyptian Hieroglyph legacy characters, by Michel Suignard, proposing to add a new kEH_AltMapping property to the Unikemet database (UAX#57):
@mikaeru@mastodon.social
Unicopedia Ægypta is a developer-oriented set of #Unicode utilities related to Egyptian hieroglyphs, wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-aegypta
#characters #codecharts #codepoints #desktopapplication #egyptian #electronjs #glyphs #hieroglyph #hieroglyphs #javascript #localfonts #unicode #unicopedia #unikemet
@mikaeru@mastodon.social
Unicopedia Plus is a developer-oriented set of Unicode, Unihan, Unikemet & emoji utilities wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-plus
#characters #chinese #cjk #codepoints #desktopapplication #electronjs #emoji #ivd #japanese #javascript #kangxi #kangxiradicals #korean #normalization #opensource #regex #segmentation #strokecount #unicode #unicopedia #unihan #unikemet
@mikaeru@mastodon.social
Unicopedia Sinica is a developer-oriented set of #Unicode utilities related to ideographs, wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-sinica
#characters #chinese #cjk #cjkrelated #cjkv #codecharts #codepoints #components #confusables #desktopapplication #electronjs #glyphs #ideographs #ideographicdescriptionsequences #ids #japanese #javascript #kangxi #kangxiradicals #korean #localfonts #opensource #strokes #tangut #unicode #unicopedia #unihan #vietnamese
@mikaeru@mastodon.social
U+2640 FEMALE SIGN
U+2642 MALE SIGN
U+26A2 DOUBLED FEMALE SIGN
U+26A3 DOUBLED MALE SIGN
U+26A4 INTERLOCKED FEMALE AND MALE SIGN
U+26A5 MALE AND FEMALE SIGN
U+26A6 MALE WITH STROKE SIGN
U+26A7 MALE WITH STROKE AND MALE AND FEMALE SIGN
U+26A8 VERTICAL MALE WITH STROKE SIGN
U+26A9 HORIZONTAL MALE WITH STROKE SIGN
U+26B2 NEUTER
@mikaeru@mastodon.social
#Unicode #Emoji: #Hearts #Galore
U+2764 U+FE0F U+1FA77 U+1F9E1 U+1F49B U+1F49A U+1F499 U+1FA75 U+1F49C U+1F90E U+1F5A4 U+1FA76 U+1F90D
U+1F49F U+2764 U+FE0F U+200D U+1F525 U+1F494 U+2764 U+FE0F U+200D U+1FA79 U+2763 U+FE0F U+1F498 U+1F493 U+1F497 U+1F496 U+1F49D U+1F495 U+1F49E
U+1F970 U+1F60D U+1F618 U+1F63B U+1F48C U+1FAF6 U+1FAF6 U+1F3FB U+1FAF6 U+1F3FC U+1FAF6 U+1F3FD U+1FAF6 U+1F3FE U+1FAF6 U+1F3FF U+1FAC0
@mikaeru@mastodon.social
@mikaeru@mastodon.social
U+1F473 U+1F473 U+1F3FB U+1F473 U+1F3FC U+1F473 U+1F3FD U+1F473 U+1F3FE U+1F473 U+1F3FF
U+1F478 U+1F478 U+1F3FB U+1F478 U+1F3FC U+1F478 U+1F3FD U+1F478 U+1F3FE U+1F478 U+1F3FF
@mikaeru@mastodon.social
#Unicode #Emoji: #Math #Geekiness
<U+1F605> <U+1F4A7> <U+1F604>
@mikaeru@mastodon.social
@liilliil@im-in.space
Offering a new #FediverseSymbol: ꙮ
The previously suggested symbol ⁂ is good for depict group and unity, but is poor in terms of associations: “3 snowflakes”.
Polish fediusers have noticed a piece of an old Russian manuscript, it says about ‘many-eyed seraphim’ (серафим многоокий). An unknown 15th-century monk played with the combination of the letters oo, turning them into a multi-eyed creature. The character found in only 1 manuscript, but despite this, it has been added into #Unicode.
Not only does the symbol beautifully reflect the unity of the fediverse, but it also shows an all-seeing open-minded wise and powerful being (Ezekiel 1:18, 10:12 etc)
@achadwick@urbanists.social
Hey, fedi #Unicode nerds!
#OpenStreetMap's Andy Mabbett (@Pigsonthewing) is asking whether anyone knows about any instances of the #OrdnanceSurvey's bench mark symbol appearing in actual print, on a page. Looks a bit like ⭱ or ⤒ but a broader arrow. Usually found carved on stone or brick all over the UK/ROI.
Their goal is to propose it as a Unicode symbol! https://community.openstreetmap.org/t/os-bench-mark-symbol-in-printed-documents/128182
Any known international usage of this symbol would doubtless be appreciated too
@achadwick@urbanists.social
Hey, fedi #Unicode nerds!
#OpenStreetMap's Andy Mabbett (@Pigsonthewing) is asking whether anyone knows about any instances of the #OrdnanceSurvey's bench mark symbol appearing in actual print, on a page. Looks a bit like ⭱ or ⤒ but a broader arrow. Usually found carved on stone or brick all over the UK/ROI.
Their goal is to propose it as a Unicode symbol! https://community.openstreetmap.org/t/os-bench-mark-symbol-in-printed-documents/128182
Any known international usage of this symbol would doubtless be appreciated too
@mikaeru@mastodon.social · Reply to Michel Mariani's post
Today (April Fools' Day), Adobe is apparently back to the list of full members (voting) of the Unicode Consortium, but for how long this time: one full year?
« Ça s’en va et ça revient
C’est fait de tout petits riens
Ça se chante et ça se danse
Et ça revient, ça se retient
Comme une chanson populaire »
Full members (voting) of the Unicode Consortium: Adobe, Airbnb, Amazon, Apple, Google, Meta, Microsoft, Salesforce, Translated.
@SnoopJ@hachyderm.io
the most important part of #Unicode history is when a mouse fell out of a light fixture and got added to the count of members present at a Technical Committee meeting (9 Nov 2016)
@Edent@mastodon.social
Which is your favourite #Unicode telephone?
Option | Voters |
---|---|
🕾 | 1 (1%) |
🕿 | 5 (7%) |
☏ | 18 (27%) |
☎ | 43 (64%) |
@Edent@mastodon.social
Which is your favourite #Unicode telephone?
Option | Voters |
---|---|
🕾 | 1 (1%) |
🕿 | 5 (7%) |
☏ | 18 (27%) |
☎ | 43 (64%) |
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@sibaku@mas.to
Found out something interesting/annoying related to #unicode! There is an issue with the character 浅. You might see it one of two ways (see screenshots) depending on which font you use, which was the cause of my confusion. One form has 2 and the other 3 horizontal strokes. So why is that?
@mikaeru@mastodon.social
The Ideographic Research Group (IRG) is responsible for preparing and reviewing sets of CJK unified ideographs to be included in the Unicode Standard.
The IRG homepage is now including comprehensive lists of current and future IRG source prefixes...
@yngvem@fosstodon.org
It's happening, @marieroald and I are doing our third #PyConUS, this time with a tutorial on Packaging with uv and a talk about #Unicode in #Python!
@yngvem@fosstodon.org
It's happening, @marieroald and I are doing our third #PyConUS, this time with a tutorial on Packaging with uv and a talk about #Unicode in #Python!
@sibaku@mas.to
Found out something interesting/annoying related to #unicode! There is an issue with the character 浅. You might see it one of two ways (see screenshots) depending on which font you use, which was the cause of my confusion. One form has 2 and the other 3 horizontal strokes. So why is that?
@doctormo@floss.social
It might have taken an ungodly amount of time. But getting these corner cases right in this PDF export is going to mean the world to a lot of people.
Arabic and Hebrew and non messing up the glyphs.
#inkscape #pdf #cmyk #arabic #language #unicode #text #glyphs #hewbrew
@doctormo@floss.social
It might have taken an ungodly amount of time. But getting these corner cases right in this PDF export is going to mean the world to a lot of people.
Arabic and Hebrew and non messing up the glyphs.
#inkscape #pdf #cmyk #arabic #language #unicode #text #glyphs #hewbrew
@jake4480@c.im
Some Pac-Man and other alien space invadery type symbols now in Unicode, via this Symbols for Legacy Computing Supplement: https://unicode.org/charts//PDF/Unicode-16.0/U160-1CC00.pdf
@jake4480@c.im
Some Pac-Man and other alien space invadery type symbols now in Unicode, via this Symbols for Legacy Computing Supplement: https://unicode.org/charts//PDF/Unicode-16.0/U160-1CC00.pdf
@jake4480@c.im
Some Pac-Man and other alien space invadery type symbols now in Unicode, via this Symbols for Legacy Computing Supplement: https://unicode.org/charts//PDF/Unicode-16.0/U160-1CC00.pdf
@phrawzty@hachyderm.io
Today I learned that there is a specific #unicode "record separator" symbol, formally known as "U+001E Information Separator Two".
It is meant to be used to indicate a separation between two units of information. An example of where this could be used is in a separated-value file, e.g. a CSV, but using this symbol instead of a comma.
This is interesting because there are vanishingly few instances where the record separator symbol would appear in most contexts, but many instances where a comma appears. Using this symbol instead of a comma (or a semi-colon, or an exclamation point, or any one of the usual separators) could make some data hygiene scenarios much more straightforward.
@phrawzty@hachyderm.io
Today I learned that there is a specific #unicode "record separator" symbol, formally known as "U+001E Information Separator Two".
It is meant to be used to indicate a separation between two units of information. An example of where this could be used is in a separated-value file, e.g. a CSV, but using this symbol instead of a comma.
This is interesting because there are vanishingly few instances where the record separator symbol would appear in most contexts, but many instances where a comma appears. Using this symbol instead of a comma (or a semi-colon, or an exclamation point, or any one of the usual separators) could make some data hygiene scenarios much more straightforward.
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@xtaran@chaos.social
#UserAgent based banning of #textmode browsers is sooooo lame.
$ lynx -useragent=🖕 https://[…]
@thias@mastodon.social
Treasure Hunt – Braille Hints
So I prepared a treasure hunt for my older daughter, which involved some form of coded message. I found a braille table I could 3D-print, using a real system instead of some made-up code gave me the opportunity to explain how/why this was used in reality, you find braille codes in lifts, staircase handrails.
@ausir@meowr.me
brand new combining diacritics dropping soon in Unicode 17, to be used for transcribing rare historical uses, and even more so for really tryhard conlangs!
@simontatham@hachyderm.io
In the old #ASCII days, you could change a letter between upper and lower case by XORing its character code with 0x20. Of course, if you tried this with anything that wasn't a letter, you'd get nonsense results.
If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.
A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.
>>> chr(ord('🚗') ^ 0x20)
'🚷'
@ptmcg@fosstodon.org
@simontatham@hachyderm.io
In the old #ASCII days, you could change a letter between upper and lower case by XORing its character code with 0x20. Of course, if you tried this with anything that wasn't a letter, you'd get nonsense results.
If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.
A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.
>>> chr(ord('🚗') ^ 0x20)
'🚷'
@simontatham@hachyderm.io
In the old #ASCII days, you could change a letter between upper and lower case by XORing its character code with 0x20. Of course, if you tried this with anything that wasn't a letter, you'd get nonsense results.
If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.
A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.
>>> chr(ord('🚗') ^ 0x20)
'🚷'
@simontatham@hachyderm.io
In the old #ASCII days, you could change a letter between upper and lower case by XORing its character code with 0x20. Of course, if you tried this with anything that wasn't a letter, you'd get nonsense results.
If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.
A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.
>>> chr(ord('🚗') ^ 0x20)
'🚷'
@simontatham@hachyderm.io
In the old #ASCII days, you could change a letter between upper and lower case by XORing its character code with 0x20. Of course, if you tried this with anything that wasn't a letter, you'd get nonsense results.
If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.
A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.
>>> chr(ord('🚗') ^ 0x20)
'🚷'
@simontatham@hachyderm.io
In the old #ASCII days, you could change a letter between upper and lower case by XORing its character code with 0x20. Of course, if you tried this with anything that wasn't a letter, you'd get nonsense results.
If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.
A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.
>>> chr(ord('🚗') ^ 0x20)
'🚷'
@simontatham@hachyderm.io
In the old #ASCII days, you could change a letter between upper and lower case by XORing its character code with 0x20. Of course, if you tried this with anything that wasn't a letter, you'd get nonsense results.
If you try that with #Unicode code points, it sometimes works, and sometimes doesn't. But Unicode can deliver much more impressive nonsense when it doesn't.
A fun example I just found: the "lower-case" version of CAR is NO PEDESTRIANS.
>>> chr(ord('🚗') ^ 0x20)
'🚷'
@fmeerkoetter@mountains.social
Love this book/comic the kids picked up from the library.
@fmeerkoetter@mountains.social
Love this book/comic the kids picked up from the library.
@mrdk@mathstodon.xyz · Reply to 0xDE's post
@11011110 At least these symbols have a meaning! But nobody knows what “Angzarr” (⍼) is and why it is in Unicode (https://en.wikipedia.org/wiki/Angzarr).
@mrdk@mathstodon.xyz · Reply to 0xDE's post
@11011110 At least these symbols have a meaning! But nobody knows what “Angzarr” (⍼) is and why it is in Unicode (https://en.wikipedia.org/wiki/Angzarr).
@revathskumar@fosstodon.org · Reply to Revath S Kumar :javascript:'s post
Wrote a small web utility to visualize the different string normalization forms of a text.
https://string-normalize.surge.sh/?str=I+%e2%99%a5+K%c3%b6ln
Not the best design 😄 , but feedbacks are welcome.
@mikaeru@mastodon.social
New utility in Unicopedia Sinica:
- Pan-CJK Font Variants
(port from Unicopedia Plus, with Serif/明朝体 font style instead of Sans-Serif/ゴシック体)
@mikaeru@mastodon.social
New utility in Unicopedia Plus:
- Unihan Phonetics
@revathskumar@fosstodon.org · Reply to Revath S Kumar :javascript:'s post
Wrote a small web utility to visualize the different string normalization forms of a text.
https://string-normalize.surge.sh/?str=I+%e2%99%a5+K%c3%b6ln
Not the best design 😄 , but feedbacks are welcome.
@SnoopJ@hachyderm.io
have you ever "naturally" (i.e. not discussion among #Unicode experts) encountered a font that correctly renders ꙮ?
Option | Voters |
---|---|
yes | 0 (0%) |
no | 0 (0%) |
what the hell are you talking about | 0 (0%) |
@revathskumar@fosstodon.org
New blog post : "JavaScript : understanding string normalize"
https://blog.revathskumar.com/2025/01/javascript-understanding-string-normalize.html
@qiita@rss-mstdn.studiofreesia.com
@qiita@rss-mstdn.studiofreesia.com
[謹賀新年] 世界中に配置した Oracle Active Data Guard から新年のご挨拶
https://qiita.com/shirok/items/1da55c23b33c5228049a?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items
@ptmcg@fosstodon.org · Reply to Axel Rauschmayer's post
@rauschma Ah! I did something similar in Python - this is valid Python code:
def ℎ𝕖𝐥l𝙤():
try:
ℎ𝙚𝕝𝗹𝘰_ = "Hello"
w𝔬𝓇ˡ𝚍﹎ = "World"
𝖕𝘳𝒊𝖓𝑡(f"{𝗵𝒆𝘭𝓵𝚘﹍}, {𝑤º𝘳l𝑑︴}!")
except T𝗒ₚ𝕖E𝗿𝗋𝗈𝓻 as ᵉ𝒙ⅽ:
𝐩ᵣ𝚒𝖓𝓉("failed: {}".𝕗𝕠r𝑚𝖺𝘵(ⅇ𝔵𝚌))
if _︳n𝗮𝖒𝓮﹍︳ == "__main__":
h𝙚ⅼ𝐥𝕠()
@vwbusguy@mastodon.online
"This coding interview is just going to be determining the human friendly length of a unicode utf-8 string."
Junior level dev: "Oh, this is going to be easy. How do they not know about len()?"
Senior level dev: "Oh, brilliant - a test of tolerance for pain by evaluating various code point chains with emoji, accents, and LTR/RTL markers. I'll start by writing some tests for 8-bit ord and char conversions with lookahead evals."
@vwbusguy@mastodon.online
"This coding interview is just going to be determining the human friendly length of a unicode utf-8 string."
Junior level dev: "Oh, this is going to be easy. How do they not know about len()?"
Senior level dev: "Oh, brilliant - a test of tolerance for pain by evaluating various code point chains with emoji, accents, and LTR/RTL markers. I'll start by writing some tests for 8-bit ord and char conversions with lookahead evals."
@vwbusguy@mastodon.online
"This coding interview is just going to be determining the human friendly length of a unicode utf-8 string."
Junior level dev: "Oh, this is going to be easy. How do they not know about len()?"
Senior level dev: "Oh, brilliant - a test of tolerance for pain by evaluating various code point chains with emoji, accents, and LTR/RTL markers. I'll start by writing some tests for 8-bit ord and char conversions with lookahead evals."
@siljelb@snabelen.no
TIL that a proposal was made in 1997 to add #tengwar to #unicode. I'm disappointed it hasn't been made official yet though. Here's a link to the proposal document: https://www.unicode.org/wg2/docs/n1641.pdf #Tolkien #LordOfTheRings
@omgubuntu@floss.social
Ubuntu LTS users will shortly be able to see and use the 8 new emoji included in Unicode 16.0.
https://www.omgubuntu.co.uk/2024/12/ubuntu-update-support-for-emoji-16-0
@mikaeru@mastodon.social
In the open-source application `Unicopedia Sinica`, both data files used for the `CJK Components` and the `CJK Related` utilities are now in a consistent JSON format with MIT license: `cjk-ids.json` and `cjk-related.json` respectively.
@SnoopJ@hachyderm.io
HUH, #Unicode UAX#31 offers official guidance on hashtag identifiers, and I have somehow managed to miss that completely for several years (introduced along with Unicode 11.0 in 2018).
https://www.unicode.org/reports/tr31/#hashtag_identifiers
It's not like I re-read the whole document regularly or anything but yea huh
@amake@mastodon.social · Reply to Aaron “#e14n pro” Madlon-Kay's post
@hongminhee@hollo.social
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
@eniko@peoplemaking.games
Btw here's a little #gamedev unicode protip: unicode defines several character ranges as private use areas. You can map code points in these ranges to whatever glyph you want. This can be very handy for custom characters in your game that won't conflict with established unicode characters
In our games we use the PUA for keyboard and controller button glyphs
@ausir@meowr.me
brand new combining diacritics dropping soon in Unicode 17, to be used for transcribing rare historical uses, and even more so for really tryhard conlangs!
@emnullfuenf@chaos.social
My study "Unicode Spaces" will be published in Slanted Magazine - Experimental Type 3!
@mro@digitalcourage.social · Reply to zirias (on snac)'s post
@zirias @stefano #hashtags are #unicode defined: https://www.unicode.org/reports/tr31/#D2
read 'em like this https://codeberg.org/seppo/seppo/src/commit/87bf300/lib/tag.ml#L31
@Edent@mastodon.social
iOS 14 gets support for the Unicode Power Symbol!
https://shkspr.mobi/blog/2020/09/ios-14-gets-support-for-the-power-symbol/
@jdlh@mstdn.ca · Reply to Jim DeLaHunt's post
A cool change is that the Core Specification of the Unicode Standard is now released as a static HTML subsite, backed up by an archiveable #PDF of 1,140 pages.
https://unicode.org/versions/Unicode16.0.0/core-spec/
You can now link to specific sections and paragraphs, e.g.
"Unicode is about plain text, see: https://unicode.org/versions/Unicode16.0.0/core-spec/chapter-2/#G642" .
I helped out in a small way with the project to produce the core spec as HTML + PDF. I think it is a marvellous improvement.
@jdlh@mstdn.ca
@liilliil@mastodon.online
Народ, айда форсить наш, славянский, кириллический #fediverseSymbol!
«Три снежинки» — ⁂ — потенциальный повод для многочисленных подъёбок
Польские ребята (@brie) нашли лучшего кандидата — ꙮ, «серафим многꙮкий». Символ, найденный в 1928 году только в одной (!) рукописи, и только из-за этого (!) добавленный в #Unicode несколько веков ждал своего часа
https://ru.wikipedia.org/wiki/Мультиокулярная_О
(English version https://im-in.space/@liilliil/113028392518272881 )
@amyfou@lingo.lol
I am a #linguist (non-tenure track, uni) interested in every single thing about #languages, esp #Indigenous ones, #academics & #teaching Side gig in #ComunityBased #LanguageTech (#webdev #React #postgres #hasura #graphQL #nodeJS #nginx #linux #podman #kubernetes #docker #unicode lol). I love #animals and will ask you too many questions about your #dogs #cats #horses #sheep #goats #chickens #bunnies #piggies #cows etc . Proud #UglyDogs fan. Love #nature #birds #photography #art 👋
@hongminhee@fosstodon.org · Reply to 洪 民憙 (Hong Minhee)'s post
@hongminhee@fosstodon.org
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, and @hollo, a fediverse microblog for single users.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文/#漢文)!
@chunshek@prettyaweso.me
#Introduction post for my own Mastodon instance!
• I’m a 44-year-old jack-of-all-trades.
• I grew up in #HongKong, lived in the #US. My partner of 14 years and I moved to #Taiwan in 2020.
• We are “parents” to one remaining dog.
• I speak 6 #languages, and have dabbled in many others.
• Things I will nerd out about: #Unicode, #typography, #typhoons.
• I am a person of faith, but not a fan of organized religions.
• I type in #Dvorak.
• I curate pop music at @soniccruise.
@hongminhee@fosstodon.org · Reply to 洪 民憙 (Hong Minhee)'s post
@hongminhee@fosstodon.org
@liilliil@im-in.space
Offering a new #FediverseSymbol: ꙮ
The previously suggested symbol ⁂ is good for depict group and unity, but is poor in terms of associations: “3 snowflakes”.
Polish fediusers have noticed a piece of an old Russian manuscript, it says about ‘many-eyed seraphim’ (серафим многоокий). An unknown 15th-century monk played with the combination of the letters oo, turning them into a multi-eyed creature. The character found in only 1 manuscript, but despite this, it has been added into #Unicode.
Not only does the symbol beautifully reflect the unity of the fediverse, but it also shows an all-seeing open-minded wise and powerful being (Ezekiel 1:18, 10:12 etc)
@xChaos@f.cz
Nebaví vás googlit unicode znaky pro subscript a superscript? Mě už taky ne :-)
Akordy pro psaní horního a dolního indexu (ve smyslu Unicode) na klávesnici Windows se dají snadno vygooglit. Pod Linuxem je to ovšem trochu věda:
1) nejdřív Pravý alt + pravý shift + backspace + 2 (ano, čtyřhmat)
2) potom znak, který má být dolní index, třeba číslovka (což ovšem na české klávesnici, na kterou jste přepnutí, taky s shiftem, takže dvouhmat).
H₂O
Pro horní index ve stejném čtyřhmatu akorát nahradíte tu dvojku trojkou:
a² + b² = c²
Slušné akordy, ne? problém je, že pokud čtyřhmat nedomáčknete přesně (?) tak ten Backspace má tendenci fungovat jako backspace, takže umaže jeden znak... no zkrátka, dělám to pokaždé na několikátý pokus, zatím :-)
Vůbec jsem nepochopil návod
https://www.abclinuxu.cz/blog/kenyho_stesky/2020/8/psani-hornich-a-dolnich-indexu-pres-compose-key
... asi proto, že nevím, která PC klávesa je "compose key", ale v komentářích čtenářů jsem si všiml návodu pro slovenskou klávesnici a funguje mi i pro český layout a tak to předávám dál.
@SnoopJ@hachyderm.io
the most important part of #Unicode history is when a mouse fell out of a light fixture and got added to the count of members present at a Technical Committee meeting (9 Nov 2016)
@nemobis@mamot.fr
Re-#introduction: recurring topics here.
#Wikimedia #Wikidata #Wikipedia #MediaWiki #OpenStreetMap #Wikimania #Wikisource #WikiCite #OpenRefine #wiki #Wiktionary #WikiLovesMonuments #Wikibase #Wikiquote
#i18n #L10n #translatewiki.net #Unicode #CLDR #languages
#Copyright #PublicDomain #PubblicoDominio #Copyleft #CreativeCommons #OpenData #UploadFilters #LicenzaLibera #DatiAperti
#InternetArchive #books #biblioteche #library #Koha #KohaILS #GLAM
#WikiTeam #digipres #ArchiveTeam #XSLT
1/4
@mikaeru@mastodon.social · Reply to Design Brouhaha's post
Je viens tout juste d'acquérir les cinq premiers numéros d’Unicode à Gogo ! Tous disponibles à la boutique du Musée de l'Imprimerie et de la Communication graphique.
Excellent ! 💮
@mikaeru@mastodon.social
Unicopedia Ægypta is a developer-oriented set of #Unicode utilities related to Egyptian hieroglyphs, wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-aegypta
#characters #codecharts #codepoints #desktopapplication #egyptian #electronjs #glyphs #hieroglyph #hieroglyphs #javascript #localfonts #unicode #unicopedia #unikemet
@thias@mastodon.social
Treasure Hunt – Braille Hints
So I prepared a treasure hunt for my older daughter, which involved some form of coded message. I found a braille table I could 3D-print, using a real system instead of some made-up code gave me the opportunity to explain how/why this was used in reality, you find braille codes in lifts, staircase handrails.
@mikaeru@mastodon.social
Beautifully crafted BabelStone Han font, by Andrew West 魏安
#BabelStone Han v. 15.1.3 is a free #Unicode #CJK #font with over 57,000 Han characters (#hanzi, #kanji, #hanja), and 62,061 Unicode characters in total. It is a Song/Ming style (宋体/明體) font, with glyphs modelled on the official character forms used in the People's Republic of China, and is primarily intended for writing Modern Standard #Chinese, Classical Chinese, and various Sinitic languages and dialects.
@Edent@mastodon.social
🆕 blog! “Internationalise The Fediverse”
We live in the future now. It is OK to use Unicode everywhere. It seems bizarre to me that modern Internet services sometimes "forget" that there's a world outside the Anglosphere. Some people have the temerity to speak foreign languages! And some of those languages have accents on their letters!! Even worse, some …
👀 Read more: https://shkspr.mobi/blog/2024/02/internationalise-the-fediverse/
⸻
#ActivityPub #fediverse #i18n #mastodon #unicode
@mikaeru@mastodon.social
Unicopedia Plus is a developer-oriented set of Unicode, Unihan, Unikemet & emoji utilities wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-plus
#characters #chinese #cjk #codepoints #desktopapplication #electronjs #emoji #ivd #japanese #javascript #kangxi #kangxiradicals #korean #normalization #opensource #regex #segmentation #strokecount #unicode #unicopedia #unihan #unikemet
@mikaeru@mastodon.social
Unicopedia Sinica is a developer-oriented set of #Unicode utilities related to ideographs, wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-sinica
#characters #chinese #cjk #cjkrelated #cjkv #codecharts #codepoints #components #confusables #desktopapplication #electronjs #glyphs #ideographs #ideographicdescriptionsequences #ids #japanese #javascript #kangxi #kangxiradicals #korean #localfonts #opensource #strokes #tangut #unicode #unicopedia #unihan #vietnamese
@idontlikenames@mastodon.gamedev.place
New 2d numeral system just dropped‽‽‽
It's based on ᚛ᚑᚌᚐᚋ᚜ & ☯ & bijective base 6, & works left→right or left←right
#math #unicode #linguistics #pixelart #ui #blackandwhite #design #inspiration #language
@amake@mastodon.social
@gimsieke@mastodon.cloud
Formatting people’s names correctly in a given context, for a given purpose, is hard. International linguists recently helped update the #Unicode Common Locale Data Repository (#CLDR). It will help programmers display person names correctly in many settings.
Mike McKenna wrote about it in “A Story Teller’s Case Study: Unlocking the Power of CLDR Person Name Formatting – A Solution for Formatting Names in a Globalized World” https://www.unicode.org/media/CLDR_Person_Name_White_Paper_June%202023.pdf