#Unicode
Qiita - 人気の記事
[謹賀新年] 世界中に配置した Oracle Active Data Guard から新年のご挨拶
https://qiita.com/shirok/items/1da55c23b33c5228049a?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items
Paul McGuire
@[email protected] · Reply to Axel Rauschmayer's post
@rauschma Ah! I did something similar in Python - this is valid Python code:
def ℎ𝕖𝐥l𝙤():
try:
ℎ𝙚𝕝𝗹𝘰_ = "Hello"
w𝔬𝓇ˡ𝚍﹎ = "World"
𝖕𝘳𝒊𝖓𝑡(f"{𝗵𝒆𝘭𝓵𝚘﹍}, {𝑤º𝘳l𝑑︴}!")
except T𝗒ₚ𝕖E𝗿𝗋𝗈𝓻 as ᵉ𝒙ⅽ:
𝐩ᵣ𝚒𝖓𝓉("failed: {}".𝕗𝕠r𝑚𝖺𝘵(ⅇ𝔵𝚌))
if _︳n𝗮𝖒𝓮﹍︳ == "__main__":
h𝙚ⅼ𝐥𝕠()
Scott Williams 🐧
"This coding interview is just going to be determining the human friendly length of a unicode utf-8 string."
Junior level dev: "Oh, this is going to be easy. How do they not know about len()?"
Senior level dev: "Oh, brilliant - a test of tolerance for pain by evaluating various code point chains with emoji, accents, and LTR/RTL markers. I'll start by writing some tests for 8-bit ord and char conversions with lookahead evals."
Scott Williams 🐧
"This coding interview is just going to be determining the human friendly length of a unicode utf-8 string."
Junior level dev: "Oh, this is going to be easy. How do they not know about len()?"
Senior level dev: "Oh, brilliant - a test of tolerance for pain by evaluating various code point chains with emoji, accents, and LTR/RTL markers. I'll start by writing some tests for 8-bit ord and char conversions with lookahead evals."
Scott Williams 🐧
"This coding interview is just going to be determining the human friendly length of a unicode utf-8 string."
Junior level dev: "Oh, this is going to be easy. How do they not know about len()?"
Senior level dev: "Oh, brilliant - a test of tolerance for pain by evaluating various code point chains with emoji, accents, and LTR/RTL markers. I'll start by writing some tests for 8-bit ord and char conversions with lookahead evals."
SiljeLB
TIL that a proposal was made in 1997 to add #tengwar to #unicode. I'm disappointed it hasn't been made official yet though. Here's a link to the proposal document: https://www.unicode.org/wg2/docs/n1641.pdf #Tolkien #LordOfTheRings
omg! ubuntu
Ubuntu LTS users will shortly be able to see and use the 8 new emoji included in Unicode 16.0.
https://www.omgubuntu.co.uk/2024/12/ubuntu-update-support-for-emoji-16-0
Michel Mariani
In the open-source application `Unicopedia Sinica`, both data files used for the `CJK Components` and the `CJK Related` utilities are now in a consistent JSON format with MIT license: `cjk-ids.json` and `cjk-related.json` respectively.
🔗 https://codeberg.org/tonton-pixel/unicopedia-sinica
These data files are still a work in progress, independently available in their own (frequently updated) repository:
🔗 https://codeberg.org/tonton-pixel/cjk-ids-data
🔗 https://codeberg.org/tonton-pixel/cjk-related-data
SnoopJ
HUH, #Unicode UAX#31 offers official guidance on hashtag identifiers, and I have somehow managed to miss that completely for several years (introduced along with Unicode 11.0 in 2018).
https://www.unicode.org/reports/tr31/#hashtag_identifiers
It's not like I re-read the whole document regularly or anything but yea huh
Aaron “#e14n pro” Madlon-Kay
@[email protected] · Reply to Aaron “#e14n pro” Madlon-Kay's post
洪 民憙 (Hong Minhee)
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, and @hollo, an ActivityPub-enabled microblogging software for single users.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文, #漢文)!
Eniko | Kitsune Tails out now!
Btw here's a little #gamedev unicode protip: unicode defines several character ranges as private use areas. You can map code points in these ranges to whatever glyph you want. This can be very handy for custom characters in your game that won't conflict with established unicode characters
In our games we use the PUA for keyboard and controller button glyphs
Michael Zöllner
My study "Unicode Spaces" will be published in Slanted Magazine - Experimental Type 3!
Marcus Rohrmoser 🌻
@[email protected] · Reply to zirias (on snac)'s post
@zirias @stefano #hashtags are #unicode defined: https://www.unicode.org/reports/tr31/#D2
read 'em like this https://codeberg.org/seppo/seppo/src/commit/87bf300/lib/tag.ml#L31
Terence Eden
iOS 14 gets support for the Unicode Power Symbol!
https://shkspr.mobi/blog/2020/09/ios-14-gets-support-for-the-power-symbol/
Jim DeLaHunt
@[email protected] · Reply to Jim DeLaHunt's post
A cool change is that the Core Specification of the Unicode Standard is now released as a static HTML subsite, backed up by an archiveable #PDF of 1,140 pages.
https://unicode.org/versions/Unicode16.0.0/core-spec/
You can now link to specific sections and paragraphs, e.g.
"Unicode is about plain text, see: https://unicode.org/versions/Unicode16.0.0/core-spec/chapter-2/#G642" .
I helped out in a small way with the project to produce the core spec as HTML + PDF. I think it is a marvellous improvement.
Jim DeLaHunt
liilliil 🇫🇯🇱🇨
Народ, айда форсить наш, славянский, кириллический #fediverseSymbol!
«Три снежинки» — ⁂ — потенциальный повод для многочисленных подъёбок
Польские ребята (@brie) нашли лучшего кандидата — ꙮ, «серафим многꙮкий». Символ, найденный в 1928 году только в одной (!) рукописи, и только из-за этого (!) добавленный в #Unicode несколько веков ждал своего часа
https://ru.wikipedia.org/wiki/Мультиокулярная_О
(English version https://im-in.space/@liilliil/113028392518272881 )
AmyFou 🥥🌴
I am a #linguist (non-tenure track, uni) interested in every single thing about #languages, esp #Indigenous ones, #academics & #teaching Side gig in #ComunityBased #LanguageTech (#webdev #React #postgres #hasura #graphQL #nodeJS #nginx #linux #podman #kubernetes #docker #unicode lol). I love #animals and will ask you too many questions about your #dogs #cats #horses #sheep #goats #chickens #bunnies #piggies #cows etc . Proud #UglyDogs fan. Love #nature #birds #photography #art 👋
洪 民憙 (Hong Minhee)
@[email protected] · Reply to 洪 民憙 (Hong Minhee)'s post
洪 民憙 (Hong Minhee)
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, and @hollo, a fediverse microblog for single users.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#한국어), or #Japanese (#日本語), or even in Literary Chinese (#文言文/#漢文)!
Chunshek
#Introduction post for my own Mastodon instance!
• I’m a 43-year-old jack-of-all-trades.
• I grew up in #HongKong, lived in the #US. My partner of 14 years and I moved to #Taiwan in 2020.
• We are “parents” to one remaining dog.
• I have worked in journalism, finance, L&D, and now EdTech.
• I speak 6 #languages, and have dabbled in many others.
• Things I will nerd out about: #Unicode, #typography, #typhoons.
• I am a person of faith, but not a fan of organized religions.
• I type in #Dvorak.
洪 民憙 (Hong Minhee)
@[email protected] · Reply to 洪 民憙 (Hong Minhee)'s post
洪 民憙 (Hong Minhee)
xChaos
Nebaví vás googlit unicode znaky pro subscript a superscript? Mě už taky ne :-)
Akordy pro psaní horního a dolního indexu (ve smyslu Unicode) na klávesnici Windows se dají snadno vygooglit. Pod Linuxem je to ovšem trochu věda:
1) nejdřív Pravý alt + pravý shift + backspace + 2 (ano, čtyřhmat)
2) potom znak, který má být dolní index, třeba číslovka (což ovšem na české klávesnici, na kterou jste přepnutí, taky s shiftem, takže dvouhmat).
H₂O
Pro horní index ve stejném čtyřhmatu akorát nahradíte tu dvojku trojkou:
a² + b² = c²
Slušné akordy, ne? problém je, že pokud čtyřhmat nedomáčknete přesně (?) tak ten Backspace má tendenci fungovat jako backspace, takže umaže jeden znak... no zkrátka, dělám to pokaždé na několikátý pokus, zatím :-)
Vůbec jsem nepochopil návod
https://www.abclinuxu.cz/blog/kenyho_stesky/2020/8/psani-hornich-a-dolnich-indexu-pres-compose-key
... asi proto, že nevím, která PC klávesa je "compose key", ale v komentářích čtenářů jsem si všiml návodu pro slovenskou klávesnici a funguje mi i pro český layout a tak to předávám dál.
SnoopJ
the most important part of #Unicode history is when a mouse fell out of a light fixture and got added to the count of members present at a Technical Committee meeting (9 Nov 2016)
Nemo_bis 🌈
Re-#introduction: recurring topics here.
#Wikimedia #Wikidata #Wikipedia #MediaWiki #OpenStreetMap #Wikimania #Wikisource #WikiCite #OpenRefine #wiki #Wiktionary #WikiLovesMonuments #Wikibase #Wikiquote
#i18n #L10n #translatewiki.net #Unicode #CLDR #languages
#Copyright #PublicDomain #PubblicoDominio #Copyleft #CreativeCommons #OpenData #UploadFilters #LicenzaLibera #DatiAperti
#InternetArchive #books #biblioteche #library #Koha #KohaILS #GLAM
#WikiTeam #digipres #ArchiveTeam #XSLT
1/4
Michel Mariani
Unicopedia Ægypta is a developer-oriented set of #Unicode utilities related to Egyptian hieroglyphs, wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-aegypta
#characters #codecharts #codepoints #desktopapplication #egyptian #electronjs #glyphs #hieroglyph #hieroglyphs #javascript #localfonts #unicode #unicopedia #unikemet
Michel Mariani
Unicopedia Plus is a developer-oriented set of Unicode, Unihan, Unikemet & emoji utilities wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-plus
#characters #chinese #cjk #codepoints #desktopapplication #electronjs #emoji #ivd #japanese #javascript #kangxi #kangxiradicals #korean #normalization #opensource #regex #segmentation #strokecount #unicode #unicopedia #unihan #unikemet
Michel Mariani
Unicopedia Sinica is a developer-oriented set of #Unicode utilities related to ideographs, wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-sinica
#characters #chinese #cjk #cjkrelated #cjkv #codecharts #codepoints #components #confusables #desktopapplication #electronjs #glyphs #ideographs #ideographicdescriptionsequences #ids #japanese #javascript #kangxi #kangxiradicals #korean #localfonts #opensource #strokes #tangut #unicode #unicopedia #unihan #vietnamese
꧁ᐊ𰻞ᵕ̣̣̣̣̣̣́́♛ᵕ̣̣̣̣̣̣́́𰻞ᐅ꧂
New 2d numeral system just dropped‽‽‽
It's based on ᚛ᚑᚌᚐᚋ᚜ & ☯ & bijective base 6, & works left→right or left←right
#math #unicode #linguistics #pixelart #ui #blackandwhite #design #inspiration #language
Aaron “#e14n pro” Madlon-Kay
Gerrit Imsieke
Formatting people’s names correctly in a given context, for a given purpose, is hard. International linguists recently helped update the #Unicode Common Locale Data Repository (#CLDR). It will help programmers display person names correctly in many settings.
Mike McKenna wrote about it in “A Story Teller’s Case Study: Unlocking the Power of CLDR Person Name Formatting – A Solution for Formatting Names in a Globalized World” https://www.unicode.org/media/CLDR_Person_Name_White_Paper_June%202023.pdf