Thursday, September 10, 2024

Announcing The Unicode® Standard, Version 16.0

Version 16.0 of the Unicode Standard is now available, including the core specification, annexes, and data files. This update continues to build on our commitment to providing comprehensive language and symbol support, enabling consistent representation across the globe. 

Unicode 16.0 introduces a substantial increase of 5,185 characters, expanding our total character set to a remarkable 154,998. Seven new scripts have been added to enhance language representation

  • Garay: A modern script used in West Africa. 
  • Gurung Khema, Kirat Rai, Ol Onal, Sunuwar: Scripts from Northeast India and Nepal, enhancing support for these linguistic traditions. 
  • Todhri: A historical script used for writing Albanian. 
  • Tulu-Tigalari: Another historical script from Southwest India, enriching our historical script repertoire. 


Among the popular symbol additions are ten new emoji (nine pictured), including a shovel, dirty sponge, and harp, and characters like a leafless tree and a sad flower symbolizing greenwashing that reflect ongoing climatic changes. 

Among the popular symbol additions are ten new emoji (nine pictured), including a shovel, dirty sponge, and harp, and characters like a leafless tree and a sad flower symbolizing greenwashing that reflect ongoing climatic changes.

This update also includes 3,995 additional Egyptian Hieroglyphs, and over 700 symbols from legacy computing environments. This variety supports both historical text representation and the technical needs of today's users. This version's enhancements in CJK unified ideograph blocks with new Japanese source references for over 36,000 ideographs improve text representation and usability across East Asian languages.

  • DoNotEmit.txt: A new data file directing input methods and autocorrect applications to alternative character sequences. 
  • Unikemet.txt: Providing detailed property and character information specifically for Egyptian hieroglyphs support. 

Unicode properties and specifications determine the behavior of text on computers and phones. As part of this release, key Unicode Technical Standards have been synchronized to support Version 16.0, which include updates to:

  • UTS #10 (Unicode Collation Algorithm): Enhancements for more precise text sorting. 
  • UTS #39 (Unicode Security Mechanisms): Strategies to mitigate Unicode spoofing. 
  • UTS #46 (Unicode IDNA Compatibility Processing): Improved processing for non-ASCII URLs. 
  • UTS #51 (Unicode Emoji): Updates affecting emoji representation and behavior. 

This version brings several changes that might require modifications for implementations using earlier versions. For comprehensive migration details and technical specifics, refer to the migration sections in the relevant Unicode Technical Standards documentation.
The changes introduced by Unicode 16.0 reflect our continued commitment to accommodate evolving language and symbolic communication needs worldwide, enriching the way people collaborate and share ideas.


Adopt a Character and Support Unicode's Mission

Looking to give that special someone a special something?
Or maybe something to treat yourself?
🕉️💗🏎️🐨🔥🚀爱₿♜🍀

Adopt a character or emoji to give it the attention it deserves, while also supporting Unicode’s mission to ensure everyone can communicate in their own languages across all devices.

Each adoption includes a digital badge and certificate that you can proudly display!

Have fun and support a good cause

You can also donate funds or gift stock


As Unicode, Inc. is a US-based open source, open standards, non-profit, 501(c)3 organization, your contribution may be eligible for a tax deduction. Please consult with a tax advisor for details.