Profile

Cover photo
198 followers|94,359 views
AboutPostsPhotosVideos

Stream

Unicode

Shared publicly  - 
 
New Unicode Character Property EquivalentUnifiedIdeograph

A new character #property EquivalentUnifiedIdeograph is proposed for addition to #Unicode 10.0. This property associates (where possible) the 365 characters in the CJK Radicals Supplement, #Kangxi Radicals, and #CJK Strokes blocks to an appropriate CJK Unified Ideograph.

For details of the proposal and information about how to provide feedback, please see Public Review Isue #344

http://www.unicode.org/review/pri344/

http://blog.unicode.org/2017/01/new-unicode-character-property.html

1
Add a comment...

Unicode

Shared publicly  - 
 
Support Unicode with an Adopt-a-Character Gift this Holiday Season!

This #holiday season you can give a unique gift by adopting any #emoji, letter, or symbol — and help support the #Unicode Consortium’s mission to enable all languages to be used on computers. ​Three levels of sponsorship​ are available​, starting at $100. With over 128,000 characters to choose from, you are certain to find an appropriate character, for even the most demanding recipient. All sponsors will receive a custom digital badge featuring the adopted character for use on the web and elsewhere. Sponsors at the two highest levels will receive a special thank-you gift engraved with the name you supply and the adopted character.

The program funds work on “digitally disadvantaged” languages, both modern and historic. In 2016 the program awarded a grant to support work on a proposal for the Hanifi Rohingya script. The program has also funded work on Egyptian hieroglyphs and Mayan hieroglyphs.

In its first year, the Adopt-a-Character program has had nearly 400 sponsors. Be part of the next wave, with a worthwhile gift!

For more information on the program, or to adopt a character, see the Adopt-a-Character Page.

http://www.unicode.org/consortium/adopt-a-character.html

1
Add a comment...

Unicode

Shared publicly  - 
 
113 new #emoji are now available in UTR #51 Unicode Emoji, Version 4.0. http://www.unicode.org/reports/tr51/ The main focus of this 4.0 release is further enhancing gender representation and professions. These new emoji are already appearing on smart phones and other devices and platforms that support emoji. See the full list in Emoji Recently Added. http://www.unicode.org/emoji/charts/emoji-released.html

The new emoji will soon be available for adoption, helping fund projects to improve language support. http://blog.unicode.org/2016/06/encoding-mayan-script-your-adopt.html

Unlike the 72 emoji characters added to Unicode 9.0 in June, these are not new Unicode characters. Most of these new emoji are sequences of existing emoji, “glued together” with a special invisible character so that they appear and behave like a single character. This glue character is called a ZWJ, pronounced “zwidge” or /zwɪdʒ/. Three existing Unicode 9.0 characters (gender and medical symbols) were changed to qualify as emoji, for use in those ZWJ sequences.

Two of the new sequences are flags, 10 are family groupings (such as mother with daughter), 32 are new professions/roles (such as man or woman astronaut), and 66 are explicit-gendered variants (such as man or woman running). 99 of these sequences, plus 5 other characters (such as snowboarder), can also now have the 5 skin tone modifiers.
The technical documentation has also been updated, with additional guidelines for implementers and the new versions of the emoji data files for use in programs.


http://blog.unicode.org/2016/11/113-new-unicode-emoji-plus-skin-tones.html

1
Add a comment...

Unicode

Shared publicly  - 
 
CLDR Version 30 Released

Unicode #CLDR 30 provides an update to the key building blocks for software supporting the world’s languages. This data is used by all major software systems for their software internationalization and localization, adapting software to the conventions of different languages for such common software tasks. The following summarizes the main improvements in the release.

* #Unicode support is updated to 9.0, including updated Unihan readings for the pinyin collation and Han-Latin transforms, and support for new script codes and number systems.
* The set of language codes for translation has been updated, with a significant increase in the total number of translated language names.
* Substantial new data has been added for likely subtags (e.g., to get the main script for each language).
* New data items have been added to support relative times such as “3 Fridays ago” or “this hour”.
* New draft format and preference structure has been added to support week designations such as “the week of August 10” or “week 3 of March”.
* New <characterLabels> data can be used to generate labels for groups of related characters in character pickers.
* The structure for emoji annotations has been revised, and the data has been significantly updated. The emoji collation has been updated, and data is added for improved segmentation behavior. Added a specification for synthesizing ZWJ sequence names.
* The CLDR 30 Survey Tool data collection resulted in a net increase in data items of about 9.2%, with an additional 5.9% of items changed.

For further details and links to documentation, see the CLDR Release Notes
http://cldr.unicode.org/index/downloads/cldr-30

http://blog.unicode.org/2016/10/cldr-version-30-released.html


Unicode CLDR 30 provides an update to the key building blocks for software supporting the world’s languages. This data is used by all major software systems for their software internationalization and localization, adapting s...
2
Add a comment...

Unicode

Shared publicly  - 
 
Unicode 9.0 Paperback Available

The Unicode 9.0 core specification is now available in paperback book form with a new, original cover design. This edition consists of a pair of modestly priced print-on-demand volumes containing the complete text of the core specification of Version 9.0 of the #Unicode Standard.

Each of the two volumes is a compact 6×9 inch US trade paperback size. The two volumes may be purchased separately or together, although they are intended as a set. The cost for the pair is US $16.75, plus postage and applicable taxes. Please visit the description page to order. http://www.lulu.com/spotlight/unicode

Note that these volumes do not include the Version 9.0 code charts, nor do they include the Version 9.0 Standard Annexes and Unicode Character Database, which are freely available on the Unicode website.

Purchase The Unicode Standard, Version 9.0 - Core Specification:
http://www.lulu.com/spotlight/unicode

http://blog.unicode.org/2016/09/unicode-90-paperback-available.html

1
Add a comment...

Unicode

Shared publicly  - 
 

Keynote Speaker Announced for IUC 40

My Life as a Higher Level Protocol
John Hudson

After sitting in on a full day of in depth tutorials, join us Wednesday morning as we kick off our 25th year with a #keynote presentation by John Hudson, Co-Founder, Tiro Typeworks. John has spent two decades working at the messy interface between text encoding and typography, much of it making fonts for complex scripts. In his keynote presentation, he reflects on some of the messiest aspects of this work, and why, after twenty years, he's convinced that a holistic overview of text is necessary.

About IUC 40, November 1-3, 2016: For twenty-five years the Internationalization & Unicode® Conference (IUC) has been the preeminent event highlighting the latest innovations and best practices of global and #multilingual software providers. Please join us for our 40th #conference! This year's event is being held on November 1-3, 2016 in Santa Clara, California. Read more: http://www.unicodeconference.org/about-iuc.htm

http://blog.unicode.org/2016/08/keynote-speaker-announced-for-iuc-40.html

http://www.tiro.com/index.html @tirotypeworks #iuc40

1

Unicode

Shared publicly  - 
 
Adopt-A-Character Grant to Support Indic Scripts

The Adopt-a-Character program has awarded a grant to support further development of the following four #Indic scripts in the #Unicode Standard:

* Hanifi Rohingya, a script in current use in Myanmar and Bangladesh

* Nandinagari, a Brahmi-based historic #script formerly used in South India

* Old Sogdian, a group of historic scripts formerly used in Kazakhstan, Pakistan, and Western China

* Sogdian, derived from Old Sogdian, a group of historic scripts formerly used in Central Asia

The goal of this grant is to enable the development of encoding proposals that can be included in the Unicode Standard. The work will be done by Anshuman Pandey under the direction of Deborah Anderson (SEI, UC Berkeley) and Rick McGowan (Unicode Consortium).

http://unicode.org/consortium/adopt-a-character.html

http://blog.unicode.org/2016/12/adopt-character-grant-to-support-indic.html

1
1
Add a comment...

Unicode

Shared publicly  - 
 
Proposed Update UTR #51, Unicode Emoji (Version 5.0)

A proposed update of UTR #51, #Unicode #Emoji (Version 5.0) is available for public review and feedback. http://www.unicode.org/reports/tr51/tr51-10.html This new version adds a mechanism to support regional #flags, such as Scotland or California, though the choice of which of these flags to support is left to vendors.

Associated #charts are available at http://www.unicode.org/emoji/charts-beta/index.html, and associated data files are available at http://www.unicode.org/Public/emoji/5.0/ . This proposed update also has a separate data file for the valid #emoji presentation sequences, and reflects a small change in the ordering of SELFIE. The charts also add the newest Apple and Facebook emoji.

At this time, the proposed update does not add any additional recommended #emoji zwj sequences, nor reclassify any existing Unicode 9.0 characters as emoji. There are proposals for doing so that will be reviewed in the next Unicode Technical Committee meeting.

The #review period for the proposed update ends on January 16, 2017. For further information and instructions on how to provide feedback, please see Public Review Issue #343. http://www.unicode.org/review/pri343/

This holiday season you can give a unique gift by adopting any emoji, letter, or symbol — and help support the Unicode Consortium’s mission to enable all languages to be used on computers. You can now adopt Unicode 9.0 characters and the Emoji 4.0 emoji sequences (such as woman astronaut or rockstar). ​​See the Adopt-a-Character Page. http://www.unicode.org/consortium/adopt-a-character.html

http://blog.unicode.org/2016/12/proposed-update-utr-51-unicode-emoji.html


1
1
Add a comment...

Unicode

Shared publicly  - 
 
Proposed Update UTS #37, Unicode Ideographic Variation Database

The #Unicode Consortium has posted a new issue for public review and comment.

UTS #37, Unicode Ideographic Variation Database, is being updated to broaden the scope of base character, from characters with the Unified_Ideograph property to characters with the Ideographic property, excluding characters that canonically or compatibly decompose. The substantive changes can be found in Section 2, Description. This proposed update is currently under review with a closing date of 2017-01-16. For more information, please see Public Review Issue #337. http://www.unicode.org/review/pri337/ #ivd #ideograph

http://blog.unicode.org/2016/11/proposed-update-uts-37-unicode.html

1
Add a comment...

Unicode

Shared publicly  - 
 
ICU 58 Released

Unicode® ICU version 58 has just been released! #ICU is the main avenue for many software products and libraries to support the world's languages, implementing both the latest version of the #Unicode encoding standard and of the Unicode locale data (#CLDR).

ICU 58 provides full support for the recent Unicode 9.0 release with 7,500 new characters and many property improvements. It covers the Unicode 9.0 emoji characters — plus the latest draft version of Emoji 4.0 — for a total of 2,444 emoji characters and sequences, including the new ZWJ sequences for gendered professions; ICU word & line breaking is updated for Emoji 4.0. ICU 58 incorporates the latest version 30 of Unicode CLDR locale data with a significant increase in data coverage.

There are a number of new APIs, including ones for measurement system unit display names (such as “acre” or “Hektar” in 80 languages), and improvements in performance and robustness. For Java, the unit tests are converted to JUnit, for easier and faster integration into test suites.

For details please see http://site.icu-project.org/download/58

http://blog.unicode.org/2016/10/icu-58-released.html


1
Add a comment...

Unicode

Shared publicly  - 
 
Emoji Deadline

Reminder: Emoji proposals must be submitted by October 1 to be considered for Unicode 10 (2017). See Process and Timeline. http://www.unicode.org/emoji/selection.html#timeline

Also, see the latest #emoji charts. Both the v3.0 and the v4.0 beta have been regenerated with updated images, and with updated sorting order, short names, and keywords (annotations) from the alpha Unicode #CLDR v30 release.

http://www.unicode.org/emoji/charts/
http://www.unicode.org/emoji/charts-beta/
http://cldr.unicode.org/index/downloads/cldr-30

http://blog.unicode.org/2016/09/emoji-deadline.html

1
chuah shen long's profile photo
 
你行不行的
 ·  Translate
Add a comment...

Unicode

Shared publicly  - 
 
New FAQ on Myanmar Scripts and Languages

A new FAQ on #Myanmar Scripts and Languages has been posted on the Unicode website. This FAQ discusses the use of the Myanmar script in Unicode, and covers the challenges of encoding, display, and interoperating with existing non-Unicode encodings such as #Zawgyi.

http://www.unicode.org/faq/myanmar.html

http://blog.unicode.org/2016/09/new-faq-on-myanmar-scripts-and-languages.html
1
Add a comment...
Story
Tagline
The Unicode Consortium enables people around the world to use computers in any language.
Introduction
Our members develop the Unicode Standard, Unicode Locales (CLDR), and other standards. These specifications form the foundation for software internationalization in all major operating systems, search engines, applications, and the Web.