It's been well documented elsewhere (http://www.cl.cam.ac.uk/~mgk25/ucs/utf-8-history.txt) that one Wednesday night, after a phone call from X/Open, Ken Thompson and I were sitting in a New Jersey diner talking about how best to represent Unicode as a byte stream. Given the experience we had accumulated dealing with the original UTF, which had many problems, we knew what we wanted. X/Open had offered us a deal: implement something better than their proposal, called FSS/UTF (File System Safe UTF; the name tells you something on its own), and do so before Monday. In return, they'd push it as existing practice.
UTF was awful. It had modulo-192 arithmetic, if I remember correctly, and was all but impossible to implement efficiently on old SPARCs with no divide hardware. Strings like "/*" could appear in the middle of a Cyrillic character, making your Russian text start a C comment. And more. It simply wasn't practical as an encoding: think what happens to that slash byte inside a Unix file name.
FSS/UTF addressed that problem, which was great. Big improvement though it was, however, FSS/UTF was more intricate than we liked and lacked one property we insisted on: If a byte is corrupted, it should be possible to re-synch the encoded stream without losing more than one character. When we claimed we wanted that property, and sensed we could press for a chance to design something right, X/Open gave us the green light to try.
The diner was the Corner Café in New Providence, New Jersey. We just called it Mom's, to honor the previous proprietor. I don't know if it's still the same, but we went there for dinner often, it being the closest place to the Murray Hill offices. Being a proper diner, it had paper placemats, and it was on one of those placemats that Ken sketched out the bit-packing for UTF-8. It was so easy once we saw it that there was no reason to keep the placemat for notes, and we left it behind. Or maybe we did bring it back to the lab; I'm not sure. But it's gone now.
I'll always regret that.
But that's my only regret in this story. UTF-8 has made the world a better place and I am delighted to have been a facilitator in its creation.
So tonight, please give a toast to a few quickly sketched boxes on a placemat in a New Jersey diner that today define how humans represent their text to be sent across international boundaries (http://googleblog.blogspot.com/2012/02/unicode-over-60-percent-of-web.html), and especially to the X/Open folks for giving us the chance and the Unicode consortium and IETF for pushing it so successfully.
U+263A '☺' white smiling face
If it weren't for all the rampant elitism displayed by the typical Android phone user, then I wouldn't even bat an eye...
- Devops Engineer, 2014 - present
- EHR Total SolutionsWeb Application Developer, 2013 - 2013
- NJI MediaSenior Web Developer, 2011 - 2013
- EvEMRDirector of Development, 2013 - 2013
I'm into solving complex problem solving, logic, and working on my Jeep.
The Humble Indie Bundle #3 (pay what you want for five awesome indie games)
Pay what you want for some awesome games and help support two charities. All of the games are DRM-free and support Mac, Windows, and Linux.
Cheap Domain Names Registration, Domain Transfer, SSL Certificates, Free...
Namecheap provides cheap domain name registration, renewal & transfer. We are an approved ICANN Domain Registrar. We also provide busine
Join the party! We're watching vids together. Get in here!
Plug.dj lets you and your friends throw a music video party. Chat with others in real-time about the music video. Step up and show off your
Battlefield 3 Assault Dog Tag Set - Accessories - Official DICE Store fo...
2 Aluminum 51mm x 29mm tags fitted with rubber silencers on 600mm and 100mm chain
25+ jQuery Plugins that enhance and beautify HTML form elements
I have collected over 25 jQuery plugins that enhance and beautify html form elements. Some of them add more capabilities that able to increa
Sennheiser USA - PC - Professional Gaming Headset - Gamer Headset - PC 3...
PC - Professional Gaming Headset - Gamer Headset - PC 360 - Skype, VoIP equipment, specifications, features - Personal Audio, www.sennheise