Tony Finch - no-longer-simple mail transport protocol
16 September 2006
in links
tagged with
[charactersets]
[smtp]
[utf8]
UTF-8 and other character set excitement in SMTP
Nerdy t-shirt
17 August 2005
in photos
in set interesting things
tagged with
[tshirt]
[utf8]
From PrintWhatYouThink. No, I’m not going to explain it.
using utf-8 in irssi under screen
23 June 2005
in blog
tagged with
[linux]
[screen]
[unicode]
[utf8]
Firstly, tell your local terminal application that you want a utf-8 window. This is left to you, but under macos (which I use), right click the window, select ‘Window settings’, pick the ‘Display’ option from the drop-down, and pick utf-8 under ‘Character set encoding’.
Next, when you start the screen session, pass the ‘-U’ flag. This has to be passed to a new screen session - you can’t connect to an existing one this way.
screen -U
Alternatively, you can turn on the utf-8 flag for a single existing screen window by typing your hotkey (ctrl-a by default), then ‘:utf8 on’. This is good if you don’t want all of your windows to be utf now.
On the remote machine, make sure that the ‘LANG‘ environment variable is set to something UTF-8 like, for instance, I use
export LANG=en_GB.UTF-8
in my .bashrc.
Finally, you need to tell irssi to use UTF-8. Start it up in your new utf-8 window, and type
/set term_type utf-8
Hopefully everything should work now.
More UTF8 pain
15 December 2004
in blog
tagged with
[browser]
[unicode]
[utf8]
Does no-one in the world care about non-ASCII characters? It’s pathetic. I’m trying to make HTML form uploads work for files with non-ASCII characters in their names, and I’m hitting the stupidest problems.
The main bugbear is mozilla - you can’t upload files with wide characters in their names. At all. Piece of shit. Safari seems to be encoding the upload filenames with some made-up encoding that I can’t figure out, so that’s out of luck. At least safari sends the actual contents of the files.
The one browser I’ve tried that works flawlessly is Internet Explorer. Microsoft, at least, seem to care about the non-US market.
UTF8 Openguides
13 December 2004
in blog
tagged with
[perl]
[unicode]
[utf8]
[wiki]
I foolishly offered to make OpenGuides UTF-8 safe. Because I don’t do that enough at work, or something. Anyway, it’s going quite well - because I did all the grunt work in CGI::Wiki a while ago, it’s just a matter of finding all the inputs and outputs and making sure they’re encoded properly. So far, the page contents and names are utf-8 safe, along with the cookie preferences, so your username is good. The search stuff looks scary, and there are various broken plugins, etc, etc, so there’s still stuff to do. I should also do the hooks properly - CGI::Wiki should offer nice functions for this stuff.
Anyway, there’s a demo site here in case you feel like trying to break it. The patch against OG is here, out of my svn repository, of course.
