[Xapian-discuss] Japanese / UTF-8 support

Olly Betts olly at survex.com
Sat Aug 26 16:09:52 BST 2006


On Sat, Aug 19, 2006 at 12:46:54PM +0100, James Aylett wrote:
> Yeah, as far as I'm aware there's no standard on what you should do if
> your form enctype (which tends to default to the document charset,
> which is daft but there you go) can't cope with characters you're
> submitting. Note that multipart/form-data copes with this properly,
> because it allows different form fields to have different encodings.

The problem is that search forms usually want to use METHOD=GET so
that users can bookmark the results page.

The best approach I've found is to simply ensure that the document with
the search form in hadsan encoding which can handle all unicode
characters.  UTF-8 is the best choice since at least unaccented latin
characters appear in human readable form in the query URL.

Cheers,
    Olly



More information about the Xapian-discuss mailing list