UTF-8 and TOHTML wierdness

Fairlight fairlite at fairlite.com
Thu Jun 15 14:54:08 PDT 2006


This public service announcement was brought to you by Walter Vaughan:
> Okay, maybe it's doing what it's supposed to do, but when I use the TOHTML 
> command for a current need it's creating a \x94 code which I belive is coming 
> from a double quote (inch marks) in a field.
> 
> The incoming application gets a XML document and seems to very much not like 
> that code.
> 
> I thought the problem was with the application, but if I run xmllint on the file 
> I get the same error "Input is not proper UTF-8".

Why don't you specify iso-8859-1 as your XML encoding instead of UTF-8?
That may solve your problems.  Unless you're dealing with multilingual
stuff, there are very few reasons to actually use Unicode.  Yet, everyone
seems to default their encodings to UTF-8.  I dunno why.

mark->


More information about the Filepro-list mailing list