We have a page on our intranet which is just plain/text
encoded in UTF-8
and it contais insert scripts for database. And problem is, when I download this page to a file with wget
or curl
, my downloaded file has UTF-8 encoding but special characters(Czech language chars) are broken.
So where could be a problem?I can convert it to CP-1250 encoding which works fine(I also tried ISO8859-2, it doesn't work either), but I can't use it for db insert via SQL*Plus, because our DB is encoded in UTF-8.
Thanks a lot for answers
The HTTP protocol communicates in US_ASCII.
Therefore, plain/text content CAN NOT be UTF-8: it must be encoded if it contains non-ASCII UTF-8 characters.