How do you progra开发者_如何学Gommatically convert a C-string with encoding Windows-1252 (a.k.a. CP1252) to UTF8?
On Windows? First call MultiByteToWideChar
and then WideCharToMultiByte
.
On a Mac or Linux: call iconv_open
and then iconv
as needed.
In general: incorporate the ICU4C library.
You need to use a code page conversion, which is language/paltform specific.
If you want to implement it: Windows-1252 on Wikipedia (there is a link to the conversion table)
Some solutions:
- Windows: MultiByteToWideChar
- python: decode()
- Linux: libiconv (just one of the many)
etc...
精彩评论