开发者

Encoding issues with Microsoft Word characters in an AJAX request

开发者 https://www.devze.com 2022-12-23 13:38 出处:网络
I\'m writing a function to convert MS Word-styled text into Adobe InDesign-formatted text (it uses a kind of XML to indicate styling). The text is pasted into a TinyMCE rich text editor, which then se

I'm writing a function to convert MS Word-styled text into Adobe InDesign-formatted text (it uses a kind of XML to indicate styling). The text is pasted into a TinyMCE rich text editor, which then sends the HTML-formatted code to a php function.

I've tried this function to clean up the code once it reaches my conversion code:

$text = iconv("windows-1250", "UTF-8", $html);

When I use any 'special' kind of characters, things go wrong. £ signs, é (or any other accents), and a variety of 'curly' apostrophes/quote marks seem to break things. For example, if I try to convert a £ sign, the code returns \u0141, but I get the Ł symbol displayed onscreen when the function returns.

Does anybody kn开发者_C百科ow what I can do to prevent Word's weird characters breaking everything I'm doing?


I seem to have fixed this. I was using escape() to pass the values, but replaced this with encodeURIComponent() instead (and removed the iconv() call in my php code), which seems to have fixed it.

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号