开发者

how can i escape '\xff\xfe' to a readable string

开发者 https://www.devze.com 2022-12-15 05:57 出处:网络
i see a string in this code: data[:2] == \'\\xff\\xfe\' i don\'t know what \'\\xff\\xfe\' is, soi want to escape it ,but not successful

i see a string in this code:

data[:2] == '\xff\xfe'

i don't know what '\xff\xfe' is,

so i want to escape it ,but not successful

import cgi
print cgi.escape('\xff\xfe')#print \xff\xfe

how can i开发者_如何学JAVA get it.

thanks


'\xFF' means the byte with the hex value FF. '\xff\xfe' is a byte-order mark: http://en.wikipedia.org/wiki/Byte_order_mark

You could also represent it as two separate characters but that probably won't tell you anything useful.


>>> print '\xff\xfe'.encode('string-escape')
\xff\xfe


What is the connection between "i don't know what '\xff\xfe' is" and "so i want to escape it"? What is the purpose of "escaping" it?

It would help enormously if you gave a little more context than data[:2] == '\xff\xfe' (say a few line before and after) ... however it looks like it is testing whether the first two bytes of data could possibly represent an UTF-16 littleendian byte order mark. In that case you could do something like:

UTF16_LE_BOM = "\xff\xfe"

# much later
if data[:2] == UTF16_LE_BOM:
    do_something()


You cannot escape or encode an invalid string.

You should understand that you are working with strings and not byte streams and there are some characters you cannot accept in them, first of them being 0x00 - and also your example that is happening to be a BOM sequence.

So if you need to include non-valid strings characters (unicode or ascii) you will have to stop using strings for this.

Take a look at PEP-0358

0

精彩评论

暂无评论...
验证码 换一张
取 消