i see a string in this code:
data[:2] == '\xff\xfe'
i don't know what '\xff\xfe' is,
so i want to escape it ,but not successful
import cgi
print cgi.escape('\xff\xfe')#print \xff\xfe
how can i开发者_如何学JAVA get it.
thanks
'\xFF' means the byte with the hex value FF. '\xff\xfe' is a byte-order mark: http://en.wikipedia.org/wiki/Byte_order_mark
You could also represent it as two separate characters but that probably won't tell you anything useful.
>>> print '\xff\xfe'.encode('string-escape')
\xff\xfe
What is the connection between "i don't know what '\xff\xfe' is" and "so i want to escape it"? What is the purpose of "escaping" it?
It would help enormously if you gave a little more context than data[:2] == '\xff\xfe'
(say a few line before and after) ... however it looks like it is testing whether the first two bytes of data
could possibly represent an UTF-16 littleendian byte order mark. In that case you could do something like:
UTF16_LE_BOM = "\xff\xfe"
# much later
if data[:2] == UTF16_LE_BOM:
do_something()
You cannot escape or encode an invalid string.
You should understand that you are working with strings and not byte streams and there are some characters you cannot accept in them, first of them being 0x00
- and also your example that is happening to be a BOM sequence.
So if you need to include non-valid strings characters (unicode or ascii) you will have to stop using strings for this.
Take a look at PEP-0358
精彩评论