开发者

str.strip() strange behavior

开发者 https://www.devze.com 2023-01-02 08:51 出处:网络
>>> t1 = "abcd.org.开发者_Go百科gz" >>> t1 \'abcd.org.gz\' >>> t1.strip("g")
>>> t1 = "abcd.org.开发者_Go百科gz"
>>> t1
'abcd.org.gz'
>>> t1.strip("g")
'abcd.org.gz'
>>> t1.strip("gz")
'abcd.org.'
>>> t1.strip(".gz")
'abcd.or'

Why is the 'g' of '.org' gone?


strip(".gz") removes any of the characters ., g and z from the beginning and end of the string.


x.strip(y) will remove all characters that appear in y from the beginning and end of x.

That means

'foo42'.strip('1234567890') == 'foo'

becuase '4' and '2' both appear in '1234567890'.


Use os.path.splitext if you want to remove the file extension.

>>> import os.path
>>> t1 = "abcd.org.gz"
>>> os.path.splitext(t1)
('abcd.org', '.gz')


In Python 3.9, there are two new string methods .removeprefix() and .removesuffix() to remove the beginning or end of a string, respectively. Thankfully this time, the method names make it aptly clear what these methods are supposed to perform.

>>> print (sys.version)
3.9.0 
>>> t1 = "abcd.org.gz"
>>> t1.removesuffix('gz')
'abcd.org.'
>>> t1
'abcd.org.gz'
>>> t1.removesuffix('gz').removesuffix('.gz')
'abcd.org.'     # No unexpected effect from last removesuffix call
 


The argument given to strip is a set of characters to be removed, not a substring. From the docs:

The chars argument is a string specifying the set of characters to be removed.


as far as I know strip removes from the beginning or end of a string only. If you want to remove from the whole string use replace.

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号