I hit an interesting python bug today in which instantiating a class repeatedly appears to be holding state. In later instantiation calls the variable is already defined.
I boiled down the issue into the following class/shell interaction. I 开发者_StackOverflowrealize that this is not the best way to initialize a class variable, but it sure should not be behaving like this. Is this a true bug or is this a "feature"? :D
tester.py:
class Tester(): def __init__(self): self.mydict = self.test() def test(self,out={}): key = "key" for i in ['a','b','c','d']: if key in out: out[key] += ','+i else: out[key] = i return out
Python prompt:
Python 2.6.6 (r266:84292, Oct 6 2010, 00:44:09) [GCC 4.2.1 (Apple Inc. build 5664)] on darwin >>> import tester >>> t = tester.Tester() >>> print t.mydict {'key': 'a,b,c,d'} >>> t2 = tester.Tester() >>> print t2.mydict {'key': 'a,b,c,d,a,b,c,d'}
It is a feature that pretty much all Python users run into once or twice. The main usage is for caches and the like to avoid repetitive lengthy calculations (simple memoizing, really), although I am sure people have found other uses for it.
The reason for this is that the def
statement only gets executed once, which is when the function is defined. Thus the initializer value only gets created once. For a reference type (as opposed to an immutable type which cannot change) like a list or a dictionary, this ends up as a visible and surprising pitfall, whereas for value types, it goes unnoticed.
Usually, people work around it like this:
def test(a=None):
if a is None:
a = {}
# ... etc.
In general, default method arguments shouldn't be mutable. Instead do:
def test(self, out=None):
out = out or {}
# other code goes here.
See these links for more details on why this is necessary and why it's a "feature" of the python language rather than a bug.
- "Least Astonishment" and the Mutable Default Argument
- http://effbot.org/zone/default-values.htm
You are modifying the value of the function keyword parameter out
in your method.
This blog post explains it succintly:
expressions in default arguments are calculated when the function is defined, not when it’s called.
The function is defined when the class is created, not for each instance. If you modify it like so, the problem goes away:
def test(self,out=None):
if out is None:
out = {}
key = "key"
for i in ['a','b','c','d']:
if key in out:
out[key] += ','+i
else:
out[key] = i
return out
精彩评论