Credit: Alex Martelli
Working with a dictionary D
, you need to use the entry D[k]
if it’s already present, or add a new D[k]
if k
isn’t yet a key in D
.
This is what the setdefault
method of dictionary objects is for. Say we’re building a word-to-page numbers index. A key piece of code might be:
theIndex = {} def addword(word, pagenumber): if theIndex.has_key(word): theIndex[word].append(pagenumber) else: theIndex[word] = [pagenumber]
Good Pythonic instincts suggest substituting this “look before you leap” pattern with an “easier to get permission” pattern (see Recipe 5.4 for a detailed discussion of these phrases):
def addword(word, pagenumber): try: theIndex[word].append(pagenumber) except KeyError: theIndex[word] = [pagenumber]
This is just a minor simplification, but it satisfies the pattern of “use the entry if it is already present; otherwise, add a new entry.” Here’s how using setdefault
simplifies this further:
def addword(word, pagenumber): theIndex.setdefault(word, []).append(pagenumber)
The setdefault
method of a dictionary is a handy shortcut for this task that is especially useful when the new entry you want to add is mutable. Basically, dict.setdefault(k, v)
is much like dict.get(k, v)
, except that if k
is not a key in the dictionary, the setdefault
method assigns dict[k]=v
as a side effect, in addition to returning v
. (get
would just return v
, without affecting dict
in any way.) Therefore, setdefault
is appropriate any time you have get
-like needs but also want to produce this specific side effect on the dictionary.
setdefault
is particularly useful in a dictionary with values that are lists, as detailed in Recipe 1.6. The single most typical usage form for setdefault
is:
somedict.setdefault(somekey, []).append(somevalue)
Note that setdefault
is normally not very useful if the values are immutable. If you just want to count words, for example, something like the following is no use:
theIndex.setdefault(word, 1)
In this case, you want:
theIndex[word] = 1 + theIndex.get(word, 0)
since you will be rebinding the dictionary entry at theIndex[word]
anyway (because numbers are immutable).