2

I'm dealing with JSON data which I load into python dictionaries. A lot of these have optional fields, which then may contain dictionaries, that kind of stuff.

dictionary1 = 
{"required": {"value1": "one", "value2": "two"},
"optional": {"value1": "one"}}

dictionary2 = 
{"required": {"value1": "one", "value2": "two"}}

If I do this,

dictionary1.get("required").get("value1")

this works, obviously, because the field "required" is always present.

However, when I use the same line on dictionary2 (to get the optional field), this will produce an AttributeError

dictionary2.get("optional").get("value1")
AttributeError: 'NoneType' object has no attribute 'get'

Which makes sense, because the first .get() will return None, and the second .get() cannot call get() on the None object.

I can solve this by giving default values in case the optional field is missing, but this will be annoying the more complex the data gets, so I'm calling this a "naive fix":

dictionary2.get("optional", {}).get("value1", " ")

so the first .get() will return an empty dictionary {}, on which the second .get() can be called, and since it obviously contains nothing, it will return the empty string, as defined per the second default. This will no longer produce errors, but I was wondering if there is a better solution for this - especially for more complex cases (value1 containing an array or another dictionary, etc....)

I could also fix this with try - except AttributeError, but this is not my preferred way either.

try:
    value1 = dictionary2.get("optional").get("value1")
except AttributeError:
    value1 = " "

I also dont like checking if optional field exists, this produces garbage code lines like

optional = dictionary2.get("optional")
if optional:
    value1 = optional.get("value1")
else:
    value1 = " "

which seems very non-pythonic...

I was thinking maybe my approach of just chaining .get()s is wrong in the first place?


Edit: Thanks to Ben Grossmann's answer, I figured out this one-liner which covers my use case.

value1 = dictionary2["optional"]["value1"] if "optional" in dictionary2 else " "

The member check as a ternary operator will bypass the evaluation of the statement which causes the error and uses the default value " " without having to provide default values for the individual checks

c8999c 3f964f64
  • 1,204
  • 1
  • 6
  • 20
  • Why don't you like the try-except clause? It's the most pythonic way of approaching this and easiest to read. – Darina Feb 04 '22 at 15:31
  • just creates lines of code... I was looking for a one-liner I guess. (the code would be littered with try-except blocks) – c8999c 3f964f64 Feb 04 '22 at 15:34

4 Answers4

3

In your code here:

try:
    value1 = dictionary2.get("optional").get("value1")
except AttributeError:
    value1 = " "

You can use brackets and except KeyError:

try:
    value1 = dictionary2["optional"]["value1"]
except KeyError:
    value1 = " "

If this is too verbose for the caller, add a helper:

def get_or_default(d, *keys, default=None):
    try:
        for k in keys:
            d = d[k]
    except (KeyError, IndexError):
        return default
    return d

if __name__ == "__main__":
    d = {"a": {"b": {"c": [41, 42]}}}
    print(get_or_default(d, "a", "b", "c", 1)) # => 42
    print(get_or_default(d, "a", "b", "d", default=43)) # => 43

You could also subclass dict and use tuple bracket indexing, like NumPy and Pandas:

class DeepDict(dict):
    def __init__(self, d, default=None):
        self.d = d
        self.default = default

    def __getitem__(self, keys):
        d = self.d
        try:
            for k in keys:
                d = d[k]
        except (KeyError, IndexError):
            return self.default
        return d

    def __setitem__(self, keys, x):
        d = self.d
        for k in keys[:-1]:
            d = d[k]
        d[keys[-1]] = x

if __name__ == "__main__":
    dd = DeepDict({"a": {"b": {"c": [42, 43]}}}, default="foo")
    print(dd["a", "b", "c", 1]) # => 43
    print(dd["a", "b", "c", 11]) # => "foo"
    dd["a", "b", "c", 1] = "banana"
    print(dd["a", "b", "c", 1]) # => "banana"

But there might be an engineering cost to this if it's confusing for other developers, and you'd want to flesh out the other expected methods as described in How to "perfectly" override a dict? (consider this a proof-of-concept sketch). It's best not to be too clever.

ggorlen
  • 33,459
  • 6
  • 59
  • 67
1

First of all, you refer to " " as the empty string. This is incorrect; "" is the empty string.

Second, if you're checking for membership, I don't see a reason to use the get method in the first place. I'd opt for something like the following.

if "optional" in dictionary2:
    value1 = dictionary2["optional"].get("value1")
else:
    value1 = ""

Another alternative to consider (since you're using the get method a lot) is to switch to the defaultdict class. For example,

from collections import defaultdict

dictionary2 = {"required": {"value1": "one", "value2": "two"}}
ddic2 = defaultdict(dict,dictionary2)
value1 = ddic2["optional"].get("value1")
Ben Grossmann
  • 1,931
  • 1
  • 9
  • 11
  • correct on the empty string, my bad. checking if optional is in the dictionary first will create a lot of extra lines of code. This becomes somewhat unmanageable the more checks there are... – c8999c 3f964f64 Feb 04 '22 at 15:37
  • @c See my edit: maybe it would be helpful to start using defaultdicts. – Ben Grossmann Feb 04 '22 at 15:50
  • defaultdict seems great, can you explain why it doesnt work with .get("optional"), but only when I use ["optional"] ? – c8999c 3f964f64 Feb 04 '22 at 16:00
  • @c8999c3f964f64 Yeah, defaultdict doesn't have a "get" method; the alternative that you get when the key is not present is automatically part of the `ddic2["optional"]` call – Ben Grossmann Feb 04 '22 at 16:27
  • I'm actually going with this answer because the check for membership can be used as a ternary operator which bypasses the evaluation of the statement entirely, and can be turned into a slick one-liner, just like I was looking for `value1 = dictionary2["optional"]["value1"] if "optional" in dictionary2 else " "` – c8999c 3f964f64 Feb 04 '22 at 16:58
  • @c I didn’t realize you could use if/else that way, neat! Glad my answer was helpful – Ben Grossmann Feb 04 '22 at 19:01
1

The pythonic way of going about it would be using the try/except block -

dictionary2 = {"required": {"value1": "one", "value2": "two"}}
try:
    value1 = dictionary2["optional"]["value1"]
except (KeyError, AttributeError) as e:
    value1 = ""

KeyError to capture the missing keys, and AttributeError to capture cases where you have a list/str instead of dict object.


If you don't like tons of try/except in your code, you can consider using a helper function -

def get_val(data, keys):
    try:
        for k in keys:
            data = data[k]
        return data
    except (KeyError, AttributeError) as e:
        return ""

dictionary2 = {"required": {"value1": "one", "value2": "two"}}
print(get_val(dictionary2, ("required", "value2")))
print(get_val(dictionary2, ("optional", "value1")))

outputs -

two

Max
  • 596
  • 2
  • 10
  • I will consider the idea of a helper function, but is there really no built-in way for this, something like dictionary2.get("optional").get("value1") or " ", which ignores the chained .get() and returns the "or" value in case the optional field is missing? Thank you for the answer! – c8999c 3f964f64 Feb 04 '22 at 15:51
  • In that case the best pythonic way would be to keep doing as one of your suggestions `dictionary2.get("optional", {}).get("value1", "")` – Max Feb 04 '22 at 15:53
1

You could use toolz.dicttoolz.get_in() for this:

from toolz.dicttoolz import get_in

dictionary1 = {"required": {"value1": "one", "value2": "two"}, "optional": {"value1": "one"}}
dictionary2 = {"required": {"value1": "one", "value2": "two"}}

get_in(("optional", "value1"), dictionary1)
# 'one'

get_in(("optional", "value1"), dictionary2)
# None

derek
  • 171
  • 4