Right way to initialize an OrderedDict using its constructor such that it retains order of initial data?

92,197

Solution 1

The OrderedDict will preserve any order that it has access to. The only way to pass ordered data to it to initialize is to pass a list (or, more generally, an iterable) of key-value pairs, as in your last two examples. As the documentation you linked to says, the OrderedDict does not have access to any order when you pass in keyword arguments or a dict argument, since any order there is removed before the OrderedDict constructor sees it.

Note that using a list comprehension in your last example doesn't change anything. There's no difference between OrderedDict([(i,i) for i in l]) and OrderedDict([('b', 'b'), ('a', 'a'), ('c', 'c'), ('aa', 'aa')]). The list comprehension is evaluated and creates the list and it is passed in; OrderedDict knows nothing about how it was created.

Solution 2

# An OD is represented by a list of tuples, so would this work?
d = OrderedDict([('b', 2), ('a', 1)])

Yes, that will work. By definition, a list is always ordered the way it is represented. This goes for list-comprehension too, the list generated is in the same way the data was provided (i.e. source from a list it will be deterministic, sourced from a set or dict not so much).

How does one go about verifying if OrderedDict actually maintains an order. Since a dict has an unpredictable order, what if my test vectors luckily has the same initial order as the unpredictable order of a dict?. For example, if instead of d = OrderedDict({'b':2, 'a':1}) I write d = OrderedDict({'a':1, 'b':2}), I can wrongly conclude that the order is preserved. In this case, I found out that a dict is order alphabetically, but that may not be always true. i.e. what's a reliable way to use a counter example to verify if a data structure preserves order or not short of trying test vectors repeatedly until one breaks.

You keep your source list of 2-tuple around for reference, and use that as your test data for your test cases when you do unit tests. Iterate through them and ensure the order is maintained.

Share:
92,197
click
Author by

click

Updated on July 08, 2022

Comments

  • click
    click almost 2 years

    What's the correct way to initialize an ordered dictionary (OD) so that it retains the order of initial data?

    from collections import OrderedDict
    
    # Obviously wrong because regular dict loses order
    d = OrderedDict({'b':2, 'a':1}) 
    
    # An OD is represented by a list of tuples, so would this work?
    d = OrderedDict([('b',2), ('a', 1)])
    
    # What about using a list comprehension, will 'd' preserve the order of 'l'
    l = ['b', 'a', 'c', 'aa']
    d = OrderedDict([(i,i) for i in l])
    

    Question:

    • Will an OrderedDict preserve the order of a list of tuples, or tuple of tuples or tuple of lists or list of lists etc. passed at the time of initialization (2nd & 3rd example above)?

    • How does one go about verifying if OrderedDict actually maintains an order? Since a dict has an unpredictable order, what if my test vectors luckily have the same initial order as the unpredictable order of a dict? For example, if instead of d = OrderedDict({'b':2, 'a':1}) I write d = OrderedDict({'a':1, 'b':2}), I can wrongly conclude that the order is preserved. In this case, I found out that a dict is ordered alphabetically, but that may not be always true. What's a reliable way to use a counterexample to verify whether a data structure preserves order or not, short of trying test vectors repeatedly until one breaks?

    P.S. I'll just leave this here for reference: "The OrderedDict constructor and update() method both accept keyword arguments, but their order is lost because Python’s function call semantics pass-in keyword arguments using a regular unordered dictionary"

    P.P.S : Hopefully, in future, OrderedDict will preserve the order of kwargs also (example 1): http://bugs.python.org/issue16991

  • click
    click over 9 years
    About verifying the order: How do I make sure that my 2-tuple WILL break the order of dict if it's unpredictable? This is a generic question about any data structure, perhaps I should split it from this question.
  • metatoaster
    metatoaster over 9 years
    You can't deterministically break something that is non-deterministic in nature.
  • click
    click over 9 years
    So what's the right approach to test such things? You just keep trying indefinitely? The order is unpredictable for programmers, but since it's a hash map, it follows 'some' algorithm & a right test should try to counter that?
  • metatoaster
    metatoaster over 9 years
    See __hash__. Specifically about the str type.
  • Bobort
    Bobort about 7 years
    By definition, a list is always ordered the way it is represented. This was a key statement for me. I decided to simply use a list of 2-tuples for my basic OrderedDict so that I don't have the overhead of converting a list to an OrderedDict. I just loop through the elements like a list instead of a dictionary.