Why does Python code use len() function instead of a length method?
Solution 1
Strings do have a length method: __len__()
The protocol in Python is to implement this method on objects which have a length and use the built-in len()
function, which calls it for you, similar to the way you would implement __iter__()
and use the built-in iter()
function (or have the method called behind the scenes for you) on objects which are iterable.
See Emulating container types for more information.
Here's a good read on the subject of protocols in Python: Python and the Principle of Least Astonishment
Solution 2
Jim's answer to this question may help; I copy it here. Quoting Guido van Rossum:
First of all, I chose len(x) over x.len() for HCI reasons (def __len__() came much later). There are two intertwined reasons actually, both HCI:
(a) For some operations, prefix notation just reads better than postfix — prefix (and infix!) operations have a long tradition in mathematics which likes notations where the visuals help the mathematician thinking about a problem. Compare the easy with which we rewrite a formula like x*(a+b) into x*a + x*b to the clumsiness of doing the same thing using a raw OO notation.
(b) When I read code that says len(x) I know that it is asking for the length of something. This tells me two things: the result is an integer, and the argument is some kind of container. To the contrary, when I read x.len(), I have to already know that x is some kind of container implementing an interface or inheriting from a class that has a standard len(). Witness the confusion we occasionally have when a class that is not implementing a mapping has a get() or keys() method, or something that isn’t a file has a write() method.
Saying the same thing in another way, I see ‘len‘ as a built-in operation. I’d hate to lose that. /…/
Solution 3
There is a len
method:
>>> a = 'a string of some length'
>>> a.__len__()
23
>>> a.__len__
<method-wrapper '__len__' of str object at 0x02005650>
Solution 4
Python is a pragmatic programming language, and the reasons for len()
being a function and not a method of str
, list
, dict
etc. are pragmatic.
The len()
built-in function deals directly with built-in types: the CPython implementation of len()
actually returns the value of the ob_size
field in the PyVarObject
C struct that represents any variable-sized built-in object in memory. This is much faster than calling a method -- no attribute lookup needs to happen. Getting the number of items in a collection is a common operation and must work efficiently for such basic and diverse types as str
, list
, array.array
etc.
However, to promote consistency, when applying len(o)
to a user-defined type, Python calls o.__len__()
as a fallback. __len__
, __abs__
and all the other special methods documented in the Python Data Model make it easy to create objects that behave like the built-ins, enabling the expressive and highly consistent APIs we call "Pythonic".
By implementing special methods your objects can support iteration, overload infix operators, manage contexts in with
blocks etc. You can think of the Data Model as a way of using the Python language itself as a framework where the objects you create can be integrated seamlessly.
A second reason, supported by quotes from Guido van Rossum like this one, is that it is easier to read and write len(s)
than s.len()
.
The notation len(s)
is consistent with unary operators with prefix notation, like abs(n)
. len()
is used way more often than abs()
, and it deserves to be as easy to write.
There may also be a historical reason: in the ABC language which preceded Python (and was very influential in its design), there was a unary operator written as #s
which meant len(s)
.
Solution 5
met% python -c 'import this' | grep 'only one'
There should be one-- and preferably only one --obvious way to do it.
Related videos on Youtube
Comments
-
fuentesjr almost 3 years
I know that python has a
len()
function that is used to determine the size of a string, but I was wondering why it's not a method of the string object.Update
Ok, I realized I was embarrassingly mistaken.
__len__()
is actually a method of a string object. It just seems weird to see object oriented code in Python using the len function on string objects. Furthermore, it's also weird to see__len__
as the name instead of just len.-
wjandrea almost 4 yearsRelated to your update: Is there any case where
len(someObj)
does not callsomeObj
's__len__
function? tl;dr Yes, so always uselen()
instead of__len__()
.
-
-
alternative over 12 yearsIt astonishes me how moronic the reason for using
len
is. They think that it is easier to force people to implement.__len__
than to force people to implement.len()
. Its the same thing, and one looks much cleaner. If the language is going to have an OOP__len__
, what in the world is the point of makinglen(..)
-
Peter Cooper about 12 yearsThe obvious thing when working with an object being, of course, a method.
-
bug over 11 years@Peter: I'd pay $20 to anyone with photo evidence that they taped your comment to Guido's back. $50 if it's on his forehead.
-
Alex Bitek over 11 yearsYeah, the Python designers adhere to dogma but they themselves don't respect their own dogma.
-
Evicatos over 10 yearslen, str, etc. can be used with higher-order functions like map, reduce, and filter without the need to define a function or lambda just to call a method. Not everything revolves around OOP, even in Python.
-
Piotr Dobrogost over 9 years
len
function was already mentioned in previous answers. What's the point of this "answer", then? -
abarnert over 9 yearsAlso, by using a protocol, they can provide alternative ways of implementing things. For example, you can create an iterable with
__iter__
, or with only__getitem__
, anditer(x)
will work either way. You can create a usable-in-bool-context object with__bool__
or__len__
, andbool(x)
will work either way. And so on. I think Armin explains this reasonably well in the linked post—but even if he didn't, calling Python moronic because of an outside explanation by a guy who's often publicly at odds with the core devs wouldn't exactly be fair… -
abarnert over 9 years@Evicatos: +1 for "Not everything revolves around OOP", but I'd end with "especially in Python", not even. Python (unlike Java, Ruby, or Smalltalk) doesn't try to be a "pure OOP" language; it's explicitly designed to be a "multi-paradigm" language. List comprehensions aren't methods on iterables.
zip
is a top-level function, not azip_with
method. And so on. Just because everything is an object doesn't mean being an object is the most important thing about each thing. -
MirroredFate over 8 yearsThat article is so poorly written I feel confused and angry after trying to read it. "In Python 2.x the Tuple type for instance does not expose any non-special methods and yet you can use it to make a string out of it:" The whole thing is an amalgamation of near-indecipherable sentences.
-
Ed Randall almost 8 years$ python -c 'import this' | grep obvious
-
wjandrea almost 4 yearsYes, but it's not meant to be used directly.
len
validates the output of__len__
. See Is there any case wherelen(someObj)
does not callsomeObj
's__len__
function? Dunder methods in general are not meant to be called directly. -
wjandrea almost 4 yearsIt also checks that the int is not greater than
sys.maxsize
, and thatobj.__len__
exists in the first place. I wrote an answer on another question that lists all the different checks. -
wjandrea almost 4 yearsSure, but it's not meant to be called directly. See Is there any case where
len(someObj)
does not callsomeObj
's__len__
function? Dunder methods in general are not meant to be called directly. -
dionyziz over 3 yearsI do not buy the optimality argument. While a method call could indeed be slower, a sufficiently advanced interpreter or compiler –and Python interpreters do much more advanced things than this– could recognize that one is calling the
len
method of a built-in type and revert to returningob_size
or whatever it needs, at least the vast majority of the time. -
Luciano Ramalho over 3 years@dionyziz: Python's algorithm to look up attributes is very flexible. You have instance attributes, class attributes, data descriptors, not-data descriptors, and multiple-inheritance. All that may play a role in a call like
s.len()
, but does not affectlen(s)
. -
GreenAsJade over 2 years200+ votes says that the lack of str.len() wastes a ton of people's time coming here and reading why it "should be" intuitive that a string does not have a length function :facepalm: By all means retain the built-in len() function, but why not also support sensible methods on basic things like strings...