What issues should be considered when overriding equals and hashCode in Java?
Solution 1
The theory (for the language lawyers and the mathematically inclined):
equals()
(javadoc) must define an equivalence relation (it must be reflexive, symmetric, and transitive). In addition, it must be consistent (if the objects are not modified, then it must keep returning the same value). Furthermore, o.equals(null)
must always return false.
hashCode()
(javadoc) must also be consistent (if the object is not modified in terms of equals()
, it must keep returning the same value).
The relation between the two methods is:
Whenever
a.equals(b)
, thena.hashCode()
must be same asb.hashCode()
.
In practice:
If you override one, then you should override the other.
Use the same set of fields that you use to compute equals()
to compute hashCode()
.
Use the excellent helper classes EqualsBuilder and HashCodeBuilder from the Apache Commons Lang library. An example:
public class Person {
private String name;
private int age;
// ...
@Override
public int hashCode() {
return new HashCodeBuilder(17, 31). // two randomly chosen prime numbers
// if deriving: appendSuper(super.hashCode()).
append(name).
append(age).
toHashCode();
}
@Override
public boolean equals(Object obj) {
if (!(obj instanceof Person))
return false;
if (obj == this)
return true;
Person rhs = (Person) obj;
return new EqualsBuilder().
// if deriving: appendSuper(super.equals(obj)).
append(name, rhs.name).
append(age, rhs.age).
isEquals();
}
}
Also remember:
When using a hash-based Collection or Map such as HashSet, LinkedHashSet, HashMap, Hashtable, or WeakHashMap, make sure that the hashCode() of the key objects that you put into the collection never changes while the object is in the collection. The bulletproof way to ensure this is to make your keys immutable, which has also other benefits.
Solution 2
There are some issues worth noticing if you're dealing with classes that are persisted using an Object-Relationship Mapper (ORM) like Hibernate, if you didn't think this was unreasonably complicated already!
Lazy loaded objects are subclasses
If your objects are persisted using an ORM, in many cases you will be dealing with dynamic proxies to avoid loading object too early from the data store. These proxies are implemented as subclasses of your own class. This means thatthis.getClass() == o.getClass()
will return false
. For example:
Person saved = new Person("John Doe");
Long key = dao.save(saved);
dao.flush();
Person retrieved = dao.retrieve(key);
saved.getClass().equals(retrieved.getClass()); // Will return false if Person is loaded lazy
If you're dealing with an ORM, using o instanceof Person
is the only thing that will behave correctly.
Lazy loaded objects have null-fields
ORMs usually use the getters to force loading of lazy loaded objects. This means that person.name
will be null
if person
is lazy loaded, even if person.getName()
forces loading and returns "John Doe". In my experience, this crops up more often in hashCode()
and equals()
.
If you're dealing with an ORM, make sure to always use getters, and never field references in hashCode()
and equals()
.
Saving an object will change its state
Persistent objects often use a id
field to hold the key of the object. This field will be automatically updated when an object is first saved. Don't use an id field in hashCode()
. But you can use it in equals()
.
A pattern I often use is
if (this.getId() == null) {
return this == other;
}
else {
return this.getId().equals(other.getId());
}
But: you cannot include getId()
in hashCode()
. If you do, when an object is persisted, its hashCode
changes. If the object is in a HashSet
, you'll "never" find it again.
In my Person
example, I probably would use getName()
for hashCode
and getId()
plus getName()
(just for paranoia) for equals()
. It's okay if there are some risk of "collisions" for hashCode()
, but never okay for equals()
.
hashCode()
should use the non-changing subset of properties from equals()
Solution 3
A clarification about the obj.getClass() != getClass()
.
This statement is the result of equals()
being inheritance unfriendly. The JLS (Java language specification) specifies that if A.equals(B) == true
then B.equals(A)
must also return true
. If you omit that statement inheriting classes that override equals()
(and change its behavior) will break this specification.
Consider the following example of what happens when the statement is omitted:
class A {
int field1;
A(int field1) {
this.field1 = field1;
}
public boolean equals(Object other) {
return (other != null && other instanceof A && ((A) other).field1 == field1);
}
}
class B extends A {
int field2;
B(int field1, int field2) {
super(field1);
this.field2 = field2;
}
public boolean equals(Object other) {
return (other != null && other instanceof B && ((B)other).field2 == field2 && super.equals(other));
}
}
Doing new A(1).equals(new A(1))
Also, new B(1,1).equals(new B(1,1))
result give out true, as it should.
This looks all very good, but look what happens if we try to use both classes:
A a = new A(1);
B b = new B(1,1);
a.equals(b) == true;
b.equals(a) == false;
Obviously, this is wrong.
If you want to ensure the symmetric condition. a=b if b=a and the Liskov substitution principle call super.equals(other)
not only in the case of B
instance, but check after for A
instance:
if (other instanceof B )
return (other != null && ((B)other).field2 == field2 && super.equals(other));
if (other instanceof A) return super.equals(other);
else return false;
Which will output:
a.equals(b) == true;
b.equals(a) == true;
Where, if a
is not a reference of B
, then it might be a be a reference of class A
(because you extend it), in this case you call super.equals()
too.
Solution 4
For an inheritance-friendly implementation, check out Tal Cohen's solution, How Do I Correctly Implement the equals() Method?
Summary:
In his book Effective Java Programming Language Guide (Addison-Wesley, 2001), Joshua Bloch claims that "There is simply no way to extend an instantiable class and add an aspect while preserving the equals contract." Tal disagrees.
His solution is to implement equals() by calling another nonsymmetric blindlyEquals() both ways. blindlyEquals() is overridden by subclasses, equals() is inherited, and never overridden.
Example:
class Point {
private int x;
private int y;
protected boolean blindlyEquals(Object o) {
if (!(o instanceof Point))
return false;
Point p = (Point)o;
return (p.x == this.x && p.y == this.y);
}
public boolean equals(Object o) {
return (this.blindlyEquals(o) && o.blindlyEquals(this));
}
}
class ColorPoint extends Point {
private Color c;
protected boolean blindlyEquals(Object o) {
if (!(o instanceof ColorPoint))
return false;
ColorPoint cp = (ColorPoint)o;
return (super.blindlyEquals(cp) &&
cp.color == this.color);
}
}
Note that equals() must work across inheritance hierarchies if the Liskov Substitution Principle is to be satisfied.
Solution 5
Still amazed that none recommended the guava library for this.
//Sample taken from a current working project of mine just to illustrate the idea
@Override
public int hashCode(){
return Objects.hashCode(this.getDate(), this.datePattern);
}
@Override
public boolean equals(Object obj){
if ( ! obj instanceof DateAndPattern ) {
return false;
}
return Objects.equal(((DateAndPattern)obj).getDate(), this.getDate())
&& Objects.equal(((DateAndPattern)obj).getDate(), this.getDatePattern());
}
Comments
-
Matt Sheppard over 4 years
What issues / pitfalls must be considered when overriding
equals
andhashCode
? -
devlearn almost 15 yearsAdditional point about appendSuper(): you should use it in hashCode() and equals() if and only if you want to inherit the equality behavior of the superclass. For instance, if you derive straight from Object, there's no point because all Objects are distinct by default.
-
kkessell about 14 yearsYou can make equals symmetric this way (if comparing a superclass object with subclass object, always use the equals of the subclass) if (obj.getClass() != this.getClass() && obj.getClass().isInstance(this)) return obj.equals(this);
-
Ran Biron over 13 years@pihentagy - then I'd get a stackoverflow when the implementation class doesn't override the equals method. not fun.
-
aviad about 13 yearsSpecial care must be taken of the transient fields.
-
K'' over 12 yearsA great discussion about this question in Effective Java book
-
Raedwald over 12 yearsI think the underlying theory here is to distinguish between the attributes, aggregates and associatinos of an object. The asssociations should not participate in
equals()
. If a mad scientist created a duplicate of me we would be equivalent. But we would not have the same father. -
Rok Strniša over 12 yearsYou can get Eclipse to generate the two methods for you: Source > Generate hashCode() and equals().
-
Blaisorblade over 12 yearsHave a look at the canEqual method explained here - the same principle makes both solutions work, but with canEqual you don't compare the same fields twice (above, p.x == this.x will be tested in both directions): artima.com/lejava/articles/equality.html
-
zod almost 12 yearsdoes it matter if we all randomly chose the same prime numbers?
-
seinecle over 11 yearsSame is true with Netbeans: developmentality.wordpress.com/2010/08/24/…
-
QED over 11 years@Darthenius: yours is the most useful!
-
jimmybondy over 11 years@Johannes Brodwall: i don't understand
Saving an object will change it's state
!hashCode
must returnint
, so how will you usegetName()
? Can you give an example for yourhashCode
-
bestsss about 11 yearsmake sure that the hashCode() of the key objects that you put into the collection never changes while the object is in the collection This is exactly why overriding equals and hashcode together may not be necessary. Having hashCode pretty much precludes any changes to the object state while equals can tolerate fully mutable objects. You can safely put a mutable object into list as long as the equals contract remains intact but it's untrue for hashset/hashtables. Also if you wish a class to be able to server as key in hash structure it has to be designed as such (not just having hashCode)
-
mateusz.fiolka about 11 years@jimmybondy: getName will return a String object which also has a hashCode which can be used
-
Paul Cantrell about 11 years+1 for this. Neither getClass() nor instanceof is a panacea, and this is a good explanation of how to approach both. Don't think there's any reason not to do this.getClass() == that.getClass() instead of using equals().
-
herman almost 11 yearsjava.util.Objects.hash() and java.util.Objects.equals() are part of Java 7 (released in 2011) so you don't need Guava for this.
-
herman almost 11 yearsof course, but you should avoid that since Oracle is not providing public updates anymore for Java 6 (this has been the case since February 2013).
-
Jacob Raihle almost 11 yearsYou won't get a stackoverflow. If the equals method is not overridden, you will call the same code again, but the condition for recursion will always be false!
-
supercat almost 11 years@bestsss: A general rule which is good to follow is that at any given time when a mutable aspect of an object's state might change, the object should have one well-defined owner. If an object overrides
equals
so as to depend upon some mutable state, any Dictionary in which it was stored would "own" that aspect of its state. Since any execution context which tried to alter that state would have to own it, that would imply that the object must be removed from the Dictionary first. Otherwise, if one wants to bend this rule, overridehashCode
to depend only upon immutable properties. -
mike over 10 years@Darthenius or the shortcut:
ALT
Shift
S
and thenh
. -
AndroidGecko over 10 years@Darthenius Eclipse generated equals uses getClass() which might cause problems in some cases (see Effective Java item 8)
-
izaban over 10 yearsThe first null check is not necessary given the fact that
instanceof
returns false if its first operand is null (Effective Java again). -
supercat over 10 years@pihentagy: How does that behave if there are two different derived classes? If a
ThingWithOptionSetA
can be equal to aThing
provided that all the extra options have default values, and likewise for aThingWithOptionSetB
, then it should be possible for aThingWithOptionSetA
to be compare equal to aThingWithOptionSetB
only if all non-base properties of both objects match their defaults, but I don't see how you test for that. -
Rui Marques over 10 years@AndroidGecko maybe it did in the past. Eclipse Kepler did not generate code containing getClass().
-
Brian about 10 yearsAndroid Studio can also generate equals() and hashCode() for you.
-
Mike Housky about 10 yearsThe mathematical term is "equivalence relation", not "equality relation".
-
Steve Kuo about 10 yearsYour
this
inthis.getDate()
means nothing (other than clutter) -
yalkris almost 10 yearsequals is used to add/remove elements from collections like CopyOnWriteArraySet, HashSet if hashCode is equal for two different objects, etc. equals need to be symmetric i.e. if B.equals(C) returns true then C.equals(B) should return the same result. Otherwise your add/remove on those XXXSets behave in a confusing manner. stackoverflow.com/questions/24920563/…
-
Kevin over 9 yearsIn any case, I don't think this is a good idea. It makes the Equals contract unnecessarily confusing - someone who takes two Point parameters, a and b, has to be conscious of the possibility that a.getX() == b.getX() and a.getY() == b.getY() can be true, but a.equals(b) and b.equals(a) both be false (if only one is a ColorPoint).
-
Amit Parashar almost 9 yearsYou are re-calculating hashcode every time hashCode() is called. If the object is immutable, cache the calculated hashcode in the instance. Refer the hashCode impl in String class.
-
AbrahamDaniel almost 9 yearswhen using "instanceof" instead of "getClass()" symmetry would not always be maintained
-
nickgrim almost 9 yearsThe problem with this it that it breaks transitivity. If you add
B b2 = new B(1,99)
, thenb.equals(a) == true
anda.equals(b2) == true
butb.equals(b2) == false
. -
steinybot almost 9 yearsThere is one problem with this. Anonymous classes which do not add any aspects nor override the equals method will fail the getClass check even though they should be equal.
-
steinybot almost 9 yearsThis answer is only half of the story. It fails to mention any of the pitfalls when overriding the equals method in a subclass. It is actually misleading in that simply adding appendSuper in a derived class will solve the problem. It will almost certainly break either symmetry or transitivity.
-
steinybot almost 9 yearsUnfortunately no answer really solves the problem except perhaps stackoverflow.com/a/55736/367796. Even then I would be more inclined to go with @Blaisorblade comment to use the canEqual approach stackoverflow.com/questions/27581/…
-
brady almost 9 years@Steiny It's not clear to me that objects of different types should be equal; I'm thinking of different implementations of an interface as a common anonymous class. Can you give an example to support your premise?
-
steinybot almost 9 yearsMyClass a = new MyClass(123); MyClass b = new MyClass(123) { // Override some method }; // a.equals(b) is false when using this.getClass().equals(that.getClass())
-
brady almost 9 years@Steiny Right. As it should in most cases, especially if a method is overridden instead of added. Consider my example above. If it weren't
final
, and thecompareTo()
method were overridden to reverse the sort order, instances of the subclass and superclass should not be considered equal. When these objects were used together in a tree, keys that were "equal" according to aninstanceof
implementation might not be findable. -
Amos M. Carpenter over 8 yearsYour "not instanceof" expression needs an extra bracket:
if (!(otherObject instanceof DateAndPattern)) {
. Agree with hernan and Steve Kuo (though that one's a matter of personal preference), but +1 nonetheless. -
Manish Nagar over 8 years@AnttiSykäri you can also add some example like- when we want to insert user defined object into SET that time we need to implement hashCode and equals method so it will add only unique object into SET
-
Aleksandr Dubinsky about 8 yearsBasically, this is like
if (this.getClass() != o.getClass()) return false
, but flexible in that it only returns false if the derived class(es) bother to modify equals. Is that right? -
philo about 8 yearsIntelliJ has a wizard too: right-click the class name > Generate.. (or ⌘N) > equals() and hashCode().
-
Asif Mushtaq about 8 yearswhy
instanceof
insteadif((obj == null) || (obj.getClass() != this.getClass()))
? instanceof could create problem for parent and child. -
Naman over 7 yearssurprised by the fact that
enum
is not a part of the answer, shall we not be concerned over such fields? -
David Lavender over 6 years"Use the same set of fields that you use to compute equals() to compute hashCode()" I was taught that hashcode was the "quick look-up", just to narrow the scope of what then needs a full "equals" check. So, in most situations, I would just pick one field (e.g. "age" in this example) rather than always including all the fields. Is that bad practice?
-
Sam over 5 yearsSorry but I don't understand this statement about hashCode method: it is not legal if it uses more variables than equals(). But if I code with more variables, my code compiles. Why is it not legal?