Overview
Occasionally this question pops up in newsgroups and forums : Why does C#
insist on operator overloads being static? The person raising the question
also usually complains how this prevents him or her from implementing virtual
overloaded operators.
This article explains why operator overloads have to be static in C# (or any
other MSIL compiler), and also shows how you can simulate virtual overloaded
operators very easily. It's not a universe-shattering theory (nor a very
original one for that matter) and uses a very simple pattern though it's this
very simplicity that makes it interesting.
So, why do they have to be static?
In C#, GC'd objects are heap-allocated (the managed CLR heap, not the CRT
heap) and thus GC'd objects are always used as references (into the CLR heap).
You cannot have stack-based GC'd objects in C# and what this means that you
never know when an object variable is null
. So, you can
imagine what happens if operator overloads were instance methods and you tried
to use an operator on a null
object! Traditional C++ (as
opposed to the CLI version) never faced this problem because the operators were
always applied on stack objects; and if at all pointers were used, since
pointers followed their own set of operational behavior - you never face a
situation where an overloaded op-overload method is invoked on an invalid
object.
Traditional C++ example
See below some C++ code that uses virtual operator overloads :-
class Base
{
public:
Base(int x):x_value(x){}
virtual bool operator ==(const Base& b)
{
return x_value == b.x_value;
}
protected:
int x_value;
};
class Derived : public Base
{
public:
Derived(int x, int y): Base(x), y_value(y){}
virtual bool operator ==(const Base& b)
{
Derived* pD = (Derived*)&b;
return (pD->y_value == y_value) && (pD->x_value == x_value);
}
private:
int y_value;
};
int _tmain(int argc, _TCHAR* argv[])
{
Base* b1 = new Derived(2,11);
Base* b2 = new Derived(2,11);
cout << (*b1==*b2) << endl;
return 0;
}
When (*b1==*b2)
is evaluated the ==
operator
overload for class Derived
is invoked which can be easily verified
by trying out different values for the Base
constructors for
b1
and b2
or by setting a breakpoint inside the
operator overload.
A port to C#
Taking what we know of C# let's attempt a straight-forward port of the above
example to C#.
public class Base
{
int x;
public Base( int x )
{
this.x = x;
}
public static bool operator==( Base l, Base r )
{
if( object.ReferenceEquals( l, r ) )
return true;
else if( object.ReferenceEquals( l, null ) ||
object.ReferenceEquals( r, null ) )
return false;
return l.x == r.x;
}
public static bool operator!=( Base l, Base r )
{
return !(l == r);
}
public int X { get { return x; } }
}
public class Derived : Base
{
int y;
public Derived( int x, int y ) : base( x )
{
this.y = y;
}
public static bool operator==( Derived l, Derived r )
{
if( object.ReferenceEquals( l, r ) )
return true;
else if( object.ReferenceEquals( l, null ) ||
object.ReferenceEquals( r, null ) )
return false;
return (l.y == r.y) && (l.X == r.X);
}
public static bool operator!=( Derived l, Derived r )
{
return !(l == r);
}
public int Y { get { return y; } }
}
class Program
{
static void Main()
{
Derived d1 = new Derived( 2, 11 );
Derived d2 = new Derived( 2, 11 );
Console.WriteLine( d1 == d2 );
Console.ReadLine();
}
}
If we run the program as it is above everything will work like the C++
version, but if we introduce a slight change to the program things begin to
deviate greatly.
class Program
{
static void Main()
{
Base d1 = new Derived( 2, 11 );
Base d2 = new Derived( 2, 12 );
Console.WriteLine( d1 == d2 );
Console.ReadLine();
}
}
What's going on here? As simple debugging will show us the despite the
objects being compared being instances of the Derived
class
the Base
class ==
operator is being
called. This is because C# (and thus, most other languages) figure out which
==
operator method to call based on the known (i.e.
compile-time) type of the object on the left hand side of the operation.
There are ways around this as you'll see below.
Simulating operator polymorphism with C#
Here, we see how to simulate this in C# :-
class Base
{
protected int x_value = 0;
public Base(int x)
{
x_value = x;
}
public static bool operator==(Base b1, Base b2)
{
if( object.ReferenceEquals( b1, b2 ) )
{
return true;
}
else if( object.ReferenceEquals( b1, null ) ||
object.ReferenceEquals( b2, null ) )
{
return false;
}
return b1.Equals(b2);
}
public static bool operator !=(Base b1, Base b2)
{
return !(b1 == b2);
}
public override bool Equals(object obj)
{
if( obj == null )
return false;
Base o = obj as Base;
if( o != null )
return x_value == o.x_value;
return false;
}
public override int GetHashCode()
{
return x_value.GetHashCode();
}
}
class Derived : Base
{
protected int y_value = 0;
public Derived(int x, int y) : base(x)
{
y_value = y;
}
public override bool Equals(object obj)
{
if( !base.Equals( obj ) )
return false;
Derived o = obj as Derived;
if( o == null )
return false;
return y_value == o.y_value;
}
public override int GetHashCode()
{
return x_value.GetHashCode() ^ y_value.GetHashCode() + x_value;
}
}
class Program
{
static void Main(string[] args)
{
Base b1 = new Derived(10, 12);
Base b2 = new Derived(10, 11);
Console.WriteLine(b1 == b2);
b2 = null;
Console.WriteLine(b1 == b2);
Console.ReadKey(true);
}
}
Rather than rely on the ==
operator overload to do all
of the heavy lifting we push all of the work onto the virtual Equals
method, from there we let polymorphism work its magic.
Points to note
-
The operator overload has to be static, so we have virtual instance methods
that implement the logic for us and we invoke these virtual methods from the
static operators
-
In our example, the method Equals
corresponds to ==
and Equals
is a virtual method (inherited
from System.Object
)
-
Within the static overload we need to check for null
(to
avoid null-reference exceptions)
-
Within each derived class's corresponding operator-logic method (Equals
in our case), we cast the System.Object
argument to the type of the class (e.g. - In the Derived
class we cast to Derived
)
-
While Equals
and ==
already exist
in System.Object
, we can implement similar methods for any
operator in our class hierarchies. Say, we need to implement the ++
operator, we then add a PlusPlus
(or
Increment
) virtual
method to the root
base class in our object hierarchy and in the ++
overload
we invoke the Increment
method on the passed-in object.
-
In our example, we check for null
and return true
or false
depending on whether the
objects are both null
or not. But, if you are implementing
an operator like ++
then
you might want to
check for null
and throw an ArgumentNullException
(to override the inappropriate NullReferenceException
that'd otherwise get
thrown).
History
- Apr 19, 2005 : Article first published
- Apr 20, 2005 : Fixed a bug in
Derived.Equals
and also changed
the GetHashCode
implementations for Base
and
Derived
(thanks to Jeffrey Sax for pointing this out)