digitalmars.D.learn - Final necessary for inline optimization?

IGotD- (36/36) Oct 19 2019 When I'm playing around with different D compilers I get

Johan Engelen (22/24) Oct 19 2019 It is the latter: the optimizer is able to prove that object c

IGotD- <nise nise.com> writes:

When I'm playing around with different D compilers I get 
different results regarding class method inlining. According most 
information I have seen, you must declare a class method final 
for the compiler to be able inline the method otherwise it will 
be a virtual table call.

Take this simple example.

import std.stdio;
import std.random;

class TestClass
{
     static auto rnd = Random(42);
     int tt = 3;

     void doStuff()
     {
         tt = uniform(0, 15, rnd);
     }
}

void main()
{
     auto c = new TestClass;

     c.doStuff();

     writeln("TestClass is ", c.tt);
}


The random generator is used here so that CTFE doesn't kick in.

DMD certainly doesn't inline unless I use final (on class or 
doStuff) but when I use LDC, LDC seems to be able to figure out 
that the class isn't derived and inlines it regardless (-O option 
required). I see that LDC generates the method doStuff so it 
could also be that LDC knows which specific class it is (and not 
a derived one) in main and can inline for this particular case. 
Which one is it, LDC recognizes TestClass isn't derived or is 
sure that the class (c) isn't derived in particular? Would that 
mean that the vtable entry for doStuff is created regardless? Any 
other information why LDC do better?

I have previously seen a failed attempt to make final default 
(https://forum.dlang.org/thread/lfqoan$5qq$1 digitalmars.com?page=1) but would
that be necessary. Could the compiler figure out that a class method is final
and manage that optimization by itself?

Oct 19 2019

Johan Engelen <j j.nl> writes:

Just a brief answer.

On Saturday, 19 October 2019 at 15:58:08 UTC, IGotD- wrote:
 Which one is it, LDC recognizes TestClass isn't derived or is 
 sure that the class (c) isn't derived in particular?

It is the latter: the optimizer is able to prove that object c 
has a vtable that is known exactly. Then indexing into that 
vtable gives a definite function that can then be inlined.

Some more info:
The (suboptimal) trick that LDC employs is that after the call to 
`new`, LDC again sets the vtable of the object. It is 
superfluous, because it is already done by `new`, but `new` is 
opaque to the optimizer whereas the extra vtable write is not. So 
after the object creation, the optimizer knows exactly what 
vtable is used for that object. Now unfortunately, that only 
works for the first virtual call after `new`. Any opaque function 
that is called with `c` as parameter, e.g. calling a virtual 
function of that class will destroy knowledge about what vtable 
is stored in `c`. Per D language spec, the virtual function 
cannot overwrite the vtable pointer, but the optimizer does not 
know that so it assumes it might be overwritten and it no longers 
knows the contents of the vtable pointer.
You can see this happening here:
https://d.godbolt.org/z/8ERNhg

-Johan

Oct 19 2019

D Programming

C/C++ Programming

Other

digitalmars.D.learn - Final necessary for inline optimization?