digitalmars.D - Explicitly unimplemented computed gotos

bearophile (17/17) Nov 25 2010 When I have suggested to add "computed gotos" (similar to the ones of GC...

Jimmy Cao (4/39) Nov 25 2010 Good suggestion for a D3 feature.
Adam D. Ruppe (77/79) Nov 25 2010 because it's already known in C).
Iain Buclaw (25/34) Nov 26 2010 Walter has answered that they need some work to be implemented, and they...

bearophile (5/8) Nov 26 2010 I don't like those :-)

bearophile <bearophileHUGS lycos.com> writes:

When I have suggested to add "computed gotos" (similar to the ones of GCC) to
D, Walter has answered that they need some work to be implemented, and they
have limited usefulness, almost only to optimize interpreters.

But:
- D is a system language, so writing interpreters is an important application
of it.
- I have found GCC computed gotos useful to speed up some of my code. Recently
even the CPython has introduced their usage in the main interpreter loop.
- The GCC implementation of computed gotos is not standard for C and other
compilers may not understand it (to solve this trouble in GNU C code you need
to disable pieces of code, and this is less easy to do in D).

So even if computed gotos are not going to be implemented in DMD I suggest to:
1) Invent a syntax to represent and use them (probably the GCC syntax is good,
because it's already known in C).
2) Make DMD understand this syntax, but refuse it at compile time (because DMD
doesn't support computer gotos).
3) Define a new standard Predefined Version, like "computed_goto" or
"Computed_goto" or something similar, that is defined if a D compiler supports
them (so DMD doesn't define it), that allows to disable the code that contains
the computed goto if a compiler doesn't support them.

This:
- Allows other D implementations, based on LLVM and GCC back-ends that already
support computed gotos, to support such gotos in D code too;
- Allows the programmer to write two versions of a performance-critical routine
with and without computed gotos, in a clean way just like is done for
version(D_InlineAsm_X86){...}else{...}.
- Gives a single standard common syntax that all future D compilers may use,
avoiding troubles caused by nonstandard syntax and implementations.

(Explicitly unimplemented features have a precedent in D, the array operations.
The difference is that D may never implement computed gotos.)
(This an additive feature, so it may be left for D3 too.)

Bye,
bearophile

Nov 25 2010

Jimmy Cao <jcao219 gmail.com> writes:

Good suggestion for a D3 feature.
Wikipedia says this is also called an "Assigned Goto."
It could make writing interpreters more efficient.

On Thu, Nov 25, 2010 at 9:55 PM, bearophile <bearophileHUGS lycos.com>wrote:

 When I have suggested to add "computed gotos" (similar to the ones of GCC)
 to D, Walter has answered that they need some work to be implemented, and
 they have limited usefulness, almost only to optimize interpreters.

 But:
 - D is a system language, so writing interpreters is an important
 application of it.
 - I have found GCC computed gotos useful to speed up some of my code.
 Recently even the CPython has introduced their usage in the main interpreter
 loop.
 - The GCC implementation of computed gotos is not standard for C and other
 compilers may not understand it (to solve this trouble in GNU C code you
 need to disable pieces of code, and this is less easy to do in D).

 So even if computed gotos are not going to be implemented in DMD I suggest
 to:
 1) Invent a syntax to represent and use them (probably the GCC syntax is
 good, because it's already known in C).
 2) Make DMD understand this syntax, but refuse it at compile time (because
 DMD doesn't support computer gotos).
 3) Define a new standard Predefined Version, like "computed_goto" or
 "Computed_goto" or something similar, that is defined if a D compiler
 supports them (so DMD doesn't define it), that allows to disable the code
 that contains the computed goto if a compiler doesn't support them.

 This:
 - Allows other D implementations, based on LLVM and GCC back-ends that
 already support computed gotos, to support such gotos in D code too;
 - Allows the programmer to write two versions of a performance-critical
 routine with and without computed gotos, in a clean way just like is done
 for version(D_InlineAsm_X86){...}else{...}.
 - Gives a single standard common syntax that all future D compilers may
 use, avoiding troubles caused by nonstandard syntax and implementations.

 (Explicitly unimplemented features have a precedent in D, the array
 operations. The difference is that D may never implement computed gotos.)
 (This an additive feature, so it may be left for D3 too.)

 Bye,
 bearophile

Nov 25 2010

Adam D. Ruppe <destructionator gmail.com> writes:

bearophile wrote:

 1) Invent a syntax to represent and use them (probably the GCC syntax is good,

because it's already known in C).

I'd argue that we have a syntax reserved for them already:

switch(x) {
  case 0:
  case 1:
  case 2:
  [......]
}

And this is just a little optimization in the generated code.
(Specifically: if a switch has integer cases that fill in a
whole range from [0..n), it could be rewritten as pre-caching
the addresses in an array of length n and the dispatch simply
be written as a jump to the entry in that array.)

The beauty of using switch is it continues to work even in
compilers that don't implement the optimization, without
having to write the code twice.

 The difference is that D may never implement computed gotos.

I'm actually pretty amazed that we can't really do them right
now with the inline assembler.

Well... actually, we can, but it isn't the most beautiful
implementation.

Check this out:

=======
void main() {
	void* val;

lbl1:
	asm {
		call near ptr $+2; // call this location + 2 bytes - that is, the address of
the
pop instruction
		jmp short lbl2; // FIXME: if the jmp isn't actually short, the whole thing
blows
up! This feels like an assembler bug; it should probably bitch that I'm asking
for
the impossible instead of just ignoring the short
		pop ESI; // it now holds the address pushed by call
		mov EAX, ESI; // load up the address for some work...
		add AL, [ESI - 1]; // ESI is address of the jmp opcode. +1 is the offset of
the
jump... thus adding it gets the absolute address of the label. FIXME: what if al
overflows?
		mov [val], EAX; // store it

		jmp [val]; // and let's go ahead and make the computed goto. The program
should
print "34"
	}

	printf("2");

lbl2:

	printf("3");

lbl3:

	printf("4");
}
=========

(I used printf to make the object dump a bit shorter than with writefln)

If you change to lbl3 it correctly prints 4. My voodoo worked!


If that were wrapped up into a string mixin and fixed those FIXME's - not
terribly
hard in this specific case, but might be hard to generalize - we'd have a 
general
way to getLabelAddress and jumpToPointer (the latter being trivial - that one
jmp
[val]; instruction) implemented right in the library.

It won't be as efficient as a compiler generated jump table, but initialization
is
unlikely to be a big deal here anyway. (What this code does is take the
compiler's
generated code and reads it back at runtime! Obviously skipping that read back
at
runtime step would be a lil faster, but since it is a one time cost it can
probably be ignored.)


I've started the mixin solution but it isn't quite right yet. I'm thinking I
have
an off by one error when populating the array. Been a while since I've done
machine code reading and manipulation like this and it is getting late.


Nevertheless, I'm pretty convinced that the language offers everything we need
to
implement this in the library right now, albeit the initialization won't be 100%
perfect. If I can fix this minor error in my current mixin loop, we can get very
close. Of course, reading the program's own binary code isn't the most elegant
implementation, but the usage is simple enough:

         // first is the name of the array to create, then the names of the
labels
with which to populate it
        mixin(getLabelAddresses!("labels", "lbl1", "lbl2", "lbl3", "lbl4"));
        mixin(gotoPointer!("labels", 3)); // array name and index to jump to



Anyway I spent way too much time on this. Gotta get to bed. But my feeling on
the
proposal again:

1) We have a syntax that works this way: switch(). The optimizer could, in
theory,
recognize this usage and create it.

2) We have an inline assembler that can surely do the job. Let's use it and see
what we can do.

Nov 25 2010

Iain Buclaw <ibuclaw ubuntu.com> writes:

== Quote from bearophile (bearophileHUGS lycos.com)'s article
 When I have suggested to add "computed gotos" (similar to the ones of GCC) to
D,

Walter has answered that they need some work to be implemented, and they have
limited usefulness, almost only to optimize interpreters.
 But:
 - D is a system language, so writing interpreters is an important application
of it.
 - I have found GCC computed gotos useful to speed up some of my code. Recently

even the CPython has introduced their usage in the main interpreter loop.
 - The GCC implementation of computed gotos is not standard for C and other

compilers may not understand it (to solve this trouble in GNU C code you need to
disable pieces of code, and this is less easy to do in D).
 So even if computed gotos are not going to be implemented in DMD I suggest to:
 1) Invent a syntax to represent and use them (probably the GCC syntax is good,

because it's already known in C).
 2) Make DMD understand this syntax, but refuse it at compile time (because DMD

doesn't support computer gotos).
 3) Define a new standard Predefined Version, like "computed_goto" or

"Computed_goto" or something similar, that is defined if a D compiler supports
them (so DMD doesn't define it), that allows to disable the code that contains
the
computed goto if a compiler doesn't support them.

I don't think this feature really warrants a new keyword.

Since we already have:

const var = 42;
switch (x) {
    case var:
        ...
    break;
}

Would only make sense to do the same for goto's.

goto *ptr;


Something that *would* warrant perhaps a new keyword would be non-local gotos,
but
it's usefulness is very negligible...

Regards
Iain

Nov 26 2010

bearophile <bearophileHUGS lycos.com> writes:

Iain Buclaw:

 I don't think this feature really warrants a new keyword.

I have not asked for a new keyword.


 Something that *would* warrant perhaps a new keyword would be non-local gotos,
but
 it's usefulness is very negligible...

I don't like those :-)

Bye,
bearophile

Nov 26 2010

D Programming

C/C++ Programming

Other

digitalmars.D - Explicitly unimplemented computed gotos