digitalmars.D - About some bugs
- bearophile (67/67) Jan 04 2011 I have studied more Linux bugs.
- Vladimir Panteleev (15/31) Jan 04 2011 Doesn't D already solve this?
- bearophile (5/15) Jan 04 2011 This is a different situation. You need a bit of flow analisys to catch ...
- spir (25/38) Jan 04 2011 ));
- Simen kjaeraas (18/35) Jan 04 2011 No. It ensures that the pointer and size are from the same pointer.
- bearophile (4/7) Jan 04 2011 I am not sure that's enough.
- Simen kjaeraas (4/9) Jan 04 2011 Could you please show a case where it isn't?
- Walter Bright (8/34) Jan 04 2011 In D:
- Walter Bright (2/3) Jan 04 2011 Is there a research paper that these came from?
- bearophile (8/12) Jan 04 2011 They are just part of the Linux bugs fixed in those few years.
- bearophile (34/45) Jan 05 2011 We have discussed this a lot in past, I repeat because in this newsgroup...
- Simen kjaeraas (17/70) Jan 05 2011 With the exception of your first example, these require quite a lot of
- bearophile (6/6) Jan 05 2011 What I have written was meant for D3.
- bearophile (7/11) Jan 05 2011 Some "plumbing" is needed even for the first half of the proposal, to co...
- Simen kjaeraas (17/27) Jan 05 2011 ult
I have studied more Linux bugs. ---------------- An example of bug (more than 14 like this fixed in few years): - memset(pp, 0, sizeof(pp)); + memset(pp, 0, sizeof(*pp)); - memcpy((caddr_t)TstSchedTbl, (caddr_t)&vcIndex,sizeof(TstSchedTbl)); + memcpy((caddr_t)TstSchedTbl, (caddr_t)&vcIndex, sizeof(*TstSchedTbl)); Here the type system knows that pp is a pointer. sizeof(pp) is typically a word, while the correct sizeof(*pp) is often larger. A simple way to avoid this bug in D is to use a zerioing template function, something like (untested) (in GNU C there is a way to write a similar macro, I don't know why they don't use it, even if it's a bit less safe and much less nice looking): void zeroit(T)(T* ptr) if (!IsPointer!T) { memset(ptr, 0, (*ptr).sizeof); } Standard safer wrappers for some C functions may help low-level D coding. If you don't want to use a zeroit() then a type system able to catch such bugs needs some nice annotations... ---------------- A common (> 12 cases in few years) example of redundant code: u32 da = new->da_start; - if (!obj || !new || !sgt) {... + if (!obj || !sgt) {... If the first line is correct, then "new" can't be NULL, so there's no need to test "|| !new". In 13 similar cases the code is inside a branch where the NULL test of the pointer is already done: if (ptr != NULL) { if (!ptr) { ... } ---------------- In 7 cases the result of malloc-like function was not tested for NULL: agp = kmalloc(sizeof(*agp), GFP_KERNEL); + if (!agp) + return NULL; ---------------- A very common case (20 cases in few years) are like this, where a pointer is deferenced before the NULL test: block = bdev->bd_disk->private_data; - base = block->base; if (!block) return -ENODEV; + base = block->base; ---------------- 2 cases like this (the pattern (!E && !E->fld) is nonsensical): - if (!slot->hotplug_slot && !slot->hotplug_slot->info) + if (!slot->hotplug_slot || !slot->hotplug_slot->info) continue; ---------------- There are 11 cases like this (alloc_bootmem never returns NULL and it always sets the momory to zero): zero_page = alloc_bootmem_low_pages_node(NODE_DATA(0), PAGE_SIZE); - memset(zero_page, 0, PAGE_SIZE); ---------------- There are a lot of cases (34) of missing free: + kfree(iter); ---------------- Two cases need a wider NULL test: - if (fep != NULL) { + if (fep && fep->ops) { (*fep->ops->free_bd)(ndev); (*fep->ops->cleanup_data)(ndev); } ---------------- In this post I don't see any little rule worth adding to the D compiler. So to bugzilla I will add only the !x&y thing. But from what I have seen many bugs are NULL-related, and many of them are avoidable with two features present at the same time in the language: 1) Nonnull pointer/reference tag (with it you don't miss some null tests, and you avoid some redundant tests); 2) Plus a bit of inter-function (not intra, for that nonnull signatures are enough) flow analysis to avoid bugs like: base = block->base; if (!block) return -ENODEV; if (fep != NULL) { (*fep->ops->free_bd)(ndev); (P.S.: integer overflow bugs happen in Linux kernel too.) Bye, bearophile
Jan 04 2011
On Tue, 04 Jan 2011 14:34:15 +0200, bearophile <bearophileHUGS lycos.com> wrote:void zeroit(T)(T* ptr) if (!IsPointer!T) { memset(ptr, 0, (*ptr).sizeof); } Standard safer wrappers for some C functions may help low-level D coding. If you don't want to use a zeroit() then a type system able to catch such bugs needs some nice annotations...Doesn't D already solve this? For value types: obj = typeof(obj).init; For arrays: arr[] = typeof(arr[0]).init; // or just 0 or null or whatever .init isIf the first line is correct, then "new" can't be NULL, so there's no need to test "|| !new".I think this is something that should be done by the optimizer.In 7 cases the result of malloc-like function was not tested for NULL:This is rather specific. Application programmers would usually want an exception to be thrown on a failed allocation.A very common case (20 cases in few years) are like this, where a pointer is deferenced before the NULL test: block = bdev->bd_disk->private_data; - base = block->base; if (!block) return -ENODEV; + base = block->base;Delphi compilers warn in cases when a condition is always true/false, because Delphi lacks metaprogramming so such cases are usually due to a bug. -- Best regards, Vladimir mailto:vladimir thecybershadow.net
Jan 04 2011
Vladimir Panteleev:Doesn't D already solve this?Yup.This is a different situation. You need a bit of flow analisys to catch this bug (to give a state to a type). Bye, bearophileblock = bdev->bd_disk->private_data; - base = block->base; if (!block) return -ENODEV; + base = block->base;Delphi compilers warn in cases when a condition is always true/false, because Delphi lacks metaprogramming so such cases are usually due to a bug.
Jan 04 2011
On Tue, 04 Jan 2011 07:34:15 -0500 bearophile <bearophileHUGS lycos.com> wrote:An example of bug (more than 14 like this fixed in few years): =20 - memset(pp, 0, sizeof(pp)); + memset(pp, 0, sizeof(*pp)); =20 - memcpy((caddr_t)TstSchedTbl, (caddr_t)&vcIndex,sizeof(TstSchedTbl=));+ memcpy((caddr_t)TstSchedTbl, (caddr_t)&vcIndex, sizeof(*TstSchedT=bl));=20 Here the type system knows that pp is a pointer. sizeof(pp) is typically =a word, while the correct sizeof(*pp) is often larger. A simple way to avoi= d this bug in D is to use a zerioing template function, something like (unt= ested) (in GNU C there is a way to write a similar macro, I don't know why = they don't use it, even if it's a bit less safe and much less nice looking):=20 void zeroit(T)(T* ptr) if (!IsPointer!T) { memset(ptr, 0, (*ptr).sizeof); }Doesn't this in fact hide the error to the programmer (by silently correcti= ng)? Why not instead for instance: void zeroit(T)(T* ptr) if (!IsPointer!T) { throw new Exception("Type error: argument to <funcname> should be a poi= nter."); } (And what if the memory to be actually memset is not ptr's target?) About non-null thingies, I would be all for a mode in which is inserted if (p is null) throw ...; before _every_ implicite or explicite deref of every implicite (pointer) or= implicite (class element) pointer. And even make this the default for non-= release. (With line number in the message ;-) Am I dreaming? Denis -- -- -- -- -- -- -- vit esse estrany =E2=98=A3 spir.wikidot.com
Jan 04 2011
spir <denis.spir gmail.com> wrote:No. It ensures that the pointer and size are from the same pointer. I would say it is somewhat analogous to D's arrays, which are a hell of a lot better than free pointer/length pairs.void zeroit(T)(T* ptr) if (!IsPointer!T) { memset(ptr, 0, (*ptr).sizeof); }Doesn't this in fact hide the error to the programmer (by silently correcting)?Why not instead for instance: void zeroit(T)(T* ptr) if (!IsPointer!T) { throw new Exception("Type error: argument to <funcname> should be a pointer."); }I'm not sure this makes sense. The error message seems to indicate you don't have a pointer, while that is exactly what you do. The template constraint also say that this is not an overload of the above function, making it simply a function that throws.(And what if the memory to be actually memset is not ptr's target?)If people start throwing random pointers into functions, that is a problem that's hard to work around. bearophile's zeroit function takes memset and removes its main weak point, namely that it takes a free pointer/length pair, rather than a structure containing the two.About non-null thingies, I would be all for a mode in which is inserted if (p is null) throw ...; before _every_ implicite or explicite deref of every implicite (pointer) or implicite (class element) pointer. And even make this the default for non-release. (With line number in the message ;-) Am I dreaming?Yeah. :P Much more likely would be the idea that has been discussed, of opt-in non-nullness, by disabling the default constructor for a struct that wraps a pointer. -- Simen
Jan 04 2011
Simen kjaeraas:Much more likely would be the idea that has been discussed, of opt-in non-nullness, by disabling the default constructor for a struct that wraps a pointer.I am not sure that's enough. Bye, bearophile
Jan 04 2011
bearophile <bearophileHUGS lycos.com> wrote:Simen kjaeraas:Could you please show a case where it isn't? -- SimenMuch more likely would be the idea that has been discussed, of opt-in non-nullness, by disabling the default constructor for a struct that wraps a pointer.I am not sure that's enough.
Jan 04 2011
bearophile wrote:I have studied more Linux bugs. ---------------- An example of bug (more than 14 like this fixed in few years): - memset(pp, 0, sizeof(pp)); + memset(pp, 0, sizeof(*pp)); - memcpy((caddr_t)TstSchedTbl, (caddr_t)&vcIndex,sizeof(TstSchedTbl)); + memcpy((caddr_t)TstSchedTbl, (caddr_t)&vcIndex, sizeof(*TstSchedTbl)); Here the type system knows that pp is a pointer. sizeof(pp) is typically a word, while the correct sizeof(*pp) is often larger. A simple way to avoid this bug in D is to use a zerioing template function, something like (untested) (in GNU C there is a way to write a similar macro, I don't know why they don't use it, even if it's a bit less safe and much less nice looking): void zeroit(T)(T* ptr) if (!IsPointer!T) { memset(ptr, 0, (*ptr).sizeof); } Standard safer wrappers for some C functions may help low-level D coding. If you don't want to use a zeroit() then a type system able to catch such bugs needs some nice annotations...In D: pp[] = 0; or: pp = typeof(pp).init; etc.In this post I don't see any little rule worth adding to the D compiler.Many of them are dealt with with D's scope guard, RIAA, and garbage collection support.
Jan 04 2011
bearophile wrote:I have studied more Linux bugs.Is there a research paper that these came from?
Jan 04 2011
Walter:bearophile wrote:They are just part of the Linux bugs fixed in those few years. I am sorry, but what I have written doesn't come from a paper. They partially come from this page: http://coccinelle.lip6.fr/impact_linux.php Plus some exploration of the kernel bug repository. With some more weeks of search and work a paper may be written :o) Bye, bearophileI have studied more Linux bugs.Is there a research paper that these came from?
Jan 04 2011
Simen kjaeraas:bearophile <bearophileHUGS lycos.com> wrote:We have discussed this a lot in past, I repeat because in this newsgroup I have learnt that sometimes you need to repeat things many times before most people start to notice. First of all it's a matter of syntax. Attaching a " " to a reference/pointer type to turn it into a nonnullable is not the same thing as using a special struct: class T {} T nullable_reference; T nonnullable_reference = new T (); struct S {} S nullable_pointer; S nonnullable_pointer = new S (); void foo(T x) { ... } A handy short syntax is able to turn something from a seldom used feature, to an idiom most people in the community use. Python language designers have understood this well. Then, I was talking of two features present at the same time. The first is nonnullable rerefence types, and the other is a bit of flow analysis that essentially gives a possible-null / surely-not-null state to a type: S s = new S; ... if (s != null) { // here s is essentially a S } It avoids bugs like: static int rose_rebuild_header(struct sk_buff *skb) { ... if (!rose_route_frame(skbn, NULL)) { kfree_skb(skbn); stats->tx_errors++; return 1; } stats->tx_packets++; stats->tx_bytes += skbn->len; ... } The bug here is that in some cases skbn the code calls free on skbn, so at the end of the if{} yuou can't perform skbn->len in all cases. The type system has to denote this as a possible bug. Just disallowing standard constructors is useful, but it's not enough. Bye, bearophileSimen kjaeraas:Could you please show a case where it isn't?Much more likely would be the idea that has been discussed, of opt-in non-nullness, by disabling the default constructor for a struct that wraps a pointer.I am not sure that's enough.
Jan 05 2011
bearophile <bearophileHUGS lycos.com> wrote:Simen kjaeraas:With the exception of your first example, these require quite a lot of plumbing in the compiler, and are thus not an alternative for D2 and, I would say, unlikely for D3. The wrapper struct with a disabled default constructor is close to possible in D2. Like with AAs, the sugar for non-null can easily be implemented atop a library solution, if the language is powerful enough. D is, except in the case of type states or the like. Sure, I would love for D to have a perfect type system, but I believe type state does not bring enough goods to the table that it's worth implementing yet. Perhaps in D3. Having a solution that works (your last example shows how it's possible to have invalid pointers in non-null pointers, but not null-pointers), is a lot better than hoping for something in the far future, and I have yet to be made aware of any serious shortcomings of the solution with a disabled default constructor. -- Simenbearophile <bearophileHUGS lycos.com> wrote:We have discussed this a lot in past, I repeat because in this newsgroup I have learnt that sometimes you need to repeat things many times before most people start to notice. First of all it's a matter of syntax. Attaching a " " to a reference/pointer type to turn it into a nonnullable is not the same thing as using a special struct: class T {} T nullable_reference; T nonnullable_reference = new T (); struct S {} S nullable_pointer; S nonnullable_pointer = new S (); void foo(T x) { ... } A handy short syntax is able to turn something from a seldom used feature, to an idiom most people in the community use. Python language designers have understood this well. Then, I was talking of two features present at the same time. The first is nonnullable rerefence types, and the other is a bit of flow analysis that essentially gives a possible-null / surely-not-null state to a type: S s = new S; ... if (s != null) { // here s is essentially a S } It avoids bugs like: static int rose_rebuild_header(struct sk_buff *skb) { ... if (!rose_route_frame(skbn, NULL)) { kfree_skb(skbn); stats->tx_errors++; return 1; } stats->tx_packets++; stats->tx_bytes += skbn->len; ... } The bug here is that in some cases skbn the code calls free on skbn, so at the end of the if{} yuou can't perform skbn->len in all cases. The type system has to denote this as a possible bug. Just disallowing standard constructors is useful, but it's not enough.Simen kjaeraas:Could you please show a case where it isn't?Much more likely would be the idea that has been discussed, of opt-in non-nullness, by disabling the default constructor for a struct that wraps a pointer.I am not sure that's enough.
Jan 05 2011
What I have written was meant for D3. The example I've shown at the end is a bug found in the Linux kernel, they have found many similar bugs, even in old code. Notnull types are usually not enough to catch similar bugs. The second part of my proposal was not a general typestate implementation, it's specific, hard-coded, and only two or three states are present, for just reference types, and only few built-in things change the state (if-then-else branches, nullity assetions, nullable/notnull type annotations, and little more). In a language with gotos and asm your bit of flow analysis will produce some false positives and some false negatives. This currently doesn't go well with D Zen, but things may change. Bye, bearophile
Jan 05 2011
Simen kjaeraas:With the exception of your first example, these require quite a lot of plumbing in the compiler, and are thus not an alternative for D2 and, I would say, unlikely for D3. The wrapper struct with a disabled default constructor is close to possible in D2.Some "plumbing" is needed even for the first half of the proposal, to correctly manage partially initialized objects, it's explained here: http://research.microsoft.com/pubs/67461/non-null.pdf Some partial notes of mine on the topic are here: http://d.puremagic.com/issues/show_bug.cgi?id=4571 Bye, bearophile
Jan 05 2011
bearophile <bearophileHUGS lycos.com> wrote:Simen kjaeraas:fWith the exception of your first example, these require quite a lot o=plumbing in the compiler, and are thus not an alternative for D2 and,=ultI would say, unlikely for D3. The wrapper struct with a disabled defa==constructor is close to possible in D2.Some "plumbing" is needed even for the first half of the proposal, to =correctly manage partially initialized objects, it's explained here: http://research.microsoft.com/pubs/67461/non-null.pdf Some partial notes of mine on the topic are here: http://d.puremagic.com/issues/show_bug.cgi?id=3D4571Now this is what I wanted to see. As D does not have C++'s initializatio= n lists, the problem of half-initialized fields may require flow analysis.= And yes, I remember this was discussed last time, but apparently I forgo= t. I see that this problem can also be thought of in the typestate paradigm= : a class is in the 'raw' state until constructors have finished running, and in this state the invariants of members are not guaranteed to hold. Hmm, the more I think about it, the more I believe you are right that typestates are Awesome=E2=84=A2, and should be included in D3. -- = Simen
Jan 05 2011