digitalmars.dip.ideas - Deprecate implicit conversion between signed and unsigned integers

Atila Neves (4/7) Feb 03 My bias is to not like any implicit conversions of any kind, but

Paul Backus (6/14) Feb 03 That's why I focused my proposal on the specific conversions that

Quirin Schroll (6/22) Feb 05 Those are annoying, yes. Especially unary operators. If you asked

Jonathan M Davis (10/15) Feb 19 IIRC, _all_ operations on integer types smaller than int get converted t...

Dom DiSc (6/10) Feb 20 Yup.

Jonathan M Davis (33/40) Feb 20 Actually, now that I think about it more, in this case, if it's doing th...

Quirin Schroll (7/15) Feb 04 Any implicit conversions? I’d boldly claim the following

Paul Backus (8/14) Feb 05 In general, implicit conversions that preserve the value range of

Quirin Schroll (36/53) Feb 06 I even think that implicit conversions from integral to

Walter Bright (6/9) Feb 06 We already do VRP checks for cases:

Quirin Schroll (10/19) Feb 13 I didn’t know that, but I hardly ever use floating-point types.

Kagamin (10/15) Feb 06 I once ported a 32 bit C++ application to 64 bit. The code was

Quirin Schroll (10/26) Feb 06 The fault 100% lies in converting `std::size_t` (which is

Walter Bright (5/5) Feb 06 Having a function that searches an array for a value and returns the ind...

monkyyy (4/7) Feb 06 All options suck, -1 should suck least; size_t is insane in the

Walter Bright (3/5) Feb 06 As D is also a systems programming language, it provides access to the m...

monkyyy (5/11) Feb 06 Do any of the embedded projects have working slices? Are there no

Kagamin (13/13) Feb 07 FWIW, if you want C# array idiom
Walter Bright (1/1) Feb 17 size_t is just an alias declaration. The compiler does not actually know...

DLearner (7/12) Feb 07 [...]

Walter Bright (2/7) Feb 17 That's FORTRAN style. It would break about every piece of D code.

Guillaume Piolat (4/6) Feb 05 Sounds like churn.
Walter Bright (31/31) Feb 06 [I'm not sure why a new thread was created?]

Walter Bright (16/16) Feb 06 I forgot to mention:

Richard (Rikki) Andrew Cattermole (12/31) Feb 06 Within the last couple of days on Twitter the C community has mentioned

Walter Bright (3/4) Feb 17 For popcount, not for anything else. There are a lot of functions with `...

Atila Neves (6/17) Feb 07 In Haskell, it could be either and the type would either be

Walter Bright (4/5) Feb 17 Pascal required explicit casts. It sounded like a good idea. After a whi...

Atila Neves (2/9) Feb 17 `cast(typeof(foo)) bar`?

Walter Bright (4/7) Feb 17 That can work, but when best practices mean adding more code, the result...

Atila Neves (2/11) Feb 17 Compilation or test failure, probably.
Jonathan M Davis (39/46) Feb 19 That's part of why if I were creating a new language, I'd want a level o...

Nick Treleaven (5/7) Feb 17 In this case, we can use these with IFTI instead of explicit

Walter Bright (4/8) Feb 17 Yes (those were Andrei's initiative).

Quirin Schroll (62/100) Feb 13 Java 23 does not have unsigned types, though. There are only

Walter Bright (25/62) Feb 17 Signed and unsigned multiplication produce the exact same bit pattern re...

Quirin Schroll (110/184) Feb 17 You’re right, I was mistaken. I thought multiplication by −1 had
Paul Backus (4/10) Feb 17 Dividing an integer by zero is UB according to the D spec [1],

Walter Bright (9/16) Feb 17 That's correct. But it's not memory corruption, and requiring casts does...

Paul Backus (10/23) Feb 17 An optimizing compiler (like LDC or GDC) is allowed to generate

Dom DiSc (12/20) Feb 06 I think most of the problems with these implicit conversions

Richard (Rikki) Andrew Cattermole (4/29) Feb 06 We should revisit this once editions are accepted.

Kagamin (8/9) Feb 06 I agree with Bjarne, the problem is entirely caused by abuse of

Quirin Schroll (6/16) Feb 13 What would be a “proper number”? At best, signed and unsigned

Kagamin (8/12) Feb 15 The problem is they are incompatible slices that you have to mix

Olivier Pisano (6/14) Mar 11 Hello,

Richard (Rikki) Andrew Cattermole (3/20) Mar 11 Constants such as ``-1`` are used quite often with unsigned types.

Dom DiSc (7/10) Mar 11 I wish in C everybody would use ~0 (or better: ~0u) instead of
Olivier Pisano (10/13) Mar 11 -1 is a literal of type int, which is perfectly fine.
Nick Treleaven (8/16) Mar 11 I'd like that. The compiler error can suggest using `0 - u`

Kagamin (2/4) Mar 13 It's also useful for rounding: n&-16

Dom DiSc (4/8) Mar 13 Again: Please use n&~15 instead. It produces the same result, but
Nick Treleaven (2/6) Mar 14 typeof(16) is int, signed unary minus would not be deprecated.

Olivier Pisano (4/12) Mar 14 Exactly, I proposed to deprecate unary minus for 16U (uint), not

Hipreme (11/24) Mar 14 I have just discovered today a LLVM bug for one platform I'm

Walter Bright (2/2) Mar 14 This is a continuation of this thread:

Atila Neves <atila.neves gmail.com> writes:

https://forum.dlang.org/post/pbhjffbxdqpdwtmcbikh forum.dlang.org

On Sunday, 12 May 2024 at 13:32:36 UTC, Paul Backus wrote:
 D inherited these implicit conversions from C and C++, where 
 they are widely regarded as a source of bugs.

 [...]

My bias is to not like any implicit conversions of any kind, but 
I'm not sure I can convince Walter of that.

Feb 03

Paul Backus <snarwin gmail.com> writes:

On Monday, 3 February 2025 at 18:40:20 UTC, Atila Neves wrote:
 https://forum.dlang.org/post/pbhjffbxdqpdwtmcbikh forum.dlang.org

 On Sunday, 12 May 2024 at 13:32:36 UTC, Paul Backus wrote:
 D inherited these implicit conversions from C and C++, where 
 they are widely regarded as a source of bugs.

 [...]

 My bias is to not like any implicit conversions of any kind, 
 but I'm not sure I can convince Walter of that.

That's why I focused my proposal on the specific conversions that 
are the most error-prone. I don't think we'll ever convince 
Walter to get rid of integer promotion in general, but there's a 
chance we can convince him to get rid of these specific 
conversions.

Feb 03

Quirin Schroll <qs.il.paperinik gmail.com> writes:

On Monday, 3 February 2025 at 19:30:14 UTC, Paul Backus wrote:
 On Monday, 3 February 2025 at 18:40:20 UTC, Atila Neves wrote:
 https://forum.dlang.org/post/pbhjffbxdqpdwtmcbikh forum.dlang.org

 On Sunday, 12 May 2024 at 13:32:36 UTC, Paul Backus wrote:
 D inherited these implicit conversions from C and C++, where 
 they are widely regarded as a source of bugs.

 [...]

 My bias is to not like any implicit conversions of any kind, 
 but I'm not sure I can convince Walter of that.

 That's why I focused my proposal on the specific conversions 
 that are the most error-prone. I don't think we'll ever 
 convince Walter to get rid of integer promotion in general, but 
 there's a chance we can convince him to get rid of these 
 specific conversions.

Those are annoying, yes. Especially unary operators. If you asked 
me right now what `~x` returns on a small integer type, I 
honestly don’t know.

D has C’s rules because of one design decision early on: If it 
looks like C, it acts like C or it’s an error.

Feb 05

Jonathan M Davis <newsgroup.d jmdavisprog.com> writes:

On Wednesday, February 5, 2025 4:43:37 AM MST Quirin Schroll via dip.ideas
wrote:
 Those are annoying, yes. Especially unary operators. If you asked
 me right now what `~x` returns on a small integer type, I
 honestly don’t know.

IIRC, _all_ operations on integer types smaller than int get converted to
int, and then if the compiler can determine for certain that the result
would fit in a smaller type, then it can be implicitly converted to the
smaller type, but in most cases, it can't know that. ~x would probably
implicitly convert, but like you, I'd have to test it.

 D has C’s rules because of one design decision early on: If it
 looks like C, it acts like C or it’s an error.

Yes, but the issue with cases like this is more that they could be errors
when they're not rather than us looking to change the behavior to something
else.

- Jonathan M Davis

Feb 19

Dom DiSc <dominikus scherkl.de> writes:

On Thursday, 20 February 2025 at 03:14:08 UTC, Jonathan M Davis 
wrote:
 IIRC, _all_ operations on integer types smaller than int get 
 converted to int,

Yup.

 ~x would probably implicitly convert, but like you, I'd have to 
 test it.

~ on small types will generate a lot of higher set bits, so no, 
it will NOT convert back to same type. Same problem with -

This is so bad, I'm honestly surprised that it works with +

Feb 20

Jonathan M Davis <newsgroup.d jmdavisprog.com> writes:

On Thursday, February 20, 2025 1:35:10 AM MST Dom DiSc via dip.ideas wrote:
 On Thursday, 20 February 2025 at 03:14:08 UTC, Jonathan M Davis
 wrote:
 ~x would probably implicitly convert, but like you, I'd have to
 test it.

 ~ on small types will generate a lot of higher set bits, so no,
 it will NOT convert back to same type. Same problem with -

Actually, now that I think about it more, in this case, if it's doing the
operation on int, then the result is _very_ different from if it had
actually done the operation on (u)byte or (u)short. With most arithmetic
operations, the result is the same except that you don't have overflow
issues if you're operating on int with large bytes or shorts like you would
if you'd operated directly on the type (though of course, casting back can
then truncate the result), but with ~, the result is _very_ different. I
don't even recall the last time that I used ~ and clearly didn't think it
through enough, since I'm used to the result being the same so long as the
result fits.

 This is so bad, I'm honestly surprised that it works with +

It doesn't work. This fails to compile

    byte b1 = 42;
    byte b2 = b1 + 120;

complaining that it can't convert from int to byte. The same hapens with

    byte b1 = 0;
    byte b2 = ~b1;

However, this does compile

    byte b1 = 42;
    byte b2 = ~b1;

So, I guess that it sees that it's doing enough data flow analysis to see
that b1 is 42 and that ~b1 would fit into a byte, so it allows the
conversion. Curiously though,

    byte b1 = 42;
    byte b2 = b1 + 1;

does not compile even though the result would fit, whereas

    byte b1 = 42;
    byte b2 = b1 + 0;

does compile. So, it would appear that VRP is being a tad weird with its
decisions, but it does seem to be rejecting the result when it wouldn't fit
(and of course, if it doesn't know the values, it's going to have to assume
that the result doesn't fit).

- Jonathan M Davis

Feb 20

Quirin Schroll <qs.il.paperinik gmail.com> writes:

On Monday, 3 February 2025 at 18:40:20 UTC, Atila Neves wrote:
 https://forum.dlang.org/post/pbhjffbxdqpdwtmcbikh forum.dlang.org

 On Sunday, 12 May 2024 at 13:32:36 UTC, Paul Backus wrote:
 D inherited these implicit conversions from C and C++, where 
 they are widely regarded as a source of bugs.

 [...]

 My bias is to not like any implicit conversions of any kind, 
 but I'm not sure I can convince Walter of that.

Any implicit conversions? I’d boldly claim the following 
conversions are unproblematic:
* `float` → `double` → `real`
* signed integer → bigger signed integer
* unsigned integer → bigger unsigned integer

And it would be really annoying to have to explicitly cast them.

Feb 04

Paul Backus <snarwin gmail.com> writes:

On Tuesday, 4 February 2025 at 16:29:22 UTC, Quirin Schroll wrote:
 Any implicit conversions? I’d boldly claim the following 
 conversions are unproblematic:
 * `float` → `double` → `real`
 * signed integer → bigger signed integer
 * unsigned integer → bigger unsigned integer

 And it would be really annoying to have to explicitly cast them.

In general, implicit conversions that preserve the value range of 
the original type are ok. So, for example:

* `ushort` → `int`
* `long` → `float`

The reason that conversions like `int` → `uint` and `uint` → 
`int` are problematic is that the value range of the original 
type does not fit into the value range of the target type.

Feb 05

Quirin Schroll <qs.il.paperinik gmail.com> writes:

On Wednesday, 5 February 2025 at 16:29:25 UTC, Paul Backus wrote:
 On Tuesday, 4 February 2025 at 16:29:22 UTC, Quirin Schroll 
 wrote:
 Any implicit conversions? I’d boldly claim the following 
 conversions are unproblematic:
 * `float` → `double` → `real`
 * signed integer → bigger signed integer
 * unsigned integer → bigger unsigned integer

 And it would be really annoying to have to explicitly cast 
 them.

 In general, implicit conversions that preserve the value range 
 of the original type are ok. So, for example:

 * `ushort` → `int`
 * `long` → `float`

 The reason that conversions like `int` → `uint` and `uint` → 
 `int` are problematic is that the value range of the original 
 type does not fit into the value range of the target type.

I even think that implicit conversions from integral to 
floating-point type are bad, considering that `int` → `float` and 
`long` → `double` aren’t lossless in general.

Here’s an attempt to classify:
1. Definitely okay implicit conversions:
     * `float` → `double` → `real`
     * `byte` → `short` → `int` → `long`
     * `ubyte` → `ushort` → `uint` → `ulong`
2. Probably okay implicit conversions:
     * `ubyte` → `short` → `int` → `long`
     * `ushort` → `int` → `long`
     * `uint` → `long`
3. Somewhat contentious implicit conversions:
     * `byte`/`ubyte`/`short`/`ushort` → `float` → `double` → 
`real`
     * `int`/`uint` → `double` → `real`
     * `long`/`ulong` → `real`
4. Micro-lossy narrowing conversions:
     * `int`/`uint` → `float`
     * `long`/`ulong` → `float`/`double`
5. Bit-pattern-preserving value-altering conversions:
     * `byte` ↔ `ubyte`
     * `short` ↔ `ushort`
     * `int` ↔ `uint`
     * `long` ↔ `ulong`
6. Lossy narrowing conversions:
     * The reverse of any “→” above.

It appears to me that you can reasonably draw 7 lines (from 
before 1 to after 6). Examples for what existing languages do (to 
my knowledge):
* Haskell draws the line at 0/1. It has no implicit conversions 
whatsoever.

* D draws the line at 5/6.
* C/C++ draws the line at 6/7.

Feb 06

Walter Bright <newshound2 digitalmars.com> writes:

On 2/6/2025 7:18 AM, Quirin Schroll wrote:
 4. Micro-lossy narrowing conversions:
      * `int`/`uint` → `float`
      * `long`/`ulong` → `float`/`double`

We already do VRP checks for cases:

```
float f = 1; // passes
float g = 0x1234_5678; // fails
```

Feb 06

Quirin Schroll <qs.il.paperinik gmail.com> writes:

On Thursday, 6 February 2025 at 20:52:53 UTC, Walter Bright wrote:
 On 2/6/2025 7:18 AM, Quirin Schroll wrote:
 4. Micro-lossy narrowing conversions:
      * `int`/`uint` → `float`
      * `long`/`ulong` → `float`/`double`

 We already do VRP checks for cases:

 ```
 float f = 1; // passes
 float g = 0x1234_5678; // fails
 ```

I didn’t know that, but I hardly ever use floating-point types.

However, that’s not exactly VRP, but a useful check that 
compile-time-known values are representable in the target type. 
VRP means that while you normally need a cast to assign an 
integer to a `ubyte`, you can assign `myInt & 0xFF` to a `ubyte` 
without cast. You *can* assign any run-time `int` to a `float`.

What you’re pointing out is that “micro-lossy narrowing 
conversions” are a compile-error if they’re *definitely* 
occurring.

Feb 13

Kagamin <spam here.lot> writes:

On Tuesday, 4 February 2025 at 16:29:22 UTC, Quirin Schroll wrote:
 Any implicit conversions? I’d boldly claim the following 
 conversions are unproblematic:
 * `float` → `double` → `real`
 * signed integer → bigger signed integer
 * unsigned integer → bigger unsigned integer

I once ported a 32 bit C++ application to 64 bit. The code was
```
uint32_t found=str1.find(str2);
if(found==string::npos)return;
str3=str1.substr(0,found);
```
One can say the problem is in narrowing conversion, but there's 
still the fundamental problem that `npos` of different widths are 
incompatible.

Feb 06

Quirin Schroll <qs.il.paperinik gmail.com> writes:

On Thursday, 6 February 2025 at 16:48:27 UTC, Kagamin wrote:
 On Tuesday, 4 February 2025 at 16:29:22 UTC, Quirin Schroll 
 wrote:
 Any implicit conversions? I’d boldly claim the following 
 conversions are unproblematic:
 * `float` → `double` → `real`
 * signed integer → bigger signed integer
 * unsigned integer → bigger unsigned integer

 I once ported a 32 bit C++ application to 64 bit. The code was
 ```
 uint32_t found=str1.find(str2);
 if(found==string::npos)return;
 str3=str1.substr(0,found);
 ```
 One can say the problem is in narrowing conversion, but there's 
 still the fundamental problem that `npos` of different widths 
 are incompatible.

The fault 100% lies in converting `std::size_t` (which is 
`std::uint64_t` on all(?) 64-bit platforms) to `std::uint32_t`.

You could also say it’s bad that the compiler didn’t warn you 
about a non-trivial expression that will always be `false` 
because a `std::uint32_t` simply can’t be `std::string::npos` 
(which is `~std::uint64_t{}` guaranteed by the C++ Standard). 
Clang warns on these, GCC doesn’t.

You really can’t blame `std::uint32_t` converting to 
`std::uint64_t`. That is completely reasonable.

Feb 06

Walter Bright <newshound2 digitalmars.com> writes:

Having a function that searches an array for a value and returns the index of 
the array if found, and -1 if not found, is not a good practice.

An index being returned should be size_t, and the not-found value should be 
size_t.max.

See my other post on recommendations for selecting integral types.

Feb 06

monkyyy <crazymonkyyy gmail.com> writes:

On Thursday, 6 February 2025 at 20:44:46 UTC, Walter Bright wrote:
 Having a function that searches an array for a value and 
 returns the index of the array if found, and -1 if not found, 
 is not a good practice.

All options suck, -1 should suck least; size_t is insane in the 
64bit era no one has that much ram, no one has that much ram for 
compressed bools of 2^63 bits

Feb 06

Walter Bright <newshound2 digitalmars.com> writes:

On 2/6/2025 12:59 PM, monkyyy wrote:
 All options suck, -1 should suck least; size_t is insane in the 64bit era no
one 
 has that much ram, no one has that much ram for compressed bools of 2^63 bits

As D is also a systems programming language, it provides access to the model
the 
hardware implements.

Feb 06

monkyyy <crazymonkyyy gmail.com> writes:

On Friday, 7 February 2025 at 01:30:31 UTC, Walter Bright wrote:
 On 2/6/2025 12:59 PM, monkyyy wrote:
 All options suck, -1 should suck least; size_t is insane in 
 the 64bit era no one has that much ram, no one has that much 
 ram for compressed bools of 2^63 bits

 As D is also a systems programming language, it provides access 
 to the model the hardware implements.

Do any of the embedded projects have working slices? Are there no 
ways to make size_t only signed on 64 bit machines, or as a flag?

I dont even know what the argument is for when that 64th bit will 
be used.

Feb 06

Kagamin <spam here.lot> writes:


```
int count(T)(in T[] a)
{
	debug assert(a.length==cast(int)a.length);
	return cast(int)a.length;
}

long lcount(T)(in T[] a)
{
	debug assert(long(a.length)>=0);
	return long(a.length);
}
```

Feb 07

Walter Bright <newshound2 digitalmars.com> writes:

size_t is just an alias declaration. The compiler does not actually know it
exists.

Feb 17

DLearner <bmqazwsx123 gmail.com> writes:

On Thursday, 6 February 2025 at 20:44:46 UTC, Walter Bright wrote:
 Having a function that searches an array for a value and 
 returns the index of the array if found, and -1 if not found, 
 is not a good practice.

 An index being returned should be size_t, and the not-found 
 value should be size_t.max.

[...]

Or, maintaining size_t, make first index of an array 1 not 0, and 
return 0 if not found.
Like malloc.

First array index is 1 also eliminates a fruitful source of 
off-by-one errors.

Feb 07

Walter Bright <newshound2 digitalmars.com> writes:

On 2/7/2025 1:04 PM, DLearner wrote:
 Or, maintaining size_t, make first index of an array 1 not 0, and return 0 if 
 not found.
 Like malloc.
 
 First array index is 1 also eliminates a fruitful source of off-by-one errors.

That's FORTRAN style. It would break about every piece of D code.

Feb 17

Guillaume Piolat <first.nam_e gmail.com> writes:

On Monday, 3 February 2025 at 18:40:20 UTC, Atila Neves wrote:
 My bias is to not like any implicit conversions of any kind, 
 but I'm not sure I can convince Walter of that.

Sounds like churn.

Even the => syntax prevents to use old compilers with new package 
and cause churn in the DUB ecosystem.

Feb 05

Walter Bright <newshound2 digitalmars.com> writes:

[I'm not sure why a new thread was created?]

This comes up now and then. It's an attractive idea, and seems obvious. But
I've 
always been against it for multiple reasons.

1. Pascal solved this issue by not allowing any implicit conversions. The
result 
was casts everywhere, which made the code ugly. I hate ugly code.

2. Java solve this by not having an unsigned type. People went to great lengths 
to emulate unsigned behavior. Eventually, the Java people gave up and added it.

3. Is `1` a signed int or an unsigned int?

4. What happens with `p[i]`? If p is the beginning of a memory object, we want
i 
to be unsigned. If p points to the middle, we want i to be signed. What should 
be the type of `p - q`? signed or unsigned?

5. We rely on 2's complement overflow semantics to get the same behavior if i
is 
signed or unsigned, most of the time.

6. Casts are a blunt instrument that impair readability and can cause
unexpected 
behavior when changing a type in a refactoring. High quality code avoids the
use 
of explicit casts as much as possible.

7. C behavior on this is extremely well known.

8. The Value Range Propagation feature was a brilliant solution, that resolved 
most issues with implicit signed and unsigned conversions, without causing any 
problems.

9. Array bounds checking tends to catch the usual bugs with conflating signed 
with unsigned. Array bounds checking is a total winner of a feature.

Andrei and I went around and around on this, pointing out the contradictions. 
There was no solution. There is no "correct" answer for integer 2's complement 
arithmetic.

Here's what I do:

1. use unsigned if the declaration should never be negative.

2. use size_t for all pointer offsets

3. use ptrdiff_t for deltas of size_t that could go negative

4. otherwise, use signed

Stick with those and most of the problems will be avoided.

Feb 06

Walter Bright <newshound2 digitalmars.com> writes:

I forgot to mention:

```
int popcount(int x);
```
```
uint y = ...;
popcount(y);   // do you really want that to fail?
```

Let's fix it:

```
int popcount(uint x);
```
```
int y = ...;
popcount(y);   // now this fails
```

Feb 06

"Richard (Rikki) Andrew Cattermole" <richard cattermole.co.nz> writes:

On 07/02/2025 9:55 AM, Walter Bright wrote:
 I forgot to mention:
 
 ```
 int popcount(int x);
 ```
 ```
 uint y = ...;
 popcount(y);   // do you really want that to fail?
 ```
 
 Let's fix it:
 
 ```
 int popcount(uint x);
 ```
 ```
 int y = ...;
 popcount(y);   // now this fails
 ```

Within the last couple of days on Twitter the C community has mentioned 
that they'd like implicit conversion for numeric types to static arrays.

That could resolve this quite nicely.

```d
int popcount(ubyte[4]);

int y = ...;
popcount(y);

uint z = ...;
popcount(z);
```

An explicit cast with ``ref`` could go the other way.

Feb 06

Walter Bright <newshound2 digitalmars.com> writes:

On 2/6/2025 8:26 PM, Richard (Rikki) Andrew Cattermole wrote:
 That could resolve this quite nicely.

For popcount, not for anything else. There are a lot of functions with `int` or 
`uint` parameters, but the sign is meaningless to its operation.

Feb 17

Atila Neves <atila.neves gmail.com> writes:

On Thursday, 6 February 2025 at 09:10:41 UTC, Walter Bright wrote:
 [I'm not sure why a new thread was created?]

 This comes up now and then. It's an attractive idea, and seems 
 obvious. But I've always been against it for multiple reasons.

 1. Pascal solved this issue by not allowing any implicit 
 conversions. The result was casts everywhere, which made the 
 code ugly. I hate ugly code.

I hate ugly code too, but I'd rather have explicit casts.

 3. Is `1` a signed int or an unsigned int?

In Haskell, it could be either and the type would either be 
inferred. Or the programmer chooses:

1 :: Int

 4. What happens with `p[i]`? If p is the beginning of a memory 
 object, we want i to be unsigned. If p points to the middle, we 
 want i to be signed. What should be the type of `p - q`? signed 
 or unsigned?

Good questions.

Feb 07

Walter Bright <newshound2 digitalmars.com> writes:

On 2/7/2025 4:50 AM, Atila Neves wrote:
 I hate ugly code too, but I'd rather have explicit casts.

Pascal required explicit casts. It sounded like a good idea. After a while, I 
hated it. It was so nice switching to C and leaving that behind.

(Did I mention that explicit casts also hide errors introduced by refactoring?)

Feb 17

Atila Neves <atila.neves gmail.com> writes:

On Monday, 17 February 2025 at 08:30:44 UTC, Walter Bright wrote:
 On 2/7/2025 4:50 AM, Atila Neves wrote:
 I hate ugly code too, but I'd rather have explicit casts.

 Pascal required explicit casts. It sounded like a good idea. 
 After a while, I hated it. It was so nice switching to C and 
 leaving that behind.

 (Did I mention that explicit casts also hide errors introduced 
 by refactoring?)

`cast(typeof(foo)) bar`?

Feb 17

Walter Bright <newshound2 digitalmars.com> writes:

On 2/17/2025 1:06 AM, Atila Neves wrote:
 (Did I mention that explicit casts also hide errors introduced by refactoring?)

 
 `cast(typeof(foo)) bar`?

That can work, but when best practices mean adding more code, the result is 
usually failure.

Also, what if `foo` changes to something not anticipated by that cast?

Feb 17

Atila Neves <atila.neves gmail.com> writes:

On Monday, 17 February 2025 at 22:24:37 UTC, Walter Bright wrote:
 On 2/17/2025 1:06 AM, Atila Neves wrote:
 (Did I mention that explicit casts also hide errors 
 introduced by refactoring?)

 
 `cast(typeof(foo)) bar`?

 That can work, but when best practices mean adding more code, 
 the result is usually failure.

 Also, what if `foo` changes to something not anticipated by 
 that cast?

Compilation or test failure, probably.

Feb 17

Jonathan M Davis <newsgroup.d jmdavisprog.com> writes:

On Monday, February 17, 2025 3:24:37 PM MST Walter Bright via dip.ideas wrote:
 On 2/17/2025 1:06 AM, Atila Neves wrote:
 (Did I mention that explicit casts also hide errors introduced by refactoring?)

 `cast(typeof(foo)) bar`?

 That can work, but when best practices mean adding more code, the result is
 usually failure.

 Also, what if `foo` changes to something not anticipated by that cast?

That's part of why if I were creating a new language, I'd want a level of
conversion in between implicit and explicit, though I don't have a good name
for the idea, since explicit implicit casts isn't exactly good. But
essentially, it would be nice to have a defined set of conversions like we
get with implicit casts, but they don't actually happen implicitly. Rather,
you use some sort of explicit cast to tell the compiler that you want it to
occur, but it only allows that subset of "implicit" casts rather than being
the blunt instrument that you typically get with casts which will then do
things like reinterpret the memory.

But of course, we don't have anything like that in D, and it probably
wouldn't make sense to retrofit it in at this point, though we could
certainly define more restrictive casts via templated functions (e.g. like
C++ does with stuff like dynamic_cast and const_cast) in order to allow a
particular piece of code to be more selective about the casting that it
allows so that it can have a cast but not risk it turning into a reintepret
cast or whatnot.

As for converting between signed and unsigned... I'm definitely mixed on
this one. I follow essentially the rules that you mentioned for using signed
vs unsigned, but I _have_ been bitten by this (quite recently in fact), and
it was hard to catch. On the other hand, I don't know how many casts would
be required in general if we treated conversions between signed and unsigned
as narrowing conversions and thus required a cast. Since I mostly just use
unsigned via size_t (there are exceptions, but they're rare), I suspect that
I wouldn't need many casts in my code, but I don't know. And the code that I
got bitten with recently was templated, which could make handling it
trickier (though in this case, I could have just cast to long, and that's
what I needed to do anyway).

My guess is that we'd be better off with requiring the casts, but I don't
know. It _is_ arguably trading off one set of bugs for another, but it would
also force you to think about what you want with any particular conversion
rather than silently doing something that you don't necessarily want. Casts
do become more problematic with refactoring, but the lack of casts is
similarly problematic, since those also can change behavior silently. It's
just for a different set of types. Realistically, I would expect that some
code would have fewer bugs with the cast requirement, and other code would
have more, but I would _guess_ (based on how I code at least) that the net
result would be fewer.

- Jonathan M Davis

Feb 19

Nick Treleaven <nick geany.org> writes:

On Monday, 17 February 2025 at 08:30:44 UTC, Walter Bright wrote:
 (Did I mention that explicit casts also hide errors introduced 
 by refactoring?)

In this case, we can use these with IFTI instead of explicit 
casts:

https://dlang.org/phobos/std_conv.html#signed
https://dlang.org/phobos/std_conv.html#unsigned

Feb 17

Walter Bright <newshound2 digitalmars.com> writes:

On 2/17/2025 1:11 PM, Nick Treleaven wrote:
 In this case, we can use these with IFTI instead of explicit casts:
 
 https://dlang.org/phobos/std_conv.html#signed
 https://dlang.org/phobos/std_conv.html#unsigned

Yes (those were Andrei's initiative).

Up to a point. An explicit use of a signed template doesn't work if one is 
refactoring to an unsigned type.

Feb 17

Quirin Schroll <qs.il.paperinik gmail.com> writes:

On Thursday, 6 February 2025 at 09:10:41 UTC, Walter Bright wrote:
 [I'm not sure why a new thread was created?]

 This comes up now and then. It's an attractive idea, and seems 
 obvious. But I've always been against it for multiple reasons.

 1. Pascal solved this issue by not allowing any implicit 
 conversions. The result was casts everywhere, which made the 
 code ugly. I hate ugly code.

Let me guess: Pascal has no value-range propagation?

 2. Java solve this by not having an unsigned type. People went 
 to great lengths to emulate unsigned behavior. Eventually, the 
 Java people gave up and added it.

Java 23 does not have unsigned types, though. There are only 
operations that essentially reinterpret the bits of signed 
integer types as unsigned integers and do operations on them. 
Signed and unsigned multiplication, division and modulo are 
completely different operations.

 3. Is `1` a signed int or an unsigned int?

Ideally, it has its own type that implicitly converts to anything 
that can be initialized by the constant. Of course, `typeof()` 
must return something,
there are three options:
- `typeof(1)` is `typeof(1)`, similar to `typeof(null)`
- `typeof(1)` is `__static_integer` (cf. Zig’s `comptime_int`)
- `typeof(1)` is `int`, which makes it indistinguishable from a 
runtime expression.

D chooses the latter. None of those are a bad choice; tradeoffs 
everywhere.

 4. What happens with `p[i]`? If `p` is the beginning of a 
 memory object, we want `i` to be unsigned. If `p` points to the 
 middle, we want `i` to be signed. What should be the type of `p 
 - q`? signed or unsigned?

Two questions, two answers.

 What happens with `p[i]`?

That’s a vague question. If `p` is a slice, range error if `i` is 
signed and negative. If `p` is a pointer, it’s `*(p + i)` and if 
`i` is signed and negative, so be it. `typeof(p + i)` is 
`typeof(p)`, so there shouldn’t be a problem.

 What should be the type of `p - q`? signed or unsigned?

Signed. If `p` and `q` are compile-time constants, so is `p - q`, 
and if it’s nonnegative, converts to unsigned types.

While it would be annoying for sure, it does make sense to use a 
function for pointer subtraction when one assumes the difference 
to be positive: `unsignedDifference(p, q)` It would assert that 
the result is in fact positive or zero and return a `size_t`. The 
cool thing about it is that if you expect an unsigned result and 
happen to be wrong, you’ll find out quicker than otherwise.

 5. We rely on 2's complement overflow semantics to get the same 
 behavior if `i` is signed or unsigned, most of the time.

As I see it, 2’s complement for both signed and unsigned 
arithmetic is a straightforward choice D made to keep ` safe` 
useful. If D made any of them UB, it would exclude part of basic 
arithmetic from ` safe` because ` safe` bans every operation that 
*can* introduce UB. It’s essentially why pointer arithmetic is 
banned in ` safe`, since `++p` might push `p` outside an array, 
which is UB. D offers slices as a safe (because checked) 
alternative to pointers.

 6. Casts are a blunt instrument that impair readability and can 
 cause unexpected behavior when changing a type in a 
 refactoring. High quality code avoids the use of explicit casts 
 as much as possible.

In my experience, when signed and unsigned are mixed, it points 
to a design issue.
I had this experience a couple of times working on an older C++ 
codebase.

 7. C behavior on this is extremely well known.

Making something valid in C do something it can’t do in C is a 
bad idea and invites bugs, that is true. Making questionable C 
things errors *prima facie* isn’t.

AFAICT, D for the most part sticks to: If it looks like C, it 
behaves like C or doesn’t compile. Banning signed-to-unsigned 
conversions (unless VRP proves it’s okay) simply falls into the 
latter box.

 8. The Value Range Propagation feature was a brilliant 
 solution, that resolved most issues with implicit signed and 
 unsigned conversions, without causing any problems.

Of course VRP is great. For the most part, it means if an 
implicit conversion compiles, it’s because nothing weird happens, 
no data can be lost, etc. Signed to unsigned conversion breaks 
this expectation that VRP in fact co-created.

 9. Array bounds checking tends to catch the usual bugs with 
 conflating signed with unsigned. Array bounds checking is a 
 total winner of a feature.

It’s generally good. Almost no-one complains about it.

 Andrei and I went around and around on this, pointing out the 
 contradictions. There was no solution. There is no "correct" 
 answer for integer 2's complement arithmetic.

I don’t really know what that means. Integer types in C and most 
languages derived from it (D included) inherited have this oddity 
that addition and subtraction is 2’s complement, but 
multiplication, division, and modulo are not (`cast(uint)(-10 / 
3)` and `cast(uint)-10 / 3` are different). Mathematically 
speaking, integers in D are neither values modulo 2ⁿ nor a 
section of ℤ.

 Here's what I do:

 1. use unsigned if the declaration should never be negative.

 2. use size_t for all pointer offsets

 3. use ptrdiff_t for deltas of size_t that could go negative

 4. otherwise, use signed

 Stick with those and most of the problems will be avoided.

Sounds reasonable.

Feb 13

Walter Bright <newshound2 digitalmars.com> writes:

On 2/13/2025 4:00 PM, Quirin Schroll wrote:
 Signed and unsigned multiplication, division and 
 modulo are completely different operations.

Signed and unsigned multiplication produce the exact same bit pattern result. 
Division and modulo are indeed different.


 None of those are a bad choice; tradeoffs everywhere.

It's always tradeoffs.


 4. What happens with `p[i]`? If `p` is the beginning of a memory object, we 
 want `i` to be unsigned. If `p` points to the middle, we want `i` to be 
 signed. What should be the type of `p - q`? signed or unsigned?

 
 Two questions, two answers.
 
 What happens with `p[i]`?

 
 That’s a vague question. If `p` is a slice, range error if `i` is signed and 
 negative. If `p` is a pointer, it’s `*(p + i)` and if `i` is signed and 
 negative, so be it. `typeof(p + i)` is `typeof(p)`, so there shouldn’t be a 
 problem.

Sorry, I meant `p` as a pointer. I use `a` as an array (or slice). A pointer
can 
move forward or backwards, so the index is signed. A slice cannot back up, so 
the index is unsigned. A slice can be converted to a pointer. So then what, is 
the index signed or unsigned? There's no answer for that.


 What should be the type of `p - q`? signed or unsigned?

 
 Signed.

That doesn't work if the array is bigger than the int range, or happens to 
straddle `int.max`. (The garbage collector can run into this.)


 While it would be annoying for sure, it does make sense to use a function for 
 pointer subtraction when one assumes the difference to be positive: 
 `unsignedDifference(p, q)` It would assert that the result is in fact positive 
 or zero and return a `size_t`. The cool thing about it is that if you expect
an 
 unsigned result and happen to be wrong, you’ll find out quicker than
otherwise.

I'm sorry, all these extra baggage and rules about signed and unsigned makes it 
harder to use, not easier.


 As I see it, 2’s complement for both signed and unsigned arithmetic is a 
 straightforward choice D made to keep ` safe` useful.

D's type system preceded  safe by many years :-/


 If D made any of them UB, 
 it would exclude part of basic arithmetic from ` safe` because ` safe` bans 
 every operation that *can* introduce UB.

 safe only bans memory corruption. 2's complement arithmetic is not UB.


 It’s essentially why pointer arithmetic 
 is banned in ` safe`, since `++p` might push `p` outside an array, which is
UB. 
 D offers slices as a safe (because checked) alternative to pointers.

`--p` and `++p` are always unsafe whether the implicit conversions are there or
not.


 6. Casts are a blunt instrument that impair readability and can cause 
 unexpected behavior when changing a type in a refactoring. High quality code 
 avoids the use of explicit casts as much as possible.

 
 In my experience, when signed and unsigned are mixed, it points to a design
issue.
 I had this experience a couple of times working on an older C++ codebase.

Hence my suggestions.

I look at it this way. D is a systems programming language. A requirement for 
being successful at it is understanding 2's complement arithmetic, including 
what wraparound is.

It's not that dissimilar to the requirement of some understanding of how 
floating point code works and its limitations, otherwise grief will be your 
inevitable companion.

Also that a bool is a one bit integer arithmetic type.

I know there are languages that attempt to hide all this stuff, but D isn't one 
of them.

Feb 17

Quirin Schroll <qs.il.paperinik gmail.com> writes:

On Monday, 17 February 2025 at 09:01:45 UTC, Walter Bright wrote:
 On 2/13/2025 4:00 PM, Quirin Schroll wrote:
 Signed and unsigned multiplication, division and modulo are 
 completely different operations.

 Signed and unsigned multiplication produce the exact same bit 
 pattern result. Division and modulo are indeed different.

You’re right, I was mistaken. I thought multiplication by −1 had 
to be different than multiplication my `T.max`, but it’s not.

 None of those are a bad choice; tradeoffs everywhere.

 It's always tradeoffs.

Sometimes, there are better things.

 4. What happens with `p[i]`? If `p` is the beginning of a 
 memory object, we want `i` to be unsigned. If `p` points to 
 the middle, we want `i` to be signed. What should be the type 
 of `p - q`? signed or unsigned?

 
 Two questions, two answers.
 
 What happens with `p[i]`?

 
 That’s a vague question. If `p` is a slice, range error if `i` 
 is signed and negative. If `p` is a pointer, it’s `*(p + i)` 
 and if `i` is signed and negative, so be it. `typeof(p + i)` 
 is `typeof(p)`, so there shouldn’t be a problem.

 Sorry, I meant `p` as a pointer. I use `a` as an array (or 
 slice). A pointer can move forward or backwards, so the index 
 is signed. A slice cannot back up, so the index is unsigned. A 
 slice can be converted to a pointer. So then what, is the index 
 signed or unsigned? There's no answer for that.

The index already has a type. The operation `p + i` can support 
signed and unsigned `i` via overloading. I really don’t see the 
problem. You’re not inferring the type of the index because of 
the operation.

 What should be the type of `p - q`? signed or unsigned?

 
 Signed.

 That doesn't work if the array is bigger than the int range, or 
 happens to straddle `int.max`. (The garbage collector can run 
 into this.)

Why would the GC use `int`? Unless, of course, it happens to 
equal `ptrdiff_t`? Those are conceptually different.

The general problem is, basically, that differences of n-bit 
integers require n+1 bits to represent. That problem is not 
inherent to unsigned values, it’s just more obvious because 2 − 1 
can’t be represented. In signed world, `-2` − `int.max` doesn’t 
fit in an `int` either. Making them signed doesn’t fix 
differences of indices totally, only differences of non-negative 
values.

 While it would be annoying for sure, it does make sense to use 
 a function for pointer subtraction when one assumes the 
 difference to be positive: `unsignedDifference(p, q)` It would 
 assert that the result is in fact positive or zero and return 
 a `size_t`. The cool thing about it is that if you expect an 
 unsigned result and happen to be wrong, you’ll find out 
 quicker than otherwise.

 I'm sorry, all these extra baggage and rules about signed and 
 unsigned makes it harder to use, not easier.

It’s much harder to write bugs when signed and unsigned are 
separated.

 As I see it, 2’s complement for both signed and unsigned 
 arithmetic is a straightforward choice D made to keep ` safe` 
 useful.

 D's type system preceded  safe by many years :-/

My argument isn’t so much about history, but UB. Java does the 
same.

 If D made any of them UB, it would exclude part of basic 
 arithmetic from ` safe` because ` safe` bans every operation 
 that *can* introduce UB.

  safe only bans memory corruption.

In the language design space, there’s no difference between UB 
and memory corruption because memory corruption is a form of UB 
and any UB can lead to memory corruption (by definition really). 
Therefore, speaking about memory corruption is equivalent to 
speaking about UB generally.

D’s ` safe` bans all UB (by intent at least). If it didn’t, it 
would allow for memory corruption; it doesn’t matter if it’s 
directly or indirectly.

 2's complement arithmetic is not UB.

Of course it’s not. The alternative to 2’s complement is UB 
(practically speaking). There are some odd platforms with a 
negative representation that’s not 2’s complement, but D supports 
none of them.

What I’m saying is, when designing a programming language, your 
choices to integer overflow are: 2’s complement or UB. D chose 
2’s complement overall (also Java), C/C++ chose 2’s complement 
for unsigned and UB for signed, Zig chose UB overall.

Guaranteeing 2’s complement means the operation is well-defined 
for all inputs, but the optimizer can do less. Tradeoffs 
everywhere.

Even before ` safe`, having all operations on integers 
well-defined (maybe ignore division by zero) has positives that I 
guess you saw.

Historically speaking, had D taken the C/C++ or Zig route, there 
would be no ` safe` because if basic operations on integers can 
be UB, adding a feature like ` safe` makes no sense.

 It’s essentially why pointer arithmetic is banned in ` safe`, 
 since `++p` might push `p` outside an array, which is UB. D 
 offers slices as a safe (because checked) alternative to 
 pointers.

 `--p` and `++p` are always unsafe whether the implicit 
 conversions are there or not.

What I find interesting is that:
- For pointers, it’s obvious to almost anyone that slices are a 
win because of bounds checking, even though it comes with a dual 
cost: The length has to be stored and indexing operations have to 
range-checked.
- For integer operations, people seem to be hesitant to 
range-check them, even though that comes only with the cost of 
doing the check; no bound has to be stored.

It’s not that 2’s complement doesn’t have its place; what I am 
saying is: The language constructs should be as close to the 
intuition of the programmer as possible. I for once know when I’m 
making deliberate use of the bit representation of integers, 
however, without checks, I’m making use of the bit representation 
of integers with every operation, most of the time when I don’t 
intend to.

Most of the time, the fact that integers are binary is 
conceptually irrelevant.

 6. Casts are a blunt instrument that impair readability and 
 can cause unexpected behavior when changing a type in a 
 refactoring. High quality code avoids the use of explicit 
 casts as much as possible.

 
 In my experience, when signed and unsigned are mixed, it 
 points to a design issue.
 I had this experience a couple of times working on an older 
 C++ codebase.

 Hence my suggestions.

One cannot apply suggestions retroactively to a huge codebase 
that’s >15 years old.

One can, however, ban narrowing conversions and discover the 
problematic spots in compile errors and address them properly.

 I look at it this way. D is a systems programming language. A 
 requirement for being successful at it is understanding 2's 
 complement arithmetic, including what wraparound is.

While I agree that it is true and that I would exclude anyone 
from being called a competent programmer who doesn’t understand 
2’s complement, I find myself rarely thinking about indices and 
whatnot something other than an integer with a limited range. For 
hashing and some other algorithms, you do think of those as 
elements of an ordered [unitary 
ring](https://en.wikipedia.org/wiki/Ring_(mathematics)) with an 
operation referred to as “division with remainder.”

D inherited its types from C and C inherited them from the 
operations of machines. It wouldn’t have occurred to the creators 
of C to provide different types for doing boolean logic, integer 
arithmetic, indexing arithmetic a.k.a. addressing, and bit 
operations. All of these happen in the same kinds of registers; 
to most people, however, a boolean value isn’t an integer (even C 
added `_Bool` and then `bool`); a number isn’t an index, and an 
index isn’t a bit-vector. To most people, `size_t` means more 
than “alias to the bit-width unsigned integer type the same size 
as addresses,” but conceptualizes sizes of memory or indices into 
arrays (in memory). Nobody would use a `size_t` to model the age 
of something; age is a number (within some range) and not an 
index.

What’s the difference between `i << 1` and `i * 2`? From the 
low-level perspective, literally none after optimization. 
However, in code, those encode very different intents.

D is a low-level _and_ a high-level language. From the higher 
levels, mixing bit-vectors and numbers is usually a mistake. The 
language requiring to state that, yes, that’s indeed what you 
want isn’t exactly bad.

 It's not that dissimilar to the requirement of some 
 understanding of how floating point code works and its 
 limitations, otherwise grief will be your inevitable companion.

 Also that a `bool` is a one bit integer arithmetic type.

I wonder why D has an 1-bit integer type which is conceptually a 
boolean value, but no general n-bit integer types? C23 added 
`_BitInt(n)` and `_BitInt(1)` is not `bool` (which C23 made a 
proper type).

 I know there are languages that attempt to hide all this stuff, 
 but D isn't one of them.

There’s a difference between hiding and not needlessly exposing.

Making the implicit conversion of `int` to and from `uint` an 
error isn’t hiding things akin to Java hiding its pointers.

Narrowing implicit conversions warrant a warning in C and C++ and 
rightly so – it is likely a mistake and a local fix is available 
(use an explicit cast); brace-initialization in C++ outright bans 
it. By the design of D, it should be an error. Alternatives are:
- Redesign so the error doesn’t even come up anymore.
- Assert, then cast. (If you’re “really sure” it can’t fail.)
- Use a throwing narrowing conversion function. (If you’re 
“mostly sure” it can’t fail.)

Feb 17

Paul Backus <snarwin gmail.com> writes:

On Monday, 17 February 2025 at 09:01:45 UTC, Walter Bright wrote:
 On 2/13/2025 4:00 PM, Quirin Schroll wrote:
 If D made any of them UB, it would exclude part of basic 
 arithmetic from ` safe` because ` safe` bans every operation 
 that *can* introduce UB.

  safe only bans memory corruption. 2's complement arithmetic is 
 not UB.

Dividing an integer by zero is UB according to the D spec [1], 
and it is allowed in  safe code.

[1] https://dlang.org/spec/expression.html#division

Feb 17

Walter Bright <newshound2 digitalmars.com> writes:

On 2/17/2025 7:07 AM, Paul Backus wrote:
 On Monday, 17 February 2025 at 09:01:45 UTC, Walter Bright wrote:
  safe only bans memory corruption. 2's complement arithmetic is not UB.

 
 Dividing an integer by zero is UB according to the D spec [1], and it is
allowed 
 in  safe code.
 
 [1] https://dlang.org/spec/expression.html#division

That's correct. But it's not memory corruption, and requiring casts doesn't 
address it.

The usual result is a signal is generated. These can be intercepted at the 
user's discretion.

The compiler will flag an error if it can statically determine that the divisor 
is zero. Runtime checks could be added, but since other languages don't do
that, 
it would put D at a competitive disadvantage.

As always, there are tradeoffs.

Feb 17

Paul Backus <snarwin gmail.com> writes:

On Tuesday, 18 February 2025 at 00:33:27 UTC, Walter Bright wrote:
 On 2/17/2025 7:07 AM, Paul Backus wrote:
 Dividing an integer by zero is UB according to the D spec [1], 
 and it is allowed in  safe code.
 
 [1] https://dlang.org/spec/expression.html#division

 That's correct. But it's not memory corruption, and requiring 
 casts doesn't address it.

 The usual result is a signal is generated. These can be 
 intercepted at the user's discretion.

An optimizing compiler (like LDC or GDC) is allowed to generate 
code that produces memory corruption if a division by zero would 
occur. So this is absolutely a hole in  safe.

If the compiler could guarantee that a signal would be generated 
on division by zero, that would be sufficient to close the safety 
hole.

 The compiler will flag an error if it can statically determine 
 that the divisor is zero. Runtime checks could be added, but 
 since other languages don't do that, it would put D at a 
 competitive disadvantage.

An alternative solution that does not require giving up any 
runtime performance would be to require  safe code to use 
std.checkedint for dividing integers.

Feb 17

Dom DiSc <dominikus scherkl.de> writes:

On Monday, 3 February 2025 at 18:40:20 UTC, Atila Neves wrote:
 https://forum.dlang.org/post/pbhjffbxdqpdwtmcbikh forum.dlang.org

 On Sunday, 12 May 2024 at 13:32:36 UTC, Paul Backus wrote:
 D inherited these implicit conversions from C and C++, where 
 they are widely regarded as a source of bugs.

 [...]

 My bias is to not like any implicit conversions of any kind, 
 but I'm not sure I can convince Walter of that.

I think most of the problems with these implicit conversions 
would be gone if we make this work:

```d
    byte a= -5;
    ulong b = 1_000_000_000_000;
    assert(a < b); // fails
```

And we already do have a solution for this (see 
https://issues.dlang.org/show_bug.cgi?id=259), but Walter refuses 
it, because it will break code that relies on this bug.
How much less likely is it to convince him of your proposal?

Feb 06

"Richard (Rikki) Andrew Cattermole" <richard cattermole.co.nz> writes:

On 07/02/2025 12:07 AM, Dom DiSc wrote:
 On Monday, 3 February 2025 at 18:40:20 UTC, Atila Neves wrote:
 https://forum.dlang.org/post/pbhjffbxdqpdwtmcbikh forum.dlang.org

 On Sunday, 12 May 2024 at 13:32:36 UTC, Paul Backus wrote:
 D inherited these implicit conversions from C and C++, where they are 
 widely regarded as a source of bugs.

 [...]

 My bias is to not like any implicit conversions of any kind, but I'm 
 not sure I can convince Walter of that.

 
 I think most of the problems with these implicit conversions would be 
 gone if we make this work:
 
 ```d
     byte a= -5;
     ulong b = 1_000_000_000_000;
     assert(a < b); // fails
 ```
 
 And we already do have a solution for this (see https:// 
 issues.dlang.org/show_bug.cgi?id=259), but Walter refuses it, because it 
 will break code that relies on this bug.
 How much less likely is it to convince him of your proposal?

We should revisit this once editions are accepted.

It sounds reasonable to disable comparisons as long as VRP is kicking in 
to allow it selectively.

Feb 06

Kagamin <spam here.lot> writes:

On Monday, 3 February 2025 at 18:40:20 UTC, Atila Neves wrote:
 https://forum.dlang.org/post/pbhjffbxdqpdwtmcbikh forum.dlang.org

I agree with Bjarne, the problem is entirely caused by abuse of 
unsigned integers as positive numbers. And deprecation of 
implicit conversion is impossible due to this abuse: signed and 
unsigned integers will be mixed everywhere because signed 
integers are proper numbers and unsigned integers are everywhere 

almost all interfaces and it just works.

Feb 06

Quirin Schroll <qs.il.paperinik gmail.com> writes:

On Thursday, 6 February 2025 at 16:39:26 UTC, Kagamin wrote:
 On Monday, 3 February 2025 at 18:40:20 UTC, Atila Neves wrote:
 https://forum.dlang.org/post/pbhjffbxdqpdwtmcbikh forum.dlang.org

 I agree with Bjarne, the problem is entirely caused by abuse of 
 unsigned integers as positive numbers. And deprecation of 
 implicit conversion is impossible due to this abuse: signed and 
 unsigned integers will be mixed everywhere because signed 
 integers are proper numbers and unsigned integers are 
 everywhere due to abuse.

What would be a “proper number”? At best, signed and unsigned 
types represent various slices of the infinite integers.


 interfaces and it just works.


unsigned types. There’s a 
[`CLSCompliantAttribute`](https://learn.microsoft.com/de-de/dotnet/api/system.cl
compliantattribute) that warns you if you expose unsigned integers to your

is unsigned and `sbyte` is the signed, non-CLS-compliant variant.

Feb 13

Kagamin <spam here.lot> writes:

On Friday, 14 February 2025 at 00:09:14 UTC, Quirin Schroll wrote:
 What would be a “proper number”? At best, signed and unsigned 
 types represent various slices of the infinite integers.

The problem is they are incompatible slices that you have to mix 
due to abuse of unsigned integers everywhere. At best unsigned 
integer gives you an extra bit, but in practice it doesn't cut: 
when you want a bigger integer, you use a much wider integer, not 
one bit bigger integer.


 unsigned types.

It demonstrates that the problem is due to abuse of unsigned 
integers.

Feb 15

Olivier Pisano <olivier.pisano laposte.net> writes:

On Monday, 3 February 2025 at 18:40:20 UTC, Atila Neves wrote:
 https://forum.dlang.org/post/pbhjffbxdqpdwtmcbikh forum.dlang.org

 On Sunday, 12 May 2024 at 13:32:36 UTC, Paul Backus wrote:
 D inherited these implicit conversions from C and C++, where 
 they are widely regarded as a source of bugs.

 [...]

 My bias is to not like any implicit conversions of any kind, 
 but I'm not sure I can convince Walter of that.

Hello,

Wouldn't it be nice to deprecate unary minus operator for 
unsigned types?  It typically produces an unsigned -> signed -> 
unsigned conversion that does not make much sense, in my humble 
opinion.

Mar 11

"Richard (Rikki) Andrew Cattermole" <richard cattermole.co.nz> writes:

On 11/03/2025 11:23 PM, Olivier Pisano wrote:
 On Monday, 3 February 2025 at 18:40:20 UTC, Atila Neves wrote:
 https://forum.dlang.org/post/pbhjffbxdqpdwtmcbikh forum.dlang.org

 On Sunday, 12 May 2024 at 13:32:36 UTC, Paul Backus wrote:
 D inherited these implicit conversions from C and C++, where they are 
 widely regarded as a source of bugs.

 [...]

 My bias is to not like any implicit conversions of any kind, but I'm 
 not sure I can convince Walter of that.

 
 Hello,
 
 Wouldn't it be nice to deprecate unary minus operator for unsigned 
 types?  It typically produces an unsigned -> signed -> unsigned 
 conversion that does not make much sense, in my humble opinion.

Constants such as ``-1`` are used quite often with unsigned types.

Especially in C style API's for errors.

Mar 11

Dom DiSc <dominikus scherkl.de> writes:

On Tuesday, 11 March 2025 at 10:27:38 UTC, Richard (Rikki) Andrew 
Cattermole wrote:
 Constants such as ``-1`` are used quite often with unsigned 
 types.

 Especially in C style API's for errors.

I wish in C everybody would use ~0 (or better: ~0u) instead of 
cast(uint)-1, which represent the same bit pattern but without 
using an overflow that just happens to produce a useful result :-(

But in D we have (more verbose, but explicitly stating the 
intention) uint.max, yeay!

Mar 11

Olivier Pisano <olivier.pisano laposte.net> writes:

On Tuesday, 11 March 2025 at 10:27:38 UTC, Richard (Rikki) Andrew 
Cattermole wrote:
 Constants such as ``-1`` are used quite often with unsigned 
 types.

 Especially in C style API's for errors.

-1 is a literal of type int, which is perfectly fine.

I was referring to this:

     void main ()
     {
         import std.stdio;

         uint i = 5;
         writeln(-i); // prints '4294967291'
     }

Mar 11

Nick Treleaven <nick geany.org> writes:

On Tuesday, 11 March 2025 at 10:27:38 UTC, Richard (Rikki) Andrew 
Cattermole wrote:
 On 11/03/2025 11:23 PM, Olivier Pisano wrote:
 Wouldn't it be nice to deprecate unary minus operator for 
 unsigned types?  It typically produces an unsigned -> signed 
 -> unsigned conversion that does not make much sense, in my 
 humble opinion.


I'd like that. The compiler error can suggest using `0 - u` 
instead if intended. dmd seems to treat that the same as `-u` 
even without the `-O` switch.

 Constants such as ``-1`` are used quite often with unsigned 
 types.

 Especially in C style API's for errors.

That would just be a signed to unsigned implicit conversion and 
should be unaffected by deprecating unary `-` on an unsigned 
expression.

Mar 11

Kagamin <spam here.lot> writes:

On Tuesday, 11 March 2025 at 10:23:59 UTC, Olivier Pisano wrote:
 Wouldn't it be nice to deprecate unary minus operator for 
 unsigned types?

It's also useful for rounding: n&-16

Mar 13

Dom DiSc <dominikus scherkl.de> writes:

On Thursday, 13 March 2025 at 07:29:29 UTC, Kagamin wrote:
 On Tuesday, 11 March 2025 at 10:23:59 UTC, Olivier Pisano wrote:
 Wouldn't it be nice to deprecate unary minus operator for 
 unsigned types?

 It's also useful for rounding: n&-16

Again: Please use n&~15 instead. It produces the same result, but 
without relying on ugly and confusing implicit signed/unsigned 
conversions.

Mar 13

Nick Treleaven <nick geany.org> writes:

On Thursday, 13 March 2025 at 07:29:29 UTC, Kagamin wrote:
 On Tuesday, 11 March 2025 at 10:23:59 UTC, Olivier Pisano wrote:
 Wouldn't it be nice to deprecate unary minus operator for 
 unsigned types?

 It's also useful for rounding: n&-16

typeof(16) is int, signed unary minus would not be deprecated.

Mar 14

Olivier Pisano <olivier.pisano laposte.net> writes:

On Friday, 14 March 2025 at 12:17:59 UTC, Nick Treleaven wrote:
 On Thursday, 13 March 2025 at 07:29:29 UTC, Kagamin wrote:
 On Tuesday, 11 March 2025 at 10:23:59 UTC, Olivier Pisano 
 wrote:
 Wouldn't it be nice to deprecate unary minus operator for 
 unsigned types?

 It's also useful for rounding: n&-16

 typeof(16) is int, signed unary minus would not be deprecated.

Exactly, I proposed to deprecate unary minus for 16U (uint), not 
for 16 (int).

     writeln(-16U); //writes '4294967280'

Mar 14

Hipreme <msnmancini hotmail.com> writes:

On Friday, 14 March 2025 at 16:09:55 UTC, Olivier Pisano wrote:
 On Friday, 14 March 2025 at 12:17:59 UTC, Nick Treleaven wrote:
 On Thursday, 13 March 2025 at 07:29:29 UTC, Kagamin wrote:
 On Tuesday, 11 March 2025 at 10:23:59 UTC, Olivier Pisano 
 wrote:
 Wouldn't it be nice to deprecate unary minus operator for 
 unsigned types?

 It's also useful for rounding: n&-16

 typeof(16) is int, signed unary minus would not be deprecated.

 Exactly, I proposed to deprecate unary minus for 16U (uint), 
 not for 16 (int).

     writeln(-16U); //writes '4294967280'


I have just discovered today a LLVM bug for one platform I'm 
developing to that it fails to convert correctly long to double:
```d
ulong b = 1_000_000;
double a = b;
writeln(a); //Prints 0.10
```
I'm generally against disabling implicit conversions. But maybe 
it could be useful for that situation. That could be under a 
compilation flag?

Mar 14

Walter Bright <newshound2 digitalmars.com> writes:

This is a continuation of this thread:

https://www.digitalmars.com/d/archives/digitalmars/dip/ideas/Deprecate_implicit_conversion_between_signed_and_unsigned_integers_334.html

Mar 14

D Programming

C/C++ Programming

Other

digitalmars.dip.ideas - Deprecate implicit conversion between signed and unsigned integers