digitalmars.D.learn - Disjoint slices of an array as reference

data pulverizer (32/32) Aug 19 2020 I have been trying to create a new array from an existing array

data pulverizer (8/13) Aug 19 2020 I've been thinking about this some more and I don't think it is

data pulverizer (5/11) Aug 19 2020 p.s.

=?UTF-8?Q?Ali_=c3=87ehreli?= (14/16) Aug 20 2020 It is the length, followed by the pointer, equivalent of the following

Paul Backus (5/17) Aug 19 2020 double[][] y;

data pulverizer (12/14) Aug 19 2020 Thanks. I might go for a design like this:

=?UTF-8?Q?Ali_=c3=87ehreli?= (19/36) Aug 20 2020 I implemented the same idea recently; it's a fun exercise. :) I didn't=2...

data pulverizer (36/63) Aug 20 2020 Many thanks for confirming the internal structure of D's arrays

data pulverizer <data.pulverizer gmail.com> writes:

I have been trying to create a new array from an existing array 
that is effectively a view on the original array. This can be 
done with slices in D:

```
auto x = [1,  2,  3, 4,  5,
            6,  7,  8, 9,  10,
          11, 12, 13, 14, 15];
auto y = [0..5];
```

y a subset of the x array. If I do:

```
y[0] = 11;
```

x[0] will be modified to 11;

However I would like to have disjoint slices, something like this:

```
auto y = x[0..5, 9..14];
```

Which selects different disjoint parts of the array x, but is not 
allowed on T[]. I have tried something like:

```
double[] y;
y ~= x[0..5];
y ~= x[9..14];
```

But the act of appending results in array copying - breaks the 
reference with the original array. The only other thing I have 
considered is creating an array of references to each of the 
elements of x I would like but that just seems like overkill.

It would be good to know if there is a more straightforward way 
of doing this type of disjoint selection.

Thanks in advance.

Aug 19 2020

data pulverizer <data.pulverizer gmail.com> writes:

On Thursday, 20 August 2020 at 02:21:15 UTC, data pulverizer 
wrote:
 However I would like to have disjoint slices, something like 
 this:

 ```
 auto y = x[0..5, 9..14];
 ```

I've been thinking about this some more and I don't think it is 
possible. An array in D is effectively two pointers either side 
of a memory block. When you create a slice you are creating 
another array two pointers somewhere in the same memory block. A 
disjoint slice array would need more than two pointers - which is 
not possible since an array only has 2.

Aug 19 2020

data pulverizer <data.pulverizer gmail.com> writes:

On Thursday, 20 August 2020 at 02:38:33 UTC, data pulverizer 
wrote:
 I've been thinking about this some more and I don't think it is 
 possible. An array in D is effectively two pointers either side 
 of a memory block. When you create a slice you are creating 
 another array two pointers somewhere in the same memory block. 
 A disjoint slice array would need more than two pointers - 
 which is not possible since an array only has 2.

p.s.

An array in D is either two pointers or one pointer and a length 
(I don't know which) - but the point still stands.

Aug 19 2020

=?UTF-8?Q?Ali_=c3=87ehreli?= <acehreli yahoo.com> writes:

On 8/19/20 7:40 PM, data pulverizer wrote:

 An array in D is either two pointers or one pointer and a length (I
 don't know which)

It is the length, followed by the pointer, equivalent of the following 
struct:

struct A {
   size_t length_;
   void * ptr;

   size_t length() {
     return length_;
   }

   size_t length(size_t newLength) {
     // Modify length_ and ptr as necessary
   }
}

Ali

Aug 20 2020

Paul Backus <snarwin gmail.com> writes:

On Thursday, 20 August 2020 at 02:21:15 UTC, data pulverizer 
wrote:
 ```
 double[] y;
 y ~= x[0..5];
 y ~= x[9..14];
 ```

 But the act of appending results in array copying - breaks the 
 reference with the original array. The only other thing I have 
 considered is creating an array of references to each of the 
 elements of x I would like but that just seems like overkill.

 It would be good to know if there is a more straightforward way 
 of doing this type of disjoint selection.

 Thanks in advance.

double[][] y;
y ~= x[0..5];
y ~= x[9..14];

Aug 19 2020

data pulverizer <data.pulverizer gmail.com> writes:

On Thursday, 20 August 2020 at 03:47:15 UTC, Paul Backus wrote:
 double[][] y;
 y ~= x[0..5];

Thanks. I might go for a design like this:

```
struct View(T){
   T* data;
   long[2][] ranges;
}
```
The ranges are were the slices are stored and T* (maybe even 
immutable(T*)) is a pointer is to the start of the original 
array. I'll use an opIndex that calculates the correct index in 
the original array to obtain the right data.

Aug 19 2020

=?UTF-8?Q?Ali_=c3=87ehreli?= <acehreli yahoo.com> writes:

On 8/19/20 9:11 PM, data pulverizer wrote:
 On Thursday, 20 August 2020 at 03:47:15 UTC, Paul Backus wrote:
 double[][] y;
 y ~=3D x[0..5];

=20
 Thanks. I might go for a design like this:
=20
 ```
 struct View(T){
  =C2=A0 T* data;
  =C2=A0 long[2][] ranges;
 }
 ```
 The ranges are were the slices are stored and T* (maybe even=20
 immutable(T*)) is a pointer is to the start of the original array. I'll=

=20
 use an opIndex that calculates the correct index in the original array =

 to obtain the right data.
=20

I implemented the same idea recently; it's a fun exercise. :) I didn't=20
bother with opIndex because my use case was happy with just the=20
InputRange primitives (and .length I think).

And I had to implement it because std.range.chain works only with=20
statically known number of sub-ranges. :/ If the number of ranges are=20
known, then this works:

import std.stdio;
import std.range;

void main() {
   auto x =3D [1,  2,  3, 4,  5,
             6,  7,  8, 9,  10,
             11, 12, 13, 14, 15];
   auto y =3D chain(x[0..5], x[9..14]);
   writeln(y);
}

[1, 2, 3, 4, 5, 10, 11, 12, 13, 14]

Ali

Aug 20 2020

data pulverizer <data.pulverizer gmail.com> writes:

On Thursday, 20 August 2020 at 08:26:59 UTC, Ali Çehreli wrote:
 On 8/19/20 9:11 PM, data pulverizer wrote:
 Thanks. I might go for a design like this:
 
 ```
 struct View(T){
    T* data;
    long[2][] ranges;
 }
 ```
 [...]

 I implemented the same idea recently; it's a fun exercise. :) I 
 didn't bother with opIndex because my use case was happy with 
 just the InputRange primitives (and .length I think).

 And I had to implement it because std.range.chain works only 
 with statically known number of sub-ranges. :/ If the number of 
 ranges are known, then this works:

 import std.stdio;
 import std.range;

 void main() {
   auto x = [1,  2,  3, 4,  5,
             6,  7,  8, 9,  10,
             11, 12, 13, 14, 15];
   auto y = chain(x[0..5], x[9..14]);
   writeln(y);
 }

 [1, 2, 3, 4, 5, 10, 11, 12, 13, 14]

 Ali

Many thanks for confirming the internal structure of D's arrays 
and for the tip on std.range's chain function. It's exactly what 
I need. In fact the number of sub-ranges are statically known ...

As an aside, the reason for this query is that I have written a 
small module for multidimensional arrays to be included in a GLM 
(generalized linear models) package I am writing in D (in fact 
this will be the second major version of the package in D) - I 
know about Mir but my array doesn't have to be feature rich and 
only requires very few methods, I'm also probably going to write 
an article about it's internal structure and I've learned a lot 
creating it, and it means the GLM library won't have dependencies 
outside D's standard library and BLAS/LAPACK.

The multidimensional array is structured like this:

```
Array(long N, T)
if(isFloatingPoint!T && (N >= 1))
{
   T[] data;
   long[N] dim;
   ...
}
```

Where N is the number of dimensions. The indexing and stuff works 
fine but I wasn't happy that subsetting the array with slices 
returns a copy (for example an 2D array `A[0..2, 1..$]`) and 
since the subsetting is not necessarily contiguous, directly 
slicing from the data array was not feasible. But now N is known 
at compile time (`A[r[0][0]..r[0][1], r[1][0]..r[1][1], ... , 
r[N][0]..r[N][1]]`) so doing a subset using std.range's `chain` 
on `data` is easy. It also means I don't have to implement a 
separate `View` struct. This chain function will also simplify my 
indexing code, at the moment I am using string mixins to generate 
code for for loops over all the different dimensions where the 
function creating string the is recursive ... which was fun to 
write [:laugh:]!

Aug 20 2020

D Programming

C/C++ Programming

Other

digitalmars.D.learn - Disjoint slices of an array as reference