deepakkumar1984 on master
fix unknown kernel_regularizer … Merge pull request #97 from Ben… (compare)
Oceania2018 on master
Add implicitation for Operation… (compare)
Unsafe.Addfor extra speed) and GPU (using this library), and tested it out on an input tensor of size
128*512*512on my notebook (i7 8750H 6C/12T @4Ghz, GTX1050), and this is what I got:
0.3sin this test, which is still over 11x the CPU implementation.
// Allocate a writeable buffer on the GPU, with the contents of the array using ReadWriteBuffer<float> buffer = Gpu.Default.AllocateReadWriteBuffer<float>(1000); // Run the shader Gpu.Default.For(1000, id => buffer[id.X] = id.X); // Get the data back float array = buffer.GetData();
idand why does it have an
Xproperty? is it a point in an ND space?
ThreadIdsvalue that contains the current index along the 3 axis.
ktuple of indices for 3 nested loops
forloop, which just computes the result value for a single target element
System.Mathprovides? (e.g. gpu's sin)
Math.Pow), and they get automatically mapped to HLSL functions