@@ -231,12 +231,12 @@ For optimal layouts, **matrix stride** is ignored.
231
231
232
232
Only non-packed interpretations are valid for matrices.
233
233
234
- The base address of ** matrix resource** and ** matrix offset** must be 128 byte
234
+ The base address of ** matrix resource** and ** matrix offset** must be 128- byte
235
235
aligned. Also note that the size of the underlying allocation is guaranteed to
236
236
be a multiple of 16 bytes ensuring that the 16 bytes access of the last
237
237
row/column of the matrix is valid memory.
238
238
239
- The ** matrix stride** is 16 byte aligned.
239
+ The ** matrix stride** is 16- byte aligned.
240
240
241
241
This operation doesn't perform bounds checking for matrix loads. If any part of
242
242
the matrix load is out of bounds then the entire operation is undefined.
@@ -252,7 +252,7 @@ conversion is performed.
252
252
Only non-packed interpretations are valid for bias vectors.
253
253
254
254
The base address of ** bias vector resource** and ** bias vector offset** must be
255
- 64 byte aligned.
255
+ 64- byte aligned.
256
256
257
257
This operation doesn't perform bounds checking for bias loads. If any part of
258
258
the vector load is out of bounds then the entire operation is undefined.
@@ -315,12 +315,12 @@ resource**, with **matrix offset**, **matrix stride**, **matrix
315
315
interpretation** and ** matrix layout** behaving as described [ above]
316
316
(#matrix-vector-multiply-and-multiply-add-operations).
317
317
318
- The base address of ** matrix resource** and ** matrix offset** must be 128 byte
318
+ The base address of ** matrix resource** and ** matrix offset** must be 128- byte
319
319
aligned. Also note that the size of the underlying allocation is guaranteed to
320
320
be a multiple of 16 bytes ensuring that the 16 bytes access of the last
321
321
row/column of the matrix is valid memory
322
322
323
- The ** matrix stride** is 16 byte aligned.
323
+ The ** matrix stride** is 16- byte aligned.
324
324
325
325
Not all combinations of vector element type and matrix interpretations are
326
326
supported by all implementations. [ CheckFeatureSupport] can be used to
@@ -360,7 +360,7 @@ type `TY`.
360
360
361
361
The output array is accumulated to the writeable raw-buffer resource specified
362
362
by ** output array resource** and ** output array offset** . The base address
363
- and ** output array offset** must be 64 byte aligned.
363
+ and ** output array offset** must be 64- byte aligned.
364
364
365
365
[ CheckFeatureSupport] can be used to determine which vector element types can be
366
366
accumulated. A list of types that are guaranteed to be supported on all devices
@@ -827,7 +827,7 @@ updated with the calculated output size, is then passed to the conversion
827
827
API.
828
828
829
829
The ` DestSize ` and ` DestStride ` must be a multiple of 16 bytes. The ` DestVA `
830
- must be 128B aligned.
830
+ must be 128-byte aligned.
831
831
832
832
``` c++
833
833
0 commit comments