Feature Scope
Data Types
Propagation Direction |
src Data Type |
Weight Data Type |
dst Data Type |
Bias Data Type |
|---|---|---|---|---|
dnnl_forward_training dnnl_forward_inference |
f32 |
f32 |
f32 |
f32 |
f16 |
f16 |
f16 |
f16 |
|
bf16 |
bf16 |
bf16 |
bf16 |
|
f16 |
f16 |
f32 |
f16 |
|
bf16 |
bf16 |
f32 |
bf16 |
Propagation Direction |
src Data Type |
Weight Data Type |
dst Data Type |
Bias Data Type |
|---|---|---|---|---|
dnnl_backward_data |
f32 |
f32 |
f32 |
f32 |
f16 |
f16 |
f16 |
f16 |
|
bf16 |
bf16 |
bf16 |
bf16 |
|
f32 |
f16 |
f16 |
f32 |
|
f32 |
bf16 |
bf16 |
f32 |
Propagation Direction |
src Data Type |
Weight Data Type |
dst Data Type |
Bias Data Type |
|---|---|---|---|---|
dnnl_backward_weights |
f32 |
f32 |
f32 |
f32 |
f16 |
f16 |
f16 |
f16 |
|
bf16 |
bf16 |
bf16 |
bf16 |
|
f16 |
f32 |
f16 |
f16 |
|
bf16 |
f32 |
bf16 |
bf16 |
Data Layout
2D deconvolution is supported. The input and output tensor dimension is 4D. The layout of src, weight, and dst data needs to meet the following requirements:
Tensor Dimension |
src Data Layout |
Weight Data Layout |
dst Data Layout |
|---|---|---|---|
4D Tensor |
abcd |
abcd |
abcd |
Parameter Constraints
Propagation Direction |
Variable Name |
Variable Description |
Constraint |
Remarks |
|---|---|---|---|---|
FWD_B , FWD_D, FWD_I |
mb |
batch |
≥ 1 |
[] represents rounding down to the nearest integer. |
ic |
input channel |
≥ 1 |
||
ih |
input height |
≥ 1 |
||
iw |
input width |
≥ 1 |
||
oc |
output channel |
≥ 1 |
||
kh |
kernel height |
≥ 1 |
||
kw |
kernel width |
≥ 1 |
||
oh |
output height |
≥ 1 |
||
ow |
output width |
≥ 1 |
||
sh |
height-wise stride |
≥ 1 |
||
sw |
width-wise stride |
≥ 1 |
||
dh |
height-wise dilation |
≥ 0 |
||
dw |
width-wise dilation |
≥ 0 |
||
ph |
height padding |
0 ≤ ph ≤ (kh-1) x (dh+1) |
||
pw |
width padding |
0 ≤ pw ≤ (kw-1) x (dw+1) |
||
DKH |
kernel height with dilation |
DKH = 1 + (kh-1)×(dh+1) |
||
DKW |
kernel width with dilation |
DKW = 1 + (kw-1)×(dw+1) |
||
BWD_D |
mb |
batch |
≥ 1 |
[] represents rounding down to the nearest integer. |
ic |
input channel |
≥ 1 |
||
ih |
input height |
≥ 1 |
||
iw |
input width |
≥ 1 |
||
oc |
output channel |
≥ 1 |
||
kh |
kernel height |
≥ 1 |
||
kw |
kernel width |
≥ 1 |
||
oh |
output height |
≥ 1 |
||
ow |
output width |
≥ 1 |
||
sh |
height-wise stride |
≥ 1 |
||
sw |
width-wise stride |
≥ 1 |
||
dh |
height-wise dilation |
≥ 0 |
||
dw |
width-wise dilation |
≥ 0 |
||
ph |
height padding |
0 ≤ ph ≤ (kh-1) x (dh+1) |
||
pw |
width padding |
0 ≤ pw ≤ (kw-1) x (dw+1) |
||
DKH |
kernel height with dilation |
DKH = 1 + (kh-1)×(dh+1) |
||
DKW |
kernel width with dilation |
DKW = 1 + (kw-1)×(dw+1) |
||
BWD_W, BWD_WB |
mb |
batch |
≥ 1 |
[] represents rounding down to the nearest integer. |
ic |
input channel |
≥ 1 |
||
ih |
input height |
≥ 1 |
||
iw |
input width |
≥ 1 |
||
oc |
output channel |
≥ 1 |
||
kh |
kernel height |
≥ 1 |
||
kw |
kernel width |
≥ 1 |
||
oh |
output height |
≥ 1 |
||
ow |
output width |
≥ 1 |
||
sh |
height-wise stride |
≥ 1 |
||
sw |
width-wise stride |
≥ 1 |
||
dh |
height-wise dilation |
≥ 0 |
||
dw |
width-wise dilation |
≥ 0 |
||
ph |
height padding |
0 ≤ ph ≤ (kh-1) x (dh+1) |
||
pw |
width padding |
0 ≤ pw ≤ (kw-1) x (dw+1) |
||
DKH |
kernel height with dilation |
DKH = 1 + (ih-1)×(dh+1) |
||
DKW |
kernel width with dilation |
DKW = 1 + (iw-1)×(dw+1) |


