Write hardware accelerator in C++ using Vitis HLS. Implement tiled convolution, batch normalization,
pooling, and fully-connected layers with fixed-point arithmetic.
void reduced_vgg_inference(
const fm_t input[32*32*3],
fm_t output[10],
const wt_t *W1,
...
) {
#pragma HLS INTERFACE m_axi port=W1 bundle=gmem
#pragma HLS INTERFACE s_axilite port=return
tiled_conv_8x8(input, output, W1, B1);
}
C Simulation
✓ Passed (MSE=0)
Synthesis Time
~15 minutes
Clock Target
15ns (66.7 MHz)