Running on the FPGAs
The FPGAs are hosted as back-end nodes of NEXTGenIO and currently accessible via ssh but the plan is to integrate these with the batch queue system. Therefore to access specific FPGA(s) you should ssh to the associated node, where all the nodes mount the Lustre filesystem and have available all the same tools as the login nodes.
|nextgenio-amd01||Alveo U280 and NVidia A100 GPU|
|nextgenio-amd02||VCK5000 and ADM-PA100|
|nextgenio-amd03||Stratix-10 MX and Alveo U250|
After connecting to the Xilinx nodes issue
module load vitis and this will bring into your environment the nescesary settings to run on the Xilinx FPGAs and you can then run your host-side executable.
You can use the
xbutil utility to manage the FPGAs, for instance if your code locks up then
xbutil reset (likely in another terminal) will forcibly reset the FPGA and quick the code executing on it.
xbutil examine will list available FPGA devices, and
xbutil examine --device 0000:d4:00.5 --report all will report the status of the FPGA (where 0000:d4:00.5 is the device ID obtained by the initial examine command).
After connecting to the Intel node issue
module load quartus (remembering that the module files need to already be available to enable this, i.e. you need to have already run
module use /home/nx08/shared/fpga/modulefiles/) which will bring into your environment the settings to run on the Stratix-10 and you can then run your host-side executable. Currently the HPC OpenCL image is written onto the board, it is possible to load the MAX image which enables the QSP28 network ports and for this then please contact us.
You can use
aocl diagnose to report the status of the FPGA(s) present