Hello all,
I am very curious to know how feasible it is to use a cluster of Versal HBM Series (e.g. VHK158 evaluation board) for training an LLM like LLaMA-2 70B in terms of 'Performance/Power/Cost.' Are there any papers regarding the comparison of a cluster of VHK158 evaluation boards and a cluster of, say, A100s?Thanks.
brucethemoose2|2 years ago
I think MLC-LLM (though TVM) can maybe run inference?
manili|2 years ago