2 points | by TechPreacher 2 hours ago ago
2 comments
I have 1 DGX Spark and running models with vLLM to, out of curiosity why not using Llama.cpp / TensorRT-LLM or any other alternatives?
[flagged]
I have 1 DGX Spark and running models with vLLM to, out of curiosity why not using Llama.cpp / TensorRT-LLM or any other alternatives?
[flagged]