ollama 0.11.9 Introducing A Nice CPU/GPU Performance Optimization
submitted by
https://www.phoronix.com/news/ollama-0.11.9-More-Performance
https://www.phoronix.com/news/ollama-0.11.9-More-Performance
I think llama.cpp merged ROCm support in 2023 already. It's called HIP on their Readme, but I'm not super educated on all the acronyms and compute frameworks and instruction sets.
ROCm is a software stack which includes a bunch of SDKs and API.
HIP is a subset of ROCm which lets you program on AMD GPUs with focus portability from Nvidia's CUDA