ollama 0.11.9 Introducing A Nice CPU/GPU Performance Optimization

submitted by

https://www.phoronix.com/news/ollama-0.11.9-More-Performance

10
33

Back to main discussion

Parent comment

by [any] depth: 4

I think llama.cpp merged ROCm support in 2023 already. It's called HIP on their Readme, but I'm not super educated on all the acronyms and compute frameworks and instruction sets.

ROCm is a software stack which includes a bunch of SDKs and API.

HIP is a subset of ROCm which lets you program on AMD GPUs with focus portability from Nvidia's CUDA



Insert image