Install deps ``` > sudo pacman -Sy extra/rocminfo radeontop cmake ROCm extra/rocblas extra/hipblas ``` Identify integrated GPU ``` > rocminfo | grep Name Name: AMD Ryzen 5 7640U w/ Radeon 760M Graphics Marketing Name: AMD Ryzen 5 7640U w/ Radeon 760M Graphics Vendor Name: CPU Name: gfx1103 Marketing Name: AMD Radeon 760M Vendor Name: AMD Name: amdgcn-amd-amdhsa--gfx1103 ``` Build and install steps from https://github.com/ggerganov/llama.cpp/blob/master/docs/build.md#hip Build ``` > HIPCXX="$(hipconfig -l)/clang" HIP_PATH="$(hipconfig -R)" \ cmake -S . -B build -DGGML_HIP=ON -DAMDGPU_TARGETS=gfx1103 -DCMAKE_BUILD_TYPE=Release \ && cmake --build build --config Release -- -j 16 ``` The library for the GPU arch `gfx1103` isn't available in the rocm libs on Archlinux as yet https://github.com/ROCm/ROCm/discussions/2631 For now override the GPU version ``` > HSA_OVERRIDE_GFX_VERSION=11.0.0 ./build/bin/llama-server ``` Install huggingface cli and fetch model ``` > sudo pacman -Sy extra/python-huggingface-hub > huggingface-cli download TheBloke/deepseek-coder-33B-base-GGUF deepseek-coder-33b-base.Q4_K_M.gguf --local-dir . --local-dir-use-symlinks False ```