LLAMA-CPP-PYTHON on RTX4060 GPU

กรกฎาคม 08, 2567

LLAMA-CPP-PYTHON on NVIDIA RTX4060 GPU

We try to use llama-cpp-python library with many OS.

Windows 11

Ubuntu 22.04

and Colab ( ubuntu 22.04 also )

Llama-cpp-python library with RTX4060 GPU on Windows11

Install NVDIA GPU Driver.

we use driver version 537.24

then check nvidia-smi command for check your GPU.

Install CUDA Toolkit. https://developer.nvidia.com/cuda-downloads

we use CUDA 12.1 version

make sure your cuda is work fine with nvcc command.

We use Python 3.11.7

check python and pip with command line.

Install Pytorch https://pytorch.org/get-started/locally/

check pip list and check cuda available on your GPU with python shell.

if false it cannot use with NVIDIA GPU. find your problem.

Install llama-cpp-python https://pypi.org/project/llama-cpp-python/

with CUDA 12.1 use this command.

on Jul 2,2024 version 0.2.81 but we use only 0.2.75 it's work fine. ( 0.2.81 not test on Windows11 yet )

pip install llama-cpp-python \
  --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cu121

pip list your library.

How to check your llama-cpp-python lib work with CUDA?

you use LLM code and load your LLM model to test llama-cpp-python lib.

my code test on jupyter notebook in VS code.

we try to load Llama3-Typhoon1.5-8b 4bit GGUF format from Hugging Face scb10x/llama-3-typhoon-v1.5-8b-instruct

When load model done. Check BLAS = 1 or not

if BLAS = 0 it's not use with cuda yet. find your problem.

But your can use with CPU only so it is very slow inference.

Now you can use llama-cpp-python library on your windows11.

Llama-cpp-python library with RTX4060 GPU on Ubuntu 22.04.03 LTS

Install NVDIA GPU Driver.

we use Nvidia-driver version 535

then check nvidia-smi command for check your GPU.

Install CUDA Toolkit. https://developer.nvidia.com/cuda-downloads

we use CUDA 11.5 version

Install Pytorch https://pytorch.org/get-started/locally/

check pip list and check cuda available on your GPU with python shell.

if false it cannot use with NVIDIA GPU. find your problem.

Install llama-cpp-python https://pypi.org/project/llama-cpp-python/

on Jul 2,2024 version 0.2.81 but we use only 0.2.32 it's work fine. ( 0.2.81 not test on Ubuntu yet )

Install pip Command

CMAKE_ARGS="-DLLAMA_CUDA=on" pip install llama-cpp-python==0.2.32

pip list your library.

How to check your llama-cpp-python lib work with CUDA?

you use LLM code and load your LLM model to test llama-cpp-python lib.

my code test on jupyter notebook in VS code.

we try to load Llama3-Typhoon1.5-8b 4bit GGUF format from Hugging Face scb10x/llama-3-typhoon-v1.5-8b-instruct

When load model done. Check BLAS = 1 or not

if BLAS = 0 it's not use with cuda yet. find your problem.

But your can use with CPU only so it is very slow inference.

Now you can use llama-cpp-python library on your Ubuntu OS.

Adun Nantakaew อดุลย์ นันทะแก้ว

LINE : adunnan

Facebook : https://www.facebook.com/softpowergroup

YouTube : https://www.youtube.com/channel/UCw3VVy4wOsb8a0a1YYqJAPg

Web Blog : https://aiotplatform.blogspot.com/

Website : https://softpower.tech

ค้นหาบล็อกนี้

AI and IoT Platforms

LLAMA-CPP-PYTHON on RTX4060 GPU

LLAMA-CPP-PYTHON on NVIDIA RTX4060 GPU

Llama-cpp-python library with RTX4060 GPU on Windows11

Install NVDIA GPU Driver.

Install CUDA Toolkit. https://developer.nvidia.com/cuda-downloads

Install Pytorch https://pytorch.org/get-started/locally/

Install llama-cpp-python https://pypi.org/project/llama-cpp-python/

How to check your llama-cpp-python lib work with CUDA?

Now you can use llama-cpp-python library on your windows11.

Llama-cpp-python library with RTX4060 GPU on Ubuntu 22.04.03 LTS

Install NVDIA GPU Driver.

Install CUDA Toolkit. https://developer.nvidia.com/cuda-downloads

Install Pytorch https://pytorch.org/get-started/locally/

Install llama-cpp-python https://pypi.org/project/llama-cpp-python/

How to check your llama-cpp-python lib work with CUDA?

Now you can use llama-cpp-python library on your Ubuntu OS.

Adun Nantakaew อดุลย์ นันทะแก้ว

ความคิดเห็น

แสดงความคิดเห็น

โพสต์ยอดนิยมจากบล็อกนี้

Eval Llama v3.1 8B ,70B Model with Thai Language

Llama3 Typhoon v1.5 (scb10x) LLM