ทดสอบ OpenThaiGPT

ซึ่งเป็น LLM GPT-3 ภาษาไทย

เอาไปทำ chatGPT ภาษาไทย ใช้เองได้

ทำ chatGPT ตอบคำถาม ข้อมูลของหน่วยงาน และธุรกิจ ของตัวเองได้

more info https://aiotplatform.blogspot.com/2023/07/openthaigpt.html

มาโชว์ผลการทดสอบกันก่อนเลย ดีกว่า

ขั้นตอนการทดสอบ

• Colab Code จาก OpenThaiGPT https://openthaigpt.aieat.or.th/

https://colab.research.google.com/drive/1nZ6Vc2U6rOezsMxarGJY7oDw8RG5E4r_?usp=sharing#scrollTo=lsOjziA3Dppt

• GPU on Colab

ใช้ NVIDIA T4 15 GB ( เป็นตัวเล็กสุดของ GPU บน Google Colab )

• Base Model และ LoRA Weights

Base Model : Facebook LLaMA 7 B-parameter https://huggingface.co/decapoda-research/llama-7b-hf

Weight (Lora Adapter) : openthaigpt-0.1.0-beta https://huggingface.co/kobkrit/openthaigpt-0.1.0-beta

Run Model ใช้ GPU RAM ประมาณ 14 GB ( น่าจะเป็น model FP16 : half precision )

ทดลอง ใช้เป็น 8 bit Model GPU RAM ใช้น้อยลงเป็น ที่ 9 GB

LLaMA 8 bit Model from META ( Facebook )

About LLaMA

Meta have released LLaMA, an LLM comprising 4 foundation models with 7B to 65B parameters, claiming that it outperforms many benchmark models on various tasks. In particular, LLaMA-13B outperforms GPT-3 (175B) on most benchmarks, and LLaMA-65B is competitive with the best models, Chinchilla70B and PaLM-540B.

LLaMA isn’t like ChatGPT or Bing; it’s not a system that anyone can talk to. Rather, it’s a research tool that Meta says it’s sharing in the hope of “democratizing access in this important, fast-changing field.” In other words: to help experts tease out the problems of AI language models, from bias and toxicity to their tendency to simply make up information.