Performance

To gain insight into Typhoon’s performance, we evaluated it using multiple-choice exam:

Language & Knowledge Capabilities: We assessed Typhoon on multiple-choice question answering datasets, including ThaiExam, M3Exam, and MMLU. The ThaiExam dataset was sourced from standard examinations in Thailand, including ONET, TGAT, TPAT, and A-Level. M3Exam is a benchmark for Southeast Asian countries, including Thailand. MMLU is a standard benchmark for language models in English.

Typhoon-1.5X is an eXperimental model designed for application use cases, featuring improved capabilities in Retrieval-Augmented Generation (RAG), constrained generation, and reasoning in order to achieve competitive performance in instruction following and better human alignment in Thai.

Llama3-Typhoon v1.5x 70B 4bit GGUF Demo on Colab

Download for Hugging Face

https://huggingface.co/Adun/llama-3-typhoon-v1.5x-70b-instruct-gguf

Use NVIDIA A100 40GB GPU on Colab

This 70B model has 81 layers and size 42.5GB. But NVIDIA A100 GPU has only 40GB RAM.
So, We cannot load all 81 layers. Then We set model_kwargs={"n_gpu_layers": 70}.