TinyGPT

TinyGPT is a minimal C++11 implementation of GPT-2 inference, built from scratch and mainly inspired by the picoGPT project.

For more details, check out the accompanying blog post: Write a GPT from scratch (TinyGPT)

Features

Fast BPE tokenizer, inspired by tiktoken.
CPU and CUDA inference.
KV cache enabled.

Build and Run

1. Get the code

git clone --recurse-submodules https://github.com/keith2018/TinyGPT.git

2. Download GPT-2 model file

python3 tools/download_gpt2_model.py

if success, you'll see the file model_file.data in directory assets/gpt2

3. Build and Run

mkdir build
cmake -B ./build -DCMAKE_BUILD_TYPE=Release
cmake --build ./build --config Release

This will generate the executable file and copy assets to directory app/bin, then you can run the demo:

cd app/bin
./TinyGPT_demo
[DEBUG] TIMER TinyGPT::Model::loadModelGPT2: cost: 800 ms
[DEBUG] TIMER TinyGPT::Encoder::getEncoder: cost: 191 ms
INPUT:Alan Turing theorized that computers would one day become
GPT:the most powerful machines on the planet.
INPUT:exit

Dependencies

Tensor
- TinyTorch https://github.com/keith2018/TinyTorch
Json parser
- RapidJSON https://github.com/Tencent/rapidjson
Regex (Tokenizer)
- RE2 https://github.com/google/re2
- Abseil https://github.com/abseil/abseil-cpp
HashMap
- ankerl::unordered_dense https://github.com/martinus/unordered_dense

License

This code is licensed under the MIT License (see LICENSE).

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
app		app
assets		assets
src		src
test		test
third_party		third_party
tools		tools
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TinyGPT

Features

Build and Run

1. Get the code

2. Download GPT-2 model file

3. Build and Run

Dependencies

License

About

Releases

Packages

Languages

License

keith2018/TinyGPT

Folders and files

Latest commit

History

Repository files navigation

TinyGPT

Features

Build and Run

1. Get the code

2. Download GPT-2 model file

3. Build and Run

Dependencies

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages