Integration of Fast-LLM #2913

bigximik · 2025-04-15T15:01:53Z

Fast-LLM Integration (Draft)

This is a basic integration of Fast-LLM and is a work in progress:

Based on the existing HFLM integration.
The HFLM constructor is incompatible, so all components are initialized explicitly in our constructor.
Tests are adapted from sglang since neither sglang nor our implementation supports "EleutherAI/pythia-70m"; we use Qwen2 instead. However, the test coverage is not yet as complete as in the HF integration tests.
Quantized models are not supported in this iteration.

Still to be done:

Integration of distributed inference.
A test case for passing an instantiated Fast-LLM model?

Notes:

Currently works with this branch of the Fast-LLM only: Sandbox for Implementation of generate and integration of lm_eval (evaluation harness) ServiceNow/Fast-LLM#222

CLAassistant · 2025-04-15T15:02:01Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

Toolkit User seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

base integrarion of Fast-LLM

9d55c70

bigximik closed this Apr 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration of Fast-LLM #2913

Integration of Fast-LLM #2913

bigximik commented Apr 15, 2025

CLAassistant commented Apr 15, 2025

Integration of Fast-LLM #2913

Integration of Fast-LLM #2913

Conversation

bigximik commented Apr 15, 2025

Fast-LLM Integration (Draft)

Still to be done:

Notes:

CLAassistant commented Apr 15, 2025