-
Notifications
You must be signed in to change notification settings - Fork 212
Conversation
Signed-off-by: Clark Chin <[email protected]>
⚡ Required checks status: All passing 🟢Groups summary🟢 Format Scan Tests workflow
These checks are required after the changes to 🟢 Optimize Unit Test workflow
These checks are required after the changes to 🟢 NeuralChat Unit Test
These checks are required after the changes to 🟢 Engine Unit Test workflow
These checks are required after the changes to 🟢 Chat Bot Test workflow
These checks are required after the changes to Thank you for your contribution! 💜
|
for more information, see https://pre-commit.ci
...ion_for_transformers/neural_chat/examples/finetuning/multi_modal/eval/mmmu_eval/run_llava.py
Show resolved
Hide resolved
intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/__init__.py
Show resolved
Hide resolved
Signed-off-by: Chen Xi <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Chen Xi <[email protected]>
for more information, see https://pre-commit.ci
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no requirements updated? At least optimum-habana shall be added
Signed-off-by: Meng, Hengyu <[email protected]>
Signed-off-by: Chen Xi <[email protected]>
@lkk12014402 kaokao, please take a look on the new commit on measurement of ppl |
Signed-off-by: Chen Xi <[email protected]>
* initial commit Signed-off-by: Yu, Zhentao <[email protected]> * mv example Signed-off-by: Yu Zhentao <[email protected]> * update model dtype Signed-off-by: Yu Zhentao <[email protected]> * fix multi-round generation without streaming_llm Signed-off-by: Yu, Zhentao <[email protected]> * add mem and token num log Signed-off-by: Yu, Zhentao <[email protected]> * rebase Signed-off-by: Yu, Zhentao <[email protected]> * initial fp8 Signed-off-by: Yu, Zhentao <[email protected]> * add ppl eval scripts Signed-off-by: Yu, Zhentao <[email protected]> * typo Signed-off-by: Yu, Zhentao <[email protected]> * add llama2-13b ppl eval script (align paper) Signed-off-by: Yu, Zhentao <[email protected]> * hide kv cache operation inside (v0.1) Signed-off-by: Yu, Zhentao <[email protected]> * hide kv cache operation inside (v0.2) Signed-off-by: Yu, Zhentao <[email protected]> * hide kv cache operation inside (v0.3) Signed-off-by: Yu, Zhentao <[email protected]> * update scripts Signed-off-by: Yu, Zhentao <[email protected]> * add README Signed-off-by: Yu, Zhentao <[email protected]> * update test scripts Signed-off-by: Yu, Zhentao <[email protected]> * remove useless code Signed-off-by: Yu, Zhentao <[email protected]> * update README and rename shell scripts Signed-off-by: Yu, Zhentao <[email protected]> --------- Signed-off-by: Yu, Zhentao <[email protected]> Signed-off-by: Yu Zhentao <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Clark Chin <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Chen Xi <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Clark Chin <[email protected]>
Signed-off-by: Chen Xi <[email protected]>
Signed-off-by: Clark Chin <[email protected]>
Signed-off-by: Clark Chin <[email protected]>
Signed-off-by: Clark Chin <[email protected]>
Signed-off-by: VincyZhang <[email protected]>
Signed-off-by: Clark Chin <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Clark Chin <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great start on HPU
Signed-off-by: Clark Chin <[email protected]>
Type of Change
gaudi modeling used in itrex for int4 kv-cache support