-
Notifications
You must be signed in to change notification settings - Fork 212
Add DynamicQuantConfig and QuantAwareTrainingConfig #1505
Conversation
⚡ Required checks status: All passing 🟢Groups summary🟢 Format Scan Tests workflow
These checks are required after the changes to 🟢 Optimize Unit Test workflow
These checks are required after the changes to 🟢 NeuralChat Unit Test
These checks are required after the changes to 🟢 Engine Unit Test workflow
These checks are required after the changes to 🟢 Chat Bot Test workflow
These checks are required after the changes to Thank you for your contribution! 💜
|
CI failed due to torch 2.3 upgrade, the fix depends on PR: #1508 |
Signed-off-by: changwangss <[email protected]>
Signed-off-by: changwangss <[email protected]>
for more information, see https://pre-commit.ci
d1f9f3b
to
5eb827d
Compare
for more information, see https://pre-commit.ci
Type of Change
support dynamic quantization with DynamicQuantConfig and make usage to transformers-api like with INC 2.x API.
support qat quantization with QuantAwareConfig and make usage to transformers-api like with INC 2.x API.
Description
detail description
JIRA ticket: xxx
dynamic quantization
Quant aware training
Expected Behavior & Potential Risk
the expected behavior that triggered by this PR
How has this PR been tested?
how to reproduce the test (including hardware information)
Dependency Change?
any library dependency introduced or removed