Skip to content

Commit da2a7fa

Browse files
authored
[Version] v1.7.2. (#454)
1 parent cc9675f commit da2a7fa

File tree

2 files changed

+12
-2
lines changed

2 files changed

+12
-2
lines changed

CHANGELOG.md

+11-1
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,15 @@
11
# CHANGELOG
2-
# [Version v1.7.0](https://github.com/intel/xFasterTransformer/releases/tag/v1.7.0)
2+
# [Version v1.7.2](https://github.com/intel/xFasterTransformer/releases/tag/v1.7.2)
3+
v1.7.2 - Continuous batching feature supports Qwen 1.0 & hybrid data types.
4+
5+
## Functionality
6+
- Add continuous batching support of Qwen 1.0 models.
7+
- Enable hybrid data types for continuous batching feature, including `BF16_FP16, BF16_INT8, BF16_W8A8, BF16_INT4, BF16_NF4, W8A8_INT8, W8A8_int4, W8A8_NF4`.
8+
9+
## BUG fix
10+
- Fixed the convert fault in Baichuan1 models.
11+
12+
# [Version v1.7.1](https://github.com/intel/xFasterTransformer/releases/tag/v1.7.1)
313
v1.7.1 - Continuous batching feature supports ChatGLM2/3.
414

515
## Functionality

VERSION

+1-1
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
1.7.1
1+
1.7.2

0 commit comments

Comments
 (0)