SSLZip

This repository provides a simple example of inference using the pretrained models of SSLZip.

Example

import onnxruntime as ort
from transformers import HubertModel
import torch

# Load the upstream HuBERT model.
upstream = HubertModel.from_pretrained("facebook/hubert-base-ls960")
upstream.eval()

# Load the autoencoder model.
postprocessor = ort.InferenceSession("sslzip_256.onnx")
node_name = postprocessor.get_inputs()[0].name

# Prepare an input waveform (assuming 16kHz audio).
x = torch.randn(1, 16000)

# Extract the latent representation for downstream tasks.
with torch.inference_mode():
    h = upstream(x, output_hidden_states=True).hidden_states[-1]
    z = postprocessor.run(None, {node_name: h.cpu().numpy()})[0]

# Use z as you like.
print(z.shape)

Pretrained Models

The pretrained models were developed using the LibriSpeech corpus and are distributed under the same license (CC BY 4.0).
Please include credit to Nagoya Institue of Technology and Techno-Speech, Inc. when using these models.

Upstream Model	Dimensions	w/ CLUB	ONNX Model
HuBERT Base	256	✓	link
HuBERT Base	256		link
HuBERT Base	16		link

Citation

@InProceedings{yoshimura2025sslzip,
  author = {Takenori Yoshimura and Shinji Takaki and Kazuhiro Nakamura and Keiichiro Oura and Takato Fujimoto and Kei Hashimoto and Yoshihiko Nankaku and Keiichi Tokuda},
  title = {{SSLZip}: Simple autoencoding for enhancing self-supervised speech representations in speech generation},
  booktitle = {13th ISCA Speech Synthesis Workshop (SSW 2025)},
  pages = {117--122},
  year = {2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
infer.py		infer.py
requirements.txt		requirements.txt
test_requirements.txt		test_requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SSLZip

Example

Pretrained Models

Citation

About

Uh oh!

Languages

License

sp-nitech/SSLZip

Folders and files

Latest commit

History

Repository files navigation

SSLZip

Example

Pretrained Models

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages