WaveSplit-pytorch-incomplete

need more time and computing resource for construction

Attention !

This is a incomplete pytorch version WaveSplit implemention Wavesplit: End-to-End Speech Separation by Speaker Clustering

The whole model's param is ~65M, and the original version can not train one sample even in P40 ... So I added a encoder (conv1d layer) and a decoder to reduce the feature's length ... Moreover, the mapping model was replaced by masking ... I had trained a model on wsj0-2mix, but the training was too slowly and 13 epoch model's sdr is ~10 dB ... Finally, I gave up it ... Maybe I will continue construction in future ...

If you have any questions or advices, please issue or mail [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
model		model
steps		steps
tools		tools
.gitignore		.gitignore
README.md		README.md
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WaveSplit-pytorch-incomplete

Attention !

List

About

Releases

Packages

Languages

asdlei99/WaveSplit-pytorch-incomplete

Folders and files

Latest commit

History

Repository files navigation

WaveSplit-pytorch-incomplete

Attention !

List

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages