johbac
/

voice-embedder-base

Model card Files Files and versions

johbac commited on Jun 27, 2025

Commit

1af6dda

·

verified ·

1 Parent(s): 52228da

Update README.md

Files changed (1) hide show

README.md +0 -4

README.md CHANGED Viewed

@@ -13,10 +13,6 @@ tags:
 - speech
 ---
-Create README.md
-# Model Card for johbac/voice-embedder-base
 ## Model Description
 Hey there! This is `voice-embedder-base`, a model that generates speaker embeddings—compact vectors that capture unique vocal characteristics for tasks like speaker verification, clustering, or voice similarity retrieval. It’s built by fine-tuning the `openai/whisper-base` encoder with a contrastive learning approach, using a mix of triplet loss and NT-Xent loss to make embeddings robust and speaker-discriminative. Trained on English speech from Common Voice 17 and VoxCeleb2 datasets, it shines in clean studio settings but holds its own in noisier environments too.

 - speech
 ---
 ## Model Description
 Hey there! This is `voice-embedder-base`, a model that generates speaker embeddings—compact vectors that capture unique vocal characteristics for tasks like speaker verification, clustering, or voice similarity retrieval. It’s built by fine-tuning the `openai/whisper-base` encoder with a contrastive learning approach, using a mix of triplet loss and NT-Xent loss to make embeddings robust and speaker-discriminative. Trained on English speech from Common Voice 17 and VoxCeleb2 datasets, it shines in clean studio settings but holds its own in noisier environments too.