Update README.md
Browse files
README.md
CHANGED
|
@@ -13,10 +13,6 @@ tags:
|
|
| 13 |
- speech
|
| 14 |
---
|
| 15 |
|
| 16 |
-
|
| 17 |
-
Create README.md
|
| 18 |
-
# Model Card for johbac/voice-embedder-base
|
| 19 |
-
|
| 20 |
## Model Description
|
| 21 |
Hey there! This is `voice-embedder-base`, a model that generates speaker embeddings—compact vectors that capture unique vocal characteristics for tasks like speaker verification, clustering, or voice similarity retrieval. It’s built by fine-tuning the `openai/whisper-base` encoder with a contrastive learning approach, using a mix of triplet loss and NT-Xent loss to make embeddings robust and speaker-discriminative. Trained on English speech from Common Voice 17 and VoxCeleb2 datasets, it shines in clean studio settings but holds its own in noisier environments too.
|
| 22 |
|
|
|
|
| 13 |
- speech
|
| 14 |
---
|
| 15 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 16 |
## Model Description
|
| 17 |
Hey there! This is `voice-embedder-base`, a model that generates speaker embeddings—compact vectors that capture unique vocal characteristics for tasks like speaker verification, clustering, or voice similarity retrieval. It’s built by fine-tuning the `openai/whisper-base` encoder with a contrastive learning approach, using a mix of triplet loss and NT-Xent loss to make embeddings robust and speaker-discriminative. Trained on English speech from Common Voice 17 and VoxCeleb2 datasets, it shines in clean studio settings but holds its own in noisier environments too.
|
| 18 |
|