thestage.ai

Team

company

https://thestage.ai

TheStageAI

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

quazim new activity 3 days ago

TheStageAI/thewhisper-large-v3-turbo:Update README.md

quazim updated a model 7 days ago

TheStageAI/thewhisper-large-v3-turbo

hypothetical new activity 10 days ago

TheStageAI/README:Update README.md

View all activity

quazim

in TheStageAI/thewhisper-large-v3-turbo 3 days ago

Update README.md

#4 opened 3 days ago by

quazim

updated a model 7 days ago

TheStageAI/thewhisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated 3 days ago • 1.72k • 14

hypothetical

in TheStageAI/README 10 days ago

Update README.md

#2 opened 10 days ago by

hypothetical

posted an update 11 days ago

Post

2573

We thought it would be easier, but finally we have integrated CuDNN Paged Attention to our models!

Read article here: https://app.thestage.ai/blog/Integrating-cuDNN-Paged-Attention-to-TheStage-AI-Inference?id=8

Llama-8B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Llama-3.1-8B-Instruct
Mistral-Small-24B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503

psynote123

published a model 11 days ago

TheStageAI/Wan2.2-ComfyUI

Updated 11 days ago

psynote123

updated a model 11 days ago

TheStageAI/Wan2.2-ComfyUI

Updated 11 days ago

hypothetical

published a model 11 days ago

TheStageAI/Elastic-Wan2.2-T2V-A14B-Diffusers

Text-to-Video • Updated Dec 1, 2025 • 4 • 1

hinairo

updated 2 models 12 days ago

TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503

Text Generation • Updated 12 days ago • 27 • 2

TheStageAI/Elastic-Llama-3.1-8B-Instruct

Text Generation • Updated 12 days ago • 41 • 3

hypothetical

in TheStageAI/thewhisper-large-v3-turbo 18 days ago

add languages, base model, update license

#3 opened 18 days ago by

hypothetical

posted an update 18 days ago

Post

2018

We have updated our transcription model: TheStageAI/thewhisper-large-v3-turbo

– 6.00 WER on the English Open ASR Leaderboard
– 4.74 WER on the Multilingual Open ASR Leaderboard
– Beats NVIDIA Parakeet (6.34 WER) and Whisper-large-v3-turbo (7.8 WER)
– Strong improvements in Arabic, Hindi, Chinese
– Maintains quality with background and environmental noise
– Optimized inference engines for NVIDIA and Apple
– Hugging Face Transformers interface for easy use
– Best-in-class speed on NVIDIA GPUs and power efficiency on Apple devices
– NVIDIA Jetson Thor support

2 replies

quazim

in TheStageAI/thewhisper-large-v3-turbo 18 days ago

update-checkpoint-v2

#2 opened 18 days ago by

quazim

hypothetical

updated a collection 19 days ago

Elastic Diffusers

Collection

HuggingFace Diffusers models accelerated by TheStage AI ANNA: Automated NNs Accelerator. • 6 items • Updated 19 days ago • 2

hypothetical

posted an update about 2 months ago

Post

266

Hello guys! Maybe someone want to test our framework for automated model's compression. Here is what can be produced with it. Move the slider - compress/accelerate model, select point which like and compile. I can give an access, we are now improving and collecting comments from users

TheStageAI/ANNA-LLM