Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Apoorv Vyas's picture
2

Apoorv Vyas

apoorv2904
·
  • apoorv2904

AI & ML interests

Speech

Organizations

AI at Meta's profile picture

authored 6 papers 2 months ago

Generative Pre-training for Speech with Flow Matching

Paper • 2310.16338 • Published Oct 25, 2023 • 1

Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention

Paper • 2006.16236 • Published Jun 29, 2020 • 4

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

Paper • 2306.15687 • Published Jun 23, 2023

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

Paper • 2502.05139 • Published Feb 7, 2025 • 2

SAM Audio: Segment Anything in Audio

Paper • 2512.18099 • Published Dec 19, 2025 • 24

Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning

Paper • 2512.19687 • Published Dec 22, 2025 • 2
authored a paper over 1 year ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 100
authored a paper almost 3 years ago

Scaling Speech Technology to 1,000+ Languages

Paper • 2305.13516 • Published May 22, 2023 • 12
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs