thecollabagepatch commited on
Commit
e4cb906
·
1 Parent(s): 6cd5ac5

update documentation.html for 1/24/2025 gradio/massive overhaul

Browse files
Files changed (1) hide show
  1. documentation.html +23 -0
documentation.html CHANGED
@@ -104,6 +104,29 @@
104
  <span class="badge">research project</span>
105
  </div>
106
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
107
  <div class="section">
108
  <h2>what this is</h2>
109
  <p>this API serves google's <a href="https://huggingface.co/google/magenta-realtime" target="_blank">magentaRT</a> in two distinct ways. first, as a backend for our iOS app (the untitled jamming app) where users create initial loops with stability ai's <a href="https://huggingface.co/stabilityai/stable-audio-open-small" target="_blank">stable-audio-open-small</a> and then MagentaRT uses the combined audio as context. second, as a standalone web interface that connects directly to magentaRT via websockets without any audio context.</p>
 
104
  <span class="badge">research project</span>
105
  </div>
106
 
107
+ <div class="info" style="text-align: center; margin: 24px 0;">
108
+ <strong>need help?</strong> if you have issues when duplicating this space (fresh docker builds always surface fun new problems), or would like to play with the iOS app, please reach out in discord: <a href="https://discord.gg/T8HVqwQw6T" target="_blank">https://discord.gg/T8HVqwQw6T</a>
109
+ </div>
110
+
111
+ <div class="section" style="background: linear-gradient(135deg, #f0f9ff 0%, #e0f2fe 100%); border-radius: 12px; padding: 24px; margin: 24px 0;">
112
+ <h2 style="margin-top: 0;">📅 1/24/2025 update</h2>
113
+
114
+ <h3>server-side crossfading</h3>
115
+ <p>overhauled all functionality now that <code>magenta-realtime</code>'s <code>system.py</code> handles crossfading server-side. this simplifies client implementations significantly - no more client-side audio buffer management needed for smooth transitions.</p>
116
+
117
+ <h3>updated endpoints & tester</h3>
118
+ <ul>
119
+ <li>refreshed all HTTP endpoints to work with the new crossfading behavior</li>
120
+ <li>updated the HTML web tester at <code>/tester</code> for the simplified flow</li>
121
+ <li>streamlined the websockets route at <code>/ws/jam</code> (audio injection still planned)</li>
122
+ </ul>
123
+
124
+ <h3>gradio/FastRTC experiment</h3>
125
+ <p>implemented first working test for running magenta-realtime inside a gradio app using <code>fastrtc_magenta.py</code>. this opens the door for potential huggingface spaces integration with native gradio UI. <strong>note:</strong> the web tester is still the recommended interface for now - the gradio integration is experimental and style updates don't affect generation yet.</p>
126
+
127
+ <p class="muted" style="margin-bottom: 0;"><em>TODO: audio injection via websockets and an implementation of that in the html tester/gradio app</em></p>
128
+ </div>
129
+
130
  <div class="section">
131
  <h2>what this is</h2>
132
  <p>this API serves google's <a href="https://huggingface.co/google/magenta-realtime" target="_blank">magentaRT</a> in two distinct ways. first, as a backend for our iOS app (the untitled jamming app) where users create initial loops with stability ai's <a href="https://huggingface.co/stabilityai/stable-audio-open-small" target="_blank">stable-audio-open-small</a> and then MagentaRT uses the combined audio as context. second, as a standalone web interface that connects directly to magentaRT via websockets without any audio context.</p>