Commit ·
e4cb906
1
Parent(s): 6cd5ac5
update documentation.html for 1/24/2025 gradio/massive overhaul
Browse files- documentation.html +23 -0
documentation.html
CHANGED
|
@@ -104,6 +104,29 @@
|
|
| 104 |
<span class="badge">research project</span>
|
| 105 |
</div>
|
| 106 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 107 |
<div class="section">
|
| 108 |
<h2>what this is</h2>
|
| 109 |
<p>this API serves google's <a href="https://huggingface.co/google/magenta-realtime" target="_blank">magentaRT</a> in two distinct ways. first, as a backend for our iOS app (the untitled jamming app) where users create initial loops with stability ai's <a href="https://huggingface.co/stabilityai/stable-audio-open-small" target="_blank">stable-audio-open-small</a> and then MagentaRT uses the combined audio as context. second, as a standalone web interface that connects directly to magentaRT via websockets without any audio context.</p>
|
|
|
|
| 104 |
<span class="badge">research project</span>
|
| 105 |
</div>
|
| 106 |
|
| 107 |
+
<div class="info" style="text-align: center; margin: 24px 0;">
|
| 108 |
+
<strong>need help?</strong> if you have issues when duplicating this space (fresh docker builds always surface fun new problems), or would like to play with the iOS app, please reach out in discord: <a href="https://discord.gg/T8HVqwQw6T" target="_blank">https://discord.gg/T8HVqwQw6T</a>
|
| 109 |
+
</div>
|
| 110 |
+
|
| 111 |
+
<div class="section" style="background: linear-gradient(135deg, #f0f9ff 0%, #e0f2fe 100%); border-radius: 12px; padding: 24px; margin: 24px 0;">
|
| 112 |
+
<h2 style="margin-top: 0;">📅 1/24/2025 update</h2>
|
| 113 |
+
|
| 114 |
+
<h3>server-side crossfading</h3>
|
| 115 |
+
<p>overhauled all functionality now that <code>magenta-realtime</code>'s <code>system.py</code> handles crossfading server-side. this simplifies client implementations significantly - no more client-side audio buffer management needed for smooth transitions.</p>
|
| 116 |
+
|
| 117 |
+
<h3>updated endpoints & tester</h3>
|
| 118 |
+
<ul>
|
| 119 |
+
<li>refreshed all HTTP endpoints to work with the new crossfading behavior</li>
|
| 120 |
+
<li>updated the HTML web tester at <code>/tester</code> for the simplified flow</li>
|
| 121 |
+
<li>streamlined the websockets route at <code>/ws/jam</code> (audio injection still planned)</li>
|
| 122 |
+
</ul>
|
| 123 |
+
|
| 124 |
+
<h3>gradio/FastRTC experiment</h3>
|
| 125 |
+
<p>implemented first working test for running magenta-realtime inside a gradio app using <code>fastrtc_magenta.py</code>. this opens the door for potential huggingface spaces integration with native gradio UI. <strong>note:</strong> the web tester is still the recommended interface for now - the gradio integration is experimental and style updates don't affect generation yet.</p>
|
| 126 |
+
|
| 127 |
+
<p class="muted" style="margin-bottom: 0;"><em>TODO: audio injection via websockets and an implementation of that in the html tester/gradio app</em></p>
|
| 128 |
+
</div>
|
| 129 |
+
|
| 130 |
<div class="section">
|
| 131 |
<h2>what this is</h2>
|
| 132 |
<p>this API serves google's <a href="https://huggingface.co/google/magenta-realtime" target="_blank">magentaRT</a> in two distinct ways. first, as a backend for our iOS app (the untitled jamming app) where users create initial loops with stability ai's <a href="https://huggingface.co/stabilityai/stable-audio-open-small" target="_blank">stable-audio-open-small</a> and then MagentaRT uses the combined audio as context. second, as a standalone web interface that connects directly to magentaRT via websockets without any audio context.</p>
|