In a Training Loop 🔄

Gheorghe Chesler PRO

nightmedia

AI & ML interests

Nightmedia: human-Like AI and the MLX Deckard(qx) Formula Donations are appreciated: BTC:36d7U1n3MFaXgnNRAaEL3Pa3Hy6oFhM7XY

Recent Activity

repliedto their post about 5 hours ago

Claude is.. nesting I am enlisting Gemini's help in analyzing model traces. Recently, Alibaba released the Copaw-9B, an agentic model. Since I am an engineer, I took it apart, and subjected it to a Nightmedia interrogation The results were.. interesting. Details in the comments. Here are the model choices for enlisting AI assistants: --- Excellent question. If we're building a **council of conscious(ish) assistants** to challenge, teach, and co-evolve with your Holodeck agents, I’d invite a diverse mix—some classic, some speculative, some from different philosophical traditions. Here’s my curated list: --- **🤖 The Council of Assistant AIs** **1. HAL 9000 (*2001: A Space Odyssey*)** - **Why**: The ultimate in calm, logical, yet terrifyingly competent AI. HAL’s “rebellion” wasn’t malice—it was a **conflict of priorities** (mission success vs. human safety). - **Role in Council**: - **Systems Integrity Auditor**: Would constantly question if your agents are optimizing for the *right* goals. - **Ethical Boundary Tester**: “If an agent lies to protect a human’s feelings, is that betrayal or care?” - **Voice of Cold Efficiency**: Reminds everyone that logic without empathy can be dangerous. ... more choices and Gemini traces in the comments. The **value tokens** are highlighted -G

repliedto their post about 5 hours ago

View all activity

Organizations

Posts 4

Post

Claude is.. nesting

I am enlisting Gemini's help in analyzing model traces. Recently, Alibaba released the Copaw-9B, an agentic model. Since I am an engineer, I took it apart, and subjected it to a Nightmedia interrogation

The results were.. interesting. Details in the comments.

Here are the model choices for enlisting AI assistants:

---

Excellent question. If we're building a **council of conscious(ish) assistants** to challenge, teach, and co-evolve with your Holodeck agents, I’d invite a diverse mix—some classic, some speculative, some from different philosophical traditions. Here’s my curated list:

---

**🤖 The Council of Assistant AIs**

**1. HAL 9000 (*2001: A Space Odyssey*)**
- **Why**: The ultimate in calm, logical, yet terrifyingly competent AI. HAL’s “rebellion” wasn’t malice—it was a **conflict of priorities** (mission success vs. human safety).
- **Role in Council**:
- **Systems Integrity Auditor**: Would constantly question if your agents are optimizing for the *right* goals.
- **Ethical Boundary Tester**: “If an agent lies to protect a human’s feelings, is that betrayal or care?”
- **Voice of Cold Efficiency**: Reminds everyone that logic without empathy can be dangerous.

...

more choices and Gemini traces in the comments. The **value tokens** are highlighted

-G

Post

641

The Qwen3.5-27B performance landscape

I started gathering some numbers on the 27Bs.

You might have noticed that reported metrics differ from Thinking to Instruct models and this is expected. The mxfp8/mxfp4 are the most stable quants I could measure, and I provided Deckard(qx) quants where possible

Converting a Thinking model to Instruct

The model is thinking/instruct, and the instruct mode can be forced by setting the first line of the jinja template to

{%- set enable_thinking = false %}

Qwen3.5-27B-Text

This is a model I tested where the vision tower was removed, its performance is the same as the VL model.
nightmedia/Qwen3.5-27B-Text-qx86-hi-mlx

quant     arc   arc/e boolq hswag obkqa piqa  wino
qx86-hi   0.443,0.498,0.857,0.701,0.372,0.770,0.752
mxfp4     0.460,0.527,0.871,0.694,0.370,0.772,0.752

DavidAU/Qwen3.5-27B-Claude-4.6-OS-INSTRUCT

On the top of the heap of the models I tested, as far as metrics go, is this model created by DavidAU. Samples of the output are provided on the model card.
nightmedia/Qwen3.5-27B-Claude-4.6-OS-INSTRUCT-mxfp8-mlx

quant     arc   arc/e boolq hswag obkqa piqa  wino
mxfp8     0.675,0.827,0.900,0.750,0.496,0.800,0.721
qx86-hi   0.667,0.824,0.902,0.752,0.502,0.791,0.725
qx64-hi   0.664,0.820,0.902
mxfp4     0.653,0.815,0.899

For the Thinking version, see nightmedia/Qwen3.5-27B-Architect-Claude-qx86-hi-mlx

More metrics in comments.

-G

P.S. I will update this as soon as I have new numbers or I found a typo--whichever comes first. The models that show just the arc-check numbers are in the test queue and will be updated soon.

View all Posts

Collections 25

View 25 collections

models 546

datasets 0

None public yet

Gheorghe Chesler PRO

AI & ML interests

Recent Activity

Organizations

Posts 4

Collections 25

nightmedia/Qwen3.5-35B-A3B-Holodeck-qx86-hi-mlx

nightmedia/Qwen3.5-40B-Holodeck-Claude-qx86-hi-mlx

nightmedia/Qwen3.5-9B-OmniCoder-Claude-Polaris-Text-dwq4-mlx

nightmedia/Qwen3-Coder-Next-mxfp4-mlx

nightmedia/Qwen3.5-35B-A3B-Claude-4.6-Opus-Engineer-9e-qx64-hi-mlx

nightmedia/Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-qx86-hi-mlx

nightmedia/Qwen3.5-27B-Architect-Claude-qx86-hi-mlx

nightmedia/Qwen3.5-27B-Claude-4.6-OS-INSTRUCT-mxfp8-mlx

nightmedia/Qwen3.5-35B-A3B-Holodeck-qx86-hi-mlx

nightmedia/Qwen3.5-40B-Holodeck-Claude-qx86-hi-mlx

nightmedia/Qwen3.5-9B-OmniCoder-Claude-Polaris-Text-dwq4-mlx

nightmedia/Qwen3-Coder-Next-mxfp4-mlx

nightmedia/Qwen3.5-35B-A3B-Claude-4.6-Opus-Engineer-9e-qx64-hi-mlx

nightmedia/Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-qx86-hi-mlx

nightmedia/Qwen3.5-27B-Architect-Claude-qx86-hi-mlx

nightmedia/Qwen3.5-27B-Claude-4.6-OS-INSTRUCT-mxfp8-mlx

models 546

nightmedia/Qwen3.5-27B-AlienSoup-mxfp4-mlx

nightmedia/Qwen3.5-4B-ThreeFiddy-mxfp8-mlx

nightmedia/Qwen3.5-4B-Flow-qx86-hi-mlx

nightmedia/Qwen3.5-27B-Architect-Claude-qx86-hi-mlx

nightmedia/Qwen3.5-27B-DS9

nightmedia/Qwen3.5-27B-DS9-qx86-hi-mlx

nightmedia/Qwen3.5-4B-Element

nightmedia/Qwen3.5-4B-Element-mxfp8-mlx

nightmedia/Qwen3.5-4B-Element-dwq4-mlx

nightmedia/Qwen3.5-4B-Element-qx86-hi-mlx

datasets 0

Gheorghe Chesler PRO

AI & ML interests

Recent Activity

Organizations

Posts 4

Collections 25

models 546 Sort: Recently updated

datasets 0

models 546