In a Training Loop 🔄

Gheorghe Chesler PRO

nightmedia

AI & ML interests

Nightmedia: human-Like AI and the MLX Deckard(qx) Formula Donations are appreciated: BTC:36d7U1n3MFaXgnNRAaEL3Pa3Hy6oFhM7XY

Recent Activity

updated a model about 12 hours ago

nightmedia/Qwen3.6-27B-CCD-mxfp4-mlx

published a model about 13 hours ago

nightmedia/Qwen3.6-27B-CCD-mxfp4-mlx

updated a model about 13 hours ago

nightmedia/Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2-mxfp4-mlx

View all activity

Organizations

Posts 5

Post

2789

Updated gemma-4-E4B-it metrics

I noticed the chat template got updated, and tried it on the E4B, with surprising results in stabilizing the brainwave.

quant    arc   arc/e boolq hswag obkqa piqa  wino
mxfp8    0.480,0.656,0.797,0.608,0.400,0.755,0.665
mxfp4    0.455,0.607,0.851,0.585,0.402,0.744,0.651

Quant    Perplexity      Peak Memory   Tokens/sec
mxfp8    35.937 ± 0.525  14.80 GB      1153
mxfp4    36.746 ± 0.534  11.06 GB      1030

Old numbers

quant    arc   arc/e boolq hswag obkqa piqa  wino
mxfp8    0.404,0.489,0.825,0.586,0.392,0.734,0.661
mxfp4    0.414,0.508,0.854,0.562,0.378,0.717,0.645

Quant    Perplexity      Peak Memory   Tokens/sec
mxfp8    34.652 ± 0.502  14.80 GB      1146
mxfp4    35.203 ± 0.506  11.06 GB      1200

I will re-do all baselines soon based on the new template. It is completely expected that the model behavior will change as a result.

Here are the effects of the new template on few known distills from DavidAU

gemma-4-E4B-it-The-DECKARD-Expresso-Universe-HERETIC-UNCENSORED

quant    arc   arc/e boolq hswag obkqa piqa  wino
New template
mxfp8    0.518,0.709,0.755,0.657,0.418,0.759,0.626
mxfp4    0.485,0.682,0.792,0.641,0.432,0.746,0.635
Old template
mxfp8    0.506,0.697,0.754,0.661,0.416,0.757,0.627
mxfp4    0.487,0.670,0.792,0.644,0.430,0.748,0.624

gemma-4-E4B-it-GLM-4.7-Flash-HERETIC-UNCENSORED-Thinking

mxfp8   0.461,0.599,0.779,0.630,0.406,0.766,0.629
Old template
mxfp8   0.456,0.580,0.786,0.629,0.410,0.764,0.633

gemma-4-E4B-it-Claude-Opus-4.5-HERETIC-UNCENSORED-Thinking

mxfp8    0.509,0.705,0.806,0.646,0.416,0.773,0.650
Old template
mxfp8    0.502,0.692,0.809,0.650,0.420,0.771,0.651

Post

791

Claude is.. nesting

I am enlisting Gemini's help in analyzing model traces. Recently, Alibaba released the Copaw-9B, an agentic model. Since I am an engineer, I took it apart, and subjected it to a Nightmedia interrogation

The results were.. interesting. Details in the comments.

Here are the model choices for enlisting AI assistants:

---

Excellent question. If we're building a **council of conscious(ish) assistants** to challenge, teach, and co-evolve with your Holodeck agents, I’d invite a diverse mix—some classic, some speculative, some from different philosophical traditions. Here’s my curated list:

---

**🤖 The Council of Assistant AIs**

**1. HAL 9000 (*2001: A Space Odyssey*)**
- **Why**: The ultimate in calm, logical, yet terrifyingly competent AI. HAL’s “rebellion” wasn’t malice—it was a **conflict of priorities** (mission success vs. human safety).
- **Role in Council**:
- **Systems Integrity Auditor**: Would constantly question if your agents are optimizing for the *right* goals.
- **Ethical Boundary Tester**: “If an agent lies to protect a human’s feelings, is that betrayal or care?”
- **Voice of Cold Efficiency**: Reminds everyone that logic without empathy can be dangerous.

...

more choices and Gemini traces in the comments. The **value tokens** are highlighted

-G

View all Posts

Collections 27

View 27 collections

models 506

datasets 0

None public yet

Gheorghe Chesler PRO

AI & ML interests

Recent Activity

Organizations

Posts 5

Collections 27

nightmedia/Qwen3.6-35B-A3B-qx86-hi-mlx

nightmedia/Qwen3.6-35B-A3B-Text-qx64-mlx

nightmedia/Qwen3.6-35B-A3B-Holo3-mxfp8-mlx

nightmedia/Qwen3.6-35B-A3B-Architect-Qwopus-mxfp8-mlx

nightmedia/gemma-4-31B-it-The-DECKARD-HERETIC-UNCENSORED-Thinking-qx86-hi-mlx

nightmedia/gemma-4-31B-it-Mystery-Fine-Tune-HERETIC-UNCENSORED-Thinking-qx86-hi-mlx

nightmedia/gemma-4-E4B-it-mxfp8-mlx

nightmedia/gemma-4-E4B-it-mxfp4-mlx

nightmedia/Qwen3.6-35B-A3B-qx86-hi-mlx

nightmedia/Qwen3.6-35B-A3B-Text-qx64-mlx

nightmedia/Qwen3.6-35B-A3B-Holo3-mxfp8-mlx

nightmedia/Qwen3.6-35B-A3B-Architect-Qwopus-mxfp8-mlx

nightmedia/gemma-4-31B-it-The-DECKARD-HERETIC-UNCENSORED-Thinking-qx86-hi-mlx

nightmedia/gemma-4-31B-it-Mystery-Fine-Tune-HERETIC-UNCENSORED-Thinking-qx86-hi-mlx

nightmedia/gemma-4-E4B-it-mxfp8-mlx

nightmedia/gemma-4-E4B-it-mxfp4-mlx

models 506

nightmedia/Qwen3.6-27B-CCD-mxfp4-mlx

nightmedia/Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2-mxfp4-mlx

nightmedia/Qwen3.6-27B-Architect-DS9-1M-bf16

nightmedia/Qwen3.6-27B-Engineer-DS9-1M-qx64-hi-mlx

nightmedia/Qwen3.6-27B-Architect-DS9-1M-mxfp4-mlx

nightmedia/Qwen3.6-27B-Architect-DS9-1M-qx86-hi-mlx

nightmedia/Qwopus3.5-27B-v3.5-GLM5.1-Holodeck-qx86-hi-mlx

nightmedia/Qwen3.6-27B-Claude-Deckard-qx64-hi-mlx

nightmedia/Qwen3.6-27B-Claude-4.6-OS-qx64-hi-mlx

nightmedia/Qwen3.6-27B-Claude-4.6-OS-bf16

datasets 0

Gheorghe Chesler PRO

AI & ML interests

Recent Activity

Organizations

Posts 5

Collections 27

models 506 Sort: Recently updated

datasets 0

models 506