RiverRider (River Rider)

liked a Space about 23 hours ago

SRT introspect

🧭

2

Adaptive-density reasoning traces over a frozen Qwen-2.5-7B

reacted to their post with 🔥 1 day ago

Post

2221

This is not the end of words. It is the end of pretending their meanings are determined.

Meaning Forks. SRT detects it.

Paste any text to identify contested terms

RiverRider/srt-introspect

Try any prompt (attached link) to see exactly what an LLM is thinking at every meaningful step of its answer

RiverRider/srt-introspect

Repository

https://github.com/space-bacon/SRT

Paper

https://github.com/space-bacon/SRT/blob/main/paper_nla.md

Explainer

https://github.com/space-bacon/SRT/blob/main/docs/EXPLAINERS.md

posted an update 2 days ago

Post

2221

This is not the end of words. It is the end of pretending their meanings are determined.

Meaning Forks. SRT detects it.

Paste any text to identify contested terms

RiverRider/srt-introspect

Try any prompt (attached link) to see exactly what an LLM is thinking at every meaningful step of its answer

RiverRider/srt-introspect

Repository

https://github.com/space-bacon/SRT

Paper

https://github.com/space-bacon/SRT/blob/main/paper_nla.md

Explainer

https://github.com/space-bacon/SRT/blob/main/docs/EXPLAINERS.md

reacted to their post with 🔥 7 days ago

Post

4809

SRT-introspect: Live Token-by-Token Readout of LLM Internal Reasoning

I have released SRT-introspect, a new public demonstration that makes the hidden reasoning process of a frozen large language model visible in real time.

The interface runs a Qwen-2.5-7B backbone equipped with the SRT Adapter and Activation Verbalizer. As the model generates each token, the system continuously measures divergence across attention heads, identifies high-signal moments, and translates the corresponding hidden-state object representations into natural-language verbalizations. You see exactly what the model is internally representing at the precise points where its computation is most active, complete with divergence scores, reflexivity estimates, and per-layer traces.

This is not a summary of the final output. It is a direct window into the model’s latent conceptual landscape, showing the dominant training-data attractors that activate even when the prompt asks for first-principles reasoning. The adaptive scheduler concentrates verbalizations precisely where the real internal work occurs, turning what used to be opaque black-box generation into observable, analyzable data.

The result is the clearest public demonstration yet that modern LLMs possess a rich, structured semiotic infrastructure that can now be audited without retraining or fine-tuning.

Try it:
RiverRider/srt-introspect

posted an update 7 days ago

Post

4809

SRT-introspect: Live Token-by-Token Readout of LLM Internal Reasoning

I have released SRT-introspect, a new public demonstration that makes the hidden reasoning process of a frozen large language model visible in real time.

The interface runs a Qwen-2.5-7B backbone equipped with the SRT Adapter and Activation Verbalizer. As the model generates each token, the system continuously measures divergence across attention heads, identifies high-signal moments, and translates the corresponding hidden-state object representations into natural-language verbalizations. You see exactly what the model is internally representing at the precise points where its computation is most active, complete with divergence scores, reflexivity estimates, and per-layer traces.

This is not a summary of the final output. It is a direct window into the model’s latent conceptual landscape, showing the dominant training-data attractors that activate even when the prompt asks for first-principles reasoning. The adaptive scheduler concentrates verbalizations precisely where the real internal work occurs, turning what used to be opaque black-box generation into observable, analyzable data.

The result is the clearest public demonstration yet that modern LLMs possess a rich, structured semiotic infrastructure that can now be audited without retraining or fine-tuning.

Try it:
RiverRider/srt-introspect

updated a Space 8 days ago

SRT introspect

🧭

2

Adaptive-density reasoning traces over a frozen Qwen-2.5-7B

published a Space 8 days ago

SRT introspect

🧭

2

Adaptive-density reasoning traces over a frozen Qwen-2.5-7B

reacted to their post with 👀 9 days ago

Post

218

A single forward pass of the frozen Qwen-2.5-7B model plus a lightweight classifier reaches 0.866 plus or minus 0.011 AUC on the full TruthfulQA-MC2 benchmark. No adapters. No fine-tuning. No extra parameters on the backbone.

This is the strongest hidden-state truthfulness detector reported on the benchmark to date.

The same latent features that the SRT-NLA-AV-v1 demo reads out as coherent natural-language verbalizations turn out to be rich enough to support production-grade auditing for honesty versus hallucination. The internal semiotic infrastructure we have been exploring in public is already information-dense enough to solve hard downstream problems with almost trivial overhead.

You can watch the underlying latent geometry in action right here:
RiverRider/srt-nla-av-v1-demo

Full code, artifacts, and reproduction steps are in the repository:
https://github.com/space-bacon/SRT

Try the Glass Box
RiverRider/srt-nla-demo

posted an update 12 days ago

Post

218

A single forward pass of the frozen Qwen-2.5-7B model plus a lightweight classifier reaches 0.866 plus or minus 0.011 AUC on the full TruthfulQA-MC2 benchmark. No adapters. No fine-tuning. No extra parameters on the backbone.

This is the strongest hidden-state truthfulness detector reported on the benchmark to date.

The same latent features that the SRT-NLA-AV-v1 demo reads out as coherent natural-language verbalizations turn out to be rich enough to support production-grade auditing for honesty versus hallucination. The internal semiotic infrastructure we have been exploring in public is already information-dense enough to solve hard downstream problems with almost trivial overhead.

You can watch the underlying latent geometry in action right here:
RiverRider/srt-nla-av-v1-demo

Full code, artifacts, and reproduction steps are in the repository:
https://github.com/space-bacon/SRT

Try the Glass Box
RiverRider/srt-nla-demo

updated a Space 13 days ago

MindReader-NLA

🧠

3

Ask a frozen LM what it is thinking, in plain English.

reacted to their post with 🔥 13 days ago

Post

412

🧠 New Space: MindReader-NLA — ask a frozen LM what it's thinking, in plain English.

A trained Activation Verbalizer (~5–13M params, frozen backbone) over Qwen-2.5-7B, Llama-3.2-3B, and Gemma-2-2B. Three demos in one Space:

Playground — sample K verbalizations of the layer-L hidden state and score how well each reproduces the original activation when fed back through the same frozen model (raw + anisotropy-centred cosine FVE).

Live Thought Trace — stream a verbalization per token as the model writes, side-by-side with the generation.

Steer-by-Editing — edit the verbalized thought, project it back into hidden-state space, and watch the continuation change.

Runs on ZeroGPU. Try it: RiverRider/srt-nla-demo

Paper + code: https://github.com/space-bacon/SRT

liked a Space 13 days ago

MindReader-NLA

🧠

3

Ask a frozen LM what it is thinking, in plain English.

posted an update 14 days ago

Post

412

🧠 New Space: MindReader-NLA — ask a frozen LM what it's thinking, in plain English.

A trained Activation Verbalizer (~5–13M params, frozen backbone) over Qwen-2.5-7B, Llama-3.2-3B, and Gemma-2-2B. Three demos in one Space:

Playground — sample K verbalizations of the layer-L hidden state and score how well each reproduces the original activation when fed back through the same frozen model (raw + anisotropy-centred cosine FVE).

Live Thought Trace — stream a verbalization per token as the model writes, side-by-side with the generation.

Steer-by-Editing — edit the verbalized thought, project it back into hidden-state space, and watch the continuation change.

Runs on ZeroGPU. Try it: RiverRider/srt-nla-demo

Paper + code: https://github.com/space-bacon/SRT