Graph Generator | AppPages | Russian fonts demo
Resources | Less Wrong | Action Log
Attempting to influence transformer representations via initialization
Tue, 13 Jan 2026 00:49:47 GMT When does competition lead to recognisable values?
Mon, 12 Jan 2026 23:13:18 GMT Lies, Damned Lies, and Proofs: Formal Methods are not Slopless
Mon, 12 Jan 2026 22:32:18 GMT Dating Roundup #10: Gendered Expectations
Mon, 12 Jan 2026 20:30:20 GMT Automated Interpretability-Driven Model Auditing and Control: A Research Agenda
Mon, 12 Jan 2026 19:55:21 GMT Tensor-Transformer Variants are Surprisingly Performant
Mon, 12 Jan 2026 19:43:15 GMT The Algorithm Rewards Engagement
Mon, 12 Jan 2026 19:38:53 GMT BlackBoxQuery [BBQ]-Bench: Measuring Hypothesis Formation and Experimentation Capabilities in LLMs
Mon, 12 Jan 2026 21:50:16 GMT Understanding Agency through Markov Blankets
Mon, 12 Jan 2026 19:32:59 GMT Model Reduction as Interpretability: What Neuroscience Could Teach Us About Understanding Complex Systems
Mon, 12 Jan 2026 21:06:23 GMT