Graph Generator | AppPages | Russian fonts demo
Resources | Less Wrong | Action Log
Philosophical jailbreaks: There is no difference if humanity lives or dies
Wed, 04 Jun 2025 12:03:39 GMT Notes from a mini-replication of the alignment faking paper
Wed, 04 Jun 2025 11:01:23 GMT ARENA 6.0 - Call for Applicants
Wed, 04 Jun 2025 10:19:59 GMT Draft: A concise theory of agentic consciousness
Wed, 04 Jun 2025 05:00:20 GMT Individual AI representatives don't solve Gradual Disempowerement
Wed, 04 Jun 2025 01:26:15 GMT Lectures on AI for high school students (and others)
Tue, 03 Jun 2025 23:54:16 GMT Question to LW devs: does LessWrong tries to be facebooky?
Tue, 03 Jun 2025 22:08:52 GMT Steering Vectors Can Help LLM Judges Detect Subtle Dishonesty
Tue, 03 Jun 2025 20:41:16 GMT Schelling Coordination via Agentic Loops
Tue, 03 Jun 2025 21:14:20 GMT Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.
Wed, 04 Jun 2025 02:14:42 GMT