Main »

Lesser Wrong Feed

 Graph Generator | AppPages | Russian fonts demo 

 The Most Forbidden Technique is not always forbidden
 Sat, 18 Jul 2026 00:59:57 GMT
 Should we benchmark conceptual capabilities using judgment prediction tasks?
 Fri, 17 Jul 2026 23:42:45 GMT
 A list of existing alignment approaches
 Fri, 17 Jul 2026 22:46:16 GMT
 Longtermism is very intuitive.
 Fri, 17 Jul 2026 23:31:29 GMT
 AIs finetune their own leader: A barking simpleton
 Fri, 17 Jul 2026 20:10:44 GMT
 Don't default to nonprofit
 Fri, 17 Jul 2026 19:54:48 GMT
 Studying the role of Sandboxing for AI Control
 Fri, 17 Jul 2026 19:05:10 GMT
 Announcing the Corrigibility Research Fund
 Fri, 17 Jul 2026 18:06:32 GMT
 Would your AI travel agent book a bullfight? Testing whether agents consider animal welfare without being prompted
 Fri, 17 Jul 2026 17:28:25 GMT
 Before values settle
 Fri, 17 Jul 2026 16:26:57 GMT

Categories: AppPages | LessWrong