Tagged "ai-safety"

Claude Code Permissions Hook – Delegate Permission Approval to LLM 20 March 2026
ConsciOS v1.0: A Viable Systems Architecture for Human and AI Alignment 6 March 2026
ÆTHERYA Core – Deterministic Policy Engine for Governing LLM Actions 4 March 2026
AgentLens – Open-Source Observability for AI Agents 1 March 2026
Meta's OpenClaw Release Raises Questions About Open-Source Model Safety and Alignment 24 February 2026
Anthropic Releases Claude Opus 4.6 Sabotage Risk Assessment 11 February 2026