AI Safety
Ensuring AI systems do what you intended — and stop when they shouldn't continue.
AI safety is concerned with ensuring that AI systems behave as intended, avoid causing harm, and remain under meaningful human control. For organizations deploying AI today, it has a practical near-term dimension: outputs should be accurate, systems should resist misuse and manipulation, humans should remain in the loop for consequential decisions, and there should be clear accountability when something goes wrong. It also has a longer-term dimension as AI systems become more autonomous — questions about what actions they can take, how they can be interrupted, and who remains responsible for their behavior.
AI safety is increasingly a regulatory requirement, not just an ethical one. Jurisdictions across the EU, US, and Asia are introducing obligations for safety documentation, impact assessments, and human oversight of high-risk AI systems. Organizations that treat safety as a launch checklist rather than an ongoing practice will find themselves exposed — both to regulatory scrutiny and to the operational failures that safety measures are designed to prevent. Agentic AI systems, which take real-world actions, raise the stakes further: a system that can send emails, execute transactions, or modify data needs an interrupt mechanism before it goes anywhere near production.
Read next
Related concepts
Responsible AI
Responsible AI is the difference between an organization that says it uses AI ethically and one that can actually prove it.
Governance and RiskAI Governance
AI governance is the system that determines who can deploy AI, under what conditions, with what oversight, turning ad hoc experimentation into accountable organizational practice.
Governance and RiskHallucinations
Hallucinations are AI outputs that are confidently stated but factually wrong. The model isn't lying or guessing, it's generating plausible-sounding language that happens to be false.
Optional map
Concept neighborhood
Focused neighborhood
AI Safety
Ensuring AI systems do what you intended — and stop when they shouldn't continue.
In these paths
Selected concept
Directly related
One step further
via Responsible AI
via AI Governance
via Hallucinations
via Human-in-the-Loop