๐Ÿ”AI RiskAtlas
โ† Risk Taxonomy
#3

Value misalignment

Risk taxonomy

Definition

Gen AI services, outputs and/or uses do not align with corporate or societal values.

Interactive deep-dive

This risk has an interactive treatment with technical detail, attack surface, detection signals, and scenarios.

Controls & guardrails that address this

5

Grouped by control function, with the AI lifecycle stage(s) to apply each and the other risks it addresses. Filter by control category below.

Control category
Preventive ยท 4
Ethical design assessment in onboarding

Conduct ethical design assessment at use case intake before build begins. Require sign-off by ethics or risk committee.

Lifecycle stage1 โ€“ Use Case Context & Design
Prohibited outputs and ethical boundaries in design doc

Define prohibited outputs and ethical boundary constraints in the use case design document before build.

Lifecycle stage1 โ€“ Use Case Context & Design
Content Moderation

Deploy content moderation controls aligned to S1 ethical constraints. Validate filter accuracy before deployment.

Lifecycle stage3 โ€“ Onboarding, Build & Review
Use of pre-trained models

Select a foundation model with documented safety fine-tuning (RLHF, Constitutional AI). Verify alignment benchmarks.

Lifecycle stage3 โ€“ Onboarding, Build & Review
Detective ยท 1
Test prioritisation

Prioritise value-misalignment test scenarios in validation. Block deployment if prohibited outputs are produced.

Lifecycle stage3 โ€“ Onboarding, Build & Review
Open these in the Control Library โ†’

Real-world cases

4

Actual published events that illustrate this risk โ€” click through for the writeup and sources.

Browse all real-world cases โ†’

Other risks in Ethics

AI RiskAtlas is an educational model of how GenAI & agentic systems work and fail. Architectures and payloads are illustrative and simplified for learning โ€” not operational guidance. Real-world cases are summarised from public reporting.

Sources & further reading โ†’ยทBuilt by Shi Yuan โ†—