Definition
Challenge of understanding how the Gen AI modelling techniques influence model behaviour and outputs.
Controls & guardrails that address this
6Grouped by control function, with the AI lifecycle stage(s) to apply each and the other risks it addresses. Filter by control category below.
Define explainability requirements at design stage aligned to regulatory obligations and affected user needs.
Select model architecture with explainability in mind. Prefer inherently interpretable models where performance requirements permit.
Design system prompts to elicit step-by-step chain-of-thought reasoning. Validate that reasoning is accurate and not post-hoc.
Implement counterfactual explanation generation for each AI decision. Validate fidelity before deployment.
Provide contextually appropriate explanations of AI decisions to affected users in the production interface.
Surface confidence indicators alongside AI outputs in production. Update indicators when confidence calibration drifts.