Skip to content
Pillar I: AI Organization · § 08

AI risk management, six steps

Every AI feature at Bizzi passes a six-step risk assessment before it reaches production. This is the operating spine of AI governance. The steps run in order. The artefacts they produce are durable. Nothing ships without all six.

  1. 1
    Classify use case
  2. 2
    Identify threats
  3. 3
    Measure and score
  4. 4
    Set mitigations
  5. 5
    Red-team
  6. 6
    Monitor in production
quarterly re-assessment
Six-step AI risk framework

Rate each use case Low / Medium / High.

  • Low. AI assists document search or simple categorization. No financial impact.
  • Medium. AI summarizes a contract or suggests an accounting code. A mistake annoys but is easy to reverse.
  • High. AI auto-reconciles and issues a payment instruction. A mistake creates direct financial loss.

The classification drives how strict the controls in the later steps need to be.

Map both technical and non-technical threats:

  • Technical. Model drift, hallucination, prompt injection, data poisoning.
  • Non-technical. Reputation (a public AI failure), Legal (Decree 13/2023 violation), Compliance (a customer SLA breach).

Use a 5×5 impact-by-likelihood matrix:

RarePossibleModerateHighCertain
CatastrophicMHHEE
MajorMMHHE
ModerateLMMHH
MinorLLMMH
NegligibleLLLMM

L = Low, M = Medium, H = High, E = Extreme. Any High use case scoring Extreme requires AI Board approval.

Apply guardrails by tier:

  • Low / Medium. Output guardrails, audit logging, confidence threshold in the UI.
  • High / Extreme. All of the above plus maker-checker (HITL), PII redaction, output sandboxing, strict rate limits, and a pre-wired kill-switch.

The internal security team attacks the model before it ships. Scope:

  • Prompt injection (direct and indirect via PDF).
  • Jailbreak attempts.
  • Sensitive-information extraction.
  • Excessive-agency tests.
  • Denial-of-wallet patterns.

Findings are graded by severity. Every High or Extreme finding closes before release.

Once live, track continuously:

  • Latency.
  • Cost per Transaction.
  • Error rate.
  • Drift (PSI).
  • Hallucination rate from LLM-as-a-Judge sampling.

When any metric breaches its warning band, the feature returns to Step 1 for re-classification. This re-assessment runs automatically every quarter for every production AI feature.