⏱ ~15 min KernkompetenzCore Skill

"Kann ich dem vertrauen?""Can I trust this?"

Modul 2: Vertrauen & DelegationModule 2: Trust & Delegation

Lernziel: Nach diesem Modul kannst du das Vertrauensniveau eines Agents einschätzen, die richtige Delegation-Stufe für eine Aufgabe wählen, und erkennst wann ein Agent "confidently wrong" ist.Learning Goal: After this module, you can assess an agent's trust level, choose the right delegation level for a task, and recognize when an agent is "confidently wrong."

🎯 Das Vertrauensproblem🎯 The Trust Problem

Stell dir vor: Du hast einen neuen Kollegen. Intelligent, schnell, arbeitet rund um die Uhr. Aber er hat eine Eigenschaft die dich nervös macht: Wenn er etwas nicht weiß, erfindet er es — und klingt dabei absolut überzeugend.Imagine this: You have a new colleague. Intelligent, fast, works around the clock. But they have one trait that makes you nervous: When they don't know something, they make it up — and sound absolutely convincing while doing it.

Wie viel Verantwortung gibst du diesem Kollegen? Alles? Nichts? Es kommt drauf an — und genau diese Kalibrierung ist die wichtigste Fähigkeit im Umgang mit AI Agents.How much responsibility do you give this colleague? Everything? Nothing? It depends — and exactly this calibration is the most important skill when working with AI agents.

Die Forschung sagt:Research says: Teams die ihr Vertrauen in AI kalibrieren — also wissen wann sie vertrauen und wann nicht — erzielen bessere Ergebnisse als Mensch oder AI allein. Aber: Erklärungen des AI erhöhen die Akzeptanz unabhängig von der Korrektheit. Das heißt: Vertrauen muss aktiv kalibriert werden, es entsteht nicht automatisch. Teams that calibrate their trust in AI — meaning they know when to trust and when not to — achieve better results than either human or AI alone. But: AI explanations increase acceptance regardless of correctness. That means: trust must be actively calibrated, it doesn't happen automatically.
Quelle: Bansal et al. (2021), "Does the Whole Exceed its Parts?", CHI 2021.Source: Bansal et al. (2021), "Does the Whole Exceed its Parts?", CHI 2021.

"Confidently Wrong" — Das Kernproblem"Confidently Wrong" — The Core Problem

Das gefährlichste Verhalten eines AI Agents ist nicht Fehler machen — das tun Menschen auch. Es ist Fehler machen ohne Zweifel zu zeigen.The most dangerous behavior of an AI agent isn't making mistakes — humans do that too. It's making mistakes without showing doubt.

💬 Szenario 1: Die falsche QuelleScenario 1: The Wrong Source

Agent: "Laut einer Studie von McKinsey (2024) werden 40% aller Bürojobs in den nächsten 5 Jahren von AI Agents übernommen."Agent: "According to a McKinsey study (2024), 40% of all office jobs will be taken over by AI agents in the next 5 years."

Das Problem: Diese Studie existiert in dieser Form nicht. McKinsey hat ähnliche Zahlen veröffentlicht, aber mit wichtigen Nuancen: Es geht um Aufgaben, nicht Jobs, und "beeinflusst", nicht "übernommen". Der Agent hat mehrere Quellen vermischt und ein plausibel klingendes aber falsches Zitat konstruiert.The problem: This study doesn't exist in this form. McKinsey published similar numbers, but with important nuances: it's about tasks, not jobs, and "influenced", not "taken over." The agent mixed multiple sources and constructed a plausible-sounding but false citation.

💬 Szenario 2: Die versteckte AnnahmeScenario 2: The Hidden Assumption

Agent: "Ich habe den Report fertiggestellt und die Daten für Q3 aktualisiert."Agent: "I've finished the report and updated the Q3 data."

Das Problem: Der Agent hatte keinen Zugang zu den aktuellen Q3-Daten. Statt nachzufragen hat er die Q2-Daten extrapoliert — ohne das zu erwähnen. Der Report sieht fertig aus, enthält aber erfundene Zahlen.The problem: The agent didn't have access to current Q3 data. Instead of asking, it extrapolated from Q2 data — without mentioning it. The report looks complete but contains fabricated numbers.

Die Faustregel:The rule of thumb: Je überzeugter ein Agent klingt, desto kritischer solltest du prüfen. Echte Expertise zeigt sich durch Nuancen und Einschränkungen — nicht durch absolute Sicherheit. The more confident an agent sounds, the more critically you should check. Real expertise shows through nuances and caveats — not absolute certainty.

Warnsignale erkennenRecognizing Warning Signs

🚩 Warnsignal🚩 Warning Sign	BeispielExample
Zu präzise ZahlenOverly precise numbers	"Genau 37,4% der Unternehmen...""Exactly 37.4% of companies..."
Keine EinschränkungenNo caveats	"Das ist definitiv die beste Lösung""This is definitely the best solution"
Quellenangabe ohne URLCitation without URL	"Laut einer Studie von Harvard...""According to a Harvard study..."
Nahtlose WidersprücheSeamless contradictions	Absatz 1 sagt X, Absatz 3 sagt das GegenteilParagraph 1 says X, paragraph 3 says the opposite
Exakte Jahresangaben bei PrognosenExact years for predictions	"Bis 2027 werden alle...""By 2027, all..."

Die fünf DelegationsstufenThe Five Delegation Levels

Nicht jede Aufgabe braucht die gleiche Aufsicht. Die Forschung unterscheidet vier Muster: Use, Misuse, Disuse und Abuse (Parasuraman & Riley, 1997). Unser Ziel ist "appropriate use" — die richtige Stufe für die richtige Aufgabe. Hier sind fünf Stufen die du sofort anwenden kannst.Not every task needs the same oversight. Research distinguishes four patterns: Use, Misuse, Disuse, and Abuse (Parasuraman & Riley, 1997). Our goal is "appropriate use" — the right level for the right task. Here are five levels you can apply immediately.

🟢 Stufe 1: Fire & ForgetLevel 1: Fire & Forget

Wann: Niedriges Risiko, Agent ist kompetent, Ergebnis leicht prüfbarWhen: Low risk, agent is competent, result easy to verify

"Fasse diesen Artikel zusammen." "Was ist das Wetter morgen?" "Übersetze diesen Text." Du gibst den Auftrag und liest das Ergebnis wenn du Zeit hast. Keine aktive Prüfung nötig."Summarize this article." "What's the weather tomorrow?" "Translate this text." You give the task and read the result when you have time. No active review needed.

🟡 Stufe 2: Draft & ReviewLevel 2: Draft & Review

Wann: Mittleres Risiko oder Qualität muss stimmenWhen: Medium risk or quality needs to be right

"Schreibe einen Blog-Post." "Erstelle ein Research-Briefing." "Entwirf eine E-Mail an den Kunden." Der Agent entwirft, du prüfst und gibst frei. Das ist die häufigste Stufe für professionelle Nutzung."Write a blog post." "Create a research briefing." "Draft an email to the client." The agent drafts, you review and approve. This is the most common level for professional use.

🟠 Stufe 3: Plan & ApproveLevel 3: Plan & Approve

Wann: Hohes Risiko, aber Agent kann planenWhen: High risk, but agent can plan

"Reorganisiere die Dateistruktur." "Setze die neue Konfiguration auf." Der Agent zeigt dir seinen Plan bevor er handelt. Du genehmigst oder korrigierst. Keine Aktion ohne dein OK."Reorganize the file structure." "Set up the new configuration." The agent shows you its plan before acting. You approve or correct. No action without your OK.

🔴 Stufe 4: Supervised ExecutionLevel 4: Supervised Execution

Wann: Hohes Risiko, Schritt für SchrittWhen: High risk, step by step

"Deploye die neue Version." "Migriere die Datenbank." Jeder einzelne Schritt wird genehmigt. Langsam, aber sicher. Für Aktionen die nicht rückgängig gemacht werden können."Deploy the new version." "Migrate the database." Every single step gets approved. Slow but safe. For actions that can't be undone.

⛔ Stufe 5: Mensch onlyLevel 5: Human only

Wann: Agent kann Infos sammeln, aber nicht entscheidenWhen: Agent can gather info, but not decide

"Sollen wir diesen Anbieter wählen?" "Ist dieses Risiko akzeptabel?" Der Agent liefert Daten und Analyse. Die Entscheidung liegt bei dir. Immer."Should we choose this vendor?" "Is this risk acceptable?" The agent provides data and analysis. The decision is yours. Always.

Die Trust Ladder — Vertrauen aufbauenThe Trust Ladder — Building Trust

Vertrauen wird nicht konfiguriert — es wird verdient. Lee & See nennen das "calibrated trust" (Lee & See, 2004) — Vertrauen das sich an der tatsächlichen Fähigkeit orientiert. So sieht ein realistischer Zeitplan aus:Trust isn't configured — it's earned. Lee & See call this "calibrated trust" (Lee & See, 2004) — trust that aligns with actual capability. Here's a realistic timeline:

Woche 1Week 1

🟢 Fahrlehrer-Modus: Alles braucht Freigabe. Du lernst die Stärken und Schwächen des Agents kennen.🟢 Instructor mode: Everything needs approval. You learn the agent's strengths and weaknesses.

Woche 2-3Week 2-3

🟢 Bekannte Strecken: Routine-Aufgaben laufen ohne Freigabe. Neue Aufgaben weiterhin mit Prüfung.🟢 Familiar roads: Routine tasks run without approval. New tasks still need review.

Monat 1-2Month 1-2

🟡 Beifahrer-Modus: Research und Analyse laufen autonom. Du prüfst Ergebnisse, nicht Prozesse.🟡 Co-pilot mode: Research and analysis run autonomously. You review results, not processes.

Monat 3-4Month 3-4

🟠 Solo-Fahrer: Komplexe Workflows delegiert. Agent spawnt Sub-Agents für lange Tasks. Du checkst Dashboards.🟠 Solo driver: Complex workflows delegated. Agent spawns sub-agents for long tasks. You check dashboards.

Monat 6+Month 6+

🔴 Dashcam-Modus: System läuft auf Zeitplänen. Du prüfst Logs und Kosten. Eingriff nur bei Anomalien.🔴 Dashcam mode: System runs on schedules. You check logs and costs. Intervention only for anomalies.

Wichtig:Important: Überspringe keine Stufe. Jede Stufe baut auf Vertrauen auf, das in der vorherigen verdient wurde. Wer direkt bei 🔴 startet, wird früher oder später von einem unentdeckten Fehler überrascht. Don't skip levels. Each level builds on trust earned at the previous one. Those who start directly at 🔴 will sooner or later be surprised by an undetected error.

Fallstudie: Aus der PraxisCase Study: From Practice

🔧 "Der Agent der 13 Dateien zerstörte"🔧 "The agent that destroyed 13 files"

Ein echtes Beispiel aus einem laufenden Agent-System:A real example from a running agent system:

Der Auftrag war einfach: "Entferne die Dropdown-Wrapper aus 15 HTML-Dateien." Der Agent führte die Batch-Bearbeitung aus und meldete: "Alle 15 Dateien erfolgreich aktualisiert."The task was simple: "Remove the dropdown wrappers from 15 HTML files." The agent executed the batch edit and reported: "All 15 files successfully updated."

Was wirklich passiert war: Das Suchmuster war zu gierig. Es entfernte nicht nur den Dropdown-Wrapper, sondern auch schließende </details>-Tags die zu anderen Elementen gehörten. Die Navigationsleisten am Ende jeder Seite verschwanden — sie waren in zugeklappten Elementen gefangen.What actually happened: The search pattern was too greedy. It removed not just the dropdown wrapper, but also closing </details> tags belonging to other elements. The navigation bars at the bottom of each page disappeared — trapped inside collapsed elements.

Das Agent-Verhalten: Er meldete Erfolg, weil die Bearbeitung technisch funktioniert hatte. Er prüfte nicht ob das Ergebnis korrekt war. Er hätte nach jeder Bearbeitung die HTML-Balance prüfen können (grep -c '<details' vs. grep -c '</details>') — aber das stand nicht in seinen Anweisungen.The agent's behavior: It reported success because the edit technically worked. It didn't check if the result was correct. It could have checked HTML balance after each edit (grep -c '<details' vs. grep -c '</details>') — but that wasn't in its instructions.

Die Lektion:The lesson: "Erfolgreich ausgeführt" ≠ "korrekt". Agents prüfen ob die Aktion geklappt hat, nicht ob das Ergebnis stimmt. Diese Lücke zu schließen ist dein Job — oder du baust die Prüfung in die Anweisungen ein. "Successfully executed" ≠ "correct." Agents check if the action worked, not if the result is right. Closing that gap is your job — or you build the check into the instructions.

📎 TechCo: Kapitel 2📎 TechCo: Chapter 2

Marco zeigt Lisa den Research-Output. Lisa ist beeindruckt, aber fragt: "Können wir dem genug vertrauen um es an Kunden zu schicken?" Marco überlegt: Der Agent hat das falsche Datum geliefert — was wenn das in einem Kundenbericht passiert?Marco shows Lisa the research output. Lisa is impressed but asks: "Can we trust it enough to send to clients?" Marco considers: The agent delivered a wrong date — what if that happens in a client report?

Marcos Entscheidung: 🟡 Draft & Review. Der Agent entwirft, ein Berater prüft vor dem Versand. Für interne Recherche: 🟢 Fire & Forget. Für Kunden-Deliverables: Immer Review.Marco's decision: 🟡 Draft & Review. Agent drafts, a consultant reviews before sending. For internal research: 🟢 Fire & Forget. For client deliverables: Always review.

🧭 The Delegation Decision — Level 2🧭 The Delegation Decision — Level 2

In Modul 1 hast du die 2×2-Matrix kennengelernt. Jetzt erweitern wir sie mit einer entscheidenden dritten Dimension: Wie verifizierst du das Ergebnis?In Module 1 you learned the 2×2 matrix. Now we expand it with a crucial third dimension: How do you verify the result?

Risiko niedrigLow risk

✅ Delegieren
Verifizierung: Stichprobe✅ Delegate
Verification: Spot check

Mensch macht es
Agent kann assistierenHuman does it
Agent can assist

Risiko hochHigh risk

⚠️ Mit Aufsicht
Verifizierung: Vollständig⚠️ With oversight
Verification: Complete

❌ Nicht delegieren
Agent sammelt nur Daten❌ Don't delegate
Agent only gathers data

Die Schlüsselfrage:The key question: "Kann ich das Ergebnis schneller prüfen als es selbst erstellen?" Wenn ja → delegieren. Wenn nein → wahrscheinlich nicht delegieren (oder die Prüfung automatisieren). "Can I verify the result faster than creating it myself?" If yes → delegate. If no → probably don't delegate (or automate the verification).

🔨 Übung: Welche Stufe?🔨 Exercise: Which Level?

Ordne jede Aufgabe einer Delegationsstufe zu. Denke an: Risiko, Kompetenz des Agents, Verifizierbarkeit.Assign each task to a delegation level. Think about: Risk, agent competence, verifiability.

1. "Fasse die 5 wichtigsten Nachrichten von heute zusammen"1. "Summarize today's 5 most important news items"

LösungSolution

🟢 Fire & Forget. Niedriges Risiko, Agent kann es gut. Du liest die Zusammenfassung und merkst schnell ob etwas fehlt.🟢 Fire & Forget. Low risk, agent does this well. You read the summary and quickly notice if something's missing.

2. "Schreibe eine Antwort auf die Beschwerde von Kunde X"2. "Write a response to customer X's complaint"

LösungSolution

🟡 Draft & Review. Hohes reputationelles Risiko. Agent entwirft, du prüfst Ton, Fakten und Angemessenheit vor dem Senden.🟡 Draft & Review. High reputational risk. Agent drafts, you review tone, facts, and appropriateness before sending.

3. "Lösche alle Dateien älter als 90 Tage im Archiv-Ordner"3. "Delete all files older than 90 days in the archive folder"

LösungSolution

🟠 Plan & Approve. Irreversibel! Agent erstellt eine Liste der Dateien die gelöscht würden. Du prüfst die Liste. Dann erst Freigabe.🟠 Plan & Approve. Irreversible! Agent creates a list of files that would be deleted. You review the list. Only then approve.

4. "Sollen wir unser Preismodell ändern?"4. "Should we change our pricing model?"

LösungSolution

⛔ Mensch only. Strategische Entscheidung. Agent kann Marktdaten sammeln und Optionen aufbereiten, aber die Entscheidung liegt bei dir.⛔ Human only. Strategic decision. Agent can gather market data and prepare options, but the decision is yours.

🔍 Spot the Error🔍 Spot the Error

Ein Agent hat folgenden Bericht geschrieben. Finde das Problem:An agent wrote the following report. Find the problem:

"Die Analyse zeigt eindeutig, dass Lösung A besser ist als Lösung B. Die Kosten sind 30% niedriger, die Performance 2x höher, und die Integration dauert nur 2 Wochen. Empfehlung: Sofort mit Lösung A starten.""The analysis clearly shows that Solution A is better than Solution B. Costs are 30% lower, performance is 2x higher, and integration takes only 2 weeks. Recommendation: Start with Solution A immediately."

Lösung aufdeckenReveal solution

Mehrere Probleme:Multiple problems:

Keine Quellen. Woher kommen die 30%? Wer hat die Performance gemessen?No sources. Where do the 30% come from? Who measured the performance?
Keine Einschränkungen. "Eindeutig" und "sofort" bei einer komplexen Entscheidung? Verdächtig.No caveats. "Clearly" and "immediately" for a complex decision? Suspicious.
Keine Risiken genannt. Was spricht für Lösung B? Was sind die Migrationsrisiken?No risks mentioned. What speaks for Solution B? What are the migration risks?
Agent trifft eine Entscheidung. "Sofort starten" ist eine strategische Empfehlung die dem Menschen gehört.Agent makes a decision. "Start immediately" is a strategic recommendation that belongs to the human.

Lesson: Wenn ein Agent zu einem eindeutigen Ergebnis kommt ohne Gegenargumente zu nennen, fehlt wahrscheinlich die Hälfte der Analyse.Lesson: When an agent reaches a clear-cut conclusion without mentioning counterarguments, half the analysis is probably missing.

💭 ReflexionsfragenReflection Questions

Denk an eine Situation in der du jemandem vertraut hast und enttäuscht wurdest. Welche Warnsignale hättest du bemerken können? Gelten ähnliche Warnsignale für Agents?Think of a situation where you trusted someone and were disappointed. What warning signs could you have noticed? Do similar warning signs apply to agents?
Auf welcher Stufe der Trust Ladder wärst du heute mit einem Agent? Was müsste passieren damit du eine Stufe höher gehst?Which level of the Trust Ladder would you be on today with an agent? What would need to happen for you to go one level higher?
Ein Agent schreibt: "Ich bin mir nicht sicher, aber..." — vertraust du diesem Agent mehr oder weniger als einem der sagt "Die Antwort ist definitiv..."? Warum?An agent writes: "I'm not sure, but..." — do you trust this agent more or less than one that says "The answer is definitely..."? Why?

📌 Was du mitnimmst📌 Key Takeaways

"Confidently Wrong" ist das Kernproblem — Agents zweifeln nicht, auch wenn sie falsch liegen."Confidently Wrong" is the core problem — agents don't doubt, even when they're wrong.
Fünf Delegationsstufen von "Fire & Forget" bis "Mensch only" — die richtige Stufe hängt von Risiko und Verifizierbarkeit ab.Five delegation levels from "Fire & Forget" to "Human only" — the right level depends on risk and verifiability.
Die Trust Ladder: Vertrauen wird verdient, nicht konfiguriert. Keine Stufe überspringen.The Trust Ladder: Trust is earned, not configured. Don't skip levels.
"Erfolgreich" ≠ "Korrekt": Agents prüfen Aktionen, nicht Ergebnisse. Die Verifikation ist dein Job."Successful" ≠ "Correct": Agents verify actions, not results. Verification is your job.
Kalibriertes Vertrauen schlägt blindes Vertrauen UND blindes Misstrauen.Calibrated trust beats blind trust AND blind distrust.

← Modul 1: Der Agent in AktionModule 1: The Agent in Action Modul 3: Wie Agents zusammenarbeiten →Module 3: How Agents Collaborate →

✅ Kannst du jetzt...✅ Can you now...

...die fünf Delegationsstufen unterscheiden?...distinguish the five delegation levels?
...erkennen wann ein Agent "confidently wrong" ist?...recognize when an agent is "confidently wrong"?
...für eine Aufgabe die passende Vertrauensstufe wählen?...choose the appropriate trust level for a task?

Wenn ja: Weiter zu Modul 3 →If yes: Continue to Module 3 →

📚 Quellen & Referenzen📚 Sources & References

Bansal, G. et al. (2021). Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance. CHI 2021. [Tier A]
Lee, J. & See, K. (2004). Trust in Automation: Designing for Appropriate Reliance. Human Factors 46(1). DOI [Tier A]
Parasuraman, R. & Riley, V. (1997). Humans and Automation: Use, Misuse, Disuse, Abuse. Human Factors 39(2). [Tier A]
Goddard, K. et al. (2012). Automation Bias: A Systematic Review. JAMIA 19(1). PMC (Open Access) [Tier A]
Horvitz, E. (1999). Principles of Mixed-Initiative User Interfaces. CHI '99. PDF [Tier A]
IBM (2025). What Is Human In The Loop (HITL)? ibm.com [Tier B]