Towards Trustworthy LLMs

Understanding Limits, Advancing Capabilities, Ensuring Safety

A deep dive into Cohere Labs’ Nouha Dziri’s
keynote on building trustworthy large language models,
exploring the challenges and strategies for ensuring reliability and safety.
odsc
ai
safety
Author

Oren Bochman

Published

Tuesday, April 28, 2026

Modified

Tuesday, May 19, 2026

Keywords

Trustworthy LLMs, Nouha Dziri, AI Safety, Reasoning, Reliability

The Data-Resistant Mind-The Psychology Every Data Scientist Needs to Make Their Work Matter by Sebastian Wernicke, Oxera Consult

  • Sebastian Wernicke
  • Oxera Consulting LLP
NoteNotes
  • Main claim
    • Data alone rarely changes minds; in many organizational settings, it can even harden existing decisions.
    • Data scientists should treat their role not only as analysis, modeling, and validation, but as participation in decision-making.
  • Opening example
    • The speaker describes a logistics project in Southeast Asia where route optimization could save up to 20% in fuel.
    • Even though the result was valuable and technically sound, stakeholders returned to their old behavior after the presentation.
    • This illustrates the central frustration: excellent analysis often fails to create organizational change.
  • Why data does not change minds by itself
    • Decisions are often already forming before the data science team presents its results.
    • The speaker uses a neuroscience example: decision signals accumulate over time, and late contradictory information has less influence.
    • In organizations, stakeholders may already have informal commitments, expectations, fears, and incentives before the analysis arrives.
    • By the time the data is presented, the decision may already be “in motion.”
  • Mismatch between analysis and decision needs
    • Data scientists are comfortable with uncertainty, ambiguity, multiple interpretations, and probability distributions.
    • Decision-makers often expect clear answers: yes/no, proceed/stop, invest/do not invest.
    • A statement like “65% chance of success” may sound informative to a data scientist but unusable to a decision-maker.
    • The speaker’s metaphor: decision-makers want a flashlight, but data scientists often bring an MRI.
  • Data scientists’ own bias
    • Data scientists are biased toward data and machine-learning solutions because that is their craft and identity.
    • The speaker gives an example of a metals manufacturer that wanted an algorithm to infer which production batch a part came from.
    • A simpler physical marking system may have been better than a complex predictive model.
    • The lesson: the diagnostic moment is often at project inception, not at the final presentation.
  • Reframing the data scientist’s job
    • The job is not merely to provide analysis.
    • The job is to participate in the architecture of decisions.
    • Technical architecture includes ingestion, transformation, modeling, validation, and deployment.
    • Decision architecture asks how decisions form in human organizations and how analysis can enter that process effectively.
  • Practical recommendation 1: start earlier
    • Data scientists should “push left” and get involved before the problem is fully framed.
    • They should interrogate the problem, the stakeholders, the assumptions, and the desired change.
    • A key question is: “What concrete change has to happen for this project to count as resolved?”
    • This can prevent technically successful but organizationally useless projects.
  • Practical recommendation 2: map the real decision room
    • The “room” includes all stakeholders who influence or block the decision, not only those in the kickoff meeting.
    • Data scientists should ask:
      • Who is involved?
      • What are they measured on?
      • What are they afraid of?
      • Who has implementation capacity?
    • Stated concerns about “methodology” may actually reflect incentives, workload, distrust, risk, or fear of losing control.
  • Practical recommendation 3: translate findings into decision structure
    • Do not merely report probabilities or charts.
    • Translate findings into conditions, actions, risks, and monitoring signals.
    • Instead of saying “there is a 65% chance this works,” say:
      • it works in about two-thirds of market conditions;
      • it fails under a specific condition;
      • here is the indicator to monitor.
    • This preserves the statistical truth while making it actionable.
  • Q&A: best question to predict project success
    • The speaker says the best question is: “What do you want to change at the end of this?”
    • Good answers reveal intended action, stakeholders, and implementation path.
    • Bad answers include “we just want results” or “we need to prove another department wrong.”
  • Q&A: career progression
    • Junior data scientists are often judged on technical correctness: cleaning, modeling, statistics.
    • Senior data scientists are judged more by business impact.
    • The speaker’s shift came from frustration: good analysis was not producing enough real-world effect.
  • Q&A: multiple departments
    • When departments have conflicting goals, the data scientist must identify which stakeholder they are effectively serving.
    • Sometimes that is the person funding the work, but not always.
    • The important thing is to make a decision transparently rather than drift between incompatible stakeholder agendas.
  • Q&A: making results survive translation
    • Put the analysis into the language of the business.
    • Translate numbers into decision options, effects on business metrics, career incentives, customer behavior, churn, demand, or operational risk.
    • The point is not to dilute the analysis, but to make it usable in the decision-maker’s frame.
  • Q&A: how to practice
    • The speaker recommends mentorship from experienced decision-makers.
    • Ask them how a finding will land, how to frame it, and what a stakeholder is likely to hear.
    • Practicing with people who actually make decisions is presented as one of the best ways to develop this skill.
  • Overall takeaway
    • Data science impact depends on joining the decision process early, understanding stakeholder incentives, and communicating results in a form that can guide action.
    • These are often dismissed as “soft skills,” but the speaker argues they are part of the real job of data science.

Reflection

Citation

BibTeX citation:
@online{bochman2026,
  author = {Bochman, Oren},
  title = {Towards {Trustworthy} {LLMs}},
  date = {2026-04-28},
  url = {https://orenbochman.github.io/posts/2026/04-30-ODSC-AI-2026-Day-3/talk8.html},
  langid = {en}
}
For attribution, please cite this work as:
Bochman, Oren. 2026. “Towards Trustworthy LLMs.” April 28. https://orenbochman.github.io/posts/2026/04-30-ODSC-AI-2026-Day-3/talk8.html.