LLM — if misapplied
Calling automation or ML "AI-powered" sets expectations of reasoning the system can't meet. Users attempt edge cases it can't handle. Trust erodes faster than it was built.
Using an LLM for something deterministic burns tokens on every call. At scale that's a real line item — you're paying inference cost for a lookup that should be a query.
Primary failure: Expectation gap → trust collapse + unnecessary compute cost
ML — if misapplied
Using an LLM where a trained model is needed introduces unpredictability into decisions that require consistency. Risk scores that vary by phrasing are not risk scores — they're opinions.
You're also paying for variability you didn't want. A trained model runs inference cheaply and consistently. An LLM doing the same job costs more per call and produces less stable output.
Primary failure: Inconsistency → liability + inflated inference cost
Automation — if misapplied
Labeling rule-based logic as AI inflates perceived value short term. When users discover the system is a conditional, not a model, they recalibrate downward — and take other AI claims with them.
The cost here is in over-engineering. Teams build LLM scaffolding, evaluation pipelines, and prompt infrastructure around something that should be an if/then statement. That's weeks of work and ongoing maintenance on a problem that didn't need a model.
Primary failure: Credibility debt + over-engineering cost
UX Debt — if ignored
Layering AI onto unresolved structural problems compounds complexity. Every AI interaction that fails because the underlying IA is broken trains users to distrust AI rather than the design decision that preceded it.
You've also now funded two problems instead of one. The IA still needs fixing, and you've added an AI layer that will need its own maintenance, evaluation, and iteration — on top of a foundation that was never sound.
Primary failure: Compounding confusion + compounding cost