I agree that GenAI reliability entails new concepts and techniques. I also think there's much more in common with existing systems than not. Consistently asserting how every technology is a discontinuity severely limits our collective ability to learn.
"GenAI cloud services demand a new approach to reliability engineering. Traditional monitoring, triage, and mitigation processes aren’t enough."
https://rdel.substack.com/p/rdel-91-what-makes-production-incidents