“As SREs, our teams are not just responsible for monitoring and toil reduction. We also need to be part of the application architecture discussions to help identify critical dependencies and build mitigation plans, as well as help the software engineers identify potential failures that could bubble up through the system.”
https://medium.com/site-reliability-engineering-leadership/the-domain-of-failure-64bca144c94b