Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Props to them for actually updating their status page as issues are happening rather than hours later. I was working with claude code and hit an API error, checked the status page and sure enough there was an outage.

This should be a given for any service that others rely on, but sadly this is seldom the case.



Thank you! Opening an incident as soon as user impact begins is one of those instincts you develop after handling major incidents for years as an SRE at Google, and now at Anthropic.

I was also fortunate to be using Claude at that exact moment (for personal reasons), which meant I could immediately see the severity of the outage.


It's important for companies to use their own products.


Unless using your own dogfood prevents you from fixing it if it breaks

https://www.theguardian.com/technology/2021/oct/05/facebook-...

I have a memory that Slack fell into this trap too (I could be wrong)


Facebook notoriously had to cut open the doors to one of their data centers.

Google SRE still keeps IRC available in case of an emergency.


Now I’m imagining the folks at Slack gritting their teeth and using MS Teams


Take my condolences, Sunday outages are rough


Sweet. Hopefully it is more than instinct but a codified at Anthropic. I.e. a graduate engineer with little experience can assess and raise incident if needed.


Confusingly, I was trying to debug something with a 529, and this outage really had me going for a minute.


The 529 is coming from inside the house?!


Same as you and I was glad to see the status page - hit subscribe on updates

Claude user base believes in Sunday PM work sessions


As a solo bootstrapped founder, I take my sabbath sundown on Saturday to sundown on Sunday. Sunday evening therefore is generally the start of my work week.


Sunday PM builder, reporting in.


hah I ran out of tokens a bit before it hit I reckon.


same here, and I just got started, Hm..


Sunday? What is that?


Indeed! I checked their status page within 2 minutes of having issues and it was updated to show they had detected it.


Seldom? Most status pages I've seen do eventually get updated, just not within that first critical 3 minutes.


"There's a problem and we already know about it" is so much better than "there's a problem and we don't know about it and/or are hoping it will magically go away and that we won't be embarrassed".


"If we admit to it we may have to compensate per SLAs, so dishonesty it is!"




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: