Site Reliability Engineering (SRE) is a discipline that combines aspects of software engineering and applies them to operations whose goal is to create scalable and reliable software systems. SRE teams are responsible for the reliability, performance, scalability, and monitoring of software systems. The SRE culture originated in the early 2000s […]
Read MoreBlogposts
Highways, Cars, and Navigators: Decoding My Job for the Non-Techie!
I received a message from a friend admitting he’s unclear about what I do professionally. I’ve written this article to simplify and explain the daily responsibilities of an SRE, System Architect, and Infrastructure Engineer.
Read MorePrometheus and Thanos: A Symbiotic Relationship
In my journey with monitoring and alerting tools, I’ve come to deeply appreciate Prometheus. Its real-time monitoring capability feels like having a pulse on your systems. But, just like any good story, our hero, Prometheus, has its Achilles’ heel. I remember the first time I loaded it with a ton […]
Read MoreSecuring the Web, One Site at a Time: Our Project to Stop Bad Actors
As the internet continues to grow, so do the number of bad actors, such as hackers and evildoers, who are determined to exploit websites for their gain. This can be a significant problem, not only for the website owners but also for the end-users who rely on these sites. In […]
Read MoreBest efforts to OSS
As an engineer/systems architect, you must keep up-to-date with the latest trends, technologies, and best practices in your field. One of the most effective ways to do that is by writing technical write-ups and sharing your knowledge with the community. However, sometimes it’s easier said than done, and there might […]
Read MoreHow I brought down production
I figured every SRE / Systems / DevOps / Infrastructure Engineer brings down production system some time or the other. So did I, that too on a monday morning at 11:00 am. I think this might actually be the biggest highlight of my career till now. I’ve solved countless problems […]
Read MoreAWS: DynamoDB + API Gateway – The Correct Way
I went through the hard way so you don’t. This is not a tutorial but more of a post to skip all step to fetch and clean data with DynamoDB + API Gateway.Before I say anything, I want to thank aws for creating this and second scrutinize them to not […]
Read MoreBuilding projects to give it away, the truth about side projects.
I always loved building stuff. Small or big side projects, it didn’t really matter to me. From simple websites to using tensorflow’s inception V3 model to train images to detect fruits and objects for the blind, I did it all. Most of it was for the sheer fun of learning […]
Read MoreSRE: What I think happened to Robinhood trading app when it went down
I was waiting for Robinhood to post a postmortem but they aren’t and have never been transparent to post a public postmortem and shoot themselves in the foot by being in a sector to lose customers on a post. And the post by the founders is the most bizarre thing […]
Read More