Responsibilities:
- Operate in a 24X7 365 day support model
- Own root cause analysis for complex infrastructure and application issues. Identify and drive strategies to prevent a recurrence.
Minimum qualifications:
- 5+ years of experience building complex distributed systems
- Experience working in a high-profile high stress environment managing SLA/SLO objectives.
- Practical knowledge of scripting languages
- Practical mongoDB query and troubleshooting experience.
- Experience working in a server less environment.
- Experiencing with Microsoft Intune
- Experience using Postman
- In Depth understanding of the Goggle SRE handbook.
- Experience working with New Relic, Cloud Front, Kibana, and other monitoring tools.
- Strong written and verbal communication skills are a must.
- Strong understanding of CI/CD tools and real-world experience using them