
Website Netflix
Job Description:
The Critical Operations and Reliability Engineering team’s goal is to drive customer joy by thoughtfully managing risk and minimizing impact across Netflix. We do this through cross-functional engagement with other engineering teams, managing issues when they happen, as well as promoting reliability and resilience practices throughout the organization. Our team is seeking individuals with a broad set of technical skills with an impressive history of unique career and life experiences to bring diverse views to our team. This role is rewarding for people who can collaborate in a complex environment.
Job Responsibilities:
- Robust communication with team members and customers
- Increase our reliability through an automation focused mindset to solving problems
- Collaboration, continuous improvement, and iteration as the path forward
- Develop deeper insights into the quality of experience for our customers
- Improve availability, reliability, and observability of Netflix services and reduce the burden of human toil with tooling and automation
- Engage with product teams to diagnose and correct operational surprises
- Analyze complex systems from a reliability and resilience perspective
- Curiosity about how complex socio-technical systems successfully operate at scale when failure is inevitable
- Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks
- The ability to develop alignment to cultivate relationships and driving impact
- Form and maintain relationships with internal and external partners
- Identify sources of instability in distributed systems and drive operational excellence
Job Requirements:
- Involvement with incident management and response
- Development with Python, Go, Java, or JavaScript/Node.js
- Knowledge of cloud platforms like AWS and microservices architectur
Job Details:
Company: Netflix
Vacancy Type: Full Time
Job Location: Baton Rouge, LA, US
Application Deadline: N/A
vacanciesforyou.net