Zscaler is a pioneer and global leader in zero trust security. The world's largest businesses, critical infrastructure organizations, and government agencies rely on Zscaler to secure users, branches, applications, data & devices, and to accelerate digital transformation initiatives. Distributed across more than 160 data centers globally, the Zscaler Zero Trust Exchange platform combined with advanced AI combats billions of cyber threats and policy violations every day and unlocks productivity gains for modern enterprises by reducing costs and complexity.
Here, impact in your role matters more than title and trust is built on results. We believe in transparency and value constructive, honest debate-we're focused on getting to the best ideas, faster. We build high-performing teams that can make an impact quickly and with high quality. To do this, we are building a culture of execution centered on customer obsession, collaboration, ownership and accountability.
We champion an AI Forward, People First philosophy to help us accelerate and innovate, empowering our people to embrace their potential. If you're driven by purpose, thrive on solving complex challenges and want to make a positive difference on a global scale, we invite you to bring your talents to Zscaler to help shape the future of cybersecurity.
Role
We are looking for a Principal DevOps Engineer to join our team. This role is hybrid and based in our San Jose, CA office three days a week, reporting to the Architect, Software Development Engineering within the Emerging Tech department.
You will architect and manage the global cloud infrastructure for our Zero Trust Networking Services, owning the delivery pipeline and operational health of a massive-scale, multi-region distributed system. Bridging the gap between Network Engineering and Cloud Operations, you will orchestrate highly scalable, distributed control and data plane services in the public cloud.
What you'll do (Role Expectations)
- Design and implement a multi-region AWS architecture and lead the development of modular Terraform libraries to automate provisioning across diverse geographies
- Architect self-healing infrastructure using advanced cloud load balancing, auto-scaling patterns, and Multi-AZ database topologies to ensure high availability
- Modernize CI/CD pipelines and implement Blue/Green and Canary deployment strategies to ensure zero-downtime upgrades for a continuously running global network service
- Build comprehensive SRE dashboards and implement intelligent alerting frameworks to detect regional outages or capacity exhaustion before they impact customers
- Monitor cloud resource utilization and implement scaling policies that perfectly balance performance requirements with cost-efficiency
Who You Are (Success Profile)
- You thrive in ambiguity. You're comfortable building the path as you walk it. You thrive in a dynamic environment, seeing ambiguity not as a hindrance, but as the raw material to build something meaningful.
- You act like an owner. Your passion for the mission fuels your bias for action, and you operate with integrity because you genuinely care about the outcome. You adapt to what's needed, navigating seamlessly between high-level strategy and hands-on execution.
- You are a problem-solver. You seek out challenges because you are energized by finding solutions, knowing that solving the hard problems delivers the biggest impact.
- You are customer-obsessed. You build deep empathy for the customer-both internal and external-and anchor your decisions in solving their real-world problems. You champion their needs from start to finish, knowing their success is our success.
- You operate with urgency. You understand that in a high-growth environment, speed and quality are not mutually exclusive. You have a relentless focus on execution and a bias for action, delivering high-impact results quickly to win for the customer and the team.
What We're Looking for (Minimum Qualifications)
- 12+ years of overall experience in Software Engineering, DevOps, or Site Reliability Engineering combined with a BS/MS in Computer Science or relevant field
- Deep mastery of AWS services, specifically in Cloud Networking constructs (VPC, Transit Gateway), Global Traffic Management, and managing large-scale elastic compute environments
- Proven expertise in architecting AWS-managed SQL and NoSQL data stores, with a focus on designing scalable local and multi-region deployment strategies
- Advanced expertise in Infrastructure as Code using Terraform and expert proficiency in Python or Go for building automation tooling
- Strong knowledge of Linux/BSD internals, observability stacks (Prometheus, InfluxDB), and security compliance (PKI, IAM, DevSecOps)
What Will Make You Stand Out (Preferred Qualifications)
- Experience managing large-scale distributed systems or IoT-style connectivity platforms
- Background in Network Engineering or working with Data Plane forwarding technologies
- Experience with Chaos Engineering methodologies in a public cloud environment
Base Pay Range: $182,000 - $260,000 USD
Benefits
- Time off plans for vacation and sick time
- Parental leave options
- Retirement options
- In-office perks, and more!
Pay Transparency
Zscaler complies with all applicable federal, state, and local pay transparency rules.
Zscaler is committed to providing equal employment opportunities to all individuals. We strive to create a workplace where employees are treated with respect and have the chance to succeed. All qualified applicants will be considered for employment without regard to race, color, religion, sex (including pregnancy or related medical conditions), age, national origin, sexual orientation, gender identity or expression, genetic information, disability status, protected veteran status, or any other characteristic protected by federal, state, or local laws.