Job Information
Confluent Staff Site Reliability Engineer - Incident Management (Remote - Canada) in Ontario, Canada
With Confluent, organizations can harness the full power of continuously flowing data to innovate and win in the modern digital world. We have a purpose that drives us to do better every day – we're creating an entirely new category within data infrastructure - data streaming. This technology will allow every organization to create experiences and use the power of data in ways that profoundly impact the way we all live. This impact is our purpose and drives us to do better every day.
One Confluent. One team. One Data Streaming Platform.
Data Connects Us.
About the Role:
Do you have a passion for data that can turn events into outcomes, enabling intelligent, real-time apps, and empowering teams and systems to be able to act on data instantly? Have you ever dreamt about the opportunity to work with key agencies of the public sector? Confluent's team of Site Reliability Engineers, will allow you to do just that by putting you in the driver seat to deliver highly performant, reliable systems that enable prominent public sector agencies to make real time decisions with their data to solve real time problems through Confluent Cloud. Confluent Cloud delivers a complete end-to-end streaming experience as a Software as a Service (SaaS) model.
What You Will Do:
Partner with our Cloud Architecture and Engineering teams to build upon the operational resiliency of the Confluent Cloud systems
Collaborate broadly across teams to verify and deploy production changes to Confluent Cloud systems and infrastructure
Be an active partner with peer engineering teams, engaging during incidents and driving towards positive outcomes for our customers
Maintain critical monitoring used for triage and escalations in the federal space and improve upon automated recovery
Adhere to established incident and change management processes and help drive continuous improvements
Strong writing and verbal skills, with experience in communicating with Enterprise Customers
What You Will Bring:
10+ years of relevant experience
Expertise in Cloud Native technologies with experience operating production services in the cloud
Strong fundamentals of Distributed Systems and their design
Deep knowledge of Kubernetes and containerization
Experience with telemetry tooling to monitor production systems
Confidence with problem-solving and troubleshooting critical services
Proficiency with scripting and automation (e.g Go, Java, Python, Bash)
Working knowledge of infrastructure as code (e.g Terraform, Cloudformation, AWS CDK, Pulumi)
Exceptional teamwork, collaboration skills, and the ability to act critically with minimal supervision at times in a remote first environment
Experience with a rotating on-call schedule to provide 24/7 support
BS Degree in Computer Science, Engineering, or equivalent experience
Come As You Are
At Confluent, equality is a core tenet of our culture. We are committed to building an inclusive global team that represents a variety of backgrounds, perspectives, beliefs, and experiences. The more diverse we are, the richer our community and the broader our impact. Employment decisions are made on the basis of job-related criteria without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other classification protected by applicable law.
Click HERE (https://www.confluent.io/legal/confluent-candidate-privacy-notice/) to review our Candidate Privacy Notice which describes how and when Confluent, Inc., and its group companies, collects, uses, and shares certain personal information of California job applicants and prospective employees.
#LI-Remote
Confluent
- Confluent Jobs