Experience Inc. Jobs

Job Information

Cribl, Inc Senior Site Reliability Engineer in Jackson, Mississippi

This is a Job Description for a Senior Site Reliability Engineer in Jackson, Mississippi

Summary: What does that mean? It means we are a serious company that doesn't take itself too seriously; and we're looking for people who love to get stuff done and laugh a bit along the way. We're growing rapidly - looking for collaborative, curious, and motivated team members who are passionate about putting customers first. As a remote-first company we believe in empowering our employees to do their best work, wherever they are. As the data engine for IT and Security many of the biggest names in the most demanding industries trust Cribl to solve their most pressing data needs. Ready to do the best work of your career? Join the herd and unlock your opportunity.

Duties & Responsibilities:

Engage with teams and improve service delivery and reliability across their entire lifecycle. Measure and monitor all production systems with an eye towards availability, latency, and overall system health. Seek out the cause of errors and instability in our production cloud services and drive teams towards better operational excellence. Engage with product and platform teams to improve and evolve systems by lobbying for changes that improve reliability, resilience, and observability. Help Identify and drive down toil with creative innovation and automation. Participate in On-call responsibilities

 

Requirements

and Qualifications[]{#Hlk142289191}[]{#Hlk142304825}

:

 Extensive experience with enterprise scale continuous delivery environments. 5+ years of experience as a DevOps or SRE. Development with JavaScript/Node.js/TypeScript in a Linux/Mac environment. Experience with Configuration Management Tools like Terraform (preferred) or Puppet, Chef, Ansible. Experience with sustainable incident response in a blameless environment. Knowledge of cloud platforms (prefer AWS) and container + orchestration technologies. Experience with APM and Observability and related tools such as, New Relic, Splunk, CloudWatch, Prometheus, Grafana/Kibana, Sentry etc. Background in Linux Systems Engineering. Experience with Incident response related tools for instance, PagerDuty, Fire Hydrant, Blameless etc. Comfortable with a high level of autonomy and working with a distributed team

Equal Opportunity/Affirmative Action Employer.

 

DirectEmployers