Experience Inc. Jobs

Job Information

Microsoft Corporation Site Reliability Engineering in Multiple Locations, Costa Rica

With a constantly increasing number of public and sovereign clouds, the IC3 Data Platform team is looking for a curious and passionate SRE that can help us keep up with and improve the levels of automation, reliability, and optimization required by our world-class data processing services.

The IC3 Data Platform team collects telemetry data for calls and meetings of our Intelligent Conversation and Communications Cloud (IC3) services. IC3 services empower products like Skype and Microsoft Teams with over 300 million users world-wide enabling the Modern Workplace and Modern Life initiative. The telemetry data processed is used for call and meeting quality monitoring, troubleshooting, auditing, and product enhancements.

As a Site Reliability Engineer in IC3 Data Platform, you will help in major automation tasks for low-touch and safe deployments of complex architectures to Azure clouds in multiple regions in the world. You will help to improve the reliability of our infrastructure, platforms, and services. You will help put in place the right amount of monitoring and alerting and will be data-driven to identify and implement improvements in the infrastructure’s performance and efficiency. This opportunity will allow you to gain a deep understanding of Microsoft’s clouds infrastructure for public and special clouds and enable millions of users world-wide to have a better Modern Life. This position is fully remote so you can enjoy a healthy work-and-life balance.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

Technical Knowledge and Domain-Specific Expertise

  • Demonstrates end-to-end expertise in distributed systems design, interactions between cloud technology layers and components, functions of physical network devices, and dependencies at scale. Recommendsoptimal configurations of cloud technology solutions and develops or modifies the code base that defines infrastructures to improve the reliability and operability of supported products.Develops end-to-end technical expertise in the architecture, code, features, and operations of specific products as required to implement improvements in product availability, reliability, efficiency, observability, and/or performance.

  • Researches and maintains deep knowledge of industry trends as well as advances in large-scale distributed systems and cloud technologies; identifies opportunities to create, implement, and/or optimally utilize new tools, technologies, and/or processes to solve ambiguous problems and improve product availability, reliability, efficiency, observability, and/or performance.

Contributions to Development and Design

  • Leverages technical expertise in the infrastructure of large scale distributed systems and specific products, as well as objective insights drawn from analyses of production telemetry data to advocate for, or directly contribute to, changes to the code base to improve the availability, reliability, efficiency, observability, and performance of related sets of products developed and supported by teams within the organization.Develops, tests, and implements changes to optimize code and improve the observability, reliability and operability of platforms, systems, and products at scale. Reviews the effect of these changes to document and share development insights within their team.

Driving Operational Excellence

  • Develops code, scripts, systems, or platforms that automate moderately complex but repetitive operations processes at scale.

  • Identifiesoptimal uses for existing tools and/or models to identify contributing factors or points of failure that are affecting the availability, reliability, performance, and/or efficiency of systems, platforms, or products; or that affects the velocity of the team.Develops, maintains, and leveragescapacity planning models and monitoring tools to forecast product capacity and resource demands.Participates together with the team in regular on-call rotations andhelps with identifying the level of impact, troubleshooting complex issues, and deploying appropriate fixes to resolve root cause(s). Alerts product teams, owners, and leadership to issues with major customer/business impact.

Other

  • Embody our culture (https://careers.microsoft.com/us/en/culture) and values (https://www.microsoft.com/en-us/about/corporate-values)

Qualifications

Required Qualifications

  • 6+ years technical experience in software engineering, network engineering, or systems administration

  • OR Bachelor's Degree in Computer Science , Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration

  • OR Master's Degree in Computer Science , Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration.

Additional or Preferred Qualifications

7+ years technical experience in software engineering, network engineering, or systems administration

  • OR Bachelor's Degree in Computer Science, Information Technology,

  • OR related field AND 4+ years technical experience in software engineering, network engineering,

  • OR systems administration

  • OR Master's Degree in Computer Science, Information Technology,

  • OR related field AND 3+ years technical experience in software engineering, network engineering

Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations (https://careers.microsoft.com/v2/global/en/accessibility.html) .

DirectEmployers