SMART Modular Technologies, Inc. Sr. Managed Services Engineer in Any State in the United States/Telecommute, United States
Penguin Computing, a subsidiary of SMART Global Holdings (SGH), specializes in innovative Linux infrastructure, including Open Compute Project (OCP) and EIA-based high-performance computing (HPC) on-premise and in the cloud, AI, software-defined storage (SDS), and networking technologies, coupled with professional and managed services including sys-admin-as-a-service, storage-as-a-service, and hosting, as well as highly rated customer support.
Penguin Computing Managed Services provides dedicated, remote, Linux systems administration for complex, integrated environments involving high-performance computing, cloud, and enterprise systems. This position requires strong technical skills and the ability to understand, document, configure, administer, troubleshoot, and resolve issues in deployed environments. This is a customer-facing position. Successful candidates must have excellent communication skills, a friendly demeanor, the ability to work with others, and to remain calm, focused, and organized.
Essential Duties and Responsibilities
Support a Linux-based, high-performance computing (HPC) and artificial intelligence (AI) environment, featuring a wide range of technologies.
Render professional, timely, and expert user support.
Install, configure, and tune software applications.
Manage and maintain system infrastructure (hardware and software).
Troubleshoot software and hardware issues.
Manage the RMA process and coordinate support escalations for Penguin Computing and third-party hardware and software.
Fully document processes, procedures, and all work performed.
Develop automation and other tools to improve operations.
Participate in growing Penguin Computing's technical capabilities through knowledge-sharing and team activities.
Bachelor’s Degree in Computer Science, Computer/Electrical Engineering, or a related field (or equivalent experience).
7+ years of hands-on experience with UNIX/Linux server environments.
Strong Linux systems administration skills and experience with open source technologies.
Understanding of network technologies, architectures, and protocols.
Practical knowledge of software-defined storage architecture and administration.
Practical knowledge of implementation and administration of High-Performance Computing (HPC) technologies, including cluster resource management, job scheduling, etc.
Proven skills in software application build, installation, and optimization in Linux clusters and other environments.Ability to communicate clearly and effectively.
Red Hat Certified Systems Administrator (RHCSA), SUSE Certified Administrator (SCA), or equivalent.
Deep knowledge of OpenStack, Ceph, and WekaIO.
Strong knowledge of High Performance Computing (HPC) application development.
Strong familiarity with Kubernetes or Red Hat OpenShift.
Familiarity with data science tools, such as Jupyter notebooks.
Familiarity with accelerated computing technologies (e.g., GPUs).
Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
The employee is regularly required to talk, listen, sit, and stand.
Moderate to extensive keyboard, computer terminal activity required.
Lifting and carrying (up to 25 lbs.) is required.
Travel up to 30% may be required.
Interview process includes completing our employment application forms. All offers of employment are contingent upon a successful background check, and proof of legal status to work in the U.S. We utilize E-Verify.
Penguin Computing is an Affirmative Action/Equal Opportunity Employer and is strongly committed to all policies which will afford equal opportunity employment to all qualified persons without regard to sex, age, national origin, race, ethnicity, creed, sexual orientation, gender identity, veteran status, disability, or any other characteristic protected by law.
Requisition ID: 2020-1115
External Company URL: https://www.penguincomputing.com
Street: Remote- Any State in the United States/Telecommute