22 days old

Infrastructure and DevOps Engineer

Santa Clara, CA 95050
  • Job Code
Job Description

We are looking for a candidate with deep technical experience in creating and maintaining High Performance Engineering Computing at scale (tens of thousands of servers, petabytes of storage and high performing network topology) in multiple secured enclaves.

You will be part of a team responsible for designing, building, maintaining, and supporting this complex highly available and reliable High Performance Engineering Computing (HPC) center of excellence environment for Intel's fast-growing business supporting both internal and external customers.

An ideal candidate will have technological team leadership experience in Systems Administration, Automation and Scripting.

As a Site Reliability Engineer your responsibilities include, but will not be limited to:

  • Exploring and evaluating new technologies and solutions to push the capabilities forward, getting ahead of customers' needs.
  • Communicate concepts at different levels of abstraction to exercise influence across multiple levels of the organization.
  • Share results from incident investigations to a wide Information Technology audience through a blameless postmortem process, with the goal of exposing faults so they are fixed instead of leaving issues unresolved.
  • Execute a change through an enterprise environment with consistency and reliability by applying modern software, operations and quality principles such as progressive rollouts, problem detection and rollbacks if needed.
  • Perform a thorough initial and regular health checks of the Storage and Backup systems and implement necessary fixes.
  • Create, maintain best known configurations for environment, incidents, and their known resolutions.
  • Create daily monitoring and error checklists on the High Performance Engineering Computing environment and train users in best practices.

In addition, the ideal candidate should also have the following skills:

  • Strategic engineering and design acumen with hands-on, technical work and problem-solving skills
  • Passion for quality and automation
  • Skills to handle complex systems
  • Desire for continual improvement and innovation


Minimum qualifications are required to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates.

Minimum Qualifications:

  • Bachelor's degree in Computer Science, Electrical Engineering or any other related technical field

6+ years of experience with:

  • Large Enterprise NAS environment, including NetApp
  • Storage protocols including one or more of the following: NFS, SMB/CIFS, iSCSI, and/or FCP Automation, troubleshooting and maintaining Python/Bash/PowerShell scripts
  • NetApp products especially FAS /AFF storage devices

This position is not eligible for Intel immigration sponsorship.

Inside this Business Group

Intel's Information Technology Group (IT) designs, deploys and supports the information technology architecture and hardware/software applications for Intel. This includes the LAN, WAN, telephony, data centers, client PCs, backup and restore, and enterprise applications. IT is also responsible for e-Commerce development, data hosting and delivery of Web content and services.

Intel strongly encourages employees to be vaccinated against COVID-19. Intel aligns to federal, state, and local laws and as a contractor to the U.S. Government is subject to government mandates that may be issued. Intel policies for COVID-19 including guidance about testing and vaccination are subject to change over time.

Posting Statement

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.

Work Model for this Role

This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site.

Posted: 2022-04-27 Expires: 2022-05-28

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Infrastructure and DevOps Engineer

Santa Clara, CA 95050

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast