25 days old

Site Reliability Engineer - Applied Machine Learning (Search)

Austin, TX
  • Job Code


Posted: May 7, 2019

Weekly Hours: 40

Role Number: 200054952

Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish.

Apple's Applied Machine Learning team has built systems for a number of large-scale data science applications. We work on many high-impact projects that serve various Apple lines of business. We use the latest in open source technology and as committers on some of these projects, we are pushing the envelope. Working with multiple lines of business, we manage many streams of Apple-scale data. We bring it all together and extract the value. We do all this with an exceptional group of software engineers, data scientists, dev-ops engineers and managers

Key Qualifications

  • Strong Infra Concepts in Networking ( VIP, GSLB, DNS, DHCP, CDN, Layer 4, Layer 7, for example Linux System Administration ( FSU, monitoring CPU, Memory, etc. )
  • Strong knowledge of Search Technologies, such as Endeca, InQuira, Solr and Lucene
  • Strong knowledge of managing Java, Node.js based applications with respect to deploying, debugging, securing ( TLS, Trust Stores and Key Stores )
  • Strong knowledge of DevOps philosophy and hands-on experience with one or more of technologies including Saltstack, Ansible, Spinnaker, Terraform, Cloud Formation and enabling workflows and pipelines using one or more of Jenkins, Rundeck with absolute emphasis on CI/CD pipeline using technologies such as Docker deploying on hybrid cloud and baremetal simultaneously
  • Strong fundamentals of Hadoop technologies, including HDFS, Hive, Oozie, Spark, PySpark, or similar Apache Kafka based ecosystem


As an engineer on this team, you will participate in the design and architecture of variety of apps the team manages. You will provide the Infra, SRE and DevOps perspective to the design and architecture and steer the dev and DS team to produce applications that are disaster-proof, highly available, run at Apple-scale with absolutely no downtime while constantly exceeding the SLA. You will help them deliver the applications with minimal time-to-market at precisely the resource footprint with elasticity, while ensuring absolutely tight and robust security, privacy and confidentiality.
- Design, develop, unit test, code review, build and produce the deployment artifacts
- Deploy the application and the APIs to several environments, following development to production strategy
- Maintain and support, solicit user feedback, support requirement gathering, evangelize the adoption of the application and the APIs
- Perform manual tasks including data population, backfills, deployments as needed but strive to provide automation
- Provide on-call support for production issues
- Learn and foster the development philosophy and the team culture

Education & Experience

BS in computer science with 7-10 years or MS plus 5-7 years experience or related experience

Additional Requirements

  • - Absolute strength in working on a team with several tasks in play simultaneously, constantly assessing and delivering on the always changing priorities
  • - AWS, Azure, GCP experience
  • - Data Science technologies
  • - Coding experience in Java, React.js, Node.js, Python, Golang
  • - Project Management
  • - Clear and concise documentation skills

Posted: 2019-07-29 Expires: 2019-08-27

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Site Reliability Engineer - Applied Machine Learning (Search)

Apple, Inc.
Austin, TX

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast