29 days old

R&D, Computer Science (Experienced)

Albuquerque, NM 87102

Are you passionate about your work and dream of utilizing state-of-the-art facilities to explore solutions? Do you want to join a dynamic team that seeks to revolutionize the field of High Performance Computing (HPC) analysis and operations?

We are seeking a computer science R&D professional to join a team developing new software and new operational analytics for high performance computing (HPC) Architectures.

You will enjoy innovating and collaborating with a team researching and developing HPC Monitoring, Performance Analysis, and Response solutions in order to provide advanced, data-focused operations and efficient utilization. The team authors the open-source, R&D 100 Award-winning Lightweight Distributed Metric Service (LDMS) which is used for monitoring several of the largest HPC systems in the world.

## on any given day you may be called upon to:

+ Design and develop software for extreme-scale data collection and analysis to assess system and application performance
+ Develop and deploy analysis techniques to detect and classify operational conditions that bottleneck user application performance.
+ Develop data presentations and automated response techniques to enable more efficient computing based on analysis outcomes
+ Work with internal and external organizations operating large-scale HPC systems to deploy monitoring solutions and utilize them for performance understanding
+ Publish and present research results at peer-reviewed conferences


+ MS + 2 years experience or PhD in relevant STEM discipline
+ 5 years of experience programming in C, C++, and/or Python
+ You have experience programming in Unix/Linux environments
+ A record of peer-reviewed publication of results and/or external presentations at scientific conferences
+ Ability to obtain and maintain a DoE Q clearance


+ Experience developing in Jupyter Notebooks and with NumPy
+ Experience using and/or developing statistical data analysis and/or machine learning techniques (e.g. PCA, scikit-learn, TensorFlow) for significantly sized datasets
+ Experience developing large-scale codes in a multi-developer, open-source software environment
+ Experience developing middleware for HPC systems, including consideration of resilience, memory, scalability, and CPU footprint
+ Experience doing performance analysis studies of software and applications on HPC system architectures, particularly for advanced processors and/or networks
+ Familiarity building and running applications in HPC system environments
+ Experience as a system administrator in Unix/Linux Environments
+ Experience with HPC monitoring technologies, such as LDMS, Elastic Search, Kafka, and LogStash.
+ Experience developing unit and regression tests and running such tests within frameworks, such as Jenkins
+ Current DOE Q security clearance

Department Description:

The High Performance Computing (HPC) Development Department develops creative solutions for the operation and efficient utilization of leading and next-generation computing systems. The Heterogeneous Advanced Architecture testbed Platforms (HAAPs) represent small instances of the most currently available technology in computing so that code developers and computer science researchers can test and evaluate advanced processors and accelerators. Our Advanced Technology Systems (ATS) testbeds enable porting of codes within Sandia's network environment in preparation to run production calculations on extreme-scale platforms.. The HPC Monitoring and Analysis team develops monitoring, analysis, and response software and methodologies to enable new insights into the performance and utilization of the platforms and the applications running on them.

About Sandia:

Sandia National Laboratories is the nations premier science and engineering lab for national security and technology innovation, with teams of specialists focused on cutting-edge work in a broad array of areas. Some of the main reasons we love our jobs:

+ Challenging work withamazingimpact that contributes to security, peace, and freedom worldwide
+ Extraordinary co-workers
+ Some of the best tools, equipment, and research facilities in the world
+ Career advancement and enrichment opportunities
+ Flexible schedules, generous vacations,strongmedical and other benefits, competitive 401k, learning opportunities, relocation assistance and amenities aimed at creating a solid work/life balance*

_World-changing technologies. Life-changing careers._ Learn more about Sandia at: http://www.sandia.gov

*These benefits vary by job classification.

Security Clearance:

Position requires a Department of Energy (DOE) Q-level security clearance.

Sandia is required by DOE to conduct a pre-employment drug test and background review that includes checks of personal references, credit, law enforcement records, and employment/education verifications. Applicants for employment must be able to obtain and maintain a DOE Q-level security clearance, which requires U.S. citizenship. If you hold more than one citizenship (i.e., of the U.S. and another country), your ability to obtain a security clearance may be impacted.

Applicants offered employment with Sandia are subject to a federal background investigation to meet the requirements for access to classified information or matter if the duties of the position require a DOE security clearance. Substance abuse or illegal drug use, falsification of information, criminal activity, serious misconduct or other indicators of untrustworthiness can cause a clearance to be denied or terminated by DOE, resulting in the inability to perform the duties assigned and subsequent termination of employment.

EEO Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.


Posted: 2019-11-07 Expires: 2019-12-07

 World-changing technologies. Life-changing careers.

National security is our business. We apply science to help detect, repel, defeat, or mitigate threats.

For more than 60 years, Sandia has delivered essential science and technology to resolve the nation's most challenging security issues.

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

R&D, Computer Science (Experienced)

Sandia National Laboratories
Albuquerque, NM 87102

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast