Urgently hiring Use left and right arrow keys to navigate
Estimated Pay $17 per hour
Hours Full-time, Part-time
Location Oak Ridge, Tennessee

Compare Pay

Estimated Pay
We estimate that this job pays $17.13 per hour based on our data.

$11.28

$17.13

$23.93


About this job

Job Description

Job Description
Kubernetes Platforms Systems Engineer
The Team:
In this role you will work in the Infrastructure team within the HPC Infrastructure and Networking Group to support all activities of our supercomputer center.
Major Duties/Responsibilities:
  • Use advanced knowledge and experience to ensure our Kubernetes platform remains reliable, available, and fast
  • Use advanced knowledge and experience to identify problems and provide solutions to improve the reliability, scalability, performance, and efficiency of our services
  • Respond to, investigate, and fix service issues all the way from bare metal through the OS to the application code, including technically complex problems spanning a diverse set of areas
  • Work as part of a team to define and implement best practices and standards within the organization
  • Coordinate with vendors to resolve hardware and software problems
  • Participate in an on-call rotation providing 24-hour, 7-day support and off-hours maintenance windows
  • Work with scientific and technical users of the NCCS to help them use Kubernetes
Basic Qualifications:
  • Bachelor’s degree or an equivalent combination of education and experience
  • At least five years of relevant technical experience
Preferred Qualifications:
  • 5+ years of experience working as an SRE/Systems Administrator/Systems Engineer
  • Excellent interpersonal/communications skills, and the ability to work as part of a team
  • Experience with Docker or Kubernetes
  • Experiencing using image registries such as Quay or Harbor
  • Understanding of networked computing environment concepts
  • Working knowledge of Unix systems fundamentals and common network protocols
  • Ability to develop and maintain programs and scripts that aid in the operation and automation of tasks using various shell and scripting languages (primarily bash, Python, and Go)
  • Working knowledge of tools such as Prometheus, Nagios, and Grafana to monitor systems, metrics and create dashboards
  • Working knowledge of Infrastructure-as-Code tooling such as Terraform, Helm, and Puppet
  • Working knowledge of CI/CD tooling and GitOps
  • Experience with code review and familiarity with tools like git, GitHub and GitLab

Nearby locations

Posting ID: 984074050 Posted: 2025-02-09 Job Title: Platform Engineer