
Share this Job
About the Organisation
IBM is a globally renowned technology and consulting company, established in 1911. With a focus on hybrid cloud and AI, IBM offers cutting-edge solutions in software, infrastructure, and services. It is recognized as one of the largest and most innovative tech employers, serving Fortune 50 companies around the globe. IBM values diversity, continuous learning, and impactful innovation.
Site Reliability Engineer job at IBM (International Business Machines Corporation) | Apply Now
Remote, OR, USA
Are you looking for Remote Software Engineering jobs in 2025 today? then you might be interested in Site Reliability Engineer job at IBM (International Business Machines Corporation)
Full Time
Deadline:
21 May 2025
Job Title
Site Reliability Engineer job at IBM (International Business Machines Corporation)
IBM (International Business Machines Corporation)
Job Description
The Site Reliability Engineer II will work within the Infrastructure Services team to support IBM's cloud offerings powered by HashiCorp. The role includes automating processes, reducing manual toil, improving observability, and supporting production infrastructure for IBM cloud services. Candidates will work with tools such as Nomad, Consul, Vault, Terraform, and AWS. The position is remote and ideal for engineers looking to grow into senior roles in site reliability engineering.
Key responsibilities include:
Developing and maintaining infrastructure services to ensure high availability and security.
Implementing automation and improving deployment processes.
Debugging infrastructure issues with guidance from senior engineers.
Participating in on-call rotations post-onboarding.
Creating and maintaining documentation.
Collaborating across teams and engaging in hiring activities.
Duties, Roles and Responsibilities
Qualifications, Education and Competencies
See all details of the qualifications, competencies and education for this role under the "How to Apply" section below.
ONLINE APPLICATION ONLY!
Interested candidates are advised that applications for this position must be submitted online. To apply please click the “Apply” button below.
Find application details and links on the AfriCareers Jobs Portal:
-
Click the Apply button below
-
New users: Select Create Profile and complete the Profile Creation Wizard
-
Existing users: Log in and update your profile if needed
-
Go to the "Jobs" tab
-
Read the detailed job description, Roles and Qualifications.
-
Submit your application via the jobs portal
-
Track progress under "My Applications" tab
Important Note: Some employers now hire directly on the AfriCareers New Jobs Portal — keep your profile updated so employers can easily view your CV and hire you instantly.
How to Apply
Build, maintain, and improve core infrastructure systems.
Ensure system reliability, scalability, and security.
Automate operations to minimize manual tasks.
Improve monitoring, alerting, and logging.
Resolve infrastructure issues and support incident response.
Collaborate with product and engineering teams.
Write and maintain technical documentation.
Support interviews and hiring evaluations.
Required:
High School Diploma/GED (Bachelor's Degree preferred).
Experience in site reliability engineering or systems administration.
Familiarity with AWS and Terraform.
Exposure to observability tools (Datadog, Prometheus, Grafana).
Basic scripting skills (Python, Go, Bash).
Strong problem-solving and collaboration skills.
Preferred:
Growth mindset with eagerness to learn and take on increasing responsibilities.
Familiarity with IBM and HashiCorp products.
Interest in progressing into a senior SRE role.


.jpg)
.jpeg)





