Lead Site Reliability Engineer

OpenText

Education
Benefits
Qualifications
Skills

Hiring Manager: Pedro Caba Diaz

Talent Acquisition Advisor: Draun Raval

Job Code Level: IZ-CLD-P4

Refer Your Friends!

AI-First. Future-Driven. Human-Centered.

At OpenText, AI is at the heart of everything we do—powering innovation, transforming work, and empowering digital knowledge workers. We're hiring talent that AI can't replace to help us shape the future of information management. Join us.

YOUR IMPACT

As a Site Reliability Administrator (SRE) at OpenText, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure. You will work closely with development and operations teams to design, implement, and maintain systems that are resilient and efficient.

WHAT THE ROLE OFFERS

  • Design, implement, and manage Kubernetes clusters to ensure high availability and scalability.
  • Utilize AWS services to build and maintain cloud infrastructure.
  • Develop and maintain Helm charts for application deployment and management.
  • Use Terraform to automate infrastructure provisioning and management.
  • Monitor and optimize Windows and Linux-based systems to ensure optimal performance and reliability.
  • Collaborate with development teams to ensure smooth deployment and operation of applications.
  • Implement and maintain CI/CD pipelines to streamline development and deployment processes.
  • Troubleshoot and resolve issues related to infrastructure and application performance.
  • Participate in on-call rotations to provide 24/7 support for critical systems.

WHAT YOU NEED TO SUCCEED

  • Strong experience with Kubernetes and container orchestration.
  • Proficiency in AWS services and cloud architecture.
  • Expertise in Helm charts for Kubernetes application management.
  • Solid understanding of Terraform for infrastructure as code.
  • In-depth knowledge of Linux operating systems and system administration.
  • Experience with CI/CD tools and practices.
  • Excellent problem-solving skills and ability to work under pressure.
  • Strong communication and collaboration skills.

Preferred Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or related field.
  • AWS Certifications
  • Previous experience in a Site Reliability Engineer or similar role.
  • Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana or ELK stack).
  • Knowledge of scripting languages (e.g., Python, Bash).

OpenText's efforts to build an inclusive work environment go beyond simply complying with applicable laws. Our Employment Equity and Diversity Policy provides direction on maintaining a working environment that is inclusive of everyone, regardless of culture, national origin, race, color, gender, gender identification, sexual orientation, family status, age, veteran status, disability, religion, or other basis protected by applicable laws.

If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please submit a ticket at Ask HR. Our proactive approach fosters collaboration, innovation, and personal growth, enriching OpenText's vibrant workplace.

Read Full Description
Confirmed 18 hours ago. Posted 2 days ago.

Discover Similar Jobs

Suggested Articles