Come work at a place where innovation and teamwork come together to support the most exciting missions in the world!
We are looking for a highly skilled Senior Site Reliability Engineer (SRE) to join our infrastructure team. As a Sr. SRE, you will be responsible for the reliability, scalability, and performance of our engineering systems . You will collaborate with development, operations, and security teams to design, build, and maintain infrastructure and CI/CD pipelines, and to ensure high availability of services.
We are looking for talented and motivated candidate who are interested in solve complex problems and deliver robust solutions. You will be responsible for design & development learn new technology & cloud services & Kubernetes , DevOps components.
Develop tools and automate the process for achieving large scale provisioning and deployment of cloud platform technologies.
Responsibilities:
Senior Site Reliability Engineer is a professional who acts as a bridge between development and IT operations, taking on operational tasks to ensure the efficient functioning of infrastructure & They are responsible for monitoring, automating, and improving the reliability, performance
5-8 years of experience in cloud infrastructure management, candidate must have Minimum 3 years of hands-on experience in VMware Environment working on VCN / ESXI / patching / Upgrades / DRS / HA / VSAN etc. Should have hands-on experience on Linux OS Infra Minimum 2-3 years
- Hands on with automation tools like Ansible, terraform, Jenkins.
- Candidate should have good knowledge on Linux Operating system.
- Managing the continuous integration/continuous development pipeline (CI/CD). You’ll probably be tasked with building this pipeline from scratch.
- Ability to write a program using one or more high-level languages, such as Python / Golang
- Extensive knowledge of Linux systems including hardware, software and applications.
- Extensive Knowledge of VMware and other virtualization products.
- Work on incidents till resolution in collaboration with all other teams and perform root cause analysis with all possible corrective actions for improvement.
- Should have good knowledge of information security principles and practices, understanding of security protocols and standards.
- Candidate should have basic knowledge about Kubernetes, Jenkins and Grafana
- Candidate should know about VMware / KVM hands-on experience.
- Strong knowledge on Linux, Windows, OEL Operating system.
- Good to have Data base knowledge Oracle / SQL
- A strong passion for front-end architecture as well as a driven attitude towards learning new web technologies
- Strong creativity skills
- preferred experience with cloud formation, terraform, ansible, Jenkins
- Proactive approach to identifying problems, performance bottlenecks, and areas for improvement
- Ability to analyze problems and create solutions
- Ability to work independently and follow through on projects
- Excellent written and verbal communication skills
- Just as important as your experience and skills will be the following characteristics and competencies:
- Self-motivation and sense of ownership and accountability
- Ability to analyze problems and create solutions
- Ability to work independently and professionally
- Good written and verbal communication skills
- Detailed work and organizational skills
- Responsible for designing, planning, implementing, and database Administration including security, access, and documentation.
- BE /MS degree in Computer Science, & related field.
Read Full Description