Coupa has flagged the Site Reliability Engineer job as unavailable. Let’s keep looking.

About Us (Ensono)

Ensono is an expert technology adviser and managed service provider. As a relentless ally, we accelerate clients’ digital transformation to achieve business outcomes that stand to last. Our dedicated team helps organizations optimize today’s systems across any hybrid environment with services such as consulting, mainframe and application modernization, public cloud migration and cloud-native development. With certified experts in AWS, Azure and Google Cloud and recognized as Microsoft Datacenter Transformation Partner of the Year, Ensono has over 3500+ associates globally and is headquartered in greater Chicago.

We care about your success, offering comprehensive strategic and managed services for mission-critical applications. Our Advisory and Consulting services can help upfront with an application strategy or find the right places for your applications – whether it’s public, multi or hybrid cloud, or mainframe. And because we span across all mission-critical platforms, we can meet you wherever you are in your digital transformation journey, with 24/7 support when you need it. We are your relentless ally, flexing with you when challenges emerge so you don’t feel stuck in place. With cross-platform certifications and decades of experience, our technology experts have become an extension of your team so you’re continuously innovating – doing more with less while remaining secure. And that’s just the beginning.

About Role

Ensono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed services clients. Ensono has invested time to create templated cloud-native solutions to provide value to our clients. They have loved what we’ve done so far and want us to operate these applications in production on their behalf.

In response to this demand, Ensono is applying Site Reliability Engineering principles to disrupt the traditional Managed Services approach and deliver something that empowers our customers and turns technology into an efficiency, growth and innovation multiplier. The successful candidate will be reporting into the Head of SRE and will start supporting our clients immediately. New projects are in the pipeline, so you will also be working with our pre-sales and delivery teams to ensure operations are considered long before handover.

We are just starting on our journey to Site Reliability Engineering, so we are eager to continue to learn from industry leaders and your experiences in delivering Site Reliability Engineering to build a sustainable workplace that delivers a service which will delight our customers.

What you will be doing: As a Site Reliability Engineer, your overarching responsibility is to ensure we meet our clients’ Service Level Objectives, and we respond to incidents in a timely and professional manner.

Responsibilities:

  • Monitoring our client’s services using modern tools and SRE practices.
  • Responding to incidents originating from 2nd line support within the times set out in the SLA (being on-call).
  • Performing and assisting in root cause analysis and blameless post-mortems to enable incidents to be understood and avoided in the future.
  • Improving the testing and release procedure.
  • Planning for and making changes to capacity to balance the demand vs. cost saving equation better.
  • Undertaking improvements to the infrastructure and product.
  • Making changes to client’s services based upon operational or business needs.
  • Advising and supporting the further development of Ensono Intellectual Property to ensure future projects benefit from what we learn.

Experience level - 5 to 8 yrs

Technical Key Skills (Mandatory Skills)

  • A comprehensive understanding of Site Reliability Engineering
  • Experience working with a cloud service provider (ideally Azure or AWS)
  • Strong examples of implementing automation/solutions by code (preferably Python, C#, Java, or Go, any other language)
  • Commercial experience working with compute technologies (such as Kubernetes (EKS), or Serverless)
  • Designed, implemented, and/or supported solutions in a production environment
  • Strong interpersonal and communication skills to work in a fast-paced and rapidly changing dynamic environment

Good to have skills

  • Experience with CI/CD pipeline tools (such as Azure DevOps, GitHub Actions, Gitlab CI)
  • Experience with monitoring, logging tools (such as Azure Monitor, CloudWatch or Prometheus)
  • Experience with ITSM tools (such as ServiceNow, OpsGenie, or PagerDuty)
  • Working with an Infrastructure as Code tool (Terraform, ARM, CloudFormation or Deployment Manager)
  • Excellent troubleshooting skills that span systems, networks (TCP/IP), and code
  • Expert knowledge of Linux internals and tuning

Shift Timings

Should be comfortable with any shift timings

Read Full Description
Confirmed 18 hours ago. Posted 27 days ago.

Discover Similar Jobs

Suggested Articles