Site Reliability Engineer

ASAPP

ASAPP develops AI-powered solutions that improve the effectiveness and scalability of enterprise-level customer service operations. Our team is dedicated to tackling the most challenging issues in contact centers—such as high interaction volumes and complex customer queries—using AI technology. We design intelligent systems that can autonomously handle interactions, and when necessary, seamlessly escalate to human agents. Suppose you are excited about working in a fast-paced environment where you’ll be helping large organizations achieve measurable results. In that case, you’ll find an opportunity to make a meaningful impact at ASAPP. Join us in building solutions that create value for businesses and their customers by streamlining operations and improving first-contact resolution, all while maintaining cost-effectiveness and security.

Site Reliability Engineers (SREs) are responsible for the overall performance and reliability of ASAPP's infrastructure and products. SREs design and implement the tools that automate building reliable and performant systems. We emphasize building tools over manual processes. We implement, not administer. We’re obsessed with automation, not repetition. Our job is to focus on building reliable infrastructure and tools for our product teams so that they can solve customer problems and deliver new features, not reinvent platforms.

What you'll do

  • Work with product engineering teams on service architecture and implementation
  • Deliver Infrastructure configuration as code and automate everything
  • Direct and implement monitoring and alerting systems to support rapid problem diagnosis
  • Perform Root Cause Analysis and design and deliver resolutions
  • Work on our Kubernetes / AWS infrastructure to support our product engineers
  • Design secure and performant networking solutions in our production systems

What you'll need

  • +4 years of relevant experience bringing software to production at high scale
  • Participation in on-call rotation, triaging and addressing production issues
  • Obsession with automation and instrumentation
  • Understanding of complex systems and failure scenarios
  • Excellent communication skills
  • Knowledge of AWS services, containers and container management frameworks
  • Familiarity with Message Bus based systems and distributed architectures
  • Proficiency in Terraform , Python and/or Go

What we'd like to see

  • BS or MS degree in the Computer Science field, or equivalent hands-on experience.
  • Experience in product oriented environments
  • Scalable distributed applications experience

Benefits

  • Competitive compensation
  • Stock options
  • Prudent Life Insurance
  • Free Lunch and Dinner
  • Connectivity (mobile phone & internet) stipend
  • Wellness perks
  • Mac equipment
  • Learning & development stipend
  • Parental leave, including 6 weeks of paternity leave

ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at jobs@asapp.com to obtain assistance.

Read Full Description
Confirmed 2 hours ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles