Site Reliability Engineer 1

Juniper

Education
Benefits
Qualifications
Skills

Juniper is seeking a full-time SRE to join our talented team and support high quality technology solutions that revolutionize wireless and wired networks, powered by Artificial Intelligence in the cloud. Juniper provides services through SaaS applications to several enterprises, including Fortune 100 and Fortune 500 customers. You will be responsible for maintaining and improving the company's production environment for rapid scaling and outstanding performance. You will keep stellar cloud uptime and reliability. Your primary responsibilities will be incident management and release management in cloud instances in various regions.

Responsibilities

  • Maintain system availability, health and service levels (SLAs, SLOs) of the large-scale cloud infrastructure, running in AWS and GCP.
  • Support infrastructure components, data streaming frameworks and databases, such as Kubernetes, Flink, Storm, Spark, Kafka, Cassandra, Elasticsearch, Redis, Postgres, ArangoDB, and many others.
  • Monitor, troubleshoot, analyze failures, and provide support for software engineers to debug production issues across microservices and distributed platforms. Work with development team in resolving the issues found.
  • Join on-call rotation and resolution of issues in a 24x7 multi-cloud (AWS/GCP) environment.
  • Monitor metrics and performance of applications and cloud infrastructure.
  • Handle entire lifecycle of incident management, including reporting, analyzing, handling incidents, until its closure and writing RCAs.
  • Write and update runbooks for knowledge driven automated processes and bots.
  • Perform capacity planning based on performance, usage, and utilization stats.
  • Follow SRE best practices and procedures.

Required Skills

  • Bachelor's degree in computer science or computer engineering or equivalent.
  • 1+ years hands-on experience with AWS or GCP, EC2 (GCE), IAM, S3 (GS), Docker, Kubernetes pods, Jenkins, Prometheus, CloudWatch (Stack Driver), Linux, Ansible.
  • 1+ years’ experience in deploying code and infrastructure in AWS or GCP using continuous integration/continuous delivery (CI/CD) tools in production environments.
  • 1+ Administration experience of distributed computation and streaming frameworks, like Kafka, Cassandra, Elasticsearch, Flink, Storm, Spark, and cloud services EMR, Dataproc, Elasticache, AWS RDS, GCP SQL or similar.
  • 1+ years of automation using Python or/and Golang, or/and Rust, and shell scripting.
  • 1+ prior experience in developing metrics to monitor health of infrastructure and applications.
  • Good understanding of Terraform or CloudFormation or any IaC code is preferred.

Nice to have

  • Any opensource development experience.
  • AI Ops /Gen AI experience.
  • Automation using workflow services GitHub Actions, Google Workflows, Jenkins, GitLab, Slack and Confluence/Jira.
  • Microservices release operations experience.

Minimum Salary: $88,800.00

Maximum Salary:$127,650.00

The pay range for this position is expected to be between $88,800.00 and $127,650.00/year; however, the base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position also includes medical benefits, 401(k) eligibility, vacation, sick time, and parental leave. Additional details of participation in these benefit plans will be provided if an employee receives an offer of employment.

If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.

Juniper’s pay range data is provided in accordance with local state pay transparency regulations. Juniper may post different minimum wage ranges for permanent residency petitions pursuant to US Department of Labor requirements.

Read Full Description
Confirmed 22 hours ago. Posted 14 days ago.

Discover Similar Jobs

Suggested Articles