Senior DevOps Engineer

Chegg

Job Description

Job Description

As a member of our Site Reliability Team, you will help drive collaboration across departments and provide global insights by being the consistent eyes on production. You will work with application, infrastructure, and product teams to ensure smooth launches while meeting proper security gates. You will have the chance to skill-up on communicating for global teams and help bring standards to teams with unique SDLCs. 

This group runs extremely high on constant learning and shared education to avoid silos. To be most effective, you will want to have a solid grasp of engineering principles, infrastructure design, and a mature background in iterative product delivery.

On the Team you will participate in:

  • Driving Agile teams to support both interrupt and project work. 
  • Estimating and delivering projects on-time and within budget through scope shifting and solid communication.
  • Analysis of trends on Golden Signal KPIs (or other) to provide useful feedback on anomalies.
  • Building ideas to shift conversations from outage/retro or symptom/prevention to prevention.
  • Partnerships with Engineering teams to remove drag from internal processes (Knowledge of Gitlab, Github/Git, Ansible, Terraform, Kubernetes, Docker, Consul).
  • Documenting and training others on your team and providing group training and demos.
  • Constantly improving Change, Release, Incident and Patching processes, with the goal of making them non-events.
  • Optimizing debug procedures for production issues across a variety of technical stacks.
  • Enforcing standards through communication, on design, implementation, and security.

You will be charged with having:

  • 5-8 years in Cloud commerce system delivery.
  • Expertise in at least two areas of application and infrastructure engineering.
  • A strong knowledge of AWS technologies and a willingness to self-teach.
  • Experience with CI/CD Automation (Our env: Ansible, Gitlab, Git/Github, Artifactory, Terraform)
  • Capabilities in design and delivery, bringing in projects on budget.
  • An understanding of capacity planning and how to set appropriate limits to optimize cost and performance.
  • Knowledge of identifying system scale, backoff or other throughput challenges to help prevent incidents or resolve them quickly.
  • Experience with performing to metric, SLI/SLO/SLA(s), and making meaningful commitments to customers.
  • History with product behavior, edge cases, failure modes, negative boundary behaviors, load mishaps, etc.., to stop issues before they enter production.
  • A history of building and supporting multiple versions of Linux, and Windows OS.
  • An understanding of capacity planning and how to set appropriate limits to optimize resources.

Behavioral skills:

  • Team player as you will be a part of Global team distributes across different nations.
  • Problem solver & Self motivator.
  • Good in time management & task prioritization.

Technical Skills you should have:

  • Cloud platforms. AWS (preferably) or Azure & strong understanding of services like IAM, EC2, ECS, S3, Route53, DNS, LAMBDA, Elastic-Beanstalk , Elastic-Cache , RDS, etc.
  • Containerization. Hands-on experience with Docker or Kubernetes. Be able to create and publish your own container images and deploy with Docker.
  • Infrastructure as code tools like Terraform for defining infrastructure in configuration files, and to create environments.
  • Process automation (CI/CD) tools preferably Gitlab, Jenkins etc. Strong Knowledge of automation tools.
  • Shell scripting (or PowerShell in Windows).
  • Modern programming languages programming skills in a preferred language; Python, Java, PowerShell.
  • Observability: Good understanding of infra, application & log monitoring tools (like New-Relic , DataDogs, Zabbix, Grafana etc).

Why do we exist?

Students are working harder than ever before to stabilize their future. Our recent research study called State of the Student shows that nearly 3 out of 4 students are working to support themselves through college and 1 in 3 students feel pressure to spend more than they can afford. We founded our business on provided affordable textbook rental options to address these issues. Since then, we’ve expanded our offerings to supplement many facets of higher educational learning through Chegg Study, Chegg Math, Chegg Writing, Chegg Internships, Thinkful Online Learning, and more to support students beyond their college experience. These offerings lower financial concerns for students by modernizing their learning experience. We exist so students everywhere have a smarter, faster, more affordable way to student.

Video Shorts

Life at Chegg: http://youtu.be/Fwf90zgaOLA

Certified Great Place to Work!: http://reviews.greatplacetowork.com/chegg

Chegg Corporate Career Page: https://jobs.chegg.com/

Chegg India: http://www.cheggindia.com/

Chegg Israel: http://www.chegg.com/about/working-at-chegg/israel/

Thinkful (a Chegg Online Learning Service): https://www.thinkful.com/about/#careers

Chegg out our culture and benefits!

http://www.chegg.com/about/working-at-chegg/benefits/

http://techblog.chegg.com/

Chegg is an equal opportunity employer

Read Full Description
Confirmed 15 hours ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles