Senior Site Reliability Engineer

ADP

ADP is hiring an Senior Site Reliability Engineer

  • Are you empathetic to client needs and inspired by transformation and impacting the lives of millions of people every day?
  • Are you looking to join a dynamic, inclusive team environment with a culture of collaboration and belonging?

Well, this may be the role for you. Ready to design what's next?

In this role, You become a part of an energetic and enthusiastic cross-functional team working on next generation of products for ADP. Working closely with development Scrum teams, you will use your skills to provide technical insights during design and implementation phases of product releases, driving the automation of system provisioning and configuration tasks through daily collaboration.

Qualifications you'll need:

Education: Bachelor's degree

Experience:

  • Expert in Windows systems administration – system design, upgrades, migrations, patching, maintenance, application deployment, capacity planning, etc.
  • Extensive hands-on experience in Citrix administration and management
  • Hands on experience in Active Directory administration and management, DHCP, DNS, File Shares, NFS.
  • Good experience with one of the scripting tools like Python, Bash, PowerShell or similar.
  • Exposure to one of the CI/CD tools like Jenkins, TeamCity, etc. for continuous integration, continuous delivery.

Technical skills (preferred – must have at least one of the below)

  • Knowledge on Linux server administration
  • Experience with cloud-based Infrastructure including plan, design, setup, migration to and ongoing support for such environments.
  • AWS services experience – VPC, S3, Lambda, ALB, Route53, API Gateway, EC2, EBS, AWS FSX, Eventbridge, Oracle RDS, AWS Backup, Cloudwatch, etc.
  • Experience in Infrastructure As a Code (IaaC), Cloud formation, Ansible (or similar) platforms to automate infrastructure deployments and configuration.
  • Experience with Docker, and deployment of the Docker applications in Kubernetes, Docker swarm or similar.
  • Administration/operations experience with Oracle, Postgres, Mongo database technologies.
  • Experience in monitoring and logging tools Dynatrace, Prometheus/Grafana, Splunk.
  • Consistent application of good source code and configuration management practices

Other skills

  • Experience with agile product teams developing and supporting solutions in production
  • Experience with leading incident response and conducting post-mortems/root cause analysis.
  • Strong problem solving and troubleshooting skills.

Your contribution will include (but not limited to):

  • Support services before they go live through activities such as system design consulting, automating environment provisioning and configuration management, capacity planning and operational readiness reviews.
  • Drive full automation of deployment pipelines for both platform and application changes
  • Monitor and measure availability, latency and overall system health across infrastructure, database, container and application levels.
  • Apply infrastructure as code practices, via use of tools such as git/Bitbucket, Jenkins, Ansible or similar.
  • Ensure systems are kept up to date and are compliant with ADP security standards
  • Scale systems sustainably through automation to improve reliability and velocity.
  • Lead on-call support and incident management, blameless post-mortems, and continuous improvement activities
  • Proactively ensure the highest levels of systems and infrastructure availability.
Read Full Description
Confirmed 15 hours ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles