Key skills and expertise:
- 5+yrs of experience working with Microsoft Power Platform and Dynamics and working with highly scalable and reliable servers
- Spend considerable time on Production management, including incident and problem management, capacity management, monitoring, event management, change management and plant hygiene.
- Troubleshooting issues across the entire Technology stack: hardware, software, application and network.
- Participating in on-call rotation, periodic maintenance calls with other specialists from other timezones.
- Proactively identifying and addressing system reliability risks.
- Working closely with development teams to design, build and maintain systems from a reliability, stability and resiliency perspective.
- Identifying and driving opportunities to improve automation for our platforms, scope and create automation for deployment, management, and visibility of our services.
- Representing the RPE organization in the design reviews, operations readiness exercises for the new and existing products/services
Technical Skills:
- Enterprise tools like Promethus, Grafana, Splunk and Apica
- UNIX/Linux Support and Cloud based services
- Ansible, Github, or any automation/configuration/release management tools
- Automation experience – scripting languages such as Python, Bash, Perl and Ruby (one of the languages sufficient)
- Awareness of, ability to reason about modern software and systems architecture, including load balancing, databased, queueing, caching, distributed systems failure modes, microservices, cloud etc.
- Experience of Azure networks, ServiceBus, Azure Virtual machines, and AzureSQL will be an advantage.
If you are keen to join us, you will be part of an organization that values your contributions, recognizes your potential, and provides ample opportunities for growth. For more information, visit www.capco.com. Follow us on Twitter, Facebook, LinkedIn, and YouTube.
Read Full Description