Accela has flagged the Senior Site Reliability Engineer job as unavailable. Let’s keep looking.

Our Purpose

We work to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions, governments and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team – one that makes better decisions, drives innovation and delivers better business results.

Title and Summary

Senior Site Reliability Engineer (storage)

Mastercard’s enterprise storage team is looking for a senior site reliability engineering to help advance our SRE capabilities across our storage platforms with a focus on the Software Defined Storage technology Ceph.

  • This individual will work closely with technology support teams and application teams to build monitoring and automation solutions to improve the application and infrastructure availability.
  • Key questions for viable candidates: 1. Have you resolved a complex application availability issue with monitoring and automation? 2. Are you a self-started with minimal direction and guidance? 3. Have you ever been part of a team with diverse skills and experience located in different geographical locations/time zones?

Role

  • Represent enterprise storage team in project meetings and provide advice, status, training and technical support
  • Work with our customers to understand the monitoring and automation requirements and implement solution using available tool sets and scripting languages
  • Administer, support and maintain enterprise Monitoring tools in a multi-tier environment
  • Build Automation solutions using available tool sets and scripting languages.
  • Maintain design and support documents for all built solutions and processes
  • Troubleshoots networking, Unix/Linux systems, and applications to identify and correct malfunctions and other operational problems utilizing associated Linux and UNIX command line and management tools.
  • As new technologies emerge and impact our environment, learn these technologies very quickly and resolve any problems involved in integrating new technologies.
  • Maintains a broad knowledge of state-of-the-art technology, equipment, and/or systems.
  • Self-Driven and flexible willing to learn in adjacent areas with the initiative to learn more.
  • Thorough, adhering to critical processes even under stress
  • Support business disaster recovery procedures for assigned areas of responsibility.
  • Accurately document duties and procedures to aid the department in cross-training and absentee coverage
  • Work with technical engineering teams to manage and improve processes
  • Ability to solve problems quickly and completely.
  • Ability to identify tasks that should be automated and then develop and implement automation

All About You

  • Advanced user level expertise in UNIX and/or Red Hat Linux
  • Networking experience from basics to advanced, along with security knowledge.
  • Proficient with scripting or programming languages such as Python, bash
  • Experience with enterprise systems management/monitoring tools such as Grafana and Prmetheus
  • Experience using HA tooling like HAProxy and Pacemaker
  • Strong analytical, troubleshooting and problem solving skills
  • Ability to manage multiple projects simultaneously under pressure without direct supervision
  • Ability to manage multiple activities and work with a strong sense of urgency
  • Ability and motivation to learn new technologies quickly and with minimal support and guidance.
  • 3rd line out of hours operational support
  • Strong people skills and the ability to understand business needs and translate them into technical solutions
  • Excellent verbal and written skills, organization, project prioritization, and time management skills.

Corporate Security Responsibility

All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:

  • Abide by Mastercard’s security policies and practices;
  • Ensure the confidentiality and integrity of the information being accessed;
  • Report any suspected information security violation or breach, and
  • Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.
Read Full Description
Confirmed 2 hours ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles