Site Reliability Engineer - SRE

Oracle

The Oracle ERP Cloud Operations team is looking for passionate, innovative, high caliber, team oriented people that seek being a major part of a transformative revolution in the development of modern business cloud based applications. As part of the market leading ERP Cloud, Oracle ERP Cloud Operations offers a broad suite of modules and capabilities designed to empower the development organization with world-class service reliability engineering disciplines and deliver customer success with streamlined process, increased productivity, and improved business decisions.

Oracle, the world leader in Enterprise Cloud, is hiring passionate technologists in the industry as we continue to add customer-centric, world-class, leading edge, secure, hyper-scale based solutions throughout all levels of the cloud stack. Oracle’s cloud eco-system is the only complete business cloud platform on the planet, with market leading and business transforming solutions spanning SaaS, DaaS, PaaS and IaaS. If you are interested in developing solutions that ensure our world class ERP services are fast, secure, reliable and scalable then we invite you to explore the positions we have available in our group.

Career Level - IC3

Key Tasks and Responsibilities

  • Service Ownership – You will be part of the SRE team, whose mission is the shared full stack ownership of a collection of services, with our Service Development and Operations SRE partners.
  • Ownership Scope – You will understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of the production services you own. In partnership with your Service Development and Operations SRE partners, you will have the responsibility to ensure that services are designed and delivered to be mission critical with focus on monitoring, telemetry, security, resiliency, scale and performance.
  • Service Requirements – You will provide direction and prioritization to service Product Management and Service Development teams to engineer and add premier SRE capabilities to the Oracle SaaS/ERP services.
  • Incident Response – You will be the primary author of technical content for both customer and internal communications used throughout the incident response process, e.g. postmortem/root cause analysis, end-to-end repair item definition, and fixes in production.
  • Prevention – Using data-driven incident findings, you will work on solutions that will ultimately prevent the incident/problem from arising ever again, and develop interim solutions to more quickly resolve the problem next time.
  • Service Performance – You will work with SaaS Operations and Product Development teams to triage performance issues (both reactive and proactive). You will work with central teams to define and drive monitoring tooling and process enhancements, including identification of service metrics to enhance performance issue triage, diagnostics and improvements.
  • Service Health Reviews – You will represent ERP Development in periodic cross-organizational service health reviews. You will help to identify patterns that influence service performance and/or reliability. You will lead efforts to eliminate process deficiencies and drive simplification into processes and procedures.
  • Automation – Our goal is to eliminate human intervention wherever possible. You will be responsible for driving automation into our monitoring and recovery processes, code delivery procedures and issue resolution processes.

Skills and Qualifications (3 or more desired)

  • Minimum of 3 years of software development and demonstrated knowledge of professional software engineering best practices for the full software development life cycle, including coding standards, code reviews, source control, build and release processes, continuous deployment and test suite development and maintenance.
  • Problem solving skills with abilities in analysis, problem identification and resolution.
  • Experience with enterprise system components, architecture and deployments
  • Experience in deploying and running large scale online systems built on Cloud platforms such as Oracle Cloud, AWS, Azure, Google Cloud Platform and/or OpenStack
  • Experience in performance analysis and tuning of enterprise applications
  • Experience with monitoring and alerting using technologies like Prometheus, Sensu, Nagios, Kafka, Wavefront, BigPanda, DataDog, and/or PagerDuty.
  • Experience with Oracle Linux, RedHat Linux, Ubuntu, Centos, CoreOS, and/or Amazon Linux.
  • Experience in designing and building automated tools and solutions, including programming and data model design skills
  • Hands-on with web protocols and Linux/Unix tools and architecture, from kernel to shell, file systems, and client-server protocols.
  • Experience with solutions for platform and application layer telemetry, monitoring, scalability, performance and reliability.
  • Experience with working systems and network administration, application security, DevOps and/or Site Reliability Engineering will be highly preferred
  • Excellent written and verbal technical communications with technical and non-technical peers, customers and at times executive leadership.
  • Proven success in contributing in a collaborative, team-oriented environment, with the ability to establish and nurture relationships at all levels.
  • BS in Computer Science or related field and 3 years relevant experience.

As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.

We know that true innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Read Full Description
Confirmed 8 hours ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles