Magic Leap has flagged the Senior DevOps Engineer - Contractor job as unavailable. Let’s keep looking.

What you'll do

At Walmart Labs we’re reinventing the world’s leading retail platform, leveraging our unique strengths to deliver the best customer experience wherever our customers shop. Imagine an environment where one line of code, one experiment, or one idea has the power to catapult an entire industry towards a smarter future. Better yet, imagine if that power could be yours, every day. That’s what we do at Walmart Labs.

This position is part of the Walmart Labs Advertising Technology team. This is a high-impact, fast-moving team working on greenfield projects with the backing of a Fortune 1 company. Our mission is to make digital advertisements more effective through low-latency serving, precise targeting, cutting-edge measurement and optimization technologies leveraging the trove of big data within the Walmart ecosystem. We are a highly motivated group of Big Data Geeks, Data Scientists and Applications Engineers, working in small agile groups to solve sophisticated and high impact problems. We are building smart data systems that ingest, model and analyze massive flow of data from online and offline user activity. We use cutting edge machine learning, data mining and optimization algorithms underneath it all to analyze all this data on top of Hadoop and Spark. The team also operates an end to end advertising platform that includes a scalable ad service that serves hundreds of millions of impressions each day, sophisticated ad matching algorithms, real-time reports, self-service interface for end to end program management etc.

As a Devops/Infrastructure, you will have the opportunity to work closely with our data scientists, business partners and systems engineers to envision and power the tools & infrastructure for this complex advertising ecosystem. The ideal candidate must be able to wear multiple hats (SysEng, NetEng, DBA, Dev, software engineeretc.), must be detail oriented, have superior verbal and written communication skills, strong organizational skills, able to juggle multiple tasks at once, able to work independently and can maintain professionalism under pressure. You must be able to identify problems before they happen with various monitoring and logging solutions to detect and prevent outages. You must be able to accurately prioritize projects, make sound judgments, work to improve the customer experience, and get the right things done.

Assist in designing, transitioning and deploying of applications to various clouds

Develop tools and framework to improve operational efficiency and anomaly detection

Design and operate the environment to test application resiliency to infrastructure instabilities

Responsible for application configurations in different cloud environments

Proactively monitor, identify, and escalate issues or root causes of systemic issues

Enable data scientists, business and product partners to fully leverage our platform

Be the consummate team player with a demonstrable ability to learn quickly. Amazing ability to get stuff done.

Follows the industry trends in the online world.

Minimum Qualifications

BS degree in Computer Science or a related technical field and five years experience

Experience in overseeing sites with constant high traffic

Experience in bootstrap production and non-prod environments in public/private clouds

Experience with CI/CD and related tools (e.g. Git, Gerrit, GitHub, Maven, Ant, Jenkins) in Dev, QA, Staging, and Prod environments

Extensive experience with Linux, particularly RedHat/CentOS, SSH, DNS, etc.

Experience with setting up monitoring and alerting tools, e.g. Prometheus, PagerDuty, Splunk, Elasticsearch

Understand the metric requirements of system/application health

Experience with setting up, configuring, and managing RDBMS and data stores such as Solr, Casandra, etc.

Experience writing and maintaining tools and scripts to support automation and operations, e.g. Unix Shell scripts, Ansible, etc.

Solid knowledge of Unix systems with the ability to troubleshoot issues in complex, distributed, multi-tier architectures.

Worked in SaaS/PaaS companies as DevOps/Infrastructure engineer

Passionate about operational excellence and documentation

Good written and verbal communication skills

Preferred Qualifications

Experience with public cloud. In particular Google Cloud and/or Azure.

Experience with troubleshooting and performance tuning in JVM, MySQL, MongoDB, Cassandra

Experience with setting up a whole network infrastructure, configuring and troubleshooting networking issues

Experience in secure, scalable and highly available online services

Experience collaborating with multiple teams

Experience in Big Data applications

Accepting the "infrastructure as a code" and "immutable infrastructure" concepts

About Walmart Labs

Imagine working in an environment where one experiment can catapult an entire industry toward a smarter future. That’s what we do at Walmart Labs. We’re a team of 5,000+ software engineers, data scientists, designers and product managers within Walmart, the world’s largest retailer, delivering innovations that improve how our customers shop and our enterprise operates.

Read Full Description
Confirmed 21 hours ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles