At Walmart Labs we’re reinventing the world’s leading retail platform, leveraging our unique strengths to deliver the best customer experience wherever our customers shop. Imagine an environment where one line of code, one experiment, or one idea has the power to catapult an entire industry towards a smarter future. Better yet, imagine if that power could be yours, every day. That’s what we do at Walmart Labs.
This position is part of the Walmart Labs Advertising Technology team. This is a high-impact, fast-moving team working on greenfield projects with the backing of a Fortune 1 company. Our mission is to make digital advertisements more effective through low-latency serving, precise targeting, cutting-edge measurement and optimization technologies leveraging the trove of big data within the Walmart ecosystem. We are a highly motivated group of Big Data Geeks, Data Scientists and Applications Engineers, working in small agile groups to solve sophisticated and high impact problems. We are building smart data systems that ingest, model and analyze massive flow of data from online and offline user activity. We use cutting edge machine learning, data mining and optimization algorithms underneath it all to analyze all this data on top of Hadoop and Spark. The team also operates an end to end advertising platform that includes a scalable ad service that serves hundreds of millions of impressions each day, sophisticated ad matching algorithms, real-time reports, self-service interface for end to end program management etc.
As a Devops/Infrastructure, you will have the opportunity to work closely with our data scientists, business partners and systems engineers to envision and power the tools & infrastructure for this complex advertising ecosystem. The ideal candidate must be able to wear multiple hats (SysEng, NetEng, DBA, Dev, software engineeretc.), must be detail oriented, have superior verbal and written communication skills, strong organizational skills, able to juggle multiple tasks at once, able to work independently and can maintain professionalism under pressure. You must be able to identify problems before they happen with various monitoring and logging solutions to detect and prevent outages. You must be able to accurately prioritize projects, make sound judgments, work to improve the customer experience, and get the right things done.
Assist in designing, transitioning and deploying of applications to various clouds
Develop tools and framework to improve operational efficiency and anomaly detection
Design and operate the environment to test application resiliency to infrastructure instabilities
Responsible for application configurations in different cloud environments
Proactively monitor, identify, and escalate issues or root causes of systemic issues
Enable data scientists, business and product partners to fully leverage our platform
Be the consummate team player with a demonstrable ability to learn quickly. Amazing ability to get stuff done.
Follows the industry trends in the online world.
BS degree in Computer Science or a related technical field and five years experience
Experience in overseeing sites with constant high traffic
Experience in bootstrap production and non-prod environments in public/private clouds
Experience with CI/CD and related tools (e.g. Git, Gerrit, GitHub, Maven, Ant, Jenkins) in Dev, QA, Staging, and Prod environments
Extensive experience with Linux, particularly RedHat/CentOS, SSH, DNS, etc.
Experience with setting up monitoring and alerting tools, e.g. Prometheus, PagerDuty, Splunk, Elasticsearch
Understand the metric requirements of system/application health
Experience with setting up, configuring, and managing RDBMS and data stores such as Solr, Casandra, etc.
Experience writing and maintaining tools and scripts to support automation and operations, e.g. Unix Shell scripts, Ansible, etc.
Solid knowledge of Unix systems with the ability to troubleshoot issues in complex, distributed, multi-tier architectures.
Worked in SaaS/PaaS companies as DevOps/Infrastructure engineer
Passionate about operational excellence and documentation
Good written and verbal communication skills
Experience with public cloud. In particular Google Cloud and/or Azure.
Experience with troubleshooting and performance tuning in JVM, MySQL, MongoDB, Cassandra
Experience with setting up a whole network infrastructure, configuring and troubleshooting networking issues
Experience in secure, scalable and highly available online services
Experience collaborating with multiple teams
Experience in Big Data applications
Accepting the "infrastructure as a code" and "immutable infrastructure" concepts
Imagine working in an environment where one experiment can catapult an entire industry toward a smarter future. That’s what we do at Walmart Labs. We’re a team of 5,000+ software engineers, data scientists, designers and product managers within Walmart, the world’s largest retailer, delivering innovations that improve how our customers shop and our enterprise operates.
Read Full Description