HPC Systems Engineer (Algo)

DRW

Benefits
Qualifications
Skills

DRW is a diversified trading firm with over 3 decades of experience bringing sophisticated technology and exceptional people together to operate in markets around the world. We value autonomy and the ability to quickly pivot to capture opportunities, so we operate using our own capital and trading at our own risk. 

Headquartered in Chicago with offices throughout the U.S., Canada, Europe, and Asia, we trade a variety of asset classes including Fixed Income, ETFs, Equities, FX, Commodities and Energy across all major global markets. We have also leveraged our expertise and technology to expand into three non-traditional strategies: real estate, venture capital and cryptoassets. 

We operate with respect, curiosity and open minds. The people who thrive here share our belief that it’s not just what we do that matters–it's how we do it. DRW is a place of high expectations, integrity, innovation and a willingness to challenge consensus. 

We are looking for an HPC Systems Engineer for our Chicago office. This role will be as partof a small team that is responsible for all aspects of our HPC cluster which drives the research and development of our trading systems. You will monitor the health and utilization of the environment, detect and prevent problems, ensure high availability, and enable future growth and innovation in our research infrastructure. Your work will encompass everything from integration of the newest technologies for high performance computing to rapid development and deployment of custom trading tools.

What you will do:

  • Work as part of a small team to manage a large compute cluster of Linux servers and related hardware
  • Build and manage high performance storage and network components in our infrastructure
  • Implement and manage monitoring of job performance, system stats, and the general health of our HPC infrastructure
  • Architect upgrades to expand the size, scope, and performance of our cluster and continually integrate the latest technologies
  • Coordinate with other teams for the deployment, operation, and maintenance of our data center footprint
  • Assist in optimizing and tuning grid jobs to efficiently utilize compute resources

What you will need:

  • Experience with Linux administration (Ubuntu or Debian preferred)
  • Confidence with configuration management tools like Chef, Puppet, or Ansible
  • Experience in administration of clusters of homogeneous machines
  • Experience building, tuning, and managing large HPC environments with thousands of cpu cores running under Slurm, LSF, GridEngine, etc.
  • Knowledge and comfort with hands on development in scripting languages like Bash, Python, or Ruby
  • Familiarity with containers and orchestration tools like Docker, Kubernetes, Singularity, etc.
  • Competence in networking fundamentals
  • Skill with Linux package management tools like apt/dpkg or yum/rpm
  • At home with compilation and packaging of open source software like Redis, Python, or Ruby; including reading and modifying Makefiles
  • Experience with parallel or distributed file systems is a plus

For more information about DRW's processing activities and our use of job applicants' data, please view our Privacy Notice at https://drw.com/privacy-notice.

California residents, please review the California Privacy Notice for information about certain legal rights at https://drw.com/california-privacy-notice.

#LI-BL1

Read Full Description
Confirmed 21 hours ago. Posted 27 days ago.

Discover Similar Jobs

Suggested Articles