Senior Systems Engineer

The Sourcery

(San Francisco, California)
Full Time Travel Required
Job Posting Details
About The Sourcery

We believe recruiting can become a company’s superpower. It can activate growth potential, by attracting individuals who activate each other’s potential, so they can build products that activate the human potential.

Yes, you could say we are passionate about recruiting. In fact, we love it. Our clients love it, too. Yes, really.

Summary

We’re looking for Senior Systems Engineer to join our Ops Team. This role will be responsible for the successful operations and scaling of the infrastructure and software that powers Sauce Labs and launches over 10 million VMs a month.

Responsibilities
  • Some of the things you’ll do in this role:
  • Write tools and scripts to provide automation and self service solutions for ourselves and other teams;
  • Design new systems to support production services;
  • Install, configure and debug hardware and systems in our data center;
  • Creatively solve scale challenges regarding a rapidly expanding cloud environment;
  • Work with real hardware - high density Cisco UCS B-series blades and C-series rack-mount servers, Nexus networking (10Gb+ core network), storage (NAS and SAN), Mac-in-a-datacenter, custom appliances for mobile devices, load balancers, and beyond;
  • Help improve monitoring and identify key performance metrics;
  • Proactive R&D - discovering and implementing new tools, emerging technology, etc.;
  • Disaster recovery design, implementation, and maintenance;
  • Create NOC runbooks, procedures, documentation, and diagrams of the environments you manage;
  • Troubleshooting and resolution of server/network issues;
  • Help maintain hardware in Sauce’s colocation facilities;
  • Help build out new data centers around the globe;
  • Participation in 24x7 on-call rotation.

Here are a couple of examples of the kind of projects you might work on:

  • Optimize hardware and configuration for improving hypervisor performance.
  • Automating and migrating a system configuration to our UCS platform for faster production scaling.
  • Building and optimizing a logstash cluster for our development team to monitor and analyze production system usage.

We have a lot of big projects and decisions that need to be made, and in this role you would be a key part of that process. Sound like fun? Here’s what we’re looking for:

Ideal Candidate

Our Ideal Candidate:

  • 3+ years recent experience working as a Linux administrator/engineer at scale (hundreds of systems) and designing/deploying ‘highly available’ solutions;
  • 2+ years of recent professional experience designing, developing, and operating Configuration Management solutions such as Chef, Puppet, Salt (preferred), or Ansible (preferred) at scale;
  • Solid experience in Linux tuning, profiling, and monitoring; Strong skills in at least one language: Python (preferred), Ruby, Bash.

Bonus points for:

  • Experience deploying/managing KVM-Qemu and LXC
  • Solid understanding of cloud/networking/distributed computing environment concepts; including TCP/ IP connections, firewalls, VLANs, etc.
  • Experience and understanding of contemporary metrics, monitors, and logging solutions especially statsD, Graphite, ELK, Splunk, Nagios, etc.
  • Provisioning and automation with Cisco Unified Computing System Manager
  • Highly organized, able to multi-task, able to work individually, as well as within a team, and across teams;
  • Excellent communication skills, both verbal and written across all user levels;
  • Deployment automation in physical and virtual environments (PXE, Cobbler, MAAS (preferred));
  • Working knowledge of load balancing technologies (hard/soft);
  • Proven experience collaborating in a cross functional team environment;
  • Familiarity with software engineering practices, including n-tier architecture, configuration management, development methodologies (e.g. agile, waterfall, spiral, prototyping), etc.

Notes:

  • Some travel to South Bay or SF is required.
  • Relocation available.
Compensation and Working Conditions
Benefits Benefits included

Questions

There are no answered questions, sign up or login to ask a question

sign up or login to save this job and more
San Francisco, California
This role can be based in either South Bay or in San Francisco.
Skills Desired
Sign up or login to see how your skills match up.
  • Cloud
  • Linux
  • Python
  • Ruby
  • Bash
  • Firewall
  • KVM
  • Load Balancing
  • Nagios
  • Splunk
  • TCP/IP
  • Chef Software
  • SALT
  • Ansible
  • Graphite Software
  • Virtual LAN
  • StatsD
  • Puppet
  • Qemu
  • Cisco Unified Computing System
  • Cobbler
  • LXC
  • ELK
  • Preboot Execution Environment
  • Metal as a Service

Want to see jobs that are matched to you?

DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.