Platform Operations Engineer

DigitalOcean

(Cambridge, Massachusetts)
Full Time
Job Posting Details
About DigitalOcean

DigitalOcean, the cloud for developers, is a dynamic, high-growth technology company that serves a passionate community of technologists around the world. We want to simplify cloud computing for every developer and are working on some of the most challenging and interesting problems in cloud computing.

Summary

As an engineer on the Platform team, you’ll architect the systems, software, and servers to keep our data centers running. You’ll build automation and systems management tools that make it easier to scale our rapidly growing business and deliver cloud infrastructure all around the world. You’ll also work on improving DigitalOcean’s systems performance and reliability. You should have a passion for systems, big data, network, and software engineering and you will work closely with other teams throughout the engineering organization.

Responsibilities
  • Top tier of escalation for infrastructure troubleshooting
  • Supporting / Engineering large mesos cluster
  • Write software to automate hardware provisioning, management, and other automatable tasks
  • Create new tools and improve on existing tools to help automate tasks for the Platform Team and other teams in the organization
  • Maintain and improve service quality and reliability across our complex set of services
  • Work closely with network, software, and cloud operations engineers to build efficient and economical systems
  • Communicate with vendors to debug and fix drivers and firmware
Ideal Candidate

What We'll Expect From You:

  • Expertise with one of the following languages: Go, Ruby, Python, C/C++
  • Understanding of big data technologies such as Mesos, Spark, Kubernetes
  • Experience with configuration management and hardware automation
  • Experience monitoring and debugging large, global, distributed systems
  • Experience in writing tools to manage 10,000+ physical servers
  • Passion for continual incremental improvements on tooling / processes
  • Outstanding written and verbal communication skills

Things We Would Look For In An Ideal Candidate:

  • Languages: Go, Python
  • Tools: Chef, Ansible, Prometheus, MySQL, KVM, libvirt, git
  • Big Data Experience: Mesos, Spark, HDFS, Presto
  • Infrastructure Bootstrapping: PXE, Live Images, Cobbler
  • Commitment to open source tooling and open source contributions
Compensation and Working Conditions
Benefits Benefits included

Additional Notes on Compensation

We offer competitive health, dental, and vision benefits for employees and their dependents, a monthly gym reimbursement to keep you fit, and a monthly commute allowance to make your trips to and from work easier.

Questions

There are no answered questions, sign up or login to ask a question

sign up or login to save this job and more
Cambridge, Massachusetts
Skills Desired
Sign up or login to see how your skills match up.
  • Big Data
  • C++
  • Debugging
  • Hardware
  • MySQL
  • Python
  • Ruby
  • Apache Spark
  • Automation
  • Git
  • Go
  • KVM
  • Software Engineering
  • Chef Software
  • Distributed Systems
  • Firmware
  • Ansible
  • libvirt
  • Cobbler
  • Apache Mesos
  • Preboot Execution Environment
  • Presto
  • Kubernetes
  • Hadoop Distributed File System
  • Systems Management
  • Configuration Management (CM)
  • Open Source
  • Prometheus
  • Bootstrapping

Want to see jobs that are matched to you?

DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.