Lead, Systems Engineering & Operations
Turn
(Redwood City, California)Turn delivers real-time insights that transform the way leading advertising agencies and enterprises make decisions. Our digital advertising hub enables audience planning, media execution, and real-time analytics from a single login, and provides point-and-click access to more than 150 integrated marketing technology partners.
-
Lead by example the work of operations engineers responsible for production monitoring and support of critical infrastructure
-
Work with TechOps management and team to create high-level roadmaps and team strategy
-
Serve as technical escalation point for critical production issues and drive escalation/resolution of problems. “Full Stack” perspective and expertise is highly valued
-
Drive requirements for automation/tooling needs as well as other cross-functional business priorities including helping define, develop and maintain monitoring tools and automation systems within the team
-
Define server sizing and keep up on the newest server, networking and storage hardware technologies
-
Lead collaboration with NOC, Datacenter Operations, Security Operations and Data Infrastructure teams to achieve well orchestrated infrastructure operations and high reliability of Turn Platform
-
Become proficient in understanding how each software component, system/hadoop/database design and configuration is linked together to form an end-to-end solution
-
Plan system and network maintenances while minimizing impact on production environment
-
Perform periodic on-call duty as part of the rotation maintaining the availability and performance of the Turn Platform
-
Share the ownership duties of the following infrastructure components: Puppet, Docker, Mesos/Marathon, OpenStack, Nagios/Icinga/Thruk, OpenTSDB, Logstash/Flume, Elasticsearch, Kibana/Grafana, Zookeeper, Kafka and other core systems services
-
7+ years of relevant work experience; or BA/BS degree in CS, Systems Administration or related field
-
A strong background in internet service deployment, provisioning, IP networking, service infrastructure, and software deployments.
-
Strong Linux systems administration skills (we use CentOS)
-
Experience with configuration management such as Puppet or Chef
-
Strong organization and multi-tasking abilities. Solid verbal and written communication skills
-
Proven ability to quickly learn and implement unfamiliar technologies
-
Advanced knowledge of Linux, TCP/IP and web services
-
Proficiency in one of Python, Ruby for automation tools development
-
Troubleshooting skills that range from diagnosing low-level hardware problems to large-scale failures within datacenter clusters
-
Solid experience with ITILv3 methodologies and practical ways of implementation
Preferred Qualifications
-
Experience with medium to large-scale distributed Unix/Linux systems administration and performance tuning in latency sensitive production environment
-
OS hardening, security and compliance process, and security tools
-
Experience with cloud orchestration and private/public cloud management (SaltStack, OpenStack, AWS, Google cloud)
-
Experience with MongoDB, Redis, CouchDB, ElasticSearch is a plus
-
Experience with Hadoop a plus
-
Prior Java development experience is a plus
Questions
There are no answered questions, sign up or login to ask a question
- Multi-tasking
- AWS
- Cloud
- CouchDB
- Hadoop
- Infrastructure
- Java
- Learn Quickly
- Python
- Ruby
- Software Development
- Strong Oral and Written Communication
- Superior Organizational
- Troubleshooting Abilities
- Web Services
- Automation
- CentOS
- ElasticSearch
- ITIL
- Linux System Administration
- MongoDB
- OpenStack
- Redis
- Software Configuration Management
- System Administration
- TCP/IP
- Networking & Security
- Chef Software
- SaltStack
- Orchestration
- Puppet
- Large-scale Software Systems

Want to see jobs that are matched to you?
DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.