Senior Site Reliability Engineer
TubeMogul
(Emeryville, California)TubeMogul is the global leader in software used by brands and agencies to plan, buy and measure their brand advertising. By reducing complexity, improving transparency and leveraging real-time data, our platform enables marketers to gain greater control of their videoadvertising spend.
We are seeking a Senior Site Reliability Engineer to help with the building and operation of our Real Time Analytics Platform. The operations team leverages some of the most cutting edge technology to simplify an otherwise complex environment.Candidates should have a passion for building infrastructure for high-performance, "Big Data" systems. In this role, you will leverage open-source tools like Zookeeper, Hadoop, HBase, Hive, and Couchbase. Candidates will at least be familiar with the technologies we are using but may not have had the opportunity to acquire deep experience in previous job settings. However, you should have a true passion for systems engineering that is apparent in your past work.
- Build tools to ease provisioning and scaling of TubeMogul Analytics infrastructure
- Monitor and improve service performance and stability
- Continuously extend and improve infrastructure components to handle growth
- Investigate failures and offer suggestions for future improvement
- Work closely with development teams to ensure that platforms are designed with "operability" in mind
- Assist our software engineering team to ensure proper monitoring and metrics are being built into the applications before going to production
Required Skills and Expertise:
- Must have a solid understanding of information technology and information security
- Desire to work in a fast paced environment
- Experience troubleshooting and deploying applications on Linux
- Experience in large scale monitoring and alerting tools such as Nagios, Ganglia, Graphite, Statsd, Skyline, Sensu
- Fluent with Configuration Management Tools like Puppet, Chef or Ansible
- At least one of : Perl, Python, Ruby
- Knowledge of TCP/IP, HTTP, DNS, LDAP, SSL, SSH, OpenVPN, SQL, IDS, IPS
Bonus skills:
- Java Programming Experience
- Background in building and operating a Real Time Analytics infrastructure based on technology like Kafka, Storm, Hadoop, HBase, Amazon EMR, Couchbase, Aerospike, Vertica.
- Experience with Amazon AWS (EC2, S3, EBS, EIP, VPC)
- Server Virtualization using Eucalyptus, OpenStack or CloudStack
Benefits | Benefits included |
---|
Additional Notes on Compensation
competitive compensation package including an equity component and excellent benefits.Benefits include: medical, dental, vision, 401K matching, company events and an extraordinary culture.
Questions
There are no answered questions, sign up or login to ask a question
- Big Data
- Hadoop
- HBase
- Information Technology
- Java
- Linux
- Perl
- Python
- Ruby
- SQL
- Troubleshooting
- Virtualization
- Amazon EC2
- Amazon S3
- Amazon Web Services
- Apache CloudStack
- Apache Kafka
- DNS
- LDAP
- Nagios
- OpenStack
- OpenVPN
- SSH
- SSL
- TCP/IP
- Vertica
- HTTP
- Chef Software
- Systems Engineering
- Ansible
- Ganglia Monitoring System
- Virtual Private Cloud
- Sensu
- StatsD
- Amazon Elastic Block Store
- Amazon Elastic MapReduce
- Puppet
- Information Security
- Storm
- Apache Zookeeper
- Skyline
- Aerospike
- Couchbase Server
- IPS
- Graphite
- Configuration Management Tools
- Eucalyptus

Want to see jobs that are matched to you?
DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.