Senior Python Data Engineer

Radius

(San Francisco, California)
Full Time
Job Posting Details
About Radius

Radius is a fast-growing, venture-backed startup in the heart of San Francisco. Radius applies advanced data science to deliver the freshest, most accurate, and most comprehensive view on 20M+ US companies—from small businesses to the largest enterprises. We build cutting-edge machine learning solutions that help our customers discover markets, acquire customers, and measure performance through an app that’s intuitive, secure, and enterprise-ready.

Summary

We're looking for an experienced Data Software Engineer to work on our Aggregation team. You'll develop highly scalable and robust code to process large data sets and will collaborate on a team of people with diverse expertise in designing and deploying distributed systems for information retrieval and large-scale data processing.

Responsibilities
  • Partner with the technical team to develop and improve robust applications to automatically extract, parse, and ensure the quality of data is consistent from many different sources
  • Leverage Big Data technologies such as Spark to process large data sets
  • Assess the root cause of problem reports, replicate the situation in a test environment, repair the code, and push it to production
  • Write testable, defensive, and production level code that can participate in a continuous deployment environment
  • Provide feedback and mentoring to other team members through code reviews, pair programming, etc.
  • Develop code to ease the creation of metric dashboards or reports, enabling non-technical users to monitor the data ingestion process
Ideal Candidate

Requirements:

  • 5+ years of software development experience
  • 2+ years of Python experience
  • Solid software engineering skills and experience, so the code is easy to reason about and easy to test
  • Detail-oriented mindset
  • Comfortable working in a remote Linux environment and has experience developing basic Shell scripts
  • Experience with Git or a similarly distributed revision control system

Bonus Qualifications:

  • Experience with Spark or Pyspark
  • Experience with Scrapy or a similar structured crawling framework
  • Familiarity with Natural Language Processing and Machine Learning

Questions

There are no answered questions, sign up or login to ask a question

sign up or login to save this job and more
San Francisco, California
Skills Desired
Sign up or login to see how your skills match up.
  • Big Data
  • Linux
  • Python
  • Revision Control
  • Software Development
  • Writing Shell Scripts
  • Apache Spark
  • Data Ingestion
  • Git
  • Machine Learning
  • Natural Language Processing
  • Scrapy
  • Web Crawling
  • Software Engineering
  • Distributed Systems
  • Pair Programming
  • Continuous Deployment
  • Information Retrieval
  • Pyspark
  • Dashboard

Want to see jobs that are matched to you?

DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.