Senior Data Engineer

Menlo Park, California, United States · Engineering expand job description ↓


The Senior Data Engineer, focused on Software and Analytics, will report to the VP of Engineering and be part of a team responsible for the development of our analytical platform. You will help drive the architecture and implementation of a robust, highly performant, and scalable analytical system. The role will require close collaboration with our Bioinformatics team, and involves building data pipelines and relevant backend services. This is a hands-on position, providing the opportunity to build an intelligent, next generation genetic processing and analytical platform that will reimagine how Food Safety is done across the industry!

We offer great opportunities to grow your career! Be part of an exiting, fast growing startup in the biotech sector! You can learn more about us at



  • Design and develop a highly performant and highly scalable analytical system to collect, store, process and analyze genomic and food sample metadata, utilizing modern technologies.
  • Use GCP's features and apply AI/ML functionality to enhance our analytical capabilities.
  • Utilize DDD, Event Sourcing and CQRS to build Microservices and API's for the analytical system.
  • Suggest, assess and translate system requirements into implementation designs and data models.
  • Ensure excellent quality and test coverage, as well as effective performance.
  • Work on timely resolution of issues and other tasks relevant to the position.

Desired Skills and Experience Requirements:

  • MS in Computer Sciences
  • 5+ years of experience in a relevant software engineering position
  • 5+ years hands-on experience with Java/J2EE, Relational databases and NoSQL data stores
  • In-depth experience building scalable, distributed Analytical Systems
  • Proficient with Predictive Analytics, AI and Machine Learning
  • Experience using AWS or GCP
  • Comfortable with Linux environments


  • Experience building Microservices and API's
  • Experience with LIMS or related software systems
  • Understanding of Bioinfomatic Pipelines
  • Exposure to Genomics and NGS


Clear Labs offers solid Health Care, a 401k Plan, Food Catering, Massages, and an easy-going, open work environment.

Personal information
Your Profile
Application Details
5+ years Java/J2EE development experience
5+ years hands-on work with Relational databases, ORM concepts, as well as NoSQL data stores
3+ years hands-on experience with Big Data, e.g. High Volume, ETL and Data Pipelines, Streaming Data, Horizontal Scaling, Lambda Architecture, CAP Theorem, etc.
3+ years hands-on experience working with Data Analytics
1+ years hands-on experience using AWS or GCP
Implemented Java-based Microservices
Worked with Bioinformatic Pipelines
CQRS/Event-Driven framework experience
Implemented AI or Machine Learning features, e.g. using Torch, TensorFlow, Caffe, Airflow, etc.
Implemented ML or Predictive Algorithms, e.g. Naive Bayes, Random Forest, RNN, etc.
Implemented Message-based systems (RabbitMQ/PubSub)