We are looking for a passionate Data Engineer to join our Analytics team and own our analytics data architecture. You will collaborate with the rest of the Analytics team and company stakeholders (such as product, UX, sales, marketing, medical, etc) to identify data needs and solve them to support insights that inform key business decisions.
As part of the Analytics team, you will:
- Maintain and evolve our analytics data architecture
- Handle our data pipeline from Ingestion to Storage
- Align with our Data Analysts and other stakeholders throughout the organization on data needs and implement them
- Work with our analysts and Product teams to strive for impact for our users
- Own our event tracking platform (Snowplow in Spark-EMR setup)
- Be the guardian of data quality
- Work with engineering teams to define and implement new tracking events across all platforms
- Ensure interpretability of core data elements
- Own data modelling and data lineage across all data stores and tools
- Ensure data from different sources are connected in the right way to ensure tracking across media breaks correctly and allow for attribution
We are looking for somebody who has:
- 3+ years experience in a Data Engineer or comparable role
- Experience in setting up and maintaining (cloud-based) data collections/DWHs including data modeling
- Experience in building and optimizing (cloud-based) data pipelines
- Advanced working SQL knowledge and experience working with relational databases
- Experience with granular event tracking including the necessary cross-company alignment
- Excellent communication skills in English
- A passion for solving puzzles without knowing the picture
Ideally you also:
- Have a strong interest in healthcare and health technology
- Possess previous experience using AWS Redshift, AWS Lambda, AWS S3, AWS EMR and/or Snowplow
- Have experience with Python and/or JavaScript
- Are familiar with privacy and security standards like HIPAA or GDPR
- Have experience working in a DevOps environment