About the team:
We are responsible for building the data platform. We are highly focused on simplicity and ease of use by all data-oriented users. We are a strong enabler of our data-strategy: make the right data easily available to everybody for high-quality decisions.
Our tech stack: Scala, Java, everything with Kafka (Streams, Connect, KSQL,…), Akka, Spark, Flink, Docker, K8s, Play, Slick, AWS Services (eg. EMR,S3), NewRelic, JUnit, ScalaTest, ScalaSpec
To support our ambitious growth, we are now looking for a Data Engineer (m/f/d) to join our team in Berlin, Germany starting as soon as possible.
Your Tasks – Paint the world green
- You will be responsible for building a data platform for running big data workloads at scale, collecting and combining data from various sources and help data consumers to consume data in our data lake.
- You manage the Customer Journey: Full data integration of webtracking-data with master data to understand customer and user behavior
- You are responsible for establishing a Data Catalogue: registering and documenting ingested, stored and processed data
- You improve the Tool-Layer on top of existing Messaging and Ingesting Service (e.g. generic data on/offloading in/from Kafka; Kafka topic admin-tools)
Your Profile – Ready to hop on board
- 5+ years of experience as Data Engineer or Backend Developer
- Solid expertise in at least one programming language (preferably Scala or Java)
- Excellent SQL and data management knowledge
- Experience with big data technologies (Spark, Redshift, Hadoop)
- Practical experience with Amazon Web Services
Nice to have:
- Previous experience in the design and implementation of complex data pipelines (Luigi, Airflow, AWS Data Pipeline)
- Knowledge of Python data science stack (Pandas, Scikit-learn, Numpy, Keras, etc)
- Familiarity with machine learning
- Experience with Docker
Link: https://www.flixbus.com/company/jobs