(Senior) Data Engineer - Data Science Team at idealo internet GmbH (Berlin, Deutschland)
The position provides the opportunity to work on a wide range of interesting topics from operationalizing deep learning models to training recommender systems on petabytes of data. As part of the data science team, you will be also given a lot of responsibilities to shape the direction of the team. If you would like to become part of this success story, please send your application. About your new roleYou will be part of the data science team and work closely with our data scientists to operationalize machine learning pipelinesYou will develop and implement effective data processing architecturesYou will also collaborate a lot with the data warehouse and data platform teamYou will participate at meetups, conferences and the research community and apply what you’ve learned back at your daily workSkills & RequirementsA deep understanding of distributed computing frameworks such as Spark (particularly SparkML, SparkSQL, tune/optimize and debug Spark jobs), Hadoop and/or FlinkExperience with big data at AWS, in particular using EMR and S3Experience with Docker and container orchestration like Kubernetes, Swarm or similarExperience with pipeline management tools like Airflow, Luigi or NiFiExperience with programming languages such as Python, Go and/or ScalaGood knowledge of SQL/RDBMSExperience with the command line, shell scripting and version control (Git)Excellent communication skills in English, both oral and written; German is nice to havePreferably experience with automatic configuration management like Terraform and PuppetPreferably experience with modern agile software development practices like microservices, test-driven development, pair programming, CI/CD etc.