Main Duties and Requirements
- Data capture, data cleansing, and data enrichment as a preparation step for bioinformatician/image analysis activities
- Track the flow of data within ETL (extract, transform, load) and analysis pipelines, ensuring successful processing and data validity
- Generation consolidation and re-formatting of image analysis readout tables resulting from different formats (csv., JSON etc.)
- Work together with bioinformaticians/ data scientists/image analysis specialists to identify optimal ways to prepare, curate and navigate their datasets
- Automating the way we currently capture data to save time, reduce errors and create high-quality datasets
- Developing and applying FAIR data principles
- Work with bioinformaticians/data scientist and data management teams to develop best practices in data wrangling/curation and storage experimental metadata
- Work with the software and information technology teams to specify, design, and implement the infrastructure for storing, searching, visualizing and integrating experimental datasets and results
Requirements
- Computer Science, Engineering, or Bioinformatics (Master level) plus ~3-5 years relevant experience
- Excellent programming skills (Python, R); flexibility and curiosity regarding working with different platforms and programming languages
- Strong expertise (~3 years) in data management, keying and linking, data handling and data wrangling
- Experience of working with metadata models, controlled vocabularies and ontologies
- Familiarity with data quality, cleaning and masking techniques
- An ability to interact with various data sources, both structured and unstructured and database systems (e.g. HDFS, SQL, noSQL)
- Curiosity for data science, biology, digital pathology and new data types
- Cross-functional mindset and strong stakeholder (and also conflict) mgmt. skills to connect different departments and drive setup of best practices and change within current data mgmt. activities
- Expertise with biological/health data, especially imaging data with focus on digital pathology is a plus
- Know-how of machine learning/data analytics is a plus
We are looking forward to your application!
Link: https://jobs.definiens.com/Definiens/search/