Designing Big Data pipelines in line with good practices like IaaC, High Availability and Security in mind
Data collection, curation and maintenance for existing data stack and novel databases.
Analysis, benchmarking of the existing models in analytics stack.
Development of novel computational models on antibody drug discovery.
Research into antibody biology and their therapeutic context.
Liaising with clients from the industry.
Requirements
Expertise in handling large datasets, preferably (e.g. Next Generation Sequencing, Proteomics, Protein Structures).
Programming skills in Python and tools designed for Big Data processing (terabytes of data) like Spark, Apache Airflow
Experience in designing cost efficient data pipelines using AWS tools like AWS EMR, AWS Glue, Step Functions etc.
Knowledge of IaaC tools like Terraform or CloudFormation
A high level of self-discipline - as a Data Engineer you will be responsible for making meaningful decisions about project’s course based on your insights
Full proficiency of English is mandatory
Nice to have:
A Master level degree in computer science, statistics, datascience, bioinformatics or similar. PhD would be a strong plus.
Prior work in Immunoinformatics is a strong plus.
Hands-on expertise in applied statistical methods – knowledge of machine learning is a plus.
Ta rekrutacja prowadzona jest w serwisie zewnętrznym. Po kliknięciu powyższego przycisku zostanie wczytana strona rekrutera na której można kontynuować proces rekrutacji.