Senior Bioinformatics Data Engineer
Utrecht, Netherlands
Job Description
As Senior Bioinformatics Data Engineer you will contribute to the mission of the global data engineering function and be responsible for many aspects of data including architecture, access, classification, standards, integration, pipelines and visualization. Although your role will involve a diverse set of data-related responsibilities, your expertise will be on automated processing of mostly biological research data for the Discovery Department and, particularly, for the Discovery Data Scientists. You will leverage your expertise in pipeline development with scientific data objects to model and catalog large amounts of data with corresponding metadata layers.
You will work closely with data scientists to determine what metadata will be required to retrieve data and how to capture the information in an automated way. Your ultimate goal will be to place data at the fingertips of stakeholders and enable science to go faster. You will join an enthusiastic, fast-paced and explorative global data engineering team.
The Data Products team supports Genmab's mission by helping researchers use data at its full potential! Particularly, the Utrecht team supports the Discovery department with the ingestion, flow, and processing of biological and operational data. We work closely with researchers, managers, IT staff and data scientists to find solutions together that fit Genmab’s data needs.
The Data Products team is spread between Princeton (USA), Copenhagen (Denmark) and Utrecht (The Netherlands). This position would be joining the eight data engineers currently working in Utrecht (2-3 days onsite expected).
Responsibilities
- Design, develop and deploy reproducible data pipelines using cloud-native tools. All our pipelines use infrastructure as code, have automated tests and are as re-usable and reproducible as possible.
- Help design, maintain and advice on the use of graph databases.
- Connect with collaborators (scientists, project managers, etc.) to translate their needs and questions into technical requirements. We then use the requirements to build data pipelines and visualizations that are meaningful, comprehensible, and practical for them.
- Lead and propose solutions for assigned projects. Contributions to other projects is also expected.
- Generate comprehensive documentation of the data products developed, both for technical and non-technical users.
- Promote good (coding/data) practices and lead by example.
Requirements
- MSc in Computer Science, Bioinformatics, or related field and 6+ years of demonstrated working experience as a data engineer or, alternatively, a PhD in a relevant area plus 3+ years of experience.
- Solid experience with graph database design and querying. Knowledge about ontologies is advantageous.
- Experience with data pipeline design and creation. The pipelines should use good coding practices and the right tool for the job. Experience with ETL jobs (e.g. AWS Glue, Databricks jobs, AWS Lambda) and orchestrators (e.g. AWS StepFunctions) is desirable.
- Experience in database design (partitions, schemas, choosing database type, etc.) and querying languages (SQL, pyspark or similar) is a requirement. Experience with delta lake (delta tables) is a plus.
- Strong experience writing Python code (including OOP, automated testing, etc.). Experience using R is a plus.
- Knowledge of FAIR principles and GXP rules for data handling is also advantageous but not rigorously required.
- Although understanding biological data (experimental and clinical data) is not a strong requirement, it could make the candidate more efficient in the job.
- Experience using version control system (git) in collaborative projects is required. Knowledge in CI/CD pipelines is an advantage.
- Needs good communication skills in the English language, which is the primary language spoken at Genmab.
About You
- You are passionate about our purpose and genuinely care about our mission to transform the lives of patients through innovative cancer treatment
- You bring rigor and excellence to all that you do. You are a fierce believer in our rooted-in-science approach to problem-solving
- You are a generous collaborator who can work in teams with diverse backgrounds
- You are determined to do and be your best and take pride in enabling the best work of others on the team
- You are not afraid to grapple with the unknown and be innovative
- You have experience working in a fast-growing, dynamic company (or a strong desire to)
- You work hard and are not afraid to have a little fun while you do so
Locations
Genmab leverages the effectiveness of an agile working environment, when possible, for the betterment of employee work-life balance. Our offices are designed as open, community-based spaces that work to connect employees while being immersed in our state-of-the-art laboratories. Whether you’re in one of our collaboratively designed office spaces or working remotely, we thrive on connecting with each other to innovate.
About Genmab
Genmab is an international biotechnology company with a core purpose guiding its unstoppable team to strive towards improving the lives of patients through innovative and differentiated antibody therapeutics. For more than 20 years, its passionate, innovative and collaborative team has invented next-generation antibody technology platforms and leveraged translational research and data sciences, which has resulted in a proprietary pipeline including bispecific T-cell engagers, next-generation immune checkpoint modulators, effector function enhanced antibodies and antibody-drug conjugates. To help develop and deliver novel antibody therapies to patients, Genmab has formed 20+ strategic partnerships with biotechnology and pharmaceutical companies. By 2030, Genmab’s vision is to transform the lives of people with cancer and other serious diseases with Knock-Your-Socks-Off (KYSO™) antibody medicines.
Established in 1999, Genmab is headquartered in Copenhagen, Denmark with locations in Utrecht, the Netherlands, Princeton, New Jersey, U.S. and Tokyo, Japan.
Apply
Career Focus: Biostatistics/Bioinformatics, Data Management/Data Science, Engineer
Similar Jobs
Director, Clinical Data and Systems
The Director, Clinical Data and Systems is responsible for overseeing Da...
Principal Biostatistician
Novocure is a global publicly-traded commercial-stage oncology company d...
Sr Bioinformatician
CareDx is seeking a talented bioinformatician to join the Data and AI gr...
Director, Biostatistics
At Ultragenyx, we fundamentally believe that taking real impactful actio...