Senior Data Engineer
Princeton, United States
Job Description
- The Senior Data Engineer will collaborate with business analysts and data scientists to implement data-driven solutions for business challenges.
- Design, implement and manage ETL data pipelines that ingest vast amounts of commercial and scientific data from public, internal and partner sources into various repositories on a cloud platform (AWS).
- Enhance end-to-end workflows with automation that rapidly accelerate data flow with pipeline management tools.
- Implement and maintain databases for raw and processed commercial and scientific data.
- Innovate and advise on the latest technologies and standard methodologies in Data Engineering and identify software solutions that can address hurdles in data enablement.
- Develop and maintain accurate and reliable data pipelines using Python and SQL.
- Define and contribute to data engineering practices for the group, establish templates and frameworks, and determine the best usage of specific cloud services and tools.
- Transform data using Python on challenges varying from complex aggregations, wrangling, quality control, and calculations.
- Collaborate with data scientist leads to determine best-suited data enablement methods to optimize the interpretation of the data, including creating presentations and leading tutorials on data usage.
- Participate in code reviews and contribute to best practices in data engineering. Apply value-balanced approaches to the development of the data ecosystem and pipeline initiatives.
- Proactively communicate data ecosystem and pipeline value propositions to partnering scientific collaborators.
- Share expertise of Python and related libraries throughout the group.
- This position requires only little domestic travel.
- Position allows working from home within commuting distance of worksite location.
Job Requirements
- Requires at least a Bachelor’s degree or foreign equivalent in Information Technology, Bioinformatics, or a closely related field.
- Employer will accept candidates with any suitable combination of education and experience.
- Must possess at least 5 years of experience with each of the following: (a) developing and managing large-scale ETL data pipelines on AWS; (b) developing and maintaining data pipelines using Python and SQL; (c) designing and implementing ETL solutions for processing data using Kinesis; (d) utilizing all of the following: AWS Glue; AWS Step Functions; CI/CD; Redshift; Lambda; Docker; Linux Shell Scripting; Pandas; Pyspark; Numpy; software development methodologies; and Batch and Elastic Load Balancing.
- This position requires only little domestic travel. Position allows working from home within commuting distance of worksite location.
For US based candidates, the proposed salary band for this position is as follows:
$0.00---$0.00
The actual salary offer will carefully consider a wide range of factors, including your skills, qualifications, experience, and location. Also, certain positions are eligible for additional forms of compensation, such as bonuses.
About You
- You are passionate about our purpose and genuinely care about our mission to transform the lives of patients through innovative cancer treatment
- You bring rigor and excellence to all that you do. You are a fierce believer in our rooted-in-science approach to problem-solving
- You are a generous collaborator who can work in teams with diverse backgrounds
- You are determined to do and be your best and take pride in enabling the best work of others on the team
- You are not afraid to grapple with the unknown and be innovative
- You have experience working in a fast-growing, dynamic company (or a strong desire to)
- You work hard and are not afraid to have a little fun while you do so
Locations
Genmab leverages the effectiveness of an agile working environment, when possible, for the betterment of employee work-life balance. Our offices are designed as open, community-based spaces that work to connect employees while being immersed in our state-of-the-art laboratories. Whether you’re in one of our collaboratively designed office spaces or working remotely, we thrive on connecting with each other to innovate.
Apply
Career Focus: Biostatistics/Bioinformatics, Data Management/Data Science, Engineer
Similar Jobs
Director, Clinical Data and Systems
The Director, Clinical Data and Systems is responsible for overseeing Da...
Principal Biostatistician
Novocure is a global publicly-traded commercial-stage oncology company d...
Sr Bioinformatician
CareDx is seeking a talented bioinformatician to join the Data and AI gr...
Director, Biostatistics
At Ultragenyx, we fundamentally believe that taking real impactful actio...