Data Engineer at Owkin | Torre
warning

Heads-up

The job you’re trying to post already exists in Torre:

Data Engineer

You'll engineer scalable data pipelines to power AI breakthroughs in biology and healthcare.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Provide your expected compensation while applying
location_on
Remote (for United Kingdom residents)
Remote (for Germany residents)
Match
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Shared by
Emma of Torre.ai
2 months ago

Requirements and responsibilities


About usOwkin is an AI company on a mission to solve the complexity of biology. It is building the first Biology Super Intelligence (BASI) by combining powerful biological large language models, multimodal patient data, and agentic software. At the heart of this system is Owkin K, an AI copilot and its new LLM fine-tuned on biology called Owkin Zero, used by researchers, clinicians, and drug developers to better understand biology, validate scientific hypotheses, and deliver better diagnostics and therapies faster.This position is remote and based in the UK or Germany.Please submit your CV in EnglishAbout the role:You will be part of the Engineering team. This role involves designing, building, and optimizing scalable ETL/ELT pipelines with Airflow to process complex datasets efficiently while ensuring reliability and performance. You will organize and structure data systems, aligning them with business objectives, and demonstrate expertise in scientific and healthcare information systems to deliver data products tailored for machine learning and AI research. Clear reporting and meticulous attention to detail are essential, as is the ability to manage high-volume, complex workstreams while prioritizing multiple deadlines. The role requires professional interpersonal skills to collaborate with diverse stakeholders in biotechnology and the ability to streamline production workflows for scientific processing and quality assurance.Organize and structure data systems at both macro and micro levels, designing and implementing data architectures that support business goalsOptimize data pipelines for performance, reliability, and scalabilityDesign, build, and maintain scalable ETL/ELT pipelines with Airflow to process large-scale, complex datasetsDemonstrate ability to delivery of of  data products  useful for machine learning and AI research and development (data models, metadata and semantics)Strong organizational skills to effectively manage high-volume, complex workstreams while prioritizing multiple deadlineDemonstrate knowledge of scientific and healthcare information systems and data sources and relevant software toolsDemonstrate ability to handle a variety of activities across operational delivery and development and initiativesDemonstrate professional interpersonal skills with ability to work both independently and collaboratively with a variety of stakeholders on complex biotechnology areas.Streamline the process of taking scientific processing and quality check in production, ensuring proper monitoring of the production workflows.In particular, you will:Design and optimizing data pipelines using AirflowDevelop robust solutions in Python and SQLDesign, develop, and operate scalable ETL/ELT pipelines to process and transform datasets.Work with cross-functional teams, including data scientists, business developers, software engineers and bio medical researchers to deliver high-quality data solutions.Manage and monitor containerized data infrastructures with Docker and Kubernetes and other cloud platforms.Implement and enforce best practices for data governance, security, and compliance.Build, optimize and maintain data architectures, including data lakes, data warehouses, and analytical InsightsProductionize the data processing pipelines, setting and enforcing standards and best practices across scientific teams to deliver high quality data in an efficient and scalable way.What we offerFlexible work organizationFriendly and informal working environmentOpportunity to work with an international team with high technical and scientific backgroundsRecruitment Process & SecurityPlease complete the form and submit your CV.Owkin is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, sex, gender, sexual orientation, age, color, religion, national origin, protected veteran status or on the basis of disability.Owkin is a great place to work. As a coveted workplace we are, unfortunately, vulnerable to recruitment phishing scams. We urge all job seekers and candidates to be wary of potential scams. Most of these have individuals posing as representatives of prominent companies, including Owkin, with the aim of obtaining personal, sensitive, or financial information from applicants. These scams prey upon an individual’s desire to obtain a job and can sometimes “feel” like a genuine recruitment process. Some red flags are identified below. Should you encounter a recruitment process that claims to be for Owkin but is not consistent with the below, please do not provide any personal or financial information:Legitimate Owkin recruitment processes include communication with candidates through recognized professional networks, such as LinkedIn.Communication is always through an official Owkin email address (from the @owkin.com domain), over the phone or through our applicant tracking system (Greenhouse).The Owkin talent team do use platforms such as LinkedIn and Job Teaser, however if you have any concern or doubt about this contact, please ask for them to send an email from @Owkin.com.The Owkin talent team will not solicit personal data from candidates during the application phase including, but not limited to, date of birth, social security numbers, or bank account information;Legitimate Owkin interviews may be conducted over the phone, in person, or via an approved enterprise videoconferencing service (Google Meets). They will not occur via Signal, Telegram or MessengerOwkin offers of employment are based on merit and only extended once a candidate has interviewed with members of the talent and hiring team. Offers will be extended both verbally and in written format.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.