AI Data Engineer--Peptides and Biologics
Pfizer
Job Title: AI Data Engineer—Peptides and Biologics
Company: Pfizer
Position Summary
Our cross-functional Data Ecosystem Team is looking to identify a forward-deployed data and AI engineer to help build and scale an AI-ready data architecture supporting biologics labs. You will leverage your expertise to design innovative software solutions that extract valuable insights from Pfizer's proprietary data and external datasets, enabling the generation of testable hypotheses across the entire drug discovery value chain.
Position Responsibilities
- Development, support and implementation of a modern data platform to enable efficient and scalable correlation and analysis of data for biological drug modalities.
- Development of innovative data products and machine learning methods for biologics data together with machine learning experts within Pfizer.
- Processing, analysis and integration of internal in vivo pharmacodynamics and toxicology data sets.
- Curation and integration of relevant datasets from the public domain.
- Development of analysis pipelines.
- Development and roll out of data products to meet specific needs through data integration.
- Implementation, testing and validation of new methods for data analysis and visualization techniques.
- Drive collaborations with external companies and academic institutions.
- Develop Pfizer biologics data capture, metadata tagging and storage strategy along with Pfizer’s Digital organization.
- Onboarding of Pfizer colleagues to the data platform and organization of workshops, hackathons, trainings and scientific talks.
- Strengthen external visibility and scientific excellence through publishing / presenting work in reputed journals and conference/workshop venues and engaging with the scientific community.
Basic Qualifications
- PhD in Biology, Chemistry, Physics, Statistics or a related technical discipline OR Master’s degree and 2+ years of experience building AI powered research applications.
- Strong background in data handling, integration and analysis.
- Thorough understanding of drug discovery and biology with a particular focus on large molecule therapeutics such as peptides, siRNA, antisense, mRNA and antibodies.
- Research experience developing data products and data integration solutions as well as a sincere interest for computational life sciences.
- Experience solving complex analyses/problems in a timely fashion.
- Exceptional programming skills in Python.
- Strong experience as a full-stack developer with focus on python, in-depth database expertise with a focus on postgres, ETL frameworks.
- Strong communication skills—verbal, written and presentation.
Preferred Qualifications
- Nextflow pipeline development.
- Proficiency in front-end technologies such as typescript, reactjs and browser-based visualization techniques is a plus.
- Proficiency in utilizing AI/ML libraries including PyTorch and Lightning.
- Experience with LLMs/RAG systems.
- Expertise in software engineering, package development, cloud architectures, CI/CD and software engineering tooling.
- Familiarity with pertinent libraries within the Python scientific stack.
- Hands-on experience handling, processing, integrating, and analyzing large heterogenous data sets data in a drug discovery research environment.
- Experience with Claude Code or equivalent and vibe coding paradigms.
- Strong publication record and demonstrated contributions to the field.
- Experience taking ideas from prototype to production.
Work Environment and Compensation
- Hybrid Role: Requires living within commuting distance and working on-site an average of 2.5 days per week or more as needed.
- Annual Base Salary: $106,000.00 to $171,500.00.
- Bonus & Incentives: Eligible for Pfizer’s Global Performance Plan (15.0% bonus target) and share-based long-term incentive program.
- Benefits: 401(k) plan with Pfizer Matching Contributions and Retirement Savings Contribution; paid vacation, holiday, personal days, caregiver/parental and medical leave; health benefits (medical, prescription drug, dental, and vision).
- Note: Salary range does not apply to Tampa, FL or locations outside the U.S. Relocation assistance may be available.
Sunshine Act
Pfizer reports payments and other transfers of value to health care providers as required by federal and state transparency laws. Certaing recruiting expenses for licensed physicians may constitute a reportable transfer of value under the federal Sunshine Act. If you are a licensed physician, your name, address, and the amount of payments made will be reported to the government.
EEO & Employment Eligibility
Pfizer is committed to equal opportunity regardless of race, color, religion, sex, sexual orientation, age, gender identity/expression, national origin, disability or veteran status. Pfizer complies with all laws governing nondiscrimination and work authorization (E-Verify). Permanent work authorization in the United States is required.
For accessibility assistance or accommodation requests, email disabilityrecruitment@pfizer.com.
Information & Business Tech
A career at Pfizer offers opportunity, ownership and impact. Our colleagues work together to positively impact health for everyone, everywhere, within an ownership culture that values diversity and innovation to bring therapies to patients that significantly improve their lives.
Responsibilities:
The role involves developing and implementing a modern data platform for biologics labs and creating innovative data products and machine learning methods. Additionally, it includes collaborating with external companies and academic institutions to enhance data analysis and visualization techniques.
Requirements:
Candidates must have a PhD or a Master's degree with relevant experience in AI-powered research applications and a strong background in data handling and drug discovery. Exceptional programming skills in Python and experience with large molecule therapeutics are also required.
Education
- postgraduate degree
Benefits
Skills & Tags
Keywords
Categories
Source: workday