Senior Research Scientist, Machine Learning (BioFM)
Software Engineering, Data Science
Toronto, ON, Canada
CAD 175k-200k / year + Equity
About Us
Deep Genomics is at the forefront of using artificial intelligence to transform drug discovery. Our proprietary AI platform decodes the complexity of RNA biology to identify novel drug targets, mechanisms, and therapeutics inaccessible through traditional methods. With expertise spanning machine learning, bioinformatics, data science, engineering, and drug development, our multidisciplinary team in Toronto and Cambridge, MA is revolutionizing how new medicines are created.
Opportunity
We are seeking an exceptional and creative Senior/Staff Machine Learning Scientist to lead and innovate within our core AI research team, specifically focusing on the creative building of Biological Foundation Models (BioFMs). You will pioneer novel deep learning architectures and pre-training paradigms that learn the fundamental language of the genome and cellular biology. Rather than just applying out-of-the-box ML to biological datasets, you will design the next generation of BioFMs from tackling complex -omics data at scale. If you are a first-principles thinker excited to bridge advanced ML with genome biology to solve high-impact, frontier problems in human health and drug discovery, this is a unique opportunity.
Key Responsibilities
- Lead the creative research, architecture design, and training of Biological Foundation Models (BioFMs), on massive-scale genomic, transcriptomic, and single-cell datasets.
- Collaborate closely with computational biologists and drug developers to integrate deep biological priors directly into model architectures and training objectives, ensuring our BioFMs capture fundamental and scientifically meaningful representations.
- Rigorously implement, train, debug, and evaluate large-scale models to demonstrate scientific validity and drive progress on frontier problems in human health and genetic medicines.
- Stay current with advancements in machine learning and computational biology research, identifying cross-disciplinary applications to solve real-world challenges.
- Mentor junior scientists and engineers, fostering a culture of technical excellence and scientific curiosity through leadership and high-quality code review.
- Share research findings through internal presentations and contribute to the scientific community via publications in top-tier venues.
Basic Qualifications
- PhD (or evidence of equivalent level of expertise) with a strongly distinguished research focus in Computational Biology, Machine Learning, Computer Science, or a related quantitative field.
- Deep understanding of modern deep learning and the creative building of foundation models, including CNNs, Transformers, and related sequence models (e.g., state-space models) specifically tailored for biological or genomic sequence data.
- A demonstrated track record of building and scaling AI models for complex biological datasets (e.g., single-cell genomics, DNA/RNA sequences) from initial conception to production.
- Proven ability to implement, train, and debug highly-performant deep learning models using frameworks like PyTorch.
- Experience working with massive datasets and a deep understanding of the engineering and algorithmic challenges associated with scale.
- Excellent communication skills, capable of discussing complex ideas seamlessly with both ML engineers and biological domain experts.
Preferred Qualifications
- A strong track record of impactful research demonstrated through first-author publications in high-impact scientific journals (e.g., Nature, Science, Cell) or top-tier ML/CompBio conferences (e.g., NeurIPS, ICML, ICLR, ISMB, RECOMB).
- 2+ years of relevant post-graduate experience at a leading industrial R&D lab or in a highly competitive academic environment building genomics AI.
- Experience technically leading projects or mentoring junior researchers/engineers.
- Proficiency with cloud computing platforms (e.g., GCP) for large-scale model training and experimentation.
- Contributions to open-source projects demonstrating the ability to solve complex research problems in ML or computational biology.
What We Offer
- A collaborative and innovative environment at the frontier of computational biology, machine learning, and drug discovery.
- Highly competitive compensation, including meaningful stock ownership.
- Comprehensive benefits - including health, vision, and dental coverage for employees and families, employee and family assistance program.
- Flexible work environment - including flexible hours, extended long weekends, holiday shutdown, unlimited personal days.
- Maternity and parental leave top-up coverage, as well as new parent paid time off.
- Focus on learning and growth for all employees - learning and development budget & lunch and learns.
- Facilities located in the heart of Toronto - the epicenter of machine learning and AI research and development, and in Kendall Square, Cambridge, Mass. - a global center of biotechnology and life sciences.