Collaborating with NVIDIA on biological foundation models

Authored by
Tahoe Team
Released on
January 13, 2025
Authored by
Tahoe Team
Released on
January 13, 2025
Summary

A month ago we, at Tahoe (formerly Vevo) announced that we have generated the Tahoe-100M dataset, a drug-perturbed, single-cell atlas larger than all public data combined, in collaboration with our friends and partners Parse Biosciences and Ultima Genomics.

Today we are announcing that we will fully open-source Tahoe-100M in February and that we are starting a collaboration with NVIDIA Healthcare to train disease-relevant foundation models of human cell on this and other Tahoe-generated data

Open sourcing a dataset of this magnitude is a momentous step towards creating a more open and collaborative community in biological research, which can ultimately help us design better therapeutics for patients.

Thank you NVIDIA for supporting this historic step. We hope others will follow suit.