“Arc Institute today launched the Arc Virtual Cell Atlas, a growing resource for computation-ready single-cell measurements, starting with data from over 300 million cells. The initial release of the Atlas is Arc’s first step toward assembling, curating, and generating large-scale cellular data to fuel new insights from AI-driven biological discovery.
The Atlas debuts with two foundational datasets, both of which will be publicly available starting February 25, 2025. The first is a new, open source, perturbation dataset called Tahoe-100M, created by Vevo Therapeutics, comprising 100 million cells and mapping 60,000 drug-cell interactions across 50 cancer cell lines. The second dataset, scBaseCamp, is the first single-cell RNA sequencing dataset from public data to be curated and reprocessed at scale using AI agents. Arc mined observational data from more than 200 million cells representing 21 different species sourced from public repositories, and processed them to a standardized form.”
From Arc Institute.