Screening the Enamine Real Database at Recursion


  Public Announcement
  Blog Post

Technologies

  • HPC / SLURM
  • PyTorch

Summary

After acquiring Cyclica in 2023, Recursion set out to perform the largest structure-based virtual screen ever attempted: evaluating 36 billion virtual compounds from the Enamine Real database against roughly 100 protein targets. I was part of Cyclica at the time of the acquisition and later joined the team at Recursion that helped turn this vision into reality. Using the MatchMaker model, Recursion generated over 2.8 quadrillion protein–ligand predictions, ultimately producing AI-curated, synthesis-ready libraries. This project brought together deep learning, structural biology, and hyperscale compute to push the boundaries of virtual drug discovery.

What did I do?

  • Tested configurations across 63 NVIDIA DGX H100s to maximize throughput and minimize latency.
  • Tuned GPU workloads on BioHive-1, Recursion’s on-prem super computer.
  • Ensured that billions of compound–target pairs could be screened efficiently.
  • Celebrated in a LinkedIn Post! 🎉
Back to Top ↑