IBM's MAMMAL model puts open AI drug discovery in sharper focus

IBM Research's MAMMAL is not a miracle cure machine. It is a serious sign that biotech AI is moving toward unified, open infrastructure for early drug discovery.

The important part of IBM Research's new MAMMAL model is not that it suddenly solves medicine. It does not. The important part is that it tries to bring several messy parts of biology into one model, then makes that model available for researchers to use.

That matters because drug discovery is still too fragmented. A team may use one system to think about protein sequences, another to screen small molecules, another to interpret gene-expression data and still another to assess antibody behavior. Each tool can be powerful, but the workflow is slow, expensive and full of translation gaps. MAMMAL is IBM's attempt to reduce some of that friction by treating these biological inputs as parts of the same computational language.

According to the npj Drug Discovery paper published on May 4, MAMMAL stands for Molecular Aligned Multi-Modal Architecture and Language. The model was trained on roughly 2 billion biological samples spanning protein and antibody sequences, small molecules and gene-expression profiles. Across 11 drug-discovery benchmarks, the researchers reported state-of-the-art performance on nine and competitive results on the remaining two.

The antibody result is the one that will attract the most attention. In antibody-antigen binding tests, fine-tuned MAMMAL prediction scores outperformed AlphaFold 3 confidence-score proxies on five of seven antigen targets. That does not mean MAMMAL is broadly better than AlphaFold 3, which is built around structural biology and has its own strengths. It means a model trained to align multiple biological modalities may be useful in places where binding likelihood, sequence context and task-specific fine-tuning matter.

For biotech founders, the bigger question is not whether MAMMAL wins a leaderboard. Benchmarks are useful, but they are not clinical validation. A model can perform well on curated tasks and still fail when biology becomes noisy, human and expensive. That is exactly why this story should be treated as infrastructure, not a cure announcement.

The promise is workflow compression. Early discovery involves a long chain of decisions: which target looks biologically meaningful, which molecules may interact with it, which candidates could be toxic, which antibodies are worth producing and which experiments should be run first. If a unified model can narrow those choices earlier, even imperfectly, it can change the cost structure of small biotech teams.

That is where AI-native startups may find an opening. Large pharmaceutical companies have data, lab capacity, regulatory experience and capital. Startups usually do not. What they can have is speed, focus and a willingness to build around new model architectures from the beginning. An open biomedical foundation model gives those teams a base layer they do not have to create from scratch.

IBM has made the pretrained MAMMAL model available for research use on Hugging Face, under the IBM biomedical model family. That matters because open access changes who can experiment. A small team working on antibody design or toxicity screening can test ideas without waiting for a proprietary platform vendor to expose the right feature or price the right contract.

Open models challenge the pharma platform playbook

The pharma AI market has been shaped by proprietary platforms. Companies raise large rounds, build private datasets, promise faster discovery and sign partnerships with drugmakers that want access without rebuilding the stack themselves. There is nothing wrong with that model. In a regulated industry, defensible data and repeatable lab execution are still serious advantages.

But open biomedical models put pressure on that logic. If more of the modeling layer becomes available to researchers, the scarce asset shifts. It is less about simply having a model and more about the quality of the data, the sharpness of the biological question, the assay design and the ability to close the loop between prediction and experiment.

That could be healthy for the market. It may reduce the tendency to treat every AI drug discovery platform as a black box. Investors and pharma partners can ask more direct questions: what does the model actually improve, where has it been tested, how does it perform outside benchmark data and what lab evidence supports the claim?

MAMMAL also shows why multimodal biology is becoming a practical frontier. A drug is not just a molecule in isolation. It interacts with proteins, cells, tissues and pathways. Gene-expression data can show how cells respond. Protein and antibody sequences carry different signals. Small molecules bring chemistry into the picture. The more these inputs can be aligned, the better the model can support decisions that resemble real discovery work.

The caveat is simple. No benchmark removes the need for wet-lab validation, clinical testing or regulatory proof. Drug candidates still fail at high rates because human biology is unforgiving. A model can help choose better starting points, but it cannot make uncertainty disappear.

Still, this is the kind of AI progress that should get attention. Not because it promises instant cures, but because it may make the early stages of discovery less wasteful. Watch whether startups begin building focused workflows on top of models like MAMMAL, especially in antibody design, toxicity prediction and target prioritization. The next advantage in biotech may come from teams that combine open foundation models with proprietary experimental feedback faster than incumbents expect.

Also read: AI layoffs are leaving founders with an operating debt problem • Anthropic says the AI race now turns on chips and enforcement • Calif showed how AI can compress Apple exploit work into days