ml-guided-excipient-selection

De-Risk Biologic Formulation with ML-Guided Excipient Selection

De-Risk Biologic Formulation with ML-Guided Excipient Selection

De-Risk Biologic Formulation with ML-Guided Excipient Selection

18.07.2025

6

Minutes

Leukocare Editorial Team

18.07.2025

6

Minutes

Leukocare Editorial Team

Struggling with complex biologic formulation and aggressive timelines? Traditional trial-and-error wastes valuable resources. Discover how ML-guided excipient selection offers a smarter, data-driven path to stability.

Menu

1. Current Situation

2. Typical Market Trends

4. How Leukocare Can Support These Challenges

5. Value Provided to Customers

6. FAQ

The Smart Path to Stability: De-Risking Biologic Formulation with ML-Guided Excipient Selection

For any Director of CMC or Drug Product Development, the path to a stable, effective, and commercially viable biologic has a lot of challenges. The pressure is immense: timelines are aggressive, budgets are tight, and the molecule itself is often a complex puzzle. The traditional formulation development process, often relying on extensive trial-and-error, can feel like a bottleneck, eating up precious time and valuable drug substance. What if we could approach this critical stage with more foresight and precision? What if we could use data to guide our choices, making the process smarter, not just harder?

1. Current Situation

The current reality for many in drug development is a high-stakes balancing act. Biologics are unstable. They can degrade through a range of physical and chemical pathways, from aggregation and precipitation to deamidation. Finding the right combination of excipients to stabilize the active pharmaceutical ingredient (API) is a key to successful formulation. [1, 2]

This process often begins with pre-formulation studies to understand the molecule's specific weaknesses. [1] This can take a lot of resources, especially with limited material available in the early phases. The pressure from the boardroom to move quickly towards a Biologics License Application (BLA) means there is little room for missteps. [4]

2. Typical Market Trends

The biopharmaceutical market is not standing still. Several key trends are adding new layers of complexity to formulation development:

  • High-Concentration Formulations: There's a big push to develop high-concentration protein formulations, often over 100 mg/mL, so they can be given subcutaneously. This is more convenient for patients and can lower healthcare costs, but it also makes stability challenges like aggregation and high viscosity worse. [6]

  • New Modalities: The industry is moving beyond traditional monoclonal antibodies to more complex structures like viral vectors, antibody-drug conjugates (ADCs), and RNA-based therapies. These new modalities bring unique stability issues and often need new formulation strategies. [5, 7, 8]

  • Accelerated Timelines: The demand for faster drug development is relentless. Companies are always looking for ways to shorten the journey from lab to market, which puts huge pressure on CMC teams to deliver a strong formulation quickly. The cost of development is a big factor, with estimates for an originator biologic reaching up to $800 million over 8 to 10 years. [9, 10]

3. Current Challenges and How They Are Solved

These market trends create specific, real challenges for formulation teams. The high viscosity of concentrated solutions can make injections difficult and painful. New modalities may not respond to traditional stabilization techniques, forcing teams back to the drawing board. And the sheer number of potential excipient combinations makes extensive experimental screening impractical. [6]

To cope, teams often use platform approaches, relying on established formulation recipes that worked for similar molecules before. While this can save time, it's not always the best solution and might not handle a new molecule's unique issues. Design of Experiments (DoE) is a more systematic approach, but even this can be limited by how much drug substance is available for testing. [4]

The big question is: how do you smartly narrow down all the possibilities to find the most promising formulation candidates? This is where a change in thinking, and technology, is needed. [12, 5]

4. How Leukocare Can Support These Challenges

This is exactly where data-driven, machine learning (ML)-guided approaches can really make a difference. Instead of just relying on intuition or platform recipes, ML models can analyze huge amounts of data to predict which excipients are most likely to stabilize a specific molecule.

At Leukocare, we've built a smart formulation platform that brings AI and predictive modeling into the development process. Here’s how it works: [4, 13]

  • Data-Driven Predictions: Our algorithms learn from vast datasets from past formulation projects. By analyzing a new molecule's properties, the system can predict optimal excipient combinations, tailored to that specific protein and its target product profile. This allows us to move beyond generic solutions and design formulations with a higher chance of success from the start. [4, 13]

  • Targeted Experimentation: The ML-guided approach doesn't get rid of the need for wet-lab experiments, but it makes them much more efficient. Instead of wide, speculative screening, we can do smaller, focused studies to check the model's predictions. This saves a lot of time and, importantly, saves valuable drug substance. A Bayesian optimization algorithm, for instance, has been shown to identify an optimal formulation in less than a third of the experiments required by a classical DoE method.

  • Strategic Partnership: We see ourselves as a strategic partner, not just someone who executes tasks. For a virtual biotech feeling the pressure, this means a proactive partner who offers strong opinions and deep scientific expertise. For a mid-size biotech facing a new challenge, we can step in to solve a specific problem, like lyostability, without messing up existing workflows. We provide the data-backed insights you need to make confident decisions and build a strong CMC story for investors and regulators. [12, 5]

5. Value Provided to Customers

The value of an ML-guided approach is about making the development process less risky and getting your product to the clinic and market faster. For our customers, this gives you several key benefits:

  • Faster Timelines: By cutting down on time spent on formulation screening, we help you get to a stable, BLA-ready formulation faster.

  • Reduced Material Consumption: Our targeted approach means less of your valuable drug substance is used in formulation development, which can save a lot of money, especially in early phases.

  • Data-Informed Confidence: Our claims are backed by data. We provide the structure, speed, and substance needed for a reliable development process, giving you confidence in your formulation strategy and a stronger position in discussions with investors and regulators. We don't just deliver a formulation; we deliver the data and the rationale behind it. [14]

Biologic formulation has real challenges, but there are also great ways to solve them with smarter, more predictive tools. By using an ML-guided approach, we can move away from the limits of traditional methods and build a more efficient, reliable, and successful path forward for the next generation of therapies.

6. FAQ

Q1: How does an ML-based approach to excipient selection actually work?

Our machine learning algorithms are trained on a large dataset of protein formulations with proven long-term stability data. The model learns the complex relationships between a protein's properties (like its sequence and structure) and the types of excipients that best stabilize it. When we have a new molecule, we input its specific characteristics, and the model generates predictions on the likelihood that certain excipients will create a stable formulation. This gives us a highly informed starting point for experimental work. [4, 13]

Q2: Will this replace the need for experimental formulation work?

No, it complements and focuses it. Predictive modeling helps us to drastically narrow the experimental design space. Instead of conducting large, open-ended screening studies, we can design smaller, more targeted experiments to confirm and refine the model's predictions. This makes the entire process faster and requires much less material. [12, 5]

Q3: We are a small biotech with very limited drug substance. How can this approach help? [12, 5]

This is an ideal scenario for an ML-guided approach. The high cost and limited availability of drug substance is a major constraint in early development. By using predictive models to guide excipient selection, we minimize the amount of material needed for formulation screening, ensuring that your valuable API is used as efficiently as possible. [4]

Q4: Our molecule is a new modality, not a standard mAb. Can your system handle that?

Yes. Our approach is designed to be adaptable. While many existing datasets are based on monoclonal antibodies, the principles of protein stabilization are transferable. As we work with more diverse modalities, such as viral vectors or ADCs, the models continuously learn and improve. We can address specific challenges associated with new modalities, providing tailored formulation designs rather than relying on standard templates.

Q5: How does this help us with regulatory submissions?

A data-driven approach strengthens your entire CMC package. We provide not just the final formulation but also a clear, rational justification for why specific excipients were chosen, all supported by predictive modeling and targeted experimental data. This demonstrates a deep understanding of your product and its stability characteristics, which is exactly what regulators want to see. [10]

Literature

  1. europeanpharmaceuticalreview.com

  2. evidentic.com

  3. nih.gov

  4. pharmaexcipients.com

  5. youtube.com

  6. bioprocessingsummit.com

  7. nih.gov

  8. pharmasalmanac.com

  9. drug-dev.com

  10. bioprocessonline.com

  11. gabionline.net

  12. pharmaexcipients.com

  13. biorxiv.org

  14. probiocdmo.cn

  15. acs.org

Further Articles

Further Articles

Further Articles