Implementation Details
Target Identification with PandaOmics
Insilico's implementation began with PandaOmics, a deep learning platform analyzing vast datasets including genomics, transcriptomics, and literature to identify TNIK (TRAF2 and NCK-interacting kinase) as a novel IPF target. Traditional methods miss such targets; AI scored it highly for expression in fibroblasts and druggability, reducing target validation time from years to months.[1][4]
The model integrated transfer learning from protein-protein interactions and pathology data, achieving superior prioritization over rule-based approaches.
Generative Molecule Design via Chemistry42
Next, Chemistry42 generated de novo small molecules using generative adversarial networks (GANs), variational autoencoders, and reinforcement learning. Starting from TNIK structure, it produced over 400,000 unique scaffolds optimized for potency, selectivity, and ADMET properties like solubility and liver toxicity—properties predicted with deep neural networks trained on millions of compounds.[3][5]
Lead candidate ISM001-055 (Rentosertib) emerged after multi-objective optimization, scoring high in virtual assays simulating fibrosis models. This phase took weeks, versus years for medicinal chemistry iterations.
Preclinical Simulation and Validation with InClinico
InClinico employed graph neural networks and transformers to simulate pharmacokinetics, efficacy in humanized models, and safety endpoints. Predictions guided synthesis of top 100 candidates, with 80% matching in vitro results, minimizing lab costs.[2] Wet-lab confirmation showed potent TNIK inhibition (IC50 <1nM), anti-fibrotic effects in IPF organoids, and favorable PK in rodents/non-human primates.
Clinical Trial Acceleration
IND filed in November 2021, just 21 months post-project start (Feb 2020). Phase I (2022) confirmed safety in 48 healthy volunteers; Phase II launched July 2023 in China/US for IPF patients, evaluating efficacy via FVC and imaging.[1][6] By 2025, Rentosertib received USAN naming, signaling progress. NVIDIA GPUs powered training, enabling scalability.[5]
Challenges like AI hallucination in molecule feasibility were overcome via hybrid loops: AI proposals synthesized and fed back for retraining. Regulatory buy-in came from transparent data packages and partnerships (e.g., Sanofi). This end-to-end AI pipeline compressed timelines by 80%, positioning Insilico with 30+ programs.[7]