Data Scientist / ML Engineer
Topia Life Sciences, India
- Designed and maintained large‑scale data pipelines to ingest and process biomedical data from clinical trials, PubMed, and research repositories.
- Engineered a production‑grade knowledge graph with 88K+ nodes and 824K+ edges to model drug relationships, improving link‑prediction accuracy from 65% to 88%.
- Implemented graph algorithms and predictive models that identified 9 viable drug‑repurposing candidates over a 10‑month period.
- Built an automated Retrieval‑Augmented Generation (RAG) system processing 4M+ PubMed papers and 100K+ clinical trials, reducing expert research time from weeks to hours and to ~20 hours per query.
- Enhanced explainability of model outputs using classical graph traversal techniques, enabling transparent, research‑grade decision support.