Computational Biology & Machine Learning
👋 Welcome! I’m Mahbuba Tasmin, a Ph.D. candidate in Computer Science at the University of Massachusetts Amherst, advised by Prof. Anna Green. I work in the SAGE Lab (Statistical and Genomic Evidence Lab), where my research bridges machine learning, computational biology, and antibiotic resistance genomics.
My current focus is on building interpretable, biology-grounded models for predicting drug resistance in Mycobacterium tuberculosis — combining sequence-based deep learning, evolutionary augmentation, and causal variant discovery.
I’m particularly interested in:
- Genomic and protein-based ML for resistance prediction
- Causal interpretability and structural biology
- Cross-species learning and data augmentation
- Benchmark dataset design for biological ML models
🔬 Research Highlights
- BIG-TB Benchmark (17K isolates, 11 drugs): Developing a unified dataset and evaluation framework for resistance prediction across modalities.
- Resistance Forecast Project: Integrating structural, evolutionary, and machine-learning features to predict variant impact.
- Evolutionary Augmentation: Leveraging multi-species homologs to enhance sparse training data for protein-level models.
You can read more about these in my publications and projects.
Year in Review (2025)
This year marked a series of important academic and research milestones:
- Published a research article in eLife on predicting antibiotic resistance using protein structural context
- Presented work at ICLR (MLGenX Workshop) on structure-aware machine learning models
- Delivered a lightning/spotlight talk at the Machine Learning for Computational Biology (MLCB) workshop
- Completed my M.S. in Computer Science, with thesis research aligned with my PhD work
- Advanced to PhD candidacy
- Currently preparing two journal manuscripts focused on resistance forecasting and interpretable biological ML
Overall, this year represented a transition from exploratory experimentation to defining a focused research agenda at the intersection of machine learning, structural bioinformatics, and infectious disease genomics.
📚 Teaching & Mentorship
I serve as a Teaching Assistant for CS520 (Software Testing) at UMass, where I help students design and evaluate test coverage, mutation analysis, and automated testing frameworks in Java. I also mentor undergraduate and master’s students in research on ML for biological sequences.
Beyond Research
Outside the lab, I enjoy clay crafts, photography, and event organization— from handmade air-dry clay bowls to campus community events. I also contribute to graduate student initiatives at UMass through organizing academic and social programs.
📄 Quick Links
This site is built with the Academic Pages template, powered by Jekyll and hosted freely on GitHub Pages.