cv
Basics
Name | Minh-Anh To |
Label | Master's in Statistical Science Student |
minhanh.to@duke.edu | |
Url | https://minhanhto09.github.io |
Work
-
2024.08 - Present Remote, NC, USA
AI Developer
ChessMind AI
- Developed an AI Chess Coach using Generative AI and expertise from a chess grandmaster to enhance player skill acquisition and strategic decision-making.
- Designed an agentic AI workflow using LangChain and LangGraph to retrieve chess concepts from an expert-curated dataset, increasing model retrieval accuracy by 36%.
- Integrated an interactive LLM-based grading module into the AI Chess Coach product, providing personalized feedback that enhanced user experience and increased engagement by 25%.
- Optimized Large Language Model (LLM) responses through few-shot learning and chain-of-thought prompting, minimizing hallucinations and achieving over 99% semantic similarity accuracy.
- Documented and maintained a comprehensive GitHub codebase, including unit testing and automation scripts, reducing pipeline debugging and deployment time by 40%.
-
2024.05 - 2024.08 Durham, NC, USA
Data Science Intern
Duke University School of Medicine
- Transformed 10 years of unstructured biomedical data into a structured database (5K+ mRNA sequences) using SQL and Pandas, enabling scalable predictive modeling for antibody production.
- Collaborated with a cross-functional team to engineer 35 mRNA structural features by devising statistical formulas and implementing custom Python functions, improving model performance by 75%.
- Developed, fine-tuned, and validated tree-based models (XGBoost, Random Forest) with scikit-learn, achieving 90% predictive accuracy and reducing runtime by 48%.
- Conducted feature importance analysis to identify five key mRNA properties that improved yield predictions, driving cost savings of tens of thousands of dollars.
- Authored a technical report on model performance and business impact, selected from 50 candidates to present to 200+ attendees at the Duke CFAR Retreat 2024.
-
2022.06 - 2023.08 Hanoi, Vietnam
Assistant Researcher
Vietnam Academy of Science and Technology
- Proved the practicality of the bootstrap method for goodness-of-fit testing using estimator properties; presented findings to 25 researchers and secured a significant grant from VinIF Vietnam.
- Led a weekly Bayesian Statistics seminar for 30+ students on simulation & optimization, including live R coding on topics such as Bayesian Regression, Gibbs Sampling, and Hierarchical Modeling.
Education
-
2023.08 - 2025.05 Durham, NC, USA
Master's in Statistical Science
Duke University
- Predictive Modeling, Deep Learning, Causal Inference, Statistical Computation, R Programming, SQL, Bayesian Statistics, Theory of Inference, Hierarchical Models, Applied Machine Learning.
-
2014.08 - 2018.05 Hanoi, Vietnam
Bachelor's in Mathematics
Hanoi National University of Education
- One and Multivariable Analysis, Functional Analysis, Linear Algebra, Complex Analysis, Measure Theory, Optimization Theory, Introduction to Probability Theory, Statistics, Ordinary and Partial Differential Equations, Numerical Analysis.
Awards
- 2023.08
Duke Graduate Full Tuition Remission and Teaching Assistantship
Duke University
Full Graduate Tuition Remission and Teaching Assistantship.
- 2023.05
The Vingroup Science & Technology Scholarship Program for Master's and Doctoral Degrees
VinGroup Vietnam
Top 1% of applicants nationwide to receive the scholarship for the academic years 2023-2025.
- 2018.05
HNUE Graduation: Summa Cum Laude
Hanoi National University of Education
Top 5% of undergraduate students in the class of 2018. Dean's List with Distinction (4 years).
- 2016.08
Vietnam National Mathematics Scholarship
Vietnam Ministry of Education and Training
Top 1% of undergraduate mathematics majors nationwide for the academic years 2016-2017.
Certificates
Machine Learning Specialization | ||
DeepLearning.AI | 2025-04 |
Data Analysis with R Specialization | ||
Coursera - Duke University | 2023-07 |
Python for Data Science, AI & Development | ||
Coursera - IBM | 2023-05 |
Skills
Programming | |
Python (NumPy, pandas, scikit-learn, PyTorch, TensorFlow) | |
R (dplyr, ggplot2, tidyr) | |
MySQL | |
Git | |
Rest APIs |
AI/ML | |
Generative AI (LangChain, OpenAI APIs, VectorDBs) | |
Deep Learning (CNNs, GANs) | |
Machine Learning (XGBoost, Random Forest, Logistic Regression, SVM) |
Statistics | |
Hypothesis Testing | |
Hierachical Modeling | |
Bayesian Modeling | |
Causal Inference |
Tools & Platforms | |
Jupyter | |
Cloud Services (AWS, GCP) | |
HTML, YAML |
Languages
Vietnamese | |
Native speaker |
English | |
Fluent |
French | |
Beginner |