cv
Basics
| Name | Minh-Anh To |
| Label | Master's in Statistical Science Student |
| minhanh.to@duke.edu | |
| Url | https://minhanhto09.github.io |
Work
-
2024.08 - Present Remote, NC, USA
AI Developer
ChessMind AI
- Developed an AI Chess Coach using Generative AI and expertise from a chess grandmaster to enhance player skill acquisition and strategic decision-making.
- Designed an agentic AI workflow using LangChain and LangGraph to retrieve chess concepts from an expert-curated dataset, increasing model retrieval accuracy by 36%.
- Integrated an interactive LLM-based grading module into the AI Chess Coach product, providing personalized feedback that enhanced user experience and increased engagement by 25%.
- Optimized Large Language Model (LLM) responses through few-shot learning and chain-of-thought prompting, minimizing hallucinations and achieving over 99% semantic similarity accuracy.
- Documented and maintained a comprehensive GitHub codebase, including unit testing and automation scripts, reducing pipeline debugging and deployment time by 40%.
-
2024.05 - 2024.08 Durham, NC, USA
Data Science Intern
Duke University School of Medicine
- Transformed 10 years of unstructured biomedical data into a structured database (5K+ mRNA sequences) using SQL and Pandas, enabling scalable predictive modeling for antibody production.
- Collaborated with a cross-functional team to engineer 35 mRNA structural features by devising statistical formulas and implementing custom Python functions, improving model performance by 75%.
- Developed, fine-tuned, and validated tree-based models (XGBoost, Random Forest) with scikit-learn, achieving 90% predictive accuracy and reducing runtime by 48%.
- Conducted feature importance analysis to identify five key mRNA properties that improved yield predictions, driving cost savings of tens of thousands of dollars.
- Authored a technical report on model performance and business impact, selected from 50 candidates to present to 200+ attendees at the Duke CFAR Retreat 2024.
-
2022.06 - 2023.08 Hanoi, Vietnam
Assistant Researcher
Vietnam Academy of Science and Technology
- Proved the practicality of the bootstrap method for goodness-of-fit testing using estimator properties; presented findings to 25 researchers and secured a significant grant from VinIF Vietnam.
- Led a weekly Bayesian Statistics seminar for 30+ students on simulation & optimization, including live R coding on topics such as Bayesian Regression, Gibbs Sampling, and Hierarchical Modeling.
Education
-
2023.08 - 2025.05 Durham, NC, USA
Master's in Statistical Science
Duke University
- Predictive Modeling, Deep Learning, Causal Inference, Statistical Computation, R Programming, SQL, Bayesian Statistics, Theory of Inference, Hierarchical Models, Applied Machine Learning.
-
2014.08 - 2018.05 Hanoi, Vietnam
Bachelor's in Mathematics
Hanoi National University of Education
- One and Multivariable Analysis, Functional Analysis, Linear Algebra, Complex Analysis, Measure Theory, Optimization Theory, Introduction to Probability Theory, Statistics, Ordinary and Partial Differential Equations, Numerical Analysis.
Awards
- 2023.08
Duke Graduate Full Tuition Remission and Teaching Assistantship
Duke University
Full Graduate Tuition Remission and Teaching Assistantship.
- 2023.05
The Vingroup Science & Technology Scholarship Program for Master's and Doctoral Degrees
VinGroup Vietnam
Top 1% of applicants nationwide to receive the scholarship for the academic years 2023-2025.
- 2018.05
HNUE Graduation: Summa Cum Laude
Hanoi National University of Education
Top 5% of undergraduate students in the class of 2018. Dean's List with Distinction (4 years).
- 2016.08
Vietnam National Mathematics Scholarship
Vietnam Ministry of Education and Training
Top 1% of undergraduate mathematics majors nationwide for the academic years 2016-2017.
Certificates
| Machine Learning Specialization | ||
| DeepLearning.AI | 2025-04 |
| Data Analysis with R Specialization | ||
| Coursera - Duke University | 2023-07 |
| Python for Data Science, AI & Development | ||
| Coursera - IBM | 2023-05 |
Skills
| Programming | |
| Python (NumPy, pandas, scikit-learn, PyTorch, TensorFlow) | |
| R (dplyr, ggplot2, tidyr) | |
| MySQL | |
| Git | |
| Rest APIs |
| AI/ML | |
| Generative AI (LangChain, OpenAI APIs, VectorDBs) | |
| Deep Learning (CNNs, GANs) | |
| Machine Learning (XGBoost, Random Forest, Logistic Regression, SVM) |
| Statistics | |
| Hypothesis Testing | |
| Hierachical Modeling | |
| Bayesian Modeling | |
| Causal Inference |
| Tools & Platforms | |
| Jupyter | |
| Cloud Services (AWS, GCP) | |
| HTML, YAML |
Languages
| Vietnamese | |
| Native speaker |
| English | |
| Fluent |
| French | |
| Beginner |