cv

Basics

Name Minh-Anh To
Label Master's in Statistical Science Student
Email minhanh.to@duke.edu
Url https://minhanhto09.github.io

Work

  • 2024.08 - Present

    Remote, NC, USA

    AI Developer
    ChessMind AI
    • Developed an AI Chess Coach using Generative AI and expertise from a chess grandmaster to enhance player skill acquisition and strategic decision-making.
    • Designed an agentic AI workflow using LangChain and LangGraph to retrieve chess concepts from an expert-curated dataset, increasing model retrieval accuracy by 36%.
    • Integrated an interactive LLM-based grading module into the AI Chess Coach product, providing personalized feedback that enhanced user experience and increased engagement by 25%.
    • Optimized Large Language Model (LLM) responses through few-shot learning and chain-of-thought prompting, minimizing hallucinations and achieving over 99% semantic similarity accuracy.
    • Documented and maintained a comprehensive GitHub codebase, including unit testing and automation scripts, reducing pipeline debugging and deployment time by 40%.
  • 2024.05 - 2024.08

    Durham, NC, USA

    Data Science Intern
    Duke University School of Medicine
    • Transformed 10 years of unstructured biomedical data into a structured database (5K+ mRNA sequences) using SQL and Pandas, enabling scalable predictive modeling for antibody production.
    • Collaborated with a cross-functional team to engineer 35 mRNA structural features by devising statistical formulas and implementing custom Python functions, improving model performance by 75%.
    • Developed, fine-tuned, and validated tree-based models (XGBoost, Random Forest) with scikit-learn, achieving 90% predictive accuracy and reducing runtime by 48%.
    • Conducted feature importance analysis to identify five key mRNA properties that improved yield predictions, driving cost savings of tens of thousands of dollars.
    • Authored a technical report on model performance and business impact, selected from 50 candidates to present to 200+ attendees at the Duke CFAR Retreat 2024.
  • 2022.06 - 2023.08

    Hanoi, Vietnam

    Assistant Researcher
    Vietnam Academy of Science and Technology
    • Proved the practicality of the bootstrap method for goodness-of-fit testing using estimator properties; presented findings to 25 researchers and secured a significant grant from VinIF Vietnam.
    • Led a weekly Bayesian Statistics seminar for 30+ students on simulation & optimization, including live R coding on topics such as Bayesian Regression, Gibbs Sampling, and Hierarchical Modeling.

Education

  • 2023.08 - 2025.05

    Durham, NC, USA

    Master's in Statistical Science
    Duke University
    • Predictive Modeling, Deep Learning, Causal Inference, Statistical Computation, R Programming, SQL, Bayesian Statistics, Theory of Inference, Hierarchical Models, Applied Machine Learning.
  • 2014.08 - 2018.05

    Hanoi, Vietnam

    Bachelor's in Mathematics
    Hanoi National University of Education
    • One and Multivariable Analysis, Functional Analysis, Linear Algebra, Complex Analysis, Measure Theory, Optimization Theory, Introduction to Probability Theory, Statistics, Ordinary and Partial Differential Equations, Numerical Analysis.

Awards

Certificates

Machine Learning Specialization
DeepLearning.AI 2025-04
Data Analysis with R Specialization
Coursera - Duke University 2023-07

Skills

Programming
Python (NumPy, pandas, scikit-learn, PyTorch, TensorFlow)
R (dplyr, ggplot2, tidyr)
MySQL
Git
Rest APIs
AI/ML
Generative AI (LangChain, OpenAI APIs, VectorDBs)
Deep Learning (CNNs, GANs)
Machine Learning (XGBoost, Random Forest, Logistic Regression, SVM)
Statistics
Hypothesis Testing
Hierachical Modeling
Bayesian Modeling
Causal Inference
Tools & Platforms
Jupyter
Cloud Services (AWS, GCP)
HTML, YAML

Languages

Vietnamese
Native speaker
English
Fluent
French
Beginner