Khoi Duc Nguyen

Ph.D Student, Computer Science, University of Wisconsin - Madison

kdnguyen7 [at] wisc [dot] edu GitHub LinkedIn Google Scholar

Education
University of Wisconsin-Madison Ph.D. in Computer Science
Madison, US Sep 2023 - Present
Ho Chi Minh City University of Technology (HCMUT) B.Eng. in Computer Engineering
Ho Chi Minh, Vietnam Sep 2017 - Dec 2021

Experience
Meta Research Scientist Intern
  • Worked with Dr. Wei Shi
  • MLLM post-training.
Menlo Park, US Summer 2026
Dolby Laboratories PhD Research Intern
  • Worked with Dr. Trisha Mittal
  • Vector graphics generation.
Sunnyvale, US May 2025 - Aug 2025
National University of Singapore AI Research Intern
  • Worked with Dr. Chen Li and Prof. Gim Hee Lee
  • 2D pose estimation.
Singapore Jun 2022 - May 2023
Qualcomm AI Research (formerly VinAI Research) AI Research Resident
  • Worked with Dr. Rang Nguyen, Dr. Quoc-Huy Tran and Prof. Binh-Son Hua
  • Image, video few-shot learning.
Ho Chi Minh, Vietnam Nov 2020 - Jun 2022

Publications

(*) denotes equal contribution

  1. Learning to Inference Adaptively for Multimodal Large Language Models Zhuoyan Xu*, Khoi Duc Nguyen*, Preeti Mukherjee, Saurabh Bagchi, Somali Chaterji, Yingyu Liang, and Yin Li ICCV 2025, International Conference on Computer Vision. [code]
  2. PAVE: Patching and Adapting Video Large Language Models Zhuoming Liu, Yiquan Li, Khoi Duc Nguyen, Yiwu Zhong, and Yin Li CVPR 2025, Conference on Computer Vision and Pattern Recognition. [code]
  3. Adainf: Adaptive inference for resource-constrained foundation models Zhuoyan Xu, Khoi Duc Nguyen, Preeti Mukherjee, Somali Chaterji, Yingyu Liang, and Yin Li ICML 2024 Workshop, Workshop on Efficient Systems for Foundation Models II @ ICML2024.
  4. ESCAPE: Encoding Super-keypoints for Category-Agnostic Pose Estimation Khoi Duc Nguyen, Chen Li, and Gim Hee Lee CVPR 2024, Conference on Computer Vision and Pattern Recognition. [code]
  5. Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments Khoi Duc Nguyen, Quoc-Huy Tran, Khoi Nguyen, Binh-Son Hua, and Rang Nguyen ECCV 2022, European Conference on Computer Vision. [code]
  6. POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples Duong H. Le*, Khoi Duc Nguyen*, Khoi Nguyen, Quoc-Huy Tran, Rang Nguyen, and Binh-Son Hua NeurIPS 2021, Neural Information Processing Systems. [code]

Services

Conference Reviewer: ICCV (2025), CVPR (2025, 2023), ECCV (2024), NeurIPS (2025, 2023).


Teaching
University of Wisconsin-Madison
  • CS240 Introduction to Discrete Mathematics, Fall 2023
  • CS354 Machine Organization and Programming, Spring 2024
  • CS240 Introduction to Discrete Mathematics, Fall 2024
  • CS354 Machine Organization and Programming, Spring 2025

Skills

Programming: Python, C/C++, MATLAB

Frameworks & Tools: PyTorch, JAX, OpenCV, NumPy, Git, LaTeX, Markdown, Slurm

Operating Systems: Linux, macOS, Windows