I am a PhD student at the University of Wisconsin-Madison, advised by Prof. Yin Li. My research focuses on machine learning and computer vision, with an emphasis on developing highly efficient models that make effective use of data and computational resources.
This summer, I will join Meta in Menlo Park as a Research Scientist Intern, working on MLLMs post-training.
Email: kdnguyen7 [at] wisc [dot] edu
Publications
(*) denotes equal contribution
-
Learning to Inference Adaptively for Multimodal Large Language Models
ICCV 2025, International Conference on Computer Vision.[code] -
PAVE: Patching and Adapting Video Large Language Models
CVPR 2025, Conference on Computer Vision and Pattern Recognition.[code] -
Adainf: Adaptive inference for resource-constrained foundation models
ICML 2024 Workshop, Workshop on Efficient Systems for Foundation Models II @ ICML2024. -
ESCAPE: Encoding Super-keypoints for Category-Agnostic Pose Estimation
CVPR 2024, Conference on Computer Vision and Pattern Recognition.[code] -
Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments
ECCV 2022, European Conference on Computer Vision.[code] -
POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples
NeurIPS 2021, Neural Information Processing Systems.[code]