LLM Fine-Tuning
18-3845 TCX · Blue Iris
CIS6930 · University of Florida
Fall 2025

Led dataset integration and preprocessing for a parameter-efficient fine-tuning pipeline adapting Mistral-7B-Instruct into a course-specific AI tutor for UF's CIS6930 Large Language Models course. Used LoRA/QLoRA with a blended instruction corpus of UltraChat 200k, Infinity-Instruct, and Symbolic IT, trained on UF's HiPerGator cluster. The tuned model achieved a ~10% relative improvement in token-level F1 over the base model and demonstrated low perplexity on course slide reconstruction.

PythonPyTorchHugging Face TransformersPEFTLoRA/QLoRATRLBitsAndBytesMistral-7BHiPerGator