Software Engineer Intern
1. Acquired fundamental knowledge related to LLMs and tested Qwen and Baichuan inference on single GPU cards such as RTX5000, T4, and A100 using Transformers and vLLM frameworks to ensure proper functionality.2. Conducted further testing in dual GPU environments, assessing the performance of parallel inference under different precision and quantization.