An introduction to using LLMs for inference using multiple GPUs. Basic knowledge of using LLMs for inference is required. Example scripts will be provided.
For UVA users only. An HPC account is recommended. Familiarity with Python and LLMs is also required.