Fine-Tuning Llama 3 for Sentence Classification
I’ve been curious to see just how well these enormous LLMs perform at traditional NLP tasks such as classifying text.
I’ve been curious to see just how well these enormous LLMs perform at traditional NLP tasks such as classifying text.
An in-depth tutorial on the algorithm and paper, including a pseudo-implementation in Python.
Recently I’ve been researching the topic of fine-tuning Large Language Models (LLMs) like GPT on a single GPU in Colab (a challenging feat!), comparing both the free (Tesla T4) and paid options.
I’ve studied the samplers a bit and done some of my own experiments with them, and I’ve arrived at some tentative conclusions for what to do with them.