Articles

May 9, 2023

7 MIN READ

How To Create A Custom Fine-Tuned Prediction Model Using Base GPT-3 models

May 9, 2023

7 MIN READ

Latest content

Customer Stories

4min read

Lightspeed Uses HumanFirst for In-House AI Enablement

Meet Caroline, an analyst-turned-AI-expert who replaced manual QA, saved countless managerial hours, and built new solutions for customer support.

December 10, 2024

Customer Stories

4 min read

How Infobip Generated 220+ Knowledge Articles with Gen AI For Smarter Self-Service and Better NPS

Partnering with HumanFirst, Infobip generated over 220 knowledge articles, unlocked 30% of their agents' time, and improved containment by a projected 15%.

September 16, 2024

Articles

6 min read

AI for CIOs: From One-Off Use to Company-Wide Value

A maturity model for three stages of AI adoption, including strategies for company leaders to progress to the next stage.

September 12, 2024

Articles

7 min read

Non-Technical AI Adoption: The Value of & Path Towards Workforce-Wide AI

Reviewing the state of employee experimentation and organizational adoption, and exploring the shifts in thinking, tooling, and training required for workforce-wide AI.

September 12, 2024

Tutorials

4 min read

Building Prompts for Generators in Dialogflow CX

How to get started with generative features.

August 15, 2024

Announcements

3 min read

HumanFirst and Infobip Announce a Partnership to Equip Enterprise Teams with Data + Generative AI

With a one-click integration to Conversations, Infobip’s contact center solution, HumanFirst helps enterprise teams leverage LLMs to analyze 100% of their customer data.

August 8, 2024

Tutorials

4 min read

Two Field-Tested Prompts for CX Teams

Get deeper insights from unstructured customer data with generative AI.

August 7, 2024

Customer Stories

5 min read

HomeServe Uses HumanFirst to Empower Non-Technical Teams with Conversation Data

July 29, 2024

Tutorials

5 min read

Optimizing RAG with Knowledge Base Maintenance

How to find gaps between knowledge base content and real user questions.

April 23, 2024

Customer Stories

4min read

Lightspeed Uses HumanFirst for In-House AI Enablement

Meet Caroline, an analyst-turned-AI-expert who replaced manual QA, saved countless managerial hours, and built new solutions for customer support.

December 10, 2024

Customer Stories

4 min read

How Infobip Generated 220+ Knowledge Articles with Gen AI For Smarter Self-Service and Better NPS

Partnering with HumanFirst, Infobip generated over 220 knowledge articles, unlocked 30% of their agents' time, and improved containment by a projected 15%.

September 16, 2024

Articles

6 min read

AI for CIOs: From One-Off Use to Company-Wide Value

A maturity model for three stages of AI adoption, including strategies for company leaders to progress to the next stage.

September 12, 2024

Better context, better results

Join the Waitlist Book a Demo

How To Create A Custom Fine-Tuned Prediction Model Using Base GPT-3 models

COBUS GREYLING

May 9, 2023

7 MIN READ

LLMs can be divided into two categories: generative & predictive. The generative capabilities of LLMs have been the subject of much attention and discussion, and rightly so – they are incredibly impressive

‍

The generative capabilities of LLMs have been the subject of much attention and discussion, and rightly so – they are incredibly impressive and often only require zero or few-shot learning.

The increasing popularity of Prompt Engineering has further highlighted the importance of generative tasks.

The image below shows the most common generative tasks from a Conversational AI Development Framework perspective, along with the predictive tasks.

The importance of correctly predicting an intent with a Large Language Model (LLM) is paramount, as the actions taken by a chatbot are based on this result.

To achieve this, both generative and predictive LLMs can be fine-tuned to create a custom model. OpenAI GPT-3, Ada is an example of a LLM that can be fine-tuned for classifying text into one of two classes, as seen in the image below.

As fine-tuning of LLMs becomes more commonplace, it will become the norm for mass adoption of LLMs in more formal and enterprise settings.

We are ready to begin!

The code below will allow us to access the training data from Sklearn . The command listed displays the various categories of data that have been archived from the original 20 newsgroups website.

These are the 20 categories available, from these we will make use of rec.autos and rec.motorcycles.

The code to fetch the two categories we are interested in, also assign the data to vehicles_dataset .

Below a record is printed of dataset:

The result shows that the data is disorganised and each entry has a high possibility of containing ambiguity or inaccuracy.

We can now determine how many records and examples we have for autos and motorcycles.

The printed result:

The next step is converting the data into JSON format defined by OpenAI here. Below is an example of the format.

The code to convert that data…

lastly, converting the data frame to a JSONL file named vehicles.jsonl:

Now the OpenAI utility can be used to analyse the JSONL file.

With the result of the analysis displayed below…

Now we can start the training process and from this point an OpenAI api key is required.

The command to start the fine-tuning is a single line, with the foundation GPT-3 model defined at the end. In this case it is ada. I wanted to make use of davinci, but the cost is extremely high as opposed to ada, which is one of the original base GPT-3 models.

The output from the training process.

And lastly, the model is queried with an arbitrary sentence: So how do I steer when my hands aren't on the bars?

The correct answer is given in motorcycles .

Another example with the sentence: Is countersteering like benchracing only with a taller seat, so your feet aren't on the floor?

And again the correct result is given as motorcycles .

As production implementations of LLMs become more widespread, more emphasis will be placed on fine-tuning them to maximise performance.

Nevertheless, the importance of fine-tuning LLMs is currently not being fully recognised.

I’m currently the Chief Evangelist @ HumanFirst. I explore and write about all things at the intersection of AI and language; ranging from LLMs, Chatbots, Voicebots, Development Frameworks, Data-Centric latent spaces and more.

Subscribe to HumanFirst Blog

Get the latest posts delivered right to your inbox

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Table of contents

Latest content

Lightspeed Uses HumanFirst for In-House AI Enablement

How Infobip Generated 220+ Knowledge Articles with Gen AI For Smarter Self-Service and Better NPS

AI for CIOs: From One-Off Use to Company-Wide Value

Non-Technical AI Adoption: The Value of & Path Towards Workforce-Wide AI

Building Prompts for Generators in Dialogflow CX

HumanFirst and Infobip Announce a Partnership to Equip Enterprise Teams with Data + Generative AI

Two Field-Tested Prompts for CX Teams

HomeServe Uses HumanFirst to Empower Non-Technical Teams with Conversation Data

Optimizing RAG with Knowledge Base Maintenance

Lightspeed Uses HumanFirst for In-House AI Enablement

How Infobip Generated 220+ Knowledge Articles with Gen AI For Smarter Self-Service and Better NPS

AI for CIOs: From One-Off Use to Company-Wide Value

Better context, better results

How To Create A Custom Fine-Tuned Prediction Model Using Base GPT-3 models

LLMs can be divided into two categories: generative & predictive. The generative capabilities of LLMs have been the subject of much attention and discussion, and rightly so – they are incredibly impressive

Subscribe to HumanFirst Blog