Lesson 4: Single README, Updated Notebook

nitya · nitya · commit 05dda472652c · 2023-10-25T00:08:09.000Z
diff --git a/4-prompt-engineering-fundamentals/1-introduction.ipynb b/4-prompt-engineering-fundamentals/1-introduction.ipynb
@@ -12,54 +12,16 @@
    "metadata": {},
    "source": [
     "# Introduction to Prompt Engineering\n",
-    "Introduce the concept of prompt engineering and its importance in natural language processing."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "# Introduction to Prompt Engineering\n",
-    "#\n",
-    "# Prompt engineering is the process of designing and optimizing prompts for natural language processing tasks.\n",
-    "# It involves selecting the right prompts, tuning their parameters, and evaluating their performance.\n",
-    "# Prompt engineering is crucial for achieving high accuracy and efficiency in NLP models.\n",
-    "# In this section, we will explore the basics of prompt engineering and its importance in NLP."
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "metadata": {},
-   "source": [
-    "# What is Prompt Engineering?\n",
-    "Define prompt engineering and explain how it differs from traditional natural language processing techniques."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "# What is Prompt Engineering?\n",
-    "#\n",
-    "# Prompt engineering is a technique used in natural language processing to improve the accuracy and efficiency of models.\n",
-    "# Unlike traditional NLP techniques that rely on large amounts of training data, prompt engineering involves designing and optimizing prompts\n",
-    "# that guide the model towards the desired output.\n",
-    "# By carefully selecting and tuning prompts, we can achieve high accuracy with much less training data.\n",
-    "# Prompt engineering is particularly useful for tasks where training data is scarce or expensive to obtain.\n",
-    "# Examples of such tasks include question answering, text completion, and summarization.\n",
-    "# In summary, prompt engineering is a powerful tool for improving the performance of NLP models."
+    "Prompt engineering is the process of designing and optimizing prompts for natural language processing tasks. It involves selecting the right prompts, tuning their parameters, and evaluating their performance. Prompt engineering is crucial for achieving high accuracy and efficiency in NLP models. In this section, we will explore the basics of prompt engineering using the OpenAI models for exploration."
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "# Why is Prompt Engineering Important?\n",
-    "Explain the benefits of prompt engineering, including improved model performance and interpretability."
+    "### Exercise 1: Tokenization\n",
+    "Explore Tokenization using tiktoken, an open-source fast tokenizer from OpenAI\n",
+    "See [OpenAI Cookbook](https://github.com/openai/openai-cookbook/blob/main/examples/How_to_count_tokens_with_tiktoken.ipynb) for more examples.\n"
    ]
   },
   {
@@ -68,22 +30,44 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "# Why is Prompt Engineering Important?\n",
+    "# EXERCISE:\n",
+    "# 1. Run the exercise as is first\n",
+    "# 2. Change the text to any prompt input you want to use & re-run to see tokens\n",
+    "\n",
+    "import tiktoken\n",
     "\n",
-    "# Prompt engineering is important for several reasons. First, it can significantly improve the performance of NLP models.\n",
-    "# By designing and optimizing prompts, we can guide the model towards the desired output and achieve higher accuracy with less data.\n",
-    "# Second, prompt engineering can improve the interpretability of NLP models.\n",
-    "# By using prompts that are designed to elicit specific types of information, we can gain insights into how the model is making predictions.\n",
-    "# Finally, prompt engineering can help to mitigate bias in NLP models.\n",
-    "# By carefully selecting prompts and evaluating their performance on diverse datasets, we can ensure that our models are fair and unbiased."
+    "# Define the prompt you want tokenized\n",
+    "text = f\"\"\"\n",
+    "Jupiter is the fifth planet from the Sun and the \\\n",
+    "largest in the Solar System. It is a gas giant with \\\n",
+    "a mass one-thousandth that of the Sun, but two-and-a-half \\\n",
+    "times that of all the other planets in the Solar System combined. \\\n",
+    "Jupiter is one of the brightest objects visible to the naked eye \\\n",
+    "in the night sky, and has been known to ancient civilizations since \\\n",
+    "before recorded history. It is named after the Roman god Jupiter.[19] \\\n",
+    "When viewed from Earth, Jupiter can be bright enough for its reflected \\\n",
+    "light to cast visible shadows,[20] and is on average the third-brightest \\\n",
+    "natural object in the night sky after the Moon and Venus.\n",
+    "\"\"\"\n",
+    "\n",
+    "# Set the model you want encoding for\n",
+    "encoding = tiktoken.encoding_for_model(\"gpt-3.5-turbo\")\n",
+    "\n",
+    "# Encode the text - gives you the tokens in integer form\n",
+    "tokens = encoding.encode(text)\n",
+    "print(tokens);\n",
+    "\n",
+    "# Decode the integers to see what the text versions look like\n",
+    "[encoding.decode_single_token_bytes(token) for token in tokens]"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "# Types of Prompts\n",
-    "Describe the different types of prompts, including classification, generation, and question-answering prompts."
+    "### Exercise 2: Validate OpenAI API Key Setup\n",
+    "\n",
+    "Run the code below to verify that your OpenAI endpoint it setup correctly. The code just tries a simple basic prompt and validates the completion. Input `oh say can you see` should complete along the lines of `by the dawn's early light..`\n"
    ]
   },
   {
@@ -92,39 +76,58 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "# Types of Prompts\n",
+    "# Run this as a common starting point for all the exercises below\n",
+    "# It sets the OpenAI API key and uses a helper function that sets the relevant model and parameters\n",
     "\n",
-    "# There are several types of prompts that can be used in natural language processing tasks. \n",
-    "# In this section, we will discuss three main types of prompts: classification prompts, generation prompts, and question-answering prompts.\n",
+    "import openai\n",
+    "import os\n",
     "\n",
-    "# Classification Prompts\n",
+    "# Expects OPENAI_API_KEY in env variables \n",
+    "# For GitHub Codespaces: set this as Codespaces secret => shows up as env var in OS\n",
+    "# For Docker Desktop: create a .env file (and .gitignore it explicitly to be safe) => shows up as env var from load_dotenv\n",
+    "from dotenv import load_dotenv, find_dotenv\n",
+    "_ = load_dotenv(find_dotenv())\n",
     "\n",
-    "# Classification prompts are used to classify input text into one or more categories. \n",
-    "# They are often used in tasks such as sentiment analysis, where the goal is to determine the sentiment of a given text. \n",
-    "# Classification prompts typically consist of a set of keywords or phrases that are associated with each category. \n",
-    "# For example, a classification prompt for sentiment analysis might include keywords such as \"happy\", \"sad\", \"angry\", and \"excited\".\n",
+    "# Note that we can set different env variables to different OPENAI keys and just map the right one to openai.api_key here\n",
+    "# Example: have both OPENAI_API_KEY (for OpenAI) and AOAI_API_KEY (for Azure OpenAI) as options \n",
+    "openai.api_key  = os.getenv('OPENAI_API_KEY')\n",
     "\n",
-    "# Generation Prompts\n",
+    "# Print Environment Variables\n",
+    "#for var in os.environ:\n",
+    "#    print(f\"{var}: {os.environ[var]}\")\n",
     "\n",
-    "# Generation prompts are used to generate new text based on a given input. \n",
-    "# They are often used in tasks such as text completion or summarization, where the goal is to generate a coherent and concise summary of a given text. \n",
-    "# Generation prompts typically consist of a starting phrase or sentence, followed by a set of rules or constraints that guide the generation process. \n",
-    "# For example, a generation prompt for text completion might include a starting phrase such as \"Once upon a time\", followed by rules such as \"the next sentence must include the word 'dragon'\".\n",
+    "def get_completion(prompt, model=\"gpt-3.5-turbo\"):\n",
+    "    messages = [{\"role\": \"user\", \"content\": prompt}]\n",
+    "    response = openai.ChatCompletion.create(\n",
+    "        model=model,\n",
+    "        messages=messages,\n",
+    "        temperature=0, # this is the degree of randomness of the model's output\n",
+    "        max_tokens=1024\n",
+    "    )\n",
+    "    return response.choices[0].message[\"content\"]\n",
     "\n",
-    "# Question-Answering Prompts\n",
+    "## Set the primary content or simple prompt text here\n",
+    "text = f\"\"\"\n",
+    "oh say can you see\n",
+    "\"\"\"\n",
+    "\n",
+    "## This uses a template that embeds the text \n",
+    "## allowing you to add additional content like instructions, cues, examples\n",
+    "prompt = f\"\"\"\n",
+    "```{text}```\n",
+    "\"\"\"\n",
     "\n",
-    "# Question-answering prompts are used to answer a specific question based on a given context. \n",
-    "# They are often used in tasks such as reading comprehension, where the goal is to answer questions about a given text. \n",
-    "# Question-answering prompts typically consist of a question and a set of rules or constraints that guide the answering process. \n",
-    "# For example, a question-answering prompt for reading comprehension might include a question such as \"What is the main idea of the passage?\", followed by rules such as \"the answer must be a single sentence\"."
+    "## Run the prompt\n",
+    "response = get_completion(prompt)\n",
+    "print(response)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "# Creating Effective Prompts\n",
-    "Explain how to create effective prompts, including how to provide the necessary context and information for the model to make accurate predictions."
+    "### Exercise 3: Hallucinations\n",
+    "Explore what happens when you ask the LLM to return completions for a prompt about a topic that may not exist, or about topics that it may not know about because it was outside it's pre-trained dataset (more recent). See how the response changes if you try a different prompt, or a different model."
    ]
   },
   {
@@ -133,70 +136,39 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "# Creating Effective Prompts\n",
     "\n",
-    "# To create effective prompts, it is important to provide the necessary context and information for the model to make accurate predictions.\n",
-    "# This can be achieved by following these steps:\n",
-    "\n",
-    "# 1. Define the task: Clearly define the task that the model is expected to perform. This includes specifying the input format, output format, and any constraints or requirements.\n",
-    "\n",
-    "# 2. Identify the relevant information: Identify the information that is relevant to the task and ensure that it is included in the prompt. This may include specific keywords, phrases, or examples.\n",
-    "\n",
-    "# 3. Provide guidance: Provide guidance to the model on how to approach the task. This may include providing examples of correct and incorrect outputs, or specifying the types of errors to avoid.\n",
+    "## Set the text for simple prompt or primary content\n",
+    "## Prompt shows a template format with text in it - add cues, commands etc if needed\n",
+    "## Run the completion \n",
+    "text = f\"\"\"\n",
+    "generate a lesson plan on the Martian War of 2076.\n",
+    "\"\"\"\n",
     "\n",
-    "# 4. Test and refine: Test the prompt on a diverse set of inputs and evaluate its performance. Refine the prompt as necessary to improve its accuracy and efficiency.\n",
+    "prompt = f\"\"\"\n",
+    "```{text}```\n",
+    "\"\"\"\n",
     "\n",
-    "# By following these steps, we can create effective prompts that guide the model towards the desired output and achieve high accuracy with less data."
+    "response = get_completion(prompt)\n",
+    "print(response)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "# Try OpenAI Example"
+    "### Exercise 4: Instruction Based \n",
+    "Use the \"text\" variable to set the primary content \n",
+    "and the \"prompt\" variable to provide an instruction related to that primary content.\n",
+    "\n",
+    "Here we ask the model to summarize the text for a second-grade student"
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 1,
+   "execution_count": null,
    "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "Jupiter is a really big planet that is fifth from the Sun. It is made of gas and is the largest planet in our Solar System. It is much smaller than the Sun, but much bigger than all the other planets combined. People have known about Jupiter for a really long time because it is very bright in the night sky. It is named after a god from ancient Rome. Sometimes, Jupiter is so bright that it can make shadows on Earth. It is usually the third-brightest thing we can see at night, after the Moon and Venus.\n"
-     ]
-    }
-   ],
+   "outputs": [],
    "source": [
-    "import openai\n",
-    "import os\n",
-    "\n",
-    "# Expects OPENAI_API_KEY in env variables \n",
-    "# For GitHub Codespaces: set this as Codespaces secret => shows up as env var in OS\n",
-    "# For Docker Desktop: create a .env file (and .gitignore it explicitly to be safe) => shows up as env var from load_dotenv\n",
-    "from dotenv import load_dotenv, find_dotenv\n",
-    "_ = load_dotenv(find_dotenv())\n",
-    "\n",
-    "# Note that we can set different env variables to different OPENAI keys and just map the right one to openai.api_key here\n",
-    "# Example: have both OPENAI_API_KEY (for OpenAI) and AOAI_API_KEY (for Azure OpenAI) as options \n",
-    "openai.api_key  = os.getenv('OPENAI_API_KEY')\n",
-    "\n",
-    "# Print Environment Variables\n",
-    "#for var in os.environ:\n",
-    "#    print(f\"{var}: {os.environ[var]}\")\n",
-    "\n",
-    "def get_completion(prompt, model=\"gpt-3.5-turbo\"):\n",
-    "    messages = [{\"role\": \"user\", \"content\": prompt}]\n",
-    "    response = openai.ChatCompletion.create(\n",
-    "        model=model,\n",
-    "        messages=messages,\n",
-    "        temperature=0, # this is the degree of randomness of the model's output\n",
-    "        max_tokens=1024\n",
-    "    )\n",
-    "    return response.choices[0].message[\"content\"]\n",
-    "\n",
     "# Test Example\n",
     "# https://platform.openai.com/playground/p/default-summarize\n",
     "\n",
@@ -229,9 +201,39 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Response should have been something like:\n",
+    "### Exercise 5: Complex Prompt \n",
+    "Try a request that has system, user and assistant messages \n",
+    "System sets assistant context\n",
+    "User & Assistant messages provide multi-turn conversation context\n",
     "\n",
-    "> Jupiter is a really big planet that is fifth from the Sun. It is made of gas and is the largest planet in our Solar System. It is much smaller than the Sun, but much bigger than all the other planets combined. People have known about Jupiter for a really long time because it is very bright in the night sky. It is named after a god from ancient Rome. Sometimes, Jupiter is so bright that it can make shadows on Earth. It is usually the third-brightest thing we can see at night, after the Moon and Venus."
+    "Note how the assistant personality is set to \"sarcastic\" in the system context. \n",
+    "Try using a different personality context. Or try a different series of input/output messages"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "response = openai.ChatCompletion.create(\n",
+    "    model=\"gpt-3.5-turbo\",\n",
+    "    messages=[\n",
+    "        {\"role\": \"system\", \"content\": \"You are a sarcastic assistant.\"},\n",
+    "        {\"role\": \"user\", \"content\": \"Who won the world series in 2020?\"},\n",
+    "        {\"role\": \"assistant\", \"content\": \"Who do you think won? The Los Angeles Dodgers of course.\"},\n",
+    "        {\"role\": \"user\", \"content\": \"Where was it played?\"}\n",
+    "    ]\n",
+    ")\n",
+    "print(response.choices[0].message[\"content\"])"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Exercise: Explore Your Intuition\n",
+    "The above examples give you patterns that you can use to create new prompts (simple, complex, instruction etc.) - try creating other exercises to explore some of the other ideas we've talked about like examples, cues and more."
    ]
   }
  ],
diff --git a/4-prompt-engineering-fundamentals/README.md b/4-prompt-engineering-fundamentals/README.md
@@ -93,7 +93,7 @@ But what if the user wanted to see something specific that met some criteria or
 
 ![Base LLM Chat Completion](./img/4.0-playground-chat-base.png)
 
-### 1.4.3 Concept 3: Instruction Tuned LLMs
+### 1.4.3 Concept: Instruction Tuned LLMs
 
 An [Instruction Tuned LLM](https://blog.gopenai.com/an-introduction-to-base-and-instruction-tuned-large-language-models-8de102c785a6) starts with the foundation model and fine-tunes it with examples or input/output pairs (e.g., multi-turn "messages") that can contain clear instructions - and the response from the AI attempt to follow that instruction.
 
diff --git a/requirements.txt b/requirements.txt
@@ -4,4 +4,5 @@ numpy==1.24.2
 pandas==1.5.3
 tqdm==4.64.0
 python-dotenv==1.0.0
-openai>=0.28.0
+openai>=0.28.0
+tiktoken