Forum in maintenance, we will back soon 🙂
Question about course (Top_p)
Hi Jay, I replied to your email 4 days ago.
Now, when the computer (or language model) plays this guessing game, the Top_P parameter is like a magic net it uses to catch the glowing balls. But this net is special. It only catches a portion of the glow coming from the bucket.
If the Top_P is set to be really small, like 0.1, the computer will only catch the very brightest glowing balls. But if it's larger, like 0.9, the computer might also catch balls that aren't glowing as brightly, giving it more options for the next word.
The important part is that the computer doesn't always just pick the brightest ball. Sometimes, it picks a less glowing ball just to make things more surprising and interesting. This is why when we talk to a computer, sometimes it says things we didn't expect!