ChatGPT: The Next Level in Conversational AI

►Twitter:
►Try it:
►OpenAI’s blog post:
►What is GPT-3:
►What is Reinforcement Learning:
►Join our Discord community:
►Support me on Patreon:

#ChatGPT #GPT3 #OpenAI


You’ve seen it everywhere Chad GPT has Taken on Twitter and pretty much the Whole internet thanks to its power and The meme potential it provides we all Know being able to generate memes is the Best way to conquer the internet and so It worked since you’ve seen numerous Examples you might already know that Chatgpt is an AI recently released to The public by openai allowing you to Chat with it it’s also called a chatbot Meaning you can interact with it Conversionally imitating a one-on-one Human discussion what you might not know Is what it is and how it works Chadjupiti is a model based on Reinforcement learning and the GPT Series of models from openai I will Refer you to a video about reinforcement Learning we recently published with my Friend Elias to learn more about the Subfield of AI but quickly reinforcement Learning is a way to train algorithms by Trial and error aiming for rewards just Like humans would do by learning with Positive feedback more specifically chat GPT was built following three steps the First was to take an already powerful Model and fine tune it with supervised Learning what does this mean it means That they took a model specifically its GPT 3.5 and improved and up-to-date Version of gpd3 which they trained once More on conversation examples

Specifically instead of being trained on Pretty much their whole internet as gpt3 Was this means they are trying to narrow Its potentials strictly to conversations Making it theoretically better at Conversing compared to gpt3 since a Specialist is almost always better than A generalist at a specific task if you Are still not familiar with the GPT Series of models I would suggest Watching the short introduction video I Made covering gpt3 when it came out the Second step is to add our reinforcement Learning magic which will allow the Model to practice and get better as you Know practice makes perfect more Precisely in this step we will use the Model to chat with humans directly have It provide multiple possible answers and Ask the human to rate the answers from Best to worst this data will then be Used to train another model called our Reward model learning to replicate our Human annotators this leads to our last Step where our new reward model will Give feedback to the chat GPT model’s Answers as a reward function to help it Converge toward the best answers over Time this final step is to further train Our algorithm after the initial fine Tuning step we explained this is why it Is companies like open AI that release Those kinds of amazingly powerful models It will be unfeasible for universities

Or individuals as it requires way too Much Computing and time for training Still what they achieve is quite Remarkable and I believe they are worth Doing and worth sharing to Advent Science and voila after coupling the Already powerful and most recent GPT Based language model fine-tuning it to Conversations and finally using Reinforcement learning to make it Practice its conversation skills you Obtain chat GPD as you have seen before The model is quite promising but also Sometimes a very dumb and doesn’t seem To have any logic whatsoever it is still Just an algorithm and far from being Either intelligent or conscious though It will depend on how we Define both it Definitely has its limitations Nonetheless the outputs it gives are Often surprisingly interesting and Pertinent chatgpt is definitely a step Forward in conversational Ai and quite Promising especially working on the Prompt engineering side of the model to Leverage its true potential and limit Failure cases I hope you’ve enjoyed this Video and I’d love to see your Experiments please tag me on Twitter at What’s AI if you share them or join our Discord Community where we created a Channel specifically for it I will see You next week with another amazing AI Research

Foreign [Music]

You May Also Like

About the Author: admin

Leave a Reply

Your email address will not be published. Required fields are marked *