Motivation
There were two main motivations fuelling this project: one, to see how reliable ChatGPT is when giving code suggestions from english prompts, and two, to see if Leetcode problems will become an antiquated means of judging ability during job interviews if ChatGPT can answer any Leetcode problem instantly.
Prompt
The exact prompt that was fed to ChatGPT is as follows:
"Using <language>, code a solution for the following prompt:
<problem_content>
Begin your solution with:
<problem_header>"
Model
It must be noted that this uses GPT-3-DaVinci-003 model as opposed to the actual ChatGPT model. ChatGPT does not offer an API and it is against the terms of service to scrape ChatGPT results as it will rate limit actual users from being able to access ChatGPT. That being said, GPT-3-DaVinci-003 is stated by OpenAI to be much more powerful than ChatGPT. According to this site, "GPT-3 protocol is much larger than ChatGPT. The former has a whopping 175 billion parameters making it one of the largest and most powerful AI language processing models to date, while the latter has 20 billion parameters. This is because ChatGPT has been specifically designed to generate human-like text in the context of chatbot conversations." This means that the model that is being used, GPT-3-DaVinci-003, is likely even better to be using for code generation that ChatGPT. For the sake of this project, there may be places where "ChatGPT" is used synonymously to represent GPT-3-DaVinci-003.
General Algorithm
Manual Clean Ups
After scanning the data a few problems returned errors that did not seem fair to blame on ChatGPT, so they were fixed and re-ran.
Please keep in mind, some of these issues may still be present as well as others that were not detected.
Findings