Code testing articles

  • How I test an AI chatbot's coding ability

    Since ChatGPT and generative artificial intelligence (AI) hit the public consciousness in 2022, I've been exploring how well AI chatbots can write code. At first, the technology was a novelty, akin to encouraging a puppy to perform a new trick. But since seeing how AI chatbots can be effective productivity tools and programming partners, I've been subjecting the tools to more in-depth testing. Over time, I've compiled a set of four real-world tests that we've used to evaluate the performance of...
  • Llama 3 reasoning and coding performance tested

    Following on from the launch of the new Llama 3 large language model by Meta and Mark Zuckerberg. WorldofAI has been testing out the performance and capabilities of Llama 3 when reasoning and coding. Llama 3 has already emerged as a true catalyst in the artificial intelligence (AI) space, setting new benchmarks in AI performance […]
  • Yikes! Microsoft Copilot failed every single one of my coding tests

    Recently, my ZDNET colleague and fellow AI explorer Sabrina Ortiz wrote an article entitled, 7 reasons I use Copilot instead of ChatGPT. I had never been terribly impressed with Copilot, especially since it failed some fact-checking tests I ran against it last year. But Sabrina made some really good points about the benefits of Microsoft's offering, so I thought I'd give it another try.Also: What is Copilot (formerly Bing Chat)? Here's everything you need to knowTo be clear, because Microsoft...