LMSYS’ Chatbot Arena is perhaps the most popular AI benchmark now — and an marketplace obsession. But it surely’s significantly from a great measure. A March 2023 paper tested ChatGPT's application in scientific toxicology. The authors found which the AI "fared perfectly" in answering a "pretty uncomplicated [medical situation example], https://alexandrem419fls4.dgbloggers.com/profile