We benchmarked 19 LLMs on analytical SQL and the internet had thoughts. Here's a breakdown of your feedback, what we got wrong, what we got right (but didn’t explain), and how we’re improving the benchmark for round two.

We benchmarked 19 LLMs on analytical SQL and the internet had thoughts. Here's a breakdown of your feedback, what we got wrong, what we got right (but didn’t explain), and how we’re improving the benchmark for round two.