We benchmarked 19 LLMs on analytical SQL and the internet had thoughts. Here's a breakdown of your feedback, what we got wrong, what we got right (but didn’t explain), and how we’re improving the benchmark for round two.
_OG.png)
We benchmarked 19 LLMs on analytical SQL and the internet had thoughts. Here's a breakdown of your feedback, what we got wrong, what we got right (but didn’t explain), and how we’re improving the benchmark for round two.