We benchmarked 19 LLMs on analytical SQL and the internet had thoughts. Here's a breakdown of your feedback, what we got wrong, what we got right (but didn’t explain), and how we’re improving the benchmark for round two.

Victor RamirezSource