All Tags
Browse through all available tags to find articles on topics that interest you.
Browse through all available tags to find articles on topics that interest you.
Showing 3 results for this tag.
Pervasive Annotation Errors Break Text-to-SQL Benchmarks and Leaderboards
This paper exposes widespread annotation errors in leading text-to-SQL benchmarks, BIRD and Spider 2.0-Snow, and demonstrates how these inaccuracies severely distort model performance evaluations and leaderboard rankings. It also introduces SAR-Agent and SAPAR, an AI-powered toolkit designed to effectively detect and correct these pervasive errors, advocating for higher quality benchmark development.
LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL
This paper introduces LLMSQL, a systematically revised and transformed version of the WikiSQL dataset specifically designed for the LLM era of Text-to-SQL. It addresses and rectifies various structural and annotation issues in the original WikiSQL, providing a clean and reliable benchmark for evaluating large language models.
CoddLLM: Empowering Large Language Models for Data Analytics
This paper introduces CoddLLM, a 12-billion-parameter foundation model specifically designed for data analytics applications. It leverages a novel data recipe and post-training approach, enabling superior performance in tasks like data discovery, schema creation, and natural language to SQL conversion, outperforming existing state-of-the-art models on various benchmarks.