I am a second-year PhD student at the Hong Kong University of Science and Technology (Guangzhou), honored to be supervised by Professor Yuyu Luo. Currently, I am working as a research intern in the Database Group led by Professor Guoliang Li at Tsinghua University. Previously, I obtained my bachelor’s degree from the Southern University of Science and Technology. My current research interests focus on Text-to-SQL and Data Agent, which are intelligent systems that can autonomously handle data-related tasks such as query generation, data analysis, and visualization through natural language interaction.

News

  • 2025.10 We proposed A Survey of Data Agents: Emerging Paradigm or Overstated Hype?, introducing the first systematic hierarchical taxonomy for data agents with six levels (L0-L5) that delineate progressive shifts in autonomy, from manual operations to fully autonomous data agents. This survey clarifies capability boundaries and responsibility allocation, offering a structured review of existing research and a forward-looking roadmap. Ranked Top-3 in Hugging Face Daily Papers!
  • 2025.10 We proposed DeeyEye-SQL, a software-engineering-inspired Text-to-SQL framework, achieving 73.5% EX on BIRD-Dev and 89.8% EX on Spider-Test using a ~30B LLM without any fine-tuning!
  • 2025.09 Our nvBench 2.0 paper has been accepted by NIPS'25, a new benchmark designed to evaluate Text2VIS systems in scenarios involving ambiguous queries.
  • 2025.07 Our DeepVIS paper has been accepted by VIS'25, an interactive visual interface that tightly integrates with the CoT reasoning process, allowing users to inspect reasoning steps, identify errors, and make targeted adjustments to improve visualization outcomes.
  • 2025.07 Our NL2SQL-Survey paper has been accepted by TKDE'25! For a comprehensive overview of the latest Text-to-SQL techniques and practical guidance, we warmly invite you to read our continuously updated NL2SQL Handbook.
  • 2025.07 Our EllieSQL paper has been accepted by COLM'25, a complexity-aware routing framework that assigns queries to suitable SQL generation pipelines based on estimated complexity.
  • 2025.05 Our NL2SQL-BUGs paper has been accepted by KDD'25, the first benchmark specifically designed to detect and categorize semantic errors in NL2SQL translation.
  • 2025.05 Our Alpha-SQL paper has been accepted by ICML'25, see you in Vancouver, Canada!
  • 2025.04 We proposed Advances and Challenges in Foundation Agents, a survey covers the design, evaluation, and improvement of intelligent agents based on modular, brain-inspired architectures, focusing on self-enhancement, multi-agent collaboration, and safety in AI systems.
  • 2025.04 We proposed EllieSQL, a complexity-aware routing framework that assigns queries to suitable SQL generation methods based on estimated complexity.
  • 2025.03 We proposed NL2SQL-BUGs, a new benchmark dedicated to detecting and categorizing semantic errors in NL2SQL translation.
  • 2025.03 We proposed nvBench 2.0, a new benchmark designed to evaluate NL2VIS systems in scenarios involving ambiguous queries.
  • 2025.01 We proposed Alpha-SQL, the o1 moment for NL2SQL!
  • 2025.01 Paper Augmenting Realistic Charts with Virtual Overlays has been accepted by CHI'25.
  • 2025.01 I was awarded the Merit Prize for the 2024 DSA Excellent Research Award!
  • 2024.09 Paper Are Large Language Models Good Statisticians? has been accepted by NIPS'24.
  • 2024.06 Paper The Dawn of Natural Language to SQL: Are We Fully Ready? has been accepted by VLDB'24.
  • 2024.04 Paper Efficient Deep Spiking Multilayer Perceptrons With Multiplication-Free Inference has been accepted by TNNLS'24.

Publications

-
Total Citations
Google Scholar
-
h-index
Google Scholar
-
i10-index
Google Scholar
14
Total Papers
Published & Accepted
Arxiv 2025
sym

DeepEye-SQL: A Software-Engineering-Inspired Text-to-SQL Framework
Boyan Li, Chong Chen, Zhujun Xue, Yinan Mei, Yuyu Luo

ICML 2025
sym

Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search
Boyan Li, Jiayi Zhang, Ju Fan, Yanwei Xu, Chong Chen, Nan Tang, Yuyu Luo

Homepage | Slides | PDF |

VIS 2025
sym

DeepVIS: Bridging Natural Language and Data Visualization Through Step-wise Reasoning
Zhihao Shuai*, Boyan Li*, Siyu Yan, Yuyu Luo, Weikai Yang
*Equal contribution

KDD 2025
sym

NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation
Xinyu Liu, Shuyu Shen, Boyan Li, Nan Tang, Yuyu Luo

Homepage |

NIPS 2025
sym

nvBench 2.0: A Benchmark for Natural Language to Visualization under Ambiguity
Tianqi Luo, Chuhan Huang, Leixian Shen, Boyan Li, Shuyu Shen, Wei Zeng, Nan Tang, Yuyu Luo

Homepage |

COLM 2025
sym

EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing
Yizhang Zhu, Runzhi Jiang, Boyan Li, Nan Tang, Yuyu Luo

Homepage |

TKDE 2025
sym

A Survey of NL2SQL with Large Language Models: Where are we, and where are we going?
Xinyu Liu, Shuyu Shen, Boyan Li, Peixian Ma, Runzhi Jiang, Yuyu Luo, Yuxin Zhang, Ju Fan, Guoliang Li, Nan Tang

Homepage |

CHI 2025
sym

Augmenting Realistic Charts with Virtual Overlays
Yao Shi, Boyan Li, Yuyu Luo, Lei Chen, Nan Tang

ARXIV 2024
sym

A Plug-and-Play Natural Language Rewriter for Natural Language to SQL
Peixian Ma, Boyan Li, Runzhi Jiang, Ju Fan, Nan Tang, Yuyu Luo

NIPS 2024
sym

Are Large Language Models Good Statisticians?
Yizhang Zhu, Shiyin Du, Boyan Li, Yuyu Luo, Nan Tang

Homepage |

TNNLS 2024
sym

Efficient Deep Spiking Multi-Layer Perceptrons with Multiplication-Free Inference
Boyan Li, Luziwei Leng, Shuaijie Shen, Kaixuan Zhang, Jianguo Zhang, Jianxing Liao, Ran Cheng

Code |

Honors and Awards

2025.10
AI Agent 2025 (Best Open-source Project Award)
Led the DeepEye Data Agent System team to win the AI Agent 2025 Best Open-source Project Award.
2025.01
Merit Prize for the 2024 DSA Excellent Research Award
Recognized for outstanding research contributions in Data Science and Analytics.
2023.07
Highest Honors in Computer Science and Engineering
Southern University of Science and Technology
2023.07
Outstanding Graduates
Southern University of Science and Technology

Education

2024.09 - Present
PhD Student
Pursuing doctoral studies in Data Science and Analytics
2019.09 - 2023.07
Bachelor of Computer Science and Technology
GPA 3.91/4.0, Ranking 2/183

Experience

2025.06 - Present
Exchange Student
Beijing, China
Working as a research intern in the Database Group led by Professor Guoliang Li
2023.07 - 2024.09
Research Assistant
Guangzhou, China
Conducted research in Text-to-SQL and Data Agent systems
2022.07 - 2022.09
Research Intern
Shenzhen, China

Services

Conference Reviewer
ICLR 2026