AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators

jasonchou9877@gmail.com; {nickaliu,wigginzhou,faxonlian}@tencent.com
Hunyuan Team, Tencent
*Equal Contributions Corresponding Authors
HumanEval Overfitting
HumanEval Overfitting