When running code repair, the actual question is 656 questions,
but the paper describes
“We manually add a bug to each of the 164 HumanEval solutions across all 6 languages (984 total bugs). ”
May I ask which one is correct? The problem is also in the iterative increase?
