The main difference could be that you have an existing code base (probably quite...

The main difference could be that you have an existing code base (probably quite extensive and a bit legacy?). If the llm can start from scratch it will write code “in its own way”, that it can probably grasp and extend better than what is already there. I even have the impression that Claude can struggle with code that GPT-5 wrote sometimes.