Can you go into more details of why it is bad to have the main CPU (either ARM or something else) to baby the co-processor? I can see that having an interrupt on the main CPU is not ideal but usually the interrput is very short since it is only responsible for starting the job, so it should have only very limited effects on performance?