Depends on what you want to do. If each iteration must only start after all previous iterations have already finished then you can not use several threads.
On a single core you cant get much faster...