Hallo,
Normally just splitting your code into two parallel-executed loops would be enough for optimization for dual processor. If you have already available dual-processor computer, then try to make simple experiment: Put on your block diagram while-loop (without delay), then run it and looking for CPU usage. You will see, that only one processor occupied with 100% and second not (average CPU usage is 50%). Now put second while-loop (also without delay). Now both processors are busy with 100% (but both loops executed with approximately same speed). In most cases "two-loops" splitting of your analysing algorithm will be enough for increasing performance (not twice, of course, but significally).