Why is a mod (%
) operation more expensive than a multiplication (*
) by a bit more than a factor of 2?
Please be more specific about how CPU performs division operation and returns 开发者_开发技巧the result for MOD operation.
In the following example the threads each run for a second. The test was performed on a SPARC
processor.
// multiplication
void someThread() {
int a = 10234;
while (true) {
opers++;
a = a * a;
a++;
}
// opers ~ 26 * 10^6 in a sec.
}
// MOD
void someThread() {
int a = 10234;
while (true) {
opers++;
a = a % 10000007;
a++;
}
// opers ~ 12 * 10^6 in a sec.
}
MOD is a division operation, not a multiplication operation. Division is more expensive than multiplication.
More information about the MOD operation here: http://en.wikipedia.org/wiki/Modulo_operation
Instruction latencies and throughput for AMD and Intel x86 processors
One operation is just inherently slower at the CPU :)
Algorithms (processors execute the division and the multiplication by algorithms implemented in gates) for division are more costly than for multiplication. As a matter of fact, some algorithms for division which have a good complexity are using the multiplication as a basic step.
Even if you use the naive algorithms that are learned in school. They both have the same asymptotic complexity, but the constant for the division is greater (you have to find out the digit and that is not trivial, so you can mess up and have to fix the mess).
mod is essentially the same process as division (some systems provide a "divmod" for this reason).
The big difference between binary long mulitplication and binary long division is that long division requires you to perform an overflow test after each subtraction, while long mutiplication performs the addition unconditionally after the initial masking process.
That means you can easilly rearrange and paralleise the addditions in long multiplication, but you can't do the same for long division. I wrote a longer answer about this at https://stackoverflow.com/a/53346554/5083516
Yes, mod is more expensive than multiplication, as it is implemented through division. (CPUs generally return both quotient and remainder on division.) But both of your threads use multiplication. copy/paste error?
精彩评论