I've done all those things. Here is what I get:
Platform: Mac OS X x86 (32-bit)
Compiler: GCC 4.0.1 (Apple Inc. build 5493)
Operating System: Mac OS X 10.6.2 (Build 10C540)
Model: MacPro4,1
Motherboard: Apple Computer, Inc. Mac-F221BEC8 x.x
Processor: Intel(R) Core(TM) i5 CPU 750 @ 2.67GHz
Processor ID: GenuineIntel Family 6 Model 30 Stepping 5
Logical Processors: 4
Physical Processors: 1
Processor Frequency: 2.72 GHz
L1 Instruction Cache: 32.0 KB
L1 Data Cache: 32.0 KB
L2 Cache: 256 KB
L3 Cache: 8.00 MB
Bus Frequency: 640 MHz
Memory: 4.00 GB
Memory Type: 1600 MHz DDR3
SIMD: 1
BIOS: Apple Inc. MP41.88Z.0081.B07.0903051113
Processor Model: Intel(R) Core(TM) i5 CPU 750 @ 2.67GHz
Processor Cores: 4
Integer (Score: 2515)
Blowfish single-threaded scalar -- 874, , 38.4 MB/sec
Blowfish multi-threaded scalar -- 3704, , 151.8 MB/sec
Text Compress single-threaded scalar -- 937, , 3.00 MB/sec
Text Compress multi-threaded scalar -- 3572, , 11.7 MB/sec
Text Decompress single-threaded scalar -- 1024, , 4.21 MB/sec
Text Decompress multi-threaded scalar -- 4136, , 16.5 MB/sec
Image Compress single-threaded scalar -- 946, , 7.82 Mpixels/sec
Image Compress multi-threaded scalar -- 3643, , 30.7 Mpixels/sec
Image Decompress single-threaded scalar -- 835, , 14.0 Mpixels/sec
Image Decompress multi-threaded scalar -- 3261, , 53.2 Mpixels/sec
Lua single-threaded scalar -- 1540, , 593.0 Knodes/sec
Lua multi-threaded scalar -- 5712, , 2.20 Mnodes/sec
Floating Point (Score: 5231)
Mandelbrot single-threaded scalar -- 1212, , 806.3 Mflops
Mandelbrot multi-threaded scalar -- 4848, , 3.17 Gflops
Dot Product single-threaded scalar -- 1973, , 953.4 Mflops
Dot Product multi-threaded scalar -- 8251, , 3.76 Gflops
Dot Product single-threaded vector -- 2354, , 2.82 Gflops
Dot Product multi-threaded vector -- 10581, , 11.0 Gflops
LU Decomposition single-threaded scalar -- 315, , 280.6 Mflops
LU Decomposition multi-threaded scalar -- 1263, , 1.11 Gflops
Primality Test single-threaded scalar -- 2346, , 350.5 Mflops
Primality Test multi-threaded scalar -- 7445, , 1.38 Gflops
Sharpen Image single-threaded scalar -- 2856, , 6.66 Mpixels/sec
Sharpen Image multi-threaded scalar -- 11231, , 25.9 Mpixels/sec
Blur Image single-threaded scalar -- 3746, , 2.96 Mpixels/sec
Blur Image multi-threaded scalar -- 14820, , 11.7 Mpixels/sec
Memory (Score: 2148)
Read Sequential single-threaded scalar -- 2572, , 3.15 GB/sec
Write Sequential single-threaded scalar -- 1298, , 909.2 MB/sec
Stdlib Allocate single-threaded scalar -- 1630, , 6.09 Mallocs/sec
Stdlib Write single-threaded scalar -- 2201, , 4.56 GB/sec
Stdlib Copy single-threaded scalar -- 3041, , 3.14 GB/sec
Stream (Score: 2626)
Stream Copy single-threaded scalar -- 2280, , 3.12 GB/sec
Stream Copy single-threaded vector -- 3894, , 5.05 GB/sec
Stream Scale single-threaded scalar -- 2508, , 3.26 GB/sec
Stream Scale single-threaded vector -- 3957, , 5.34 GB/sec
Stream Add single-threaded scalar -- 743, , 1.12 GB/sec
Stream Add single-threaded vector -- 3958, , 5.51 GB/sec
Stream Triad single-threaded scalar -- 797, , 1.10 GB/sec
Stream Triad single-threaded vector -- 2874, , 5.38 GB/sec