I guess it's about time I made my first post. I'm the author of cl-6502[0].
@Klaus: Your test suite is fantastic. Thank you so much for making it. It made my life much easier as someone new to hardware emulation.
I've successfully run the tests (with decimal mode disabled) on lib6502[1] after a small patch to the PHP opcode[2]. It completed in about 0.1 seconds...roughly 100x faster than cl-6502. -_-
py65 also completed the tests as reported before by @BigEd. I took the liberty of timing it with a small script[3]. py65 completes the test in about 126 seconds... roughly 10x slower than cl-6502.

Obviously, these benchmarks are both sloppy and just for fun. FWIW, I have some optimizing left to do.
Cheers,
B
[0]:
http://github.com/redline6561/cl-6502
[1]:
http://piumarta.com/software/lib6502/ Command used was 'time ./run6502 -l a test.bin -R 1000 -X 3c37'
[2]: Told ya it was small.
-#define php(ticks, adrmode) phR(ticks, adrmode, P)
+#define php(ticks, adrmode) phR(ticks, adrmode, P | flagX | flagB)
[3]: bench.sh
#!/usr/bin/python
import py65.monitor
m = py65.monitor.Monitor()
m.do_load('test.bin a')
m.do_goto('1000')
exit()