IMHO when comparing NMOS 6502 and 65C02, RDY works a bit different, and we probably had underestimated this effect.
OurCPU aims to be cycle exact to the NMOS 6502, except for RDY.
If it would be cycle exact to the NMOS 6502 _including_ the response to RDY,
another idea (not sure if it's a good idea) for identifying "compatibility problems"
might be running OurCPU and a 6510 "in parallel".
Means to have a little PCB with a 6510 plugged between the C64 6510 socket and OurCPU,
then to wire /RES, NMI, /IRQ, RDY, AEC and the clock input together for both CPUs.
Then to connect the 6510 address and data bus and I\O port to the C64,
while leaving the address bus and the I\O port of OurCPU "open",
and to place a 74245 (or 74541) in the data bus between 6510 and OurCPU to make sure
that OurCPU can only read data, but not write it to the C64.
_If_ the CPUs would be doing the same, when the CPUs own the bus the output on the address bus
and the I\O port are supposed to be identical, same thing for the data during write cycles.
So we could compare the output of both CPUs with some 74688 comparators and latch the result
into a 7474 at the falling edge of PHI2.
When triggering the 'scope with the first "comparison error" pulse output of the 7474,
we would have the exact point where both CPUs are acting differently to the code.
Both CPUs probably might need a diffrent amount of cycles to come out of the RESET sequence,
but this could be fixed when delaying the /RES input of one of the CPUs with a 74164
clocked with PHI2 or such.
Hmm... better try if this "comparator idea" works out as intended with two 6510 chips first.