Since your primer directly links to this forum thread:
To be honest Garth, I personally don't think it's very fair to say "don't run the buses off the board" and use serial interfacing and the VIA, and call it a day. Unless you're using wire wrap (time consuming, and high initial cost for sockets and tools, but easier to make changes) instead of making a PCB (defer the time cost for a monetary cost + sockets), the ability to add new
high speed parallel I/O is very difficult without having an expansion bus.
I can only speak for myself, but one goal of me making a 65xx computer is to have something which I can interface parallel I/O with, just by adding a device to the 65xx bus. This is possible with near-zero prop-delay thanks to your creative incomplete address-decoding scheme that you placed on the primer. I'm not saying I2C, SPI, or interfacing using the VIA/PIA's output doesn't have it's place, but there's at least three things that come to mind:
- If I wanted purely serial I/O besides a VIA, I would've gone with a microcontroller instead of learning how to interface to the 6502.
- On a similar token, some devices are parallel by nature, and sometimes interfacing through serial I/O expander becomes difficult timing-wise, especially if the parallel device has control signals that must be asserted beforehand.
I'm not the analog designer by trade, you are

. I think it would be nice if an example parallel bus that could work at high speeds be added to the primer, while still exercising caution. One doesn't even have to bring all the address lines out- just enough for I/O-device select, CLK, data, and power. I've never taken an RF course, but I'm certainly willing to take a chance on making my own 65xx/65816 parallel bus- or use an existing generic bus such as Wishbone and create a workable physical layout. PC-104's may also be a candidate. The 65SIB is giving me ideas on how to design such a bus as well.
I'm happy to discuss my reasoning further in DM (too far ahead in the future for me to make public promises now).
Of course, the alternative COULD be to just have the PCB remade, but...

. I personally think it's better to add incrementally while still deferring the time cost of wire-wrap.
EDIT: Yes, I see Jeff Laughton's single cycle I/O trick, so there's no penalty for accessing data through the VIAs. It doesn't solve the problem of adding more fast parallel I/O. VIA also doesn't help interface 8-bit devices which require more than two control signals, or an 8-bit device with registers, such as a 6845 (example)

.