EEyE:
Perhaps you might consider a low pin count bus like Intel's LPC bus for inspiration. Graphics transactions tend to be characterized as sequences of data starting from an initial address and proceeding linearly. Transmitting the address for each data is somewhat redundant. Like the command / address cycle of an SPI Flash/RAM, the target of your read/write operations can capture the initial address and increment the address internally as the data is read from / written to your graphics slave device.
AH I see your point. The data is redundant. I think the key is going to be in the state machines and how to send the address and data efficiently. Speed is always key in graphics.
Considering this in a graphics type system here, where the CPU is not technically 'sharing' video memory but rather sending commands to a hardware plotter it's sort of a different beast. I should make that clear here. Not every pixel is plotted linearly like when doing characters, sprites and such. Lines or circles require jumping around the map. However, plotting characters obviously will be needed... But I must resist the temptation to get into the graphics side of this in too much detail else the main goal might be lost. I just need to design the possibility into the hardware.
I think the hardware design idea is almost solidified, there are the final 19 pins to be used for the Comm port. Heck, it could be 18 data and a R/W line. The possibilities are there. I just need to solidify the hardware side which I believe is very close. I just have to recheck, recheck, recheck,.....