barnacle wrote:
You could consider charlieplexing... of course, just seven OR gates will do it, if they have up to sixteen inputs each 
The matrix could be something like this: The logic around the matrix a bit like: This for four byte address displays, two byte data displays and the last two special characters that display Modes like this:
This last gets a matrix a little different, the whole to being part of a cycle/step 65x02 tracer.
Comments are very welcome