What's the speed of the SRAM part, enso? Add a bit for getting on and off the FPGA, what kind of cycle time would that give you ideally?
You have 9k of block RAM - I suppose much of that would be ROM. But 1k you could place at the bottom of memory and have fast zero page and stack. Then you could add a waitstate for external SRAM, without too much performance hit. Or, to put it another way, no performance hit, because two cycles at double speed is the same performance, but now you gain by having a fast 8k ROM, fast zp and stack, and a little more fast RAM for small programs.
Edit: oops, the 72kbit of block RAM will be 8k x 9bits or related combinations. That's not so obvious how to make 9k x 8bits which is what I was thinking of. Especially with only 4 blocks... maybe it can't be done.
Ref:
http://www.xilinx.com/support/documenta ... app463.pdfOr maybe the 12kbits of distributed RAM could be used? 2k for zp, 2k for stack.