6502.org • View topic - Speeding up the 65C02

View unanswered posts | View active topics

Board index » 6502.org Users Forum » Hardware

All times are UTC

Speeding up the 65C02

Page 3 of 7

[ 103 posts ]

Go to page Previous 1, 2, 3, 4, 5, 6, 7 Next

Previous topic | Next topic

Author

Message

ElEctric_EyE

Post subject:

Posted: Mon Jan 04, 2010 10:01 pm

Joined: Mon Mar 02, 2009 7:27 pm
Posts: 3258
Location: NC, USA

Gonna have to think about this one guys. I'm intrigued. Gonna try to figure this out. Thanks? heh...

OK, I am understanding. Drawing it now.

_________________
65Org16:https://github.com/ElEctric-EyE/verilog-6502

Top

ElEctric_EyE

Post subject:

Posted: Tue Jan 05, 2010 12:14 am

Joined: Mon Mar 02, 2009 7:27 pm
Posts: 3258
Location: NC, USA

I misunderstood leee's post. I thought he was talking 1 NAND gate. But after drawing Garth's and Tony's ideas out on paper, I get it now. He was talking 1 IC, i.e. 4 gates... Jeez you guys are good.

I'll post an updated schematic soon. See if we've got it right

_________________
65Org16:https://github.com/ElEctric-EyE/verilog-6502

Top

ElEctric_EyE

Post subject:

Posted: Tue Jan 05, 2010 1:55 am

Joined: Mon Mar 02, 2009 7:27 pm
Posts: 3258
Location: NC, USA

Sorry if the .jpg is too big. It's past my bedtime. Not had time today to check the new logic out, but I think this is what you guys meant?

_________________
65Org16:https://github.com/ElEctric-EyE/verilog-6502

Top

BigDumbDinosaur

Post subject:

Posted: Tue Jan 05, 2010 3:00 am

Joined: Thu May 28, 2009 9:46 pm
Posts: 8505
Location: Midwestern USA

ElEctric_EyE wrote:

BigDumbDinosaur wrote:

... instead use Ø2 to qualify R/W and hook A15 directly to the SRAM's /CS. SRAM is going to be in anytime the address is less than $8000, so no need for any additional logic in that department. With A15 directly driving /CS, he'd have almost all of Ø2 low to get the address setup done. As soon as Ø2 went high, the SRAM would be ready for read/write. Just an opinion, of course.

Would this really work? I've never used phase2 together with R/W to get a good address decode signal.

Qualifying R/W with Ø2 works fine. I use it in my POC design to control writes to the SRAM, WDT and DUART. Works like a charm. However...

You should not be using that method to control address decode. If you do, you lose part of the valid read/write cycle to address setup time, which will set a hard limit on the maximum clock speed. Your decoding methodology should react to A0-A15 as soon as possible and R/W should only get into the picture if a write to a given address is intended to touch a different device than a read to the same address.

If you examine the 65C02 timing diagram, you'll see that R/W becomes valid before the rise of Ø2. If you want to read from a device at address X but not write to it, you'd qualify the chip select (/CS) for that device with R/W:

Code:

/CS = X & !R/W

where !R/W means R/W is high.

A write to a different device at the same address would be:

Code:

/CS = X & R/W

This sort of logic will select the desired device during the low half of Ø2, shortly after A0-A15 become valid. The device will remain selected through Ø2 high. You wouldn't use Ø2 to qualify any chip selects.

_________________
x86? We ain't got no x86. We don't NEED no stinking x86!

Last edited by BigDumbDinosaur on Wed Jan 06, 2010 5:51 am, edited 1 time in total.

Top

Nightmaretony

Post subject:

Posted: Tue Jan 05, 2010 3:57 pm

Joined: Fri Jun 27, 2003 8:12 am
Posts: 618
Location: Meadowbrook

also, all your enable pins pulled up high, you want to have them go through a 1k or 4.7k pullup resistor. we had that mistake in an arcade game and it blew out the sprite control on a fairly constant basis. (Sky shark, if you wonder which)....

_________________
"My biggest dream in life? Building black plywood Habitrails"

Top

ElEctric_EyE

Post subject:

Posted: Wed Jan 06, 2010 3:46 am

Joined: Mon Mar 02, 2009 7:27 pm
Posts: 3258
Location: NC, USA

I don't think the logical AND'ing of 2 different frequencies will yield a viable phase 2, since phase 2 has to be close to a 50/50 duty cycle.

_________________
65Org16:https://github.com/ElEctric-EyE/verilog-6502

Top

GARTHWILSON

Post subject:

Posted: Wed Jan 06, 2010 3:51 am

Joined: Fri Aug 30, 2002 1:09 am
Posts: 8543
Location: Southern California

It's not to AND the two frequencies together, but you use the AND gates so when the gate taking in the unwanted signal has one of its inputs low, there's no AC output from that gate. The output is at a DC high however so the final AND gate passes the frequency through from the other one. IOW, you're using it as a selector switch.

Top

BigDumbDinosaur

Post subject:

Posted: Wed Jan 06, 2010 6:42 am

Joined: Thu May 28, 2009 9:46 pm
Posts: 8505
Location: Midwestern USA

ElEctric_EyE wrote:

I am really confused now... Did I draw the last schematic wrong?..... PM^

I don't think it's a matter of "wrong" so much as confusing two separate circuit features.

Forget about the clock frequency for a minute and focus on the selection of RAM, ROM, etc. ROM is a read-only device, therefore the only time when it would make sense to select it would be when the 65C02's R/W line is high (read mode). So, to select the ROM your logic would be the inverted logical AND of R/W and the address bits on which the ROM is visible. That is, your decoding circuitry would include a NAND of R/W so the ROM's /CS (chip select) input is low only when R/W is high and the ROM has been addressed by the MPU.

Conversely, the RAM that appears at the same address as the ROM would be selected only when a valid address is on the bus AND R/W is low.

Now, here's where it gets sticky. Your goal, as I understand it, is to at reset copy the ROM to RAM at the same address, completely map out the ROM and run from RAM. So the circuit at reset must obey the logic I described above: ROM access on read cycles, RAM access on write cycles. Simple enough.

However, once the ROM dump into RAM has finished, your circuit has to completely disable ROM and write-protect the RAM by guaranteeing that the RAM's /WE line cannot ever be brought low following the ROM dump. Otherwise, a wild write due to a program bug will scribble on the RAM, boogering up your code and most likely crashing the machine. You circuit has the /WE input of the SRAM at $E000 hooked directly to the MPU's R/W line. Therefore, the RAM is not write-protected, as the device you are using ignores /OE when /WE is asserted.

Another possible error I see in your circuit is in the use of Ø2 to qualify device selection. In my opinion, that's incorrect. Device selection should be a function of what the MPU places on A0-A15, subject to the R/W qualification I described before. Getting Ø2 mixed in with device selection means the devices only have the high side of Ø2 to react to A0-A15 and get ready for access. That will not work correctly as you crank up the clock speed. You want to select a device without regard to Ø2 low. The MPU will place a valid address on A0-A15 shortly after Ø2 goes low. R/W will go valid about 60 percent of the way through Ø2 low and remain valid until shortly after the next fall of Ø2. Therefore, if you qualify an address with R/W, your device should be set up for access by the time Ø2 goes high. In the case of the SRAM, it'll be ready in plenty of time, even with the MPU running at 14 MHz or better.

Here's the chip select circuitry for my POC computer. Note how Ø2 has nothing to do with device selection. Also, note that ROM is only selected when RWB (the '816 R/W output) is high. This arrangement has been tested up to 8 MHz.

While on the subject of R/W, all writes should be qualified by Ø2. You should avoid a write operation unless Ø2 is high, as D0-D7 cannot be considered valid unless Ø2 is high. Your circuit uses R/W without regard to the Ø2 timing, which will assuredly cause random errors.

_________________
x86? We ain't got no x86. We don't NEED no stinking x86!

Top

GARTHWILSON

Post subject:

Posted: Wed Jan 06, 2010 8:39 am

Joined: Fri Aug 30, 2002 1:09 am
Posts: 8543
Location: Southern California

Quote:

However, once the ROM dump into RAM has finished, your circuit has to completely disable ROM and write-protect the RAM by guaranteeing that the RAM's /WE line cannot ever be brought low following the ROM dump. Otherwise, a wild write due to a program bug will scribble on the RAM, boogering up your code and most likely crashing the machine. You circuit has the /WE input of the SRAM at $E000 hooked directly to the MPU's R/W line. Therefore, the RAM is not write-protected, as the device you are using ignores /OE when /WE is asserted.

That's a valid consideration, although you might still want to be able to write to high RAM at various times, like to change an interrupt vector, or if you want some variable space mixed in with the code, or if you want self-modifying code (which is justifiable only on rare occasion). After pre-loading RAM, I plan to leave it writable for these reasons. When I make a programming mistake and crash my workbench computer, it virtually never messes up RAM-resident code, as it's usually just stuck in a loop where the exit condition is never met.

Quote:

Another possible error I see in your circuit is in the use of Ø2 to qualify device selection. In my opinion, that's incorrect. Device selection should be a function of what the MPU places on A0-A15, subject to the R/W qualification I described before. Getting Ø2 mixed in with device selection means the devices only have the high side of Ø2 to react to A0-A15 and get ready for access. That will not work correctly as you crank up the clock speed. You want to select a device without regard to Ø2 low. The MPU will place a valid address on A0-A15 shortly after Ø2 goes low.

Now I'm wondering if there's another big booboo in the data sheet. I just put a couple of oscilloscope probes on my workbench computer which has a 65802, and the address does come out soon after phase 2 falls. But according to the data sheet for the 65c02 (I can't find my data sheet for the '802), it's quite different. It shows that as you close in on the processor's maximum speed, the soonest time that the address is guaranteed to be valid and stable gets mighty close to the rising edge of phase 2. If that's true, I suspect that a major cause of crashing when you go too fast is that the address is not out in time before phase 2 rises. Daryl, can you check this on your '816 since you just stepped its speed way up? (Or anyone else with any WDC processor running in the teens of MHz.) At lower clock rates, yes, you would have more valid address time before phase 2 rises; but at the lower speeds the RAM access time is not difficult anyway. Common 55ns RAM will work fine with the selects qualified by phase 2.

Quote:

While on the subject of R/W, all writes should be qualified by Ø2. You should avoid a write operation unless Ø2 is high, as D0-D7 cannot be considered valid unless Ø2 is high. Your circuit uses R/W without regard to the Ø2 timing, which will assuredly cause random errors.

Not if he has the selects qualified by phase 2. The RAM can never be written to when a low phase 2 keeps the select false. If the select is qualified by phase 2, then the R/W does not need to be. The RAM address inputs are there full-time anyway, not just when phase 2 is high, so it won't get confused as to what address to write to in the scramble when the select goes true after the rising edge of phase 2. My workbench computer's RAM's select does not go true before phase 2 rises, and with 4MHz parts and RAM marked 55ns (obviously it's less), it runs up to 7MHz, and over many years of use, has never had a problem.

_________________
http://WilsonMinesCo.com/ lots of 6502 resources
The "second front page" is http://wilsonminesco.com/links.html .
What's an additional VIA among friends, anyhow?

Top

bogax

Post subject:

Posted: Wed Jan 06, 2010 7:29 pm

Joined: Tue Nov 18, 2003 8:41 pm
Posts: 250

Maybe I'm not following you here

BigDumbDinosaur wrote:

But that's what the chip select does. (no?)
So if you're talking about making sure that there's enough delay
in R/W to track the chip select logic delays, I don't see how just
gating R/W with phi2 guarantees that.

Top

ElEctric_EyE

Post subject:

Posted: Thu Jan 07, 2010 4:07 am

Joined: Mon Mar 02, 2009 7:27 pm
Posts: 3258
Location: NC, USA

bogax wrote:

...And assuming you're not going to write protect SRAM once it's written (since you don't show that).

GARTHWILSON wrote:

... or if you want self-modifying code (which is justifiable only on rare occasion).

I definately do not want to write protect the OS once it's in the SRAM. I want to try the challenge of a self-modifying OS, which is why I mentioned the possibly using FRAM a couple posts ago. FRAM holds its pattern like an EEPROM after power down, but can be written to like an ordinary SRAM, (I've not investigated the access times yet though). In the simplest example of self-modifying code, I imagine values in a lookup table averaged in over time, a long time, even when powered down. In a more advanced self-modifying code, I can imagine full subroutines being replaced based on a type of "fuzzy logic", sorta like software routines being multiplexed.

BigDumbDinosaur wrote:

...This sort of logic will select the desired device during the low half of Ø2, shortly after A0-A15 become valid. The device will remain selected through Ø2 high. You wouldn't use Ø2 to qualify any chip selects.

Your R/W-Phase2 logic will be the first I try when my circuit stops working at higher speeds. But the old school 6502 MOS manual says that A0-A15 was only valid during phase2 high. The adress decoding part of the schematic is the same as what I have already working of the Fuel injector pulse width analyzer thread, except the TTL family I use there is "F" not "AC", but it works up to 7Mhz with the 70ns EEPROM...

Your qualifying the Read and Writes with Phase2 reminds me of an 8080-style interface I tried with the TFT display, which didn't work (at sub 7MHz speeds, I later found out why and explain later on in the thread). Keep in mind though, I'm not using the 65816. The timing diag's look abit different to me, compared to the 6502.

It's here: (4th post down) viewtopic.php?t=1370&postdays=0&postorder=asc&start=75.

Or here, for the direct link: http://www.atarimagazines.com/computeii ... page17.php

_________________
65Org16:https://github.com/ElEctric-EyE/verilog-6502

Top

8BIT

Post subject:

Posted: Fri Jan 08, 2010 6:20 am

Joined: Fri Aug 30, 2002 9:02 pm
Posts: 1748
Location: Sacramento, CA

GARTHWILSON wrote:

...I just put a couple of oscilloscope probes on my workbench computer which has a 65802, and the address does come out soon after phase 2 falls. But according to the data sheet for the 65c02 (I can't find my data sheet for the '802), it's quite different. It shows that as you close in on the processor's maximum speed, the soonest time that the address is guaranteed to be valid and stable gets mighty close to the rising edge of phase 2. If that's true, I suspect that a major cause of crashing when you go too fast is that the address is not out in time before phase 2 rises. Daryl, can you check this on your '816 since you just stepped its speed way up? (Or anyone else with any WDC processor running in the teens of MHz.)

I'll try to get out my scope this weekend and check that.

Daryl

Top

BigDumbDinosaur

Post subject:

Posted: Fri Jan 08, 2010 7:20 am

Joined: Thu May 28, 2009 9:46 pm
Posts: 8505
Location: Midwestern USA

ElEctric_EyE wrote:

...But the old school 6502 MOS manual says that A0-A15 was only valid during phase2 high...The timing diag's look a bit different to me, compared to the 6502.

Except your circuit is not using the NMOS part and is expected to run much faster than any of the NMOS parts. Of course, it is possible to plug the CMOS part into most boards designed for the NMOS MPU, as the low speeds are less timing-critical. However, WDC's timing model on the 'C02 has more in common with the '816 than might seem to be the case and it's best to comply with it for a trouble-free circuit. As Garth (I think) said, "You can get away with murder at 1 MHz." You won't at 10 or 12 MHz.

My opinion, for what it's worth, is that your address decoding methodology is improper. Chips selects should not be qualified by Ø2 at all. Although bus states are dependent on the phase of Ø2, the bus changes do not coincide with the rise and fall of the clock, and that is where I think your circuit is going to have trouble.

The 65xx family has, for all practical purposes, two operating states. One is present while Ø2 is low, which is when the MPU fiddles with A0-A15 and conditions control outputs, such as R/W. During that time, the data bus is invalid. The other state is when Ø2 is high, which is when the data bus becomes valid and a read or write operation can be carried out. Your design is not acknowledging these characteristics of the 'C02 (also true of the '816) and, I believe, is headed toward a timing train wreck.

Consider your SRAM, for example. It has three control inputs: /CS, /OE and /WE. The idealized (read "correct") way to address this part is for /CS to be asserted as soon as any address to which the SRAM is to respond is placed on A0-A15 by the MPU. A0-A15 is guaranteed to be stable no more than tADS nanoseconds after the fall of Ø2, an interval of 30ns maximum on a 'C02 running at 14 MHz. During the time period denoted by tADS, A0-A15 are changing and the address bus is not valid. Add on the gate delays in your glue logic and just about all of Ø2 low will be used up in address setup. Keep in mind that the SRAM itself will have some delay after selected before its internal logic produces the correct row/column address.

Of course, the SRAM has to know whether it is to be read or written. A read operation can be qualified by inverting the MPU's R/W output and applying the output of the inverter to /OE. According to the timing diagram, R/W also becomes valid at tADS. Therefore, assuming a clock rate of 14 MHz, the needed SRAM setup will occur no more than 30ns plus gate delays after the fall of Ø2 and, if you are using 12-15ns SRAM, the part should be ready for business by the time the MPU gets around to accessing D0-D7.

Depending on the SRAM, you may be able to dispense with qualifying /OE and may be able to simply tie it to /CS. In other cases, that arrangement could be ambiguous during a write cycle, causing difficult to diagnose conditions. You need to carefully read the narrative for the SRAM you are using to verify that asserting /OE during a write operation is an acceptable procedure. If not, you must qualify it with an inverted R/W signal.

Now, here comes the important part. The MPU does not access D0-D7 until well after the rise of Ø2. In a read cycle, the MPU doesn't sample D0-D7 until tDSR nanoseconds before the next fall of Ø2. At 14 MHz, this will be 10ns. Additionally, the data must remain on D0-D7 for at least 10ns after the fall of Ø2, referred to as tDHR on the timing diagram. Therefore, the addressed device ***MUST*** stay open for access until tDHR has expired. There's no sure way to guarantee this if addressing is qualified by Ø2 as you are doing. In your design, as soon as Ø2 falls, your chip decode will become invalid, chip select will be deasserted following gate delays (very short with 74AC logic), and in the case of the SRAM, it is not likely you will meet the tDHR timing requirement. Ergo there won't be any data to read.

During a write cycle, you will have another circuit bug. As I said above, the data bus is never valid when Ø2 is low. If you directly connect the SRAM's /WE input to the MPU's R/W output, you will be telling the SRAM to accept data (assuming a write cycle) while Ø2 is low, a period during which the MPU is fiddling with the address lines. At higher clock speeds, it's very likely the SRAM may react to /WE while the address bus is in transition. The result is the invalid data on the data bus will be written to an invalid address in the SRAM.

To avoid this sort of write glitch you need to qualify R/W with Ø2 so that the only time the SRAM's /WE can be asserted is when Ø2 is high. Again referring to the timing diagram, the MPU will place data on D0-D7 at tDMS nanoseconds after the rise of Ø2, a period guaranteed to be no more than 25ns at 14 MHz. By that time, your SRAM will have already been selected and as soon as Ø2 rises, /WE will see a low input and the SRAM will accept data from D0-D7. The MPU will continue to hold the data on D0-D7 for tDHW nanoseconds after the next fall of Ø2, which would be at least 10ns.

The write glitch can occur with any hardware that can accept data, such as a 6522, a UART, etc. You must qualify R/W with Ø2 to assure that data cannot be written to a chip until after the address inputs have stabilized.

Lastly, in situations where all you need is an inverter, it's best to use an inverter, not a NAND gate with all inputs tied together. Inverters are generally faster that NANDs, and can be had six to a package (hex inverter), thus using less board real estate.

_________________
x86? We ain't got no x86. We don't NEED no stinking x86!

Top

GARTHWILSON

Post subject:

Posted: Fri Jan 08, 2010 9:36 am

Joined: Fri Aug 30, 2002 1:09 am
Posts: 8543
Location: Southern California

I hate to persist, but there are too many problems with BDD's post to just let it go.

Quote:

Therefore, the addressed device ***MUST*** stay open for access until tDHR has expired. There's no sure way to guarantee this if addressing is qualified by Ø2 as you are doing. In your design, as soon as Ø2 falls, your chip decode will become invalid, chip select will be deasserted following gate delays (very short with 74AC logic), and in the case of the SRAM, it is not likely you will meet the tDHR timing requirement. Ergo there won't be any data to read.

False, for two reasons. For one, his glue logic takes most of that 10ns, and for another, bus capacitance will easily hold the data for the rest of the time, as nothing else is driving the data bus yet. The small CMOS leakage along with the capacitive loading on the lines would hold the logic state for a good dozen microseconds (and possibly much longer) if you were to stop the clock immediately after phase 2 goes down.

Quote:

During a write cycle, you will have another circuit bug. As I said above, the data bus is never valid when Ø2 is low. If you directly connect the SRAM's /WE input to the MPU's R/W output, you will be telling the SRAM to accept data (assuming a write cycle) while Ø2 is low, a period during which the MPU is fiddling with the address lines. At higher clock speeds, it's very likely the SRAM may react to /WE while the address bus is in transition. The result is the invalid data on the data bus will be written to an invalid address in the SRAM.

No again, because the select logic does not have the RAM selected during the phase-2-low period. You're saying it cannot work, but the fact is that it has worked perfectly 100% of the time for all 16 years in the field in our products where the R/W\ line goes directly to the SRAM with no phase-2 qualification. Of the very few computer problems we've had, they've all been due to bad connections.

Quote:

The write glitch can occur with any hardware that can accept data, such as a 6522, a UART, etc. You must qualify R/W with Ø2 to assure that data cannot be written to a chip until after the address inputs have stabilized.

As I said before, the 6522 needs the R/W\ and chip selects valid before phase 2 rises, or it won't work. I found that out the hard way 23 years ago. I could have saved myself some time if I had caught it in the data book. And even though its chip selects are true, it won't try to write until phase 2 goes up. I have a lot of experience with the 6522; and after that time 23 years ago, my 6502 computer designs have always worked on first try.

Quote:

Lastly, in situations where all you need is an inverter, it's best to use an inverter, not a NAND gate with all inputs tied together. Inverters are generally faster that NANDs, and can be had six to a package (hex inverter), thus using less board real estate

The 74AC00 NAND and 74AC04 inverter are within half a nanosecond of each other at the max prop delay. Not enough difference to worry about. I will say that the '14 is slower than the '04 though, so if you need speed and not the Schmitt-trigger input, then use the '04 (or even the '00), not the '14. Sometimes you'll use fewer parts by using otherwise unused sections of the '00 for example when going to a 04 would mean adding another IC if there aren't any unused 04 sections available. In Electric_Eye's case though, it might be good to go ahead and add that extra IC so he'll have faster inverters where he doesn't need the Schmitt-trigger inputs.

Top

8BIT

Post subject:

Posted: Sat Jan 09, 2010 8:51 pm

Joined: Fri Aug 30, 2002 9:02 pm
Posts: 1748
Location: Sacramento, CA

GARTHWILSON wrote:

OK, got my ancient scope out and this is what I saw.

CH A connected to A0, CH B on the TTL Oscillator output (trigger)

Pulse width confirmed to be 69.8 ns on the clk (14.318 MHz).

A0 begins to change states at 12.5ns from falling edge of clk. It reaches the new state at approximately 26ns. This would infer that the max clock speed for my SBC-3 would be about 19.2 MHz.

The datasheet gives max setup time, tADS, as 30ns.

For those interested in the BANK address setup time, the tBAS spec has a max of 33ns. I measured approx 26ns on that line as well.

Daryl

Top

Page 3 of 7

[ 103 posts ]

Go to page Previous 1, 2, 3, 4, 5, 6, 7 Next

Board index » 6502.org Users Forum » Hardware

All times are UTC

Who is online

Users browsing this forum: No registered users and 41 guests

You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum