Do you mean that the RTX2000 was significantly faster than the MSP430, or vice-versa?
I suspect the RTX2000 was the hotter performer, as far as processing power goes, since it routinely did 16 MIPS, and Forth MIPS to boot, and 60MIPS peak. The MSP430 can supposedly do 16MIPs, but not Forth MIPS, and the 16MIPS is not taking off for the instructions that take more than one cycle.
What is speed needed for?
In the old days (up through the early 1990s), speed was an issue. For example, Testra's motion-control board for the laser-etcher based on the Dallas 80c320 was too slow, so Testra introduced the MiniForth processor that was less expensive and higher speed than the competitor's MC68000 board. Now the MiniForth has been ported to an FPGA and is much faster (maybe 20 or 40 times faster; quite a lot). It would smoke the RTX2000 or MSP430 easily. It seems like it would literally smoke the machine too --- there are physical limitations to how fast a mechanical contraption using gears and belts can move before smoke starts coming out.
With the 65VM02 my goal is to support high-level language programming (with the byte-code VM) and support a multi-tasking OS (with the D register and EXA instruction, etc.). These are conveniences for the programmer --- these features would allow programs to be written more quickly than they could be written using C on an MSP430, and execute more quickly too --- I'm not trying for super high-speed though, as I can't think of any application that needs this.
The MSP430X extension with 20-bit addressing adds additional instructions that can require up to 10 clock cycles.
What is 1MB+ memory needed for?
In the old days, memory was an issue. Most of the processors were limited to 64KB and this was a problem in some programs, and those bank-switching schemes were terribly inconvenient.
On the 65VM02, the byte-code is pretty compact compared to the machine-code on almost any processor, so programs should be smaller. A 128KB system should support very big programs --- I doubt that anybody is going to complain that their program is too big to fit.
The far-memory can be as much as 16MB, which is a lot more than the 1MB that the MSP430 addresses, but it is less convenient because the only instructions that access it are FLDA and FSTA, both of which use (direct), Y addressing with a three-byte pointer in the direct-page. I'm primarily thinking of the far-memory being used as a RAM-disk to hold files that go to or from a desktop computer via a UART.
One application that uses a lot of memory is TCP/IP. The eZ80 was provided with 16MB total memory for the purpose of supporting TCP/IP. I don't know anything about TCP/IP though. What would the 65VM02 need to support this?
Another thing that I don't know anything about, is USP. This requires hardware support. USP is a lot faster than a basic UART --- but I don't know why transferring files at high-speed is needed --- my 65c02 cross-compiler had the Apple-IIc target and MS-DOS host talking at 19,200 baud, which was adequate for my purposes.
I was told that all cars sold in America and Europe (except Toyota) use a PowerPC variant with an eTPU coprocessor for I/O. The 65VM02 isn't going to be as powerful as that. I don't know why such a powerful system is needed though. Car engines still run at about 5000 rpm max, the same as they did in the 1980s. For an eight-cylinder car, that is 80,000 times per second that you have to turn a spark on or off, which is once every 12.5 microseconds. I would expect that a 25 Mhz. 65VM02 with a coprocessor for I/O could be fast enough to do this --- the multi-tasking OS should work well for keeping track of all the inputs from the various sensors --- this is the kind of application that I would hope the 65VM02 could be used for.
Who would care though? Saving a few dollars on the cost of the computer isn't going to have a significant effect on the profit-margin of a car that is sold for tens of thousands of dollars. The car manufacturers would likely stick with the PowerPC/eTPU system because they already have the code written, and it is more than powerful enough, whereas a marginal system might prove to be too slow.