6502.org Forum  Projects  Code  Documents  Tools  Forum
It is currently Thu Nov 14, 2024 6:50 am

All times are UTC




Post new topic Reply to topic  [ 23 posts ]  Go to page 1, 2  Next
Author Message
PostPosted: Sun Jun 02, 2024 8:59 pm 
Offline

Joined: Tue Jul 05, 2005 7:08 pm
Posts: 1043
Location: near Heidelberg, Germany
I've recently noticed that the links in the datasheet archive section of 6502.org seem to point to the internet archiv web.archive.org.

I understand that this could be a measure to reduce bandwidth, but is this true, is this permanent?

(the performance - time for loading the doc - of the internet archive is way worse than what I remember from 6502.org...)

Thanks
André

_________________
Author of the GeckOS multitasking operating system, the usb65 stack, designer of the Micro-PET and many more 6502 content: http://6502.org/users/andre/


Top
 Profile  
Reply with quote  
PostPosted: Sun Jun 02, 2024 9:28 pm 
Offline
User avatar

Joined: Thu Dec 11, 2008 1:28 pm
Posts: 10977
Location: England
(I think it's been that way for a while, and like you I imagine it's to save bandwidth. But at present (May 2024) the internet archive is under some denial of service attack, which might explain why it's not always as fast as we're accustomed to seeing.)


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 12:35 am 
Offline
User avatar

Joined: Fri Aug 30, 2002 1:09 am
Posts: 8540
Location: Southern California
Andre, where did you see this?  I was surprised by what you said, so I went to that section, and went down the list on every brand, mousing over all the links to see what shows at the bottom of the screen, and every single one of them started with "http://6502.org/documents/datasheets/".

_________________
http://WilsonMinesCo.com/ lots of 6502 resources
The "second front page" is http://wilsonminesco.com/links.html .
What's an additional VIA among friends, anyhow?


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 5:34 am 
Offline

Joined: Tue Jul 05, 2005 7:08 pm
Posts: 1043
Location: near Heidelberg, Germany
But if you click on them they end up in the internet archive.
Not sure if all of them but many.

The one I tried yesterday was the Rockwell 6545-1, but I've seen others before.

André

_________________
Author of the GeckOS multitasking operating system, the usb65 stack, designer of the Micro-PET and many more 6502 content: http://6502.org/users/andre/


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 6:04 am 
Offline
User avatar

Joined: Fri Aug 30, 2002 1:09 am
Posts: 8540
Location: Southern California
fachat wrote:
But if you click on them they end up in the internet archive.
Not sure if all of them but many.

The one I tried yesterday was the Rockwell 6545-1, but I've seen others before.

I clicked on it, and the .pdf came up, without going to the internet archive.  So then I did a <Ctrl>U to see the page source, and took this screenshot of the part about the Rockwell 6545-1:
Attachment:
datasheets6545img.gif
datasheets6545img.gif [ 17.95 KiB | Viewed 819 times ]

As you can see, there's nothing there to refer to archive.org, or to anything outside 6502.org.

_________________
http://WilsonMinesCo.com/ lots of 6502 resources
The "second front page" is http://wilsonminesco.com/links.html .
What's an additional VIA among friends, anyhow?


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 6:49 am 
Offline

Joined: Tue Jul 05, 2005 7:08 pm
Posts: 1043
Location: near Heidelberg, Germany
Strange. Just tried again, and the browser displayed the PDF with a URL shown (in the address bar) from the internet archives. I use Firefox if that matters. Will try other browsers today

_________________
Author of the GeckOS multitasking operating system, the usb65 stack, designer of the Micro-PET and many more 6502 content: http://6502.org/users/andre/


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 6:51 am 
Offline
User avatar

Joined: Fri Aug 30, 2002 1:09 am
Posts: 8540
Location: Southern California
I use firefox too.  You're outside the US, right?  I wonder if that has anything to do with it.

_________________
http://WilsonMinesCo.com/ lots of 6502 resources
The "second front page" is http://wilsonminesco.com/links.html .
What's an additional VIA among friends, anyhow?


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 7:36 am 
Offline

Joined: Tue Jul 05, 2005 7:08 pm
Posts: 1043
Location: near Heidelberg, Germany
Ok, I tried it on the PC again, tracing the web access. See screenshot.


Attachments:
File comment: Screenshot of list of web requests following the click on the link to the Rockwell 6545-1 datasheet in the documents/datasheets/rockwell section
Screenshot_20240603_093353.png
Screenshot_20240603_093353.png [ 322.93 KiB | Viewed 808 times ]

_________________
Author of the GeckOS multitasking operating system, the usb65 stack, designer of the Micro-PET and many more 6502 content: http://6502.org/users/andre/
Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 7:39 am 
Offline

Joined: Tue Jul 05, 2005 7:08 pm
Posts: 1043
Location: near Heidelberg, Germany
P.S.: interestingly the actual PDF seems to be included in the original response, and even two responses from the internet archive. So my browser loads the actual file three times...?

The transferred number of bytes indicate this, as well as clicking on the "response" tab in the browser console, where all three requests with >5MB get a long encoded (base64?) response string.

Oh, the wonders of the modern web...

_________________
Author of the GeckOS multitasking operating system, the usb65 stack, designer of the Micro-PET and many more 6502 content: http://6502.org/users/andre/


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 7:48 am 
Offline
User avatar

Joined: Thu Dec 11, 2008 1:28 pm
Posts: 10977
Location: England
Two of those accesses are 302 - I think they are redirects. The size of the PDF is in the header, but I don't think the PDF is downloaded each time. Might be worth looking a little deeper.

I think Mike has put in the redirects in some straightforward way. If anyone were scraping the document archive, they wouldn't cost him bandwidth (or slow down our accesses to the forum!)


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 8:05 am 
Offline
User avatar

Joined: Wed Feb 14, 2018 2:33 pm
Posts: 1485
Location: Scotland
I tried this. I tried to get the link pointing to the Rockwell 6502, 2nd link on:

http://archive.6502.org/datasheets/rock ... essors.pdf

This sends me a redirect to

http://6502.org/documents/datasheets/ro ... essors.pdf

and that sends me a redirect to

https://web.archive.org/web/20221112220 ... essors.pdf

So someone/thing has intentionally created this archive of 6502.org over on archive.org and altered the server to redirect requests to archive.6502.org which forwards to archive.org....

This also may be a geo-fencing thing. I can't be bothered to check right now as it would mean finding a US based VPN outlet.

Probably OK, however, as of late archive.org is slow and also more frustratingly, archive.org is BLOCKED by default by many ISPs - especially mobiles ones as, as well as being full of technical stuff it's also full of porn and other potentially contentious material.

So I'm blocked from accessing it when out and about which for me right now is 2-3 days a week.

I could go through the shenanigans of enabling adult content for the mobile ISP but it's not worth the hassle.

-Gordon

_________________
--
Gordon Henderson.
See my Ruby 6502 and 65816 SBC projects here: https://projects.drogon.net/ruby/


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 8:14 am 
Offline
User avatar

Joined: Fri Aug 30, 2002 1:09 am
Posts: 8540
Location: Southern California
He must have some sort of redirect set up then.  I just looked in my history, and archive.org does show up.

_________________
http://WilsonMinesCo.com/ lots of 6502 resources
The "second front page" is http://wilsonminesco.com/links.html .
What's an additional VIA among friends, anyhow?


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 10:22 am 
Offline
User avatar

Joined: Thu Dec 11, 2008 1:28 pm
Posts: 10977
Location: England
It's not a geo thing - browserling has the same result. (Also a way to access if your ISP isn't happy.)

Another possible workaround/solution is archive.today although again it's a bit subject to ISP blocking (which is why it has a presence on many top level domains.)


Top
 Profile  
Reply with quote  
PostPosted: Mon Jun 03, 2024 3:39 pm 
Offline
User avatar

Joined: Sun Jun 30, 2013 10:26 pm
Posts: 1949
Location: Sacramento, CA, USA
I'm getting redirected too, but archive.org is currently quite responsive for me, so I wouldn't even have noticed under normal circumstances.

_________________
Got a kilobyte lying fallow in your 65xx's memory map? Sprinkle some VTL02C on it and see how it grows on you!

Mike B. (about me) (learning how to github)


Top
 Profile  
Reply with quote  
PostPosted: Tue Jun 25, 2024 6:06 pm 
Offline
Site Admin
User avatar

Joined: Fri Aug 30, 2002 1:08 am
Posts: 281
Location: Northern California
fachat wrote:
I've recently noticed that the links in the datasheet archive section of 6502.org seem to point to the internet archiv web.archive.org.

I understand that this could be a measure to reduce bandwidth, but is this true, is this permanent?

No, this is not intended to be permanent.

In the Git repository for the website, I have a SQLite database file which maps all of the PDF files in the documents archive to known-good copies on archive.org. I originally did this so that the website can be run locally or can be recreated if something happens to me.

A few months ago, we were having problems where the entire documents archive was being downloaded frequently. The downloaders would sometimes put so much load on the server that the forum became unusably slow. Blocking IPs didn't help because whenever I blocked some, the bulk downloading would start again from new ones.

I started redirecting the document files to archive.org to help deal with this. I was hoping that the people doing the bulk downloading had good intentions and might stop if they realized everything was already backed up on archive.org. However, whenever I look in the logs, they don't seem to have let up. I'd like to restore serving directly from 6502.org and I think the long-term solution will be some kind of rate limiting where an IP address will be redirected to archive.org if it exceeds some reasonable number of downloads within a time period.

_________________
- Mike Naberezny (mike@naberezny.com) http://6502.org


Top
 Profile  
Reply with quote  
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 23 posts ]  Go to page 1, 2  Next

All times are UTC


Who is online

Users browsing this forum: Google [Bot] and 30 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to: