LowEndTalk

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Sign In Register

BMail.ag - Secure Email Service

Server.net

CPLicense.net

VPS Server

Buy VPN

Vultr

VMs for AI

HostDare

ReliableSite White-Label Dedicated Hosting for Resellers

25% Recurring Discount on NVMe VPS

Try EnsoVPN - Reliable VPN - 1-Day Free Trial

InterServer VPS

BMail.ag - Secure Email Service

Best VPN

High-Performance Bare Metal Server Solutions

Karvl.com

Server Mania Cloud Hosting

DataWagon Hosting

AlphaVPS Hosting

Evoxt.com

Clouvider

VPS Hosting with NVMe

Residential IPs in the US & 4G Mobile Proxies in EU & US with Unlimited Bandwidth

ReliableSite White-Label Dedicated Hosting for Resellers

Rabisu - Hosting Solutions

CloudLinux

Try EnsoVPN - Fast & Private VPN - 1-Day Free Trial

Categories

In this Discussion

New on LowEndTalk? Please Register and read our Community Rules.

All new Registrations are manually reviewed and approved, so a short delay after registration may occur before your account becomes active.

Wayback machine?

n1kko

n1kko Member

April 2014 in General

Is there anyway to save/download an old site from waybackmachine.org?

Comments

Mark_R Member

April 2014

www.httrack.com
n1kko Member

April 2014

Great thanks, I'm on Mac but can run parallels
srvrpro Member

April 2014

httrack could do the thing but it would also copy the code snippets waybackmachine.org adds, so you'll have to remove that from each page
n1kko Member

April 2014

Only seems to save index.html and goes no deeper. No limits set in settings either
jeffreywinters Member

April 2014

@n1kko said:
Great thanks, I'm on Mac but can run parallels

Sitesucker
srvrpro Member

April 2014

@n1kko set permission to disallow robots.txt file

It should not download the robots.txt file in order to go deeper. This works for most websites, not sure about waybackmachine
n1kko Member

April 2014

Tried sitesucker and just saves robots.txt with this

robots.txt web.archive.org 2013-10-02

User-agent: *
Disallow: /

User-agent: ia_archiver
Allow: /
n1kko Member

April 2014

Sitesucker allows "ignore robot exclusions" in settings working now

or Register to comment.

2008-2026 © LowEndBox & LowEndTalk. Privacy Policy. Powered by Vanilla.

Back to Top | LowEndTalk | LowEndBox | Dark Theme Config | Advertise