Skip to content

How-To: Import raw files out of a wget dump or similar website dump #1387

Answered by pirate
Juliaria08 asked this question in Q&A
Discussion options

You must be logged in to vote

You can technically accomplish this but it's a very labor intensive process right now, as it's not one of our main use-cases. You have to create a Snapshot for each URL, and it's recommended to add an ArchiveResult entry for each item you want to import.

  1. Go to /add/ and add each URL to create a Snapshot entry (you can deselect all the archive methods except title to avoid it trying to archive the page on its own)
  2. Go to /admin/core/archiveresult/ and create a new ArchiveResult for each file you want to store for that URL, and point it to your Snapshot created earlier for that URL
  3. Drag your existing files into the snapshot folder e.g. you can put mp4 files into ./archive/<timestamp>/media/,…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Juliaria08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants