Probably a good idea to go see how much storage will be necessary...

AnimalsDream@slrpnk.net · edit-2 4 days ago

Probably a good idea to go see how much storage will be necessary...

foggy@lemmy.world · edit-2 4 days ago

Be smart and keep it all on thumb drives.

pyrflie@lemmy.dbzer0.com · edit-2 4 days ago

Welcome to datahoarders.

We’ve been here for decades.

Also follow 3-2-1 people. 3 Backups, 2 storage mediums, 1 offsite.

lambalicious@lemmy.sdf.org · 3 days ago

“backups”? Pray tell, fine sir and or madam, what is that?

wurstgulasch3000@feddit.org · edit-2 3 days ago

You know there’s only two kind of people, those who do backups and those that haven’t lost a hard drive/data before. Also: raid is no backup

Valmond@lemmy.world · 3 days ago

Still remember the PSU blast taking out my main drive plus my backup drive in like 2001. I thought I was so good because I at least had a backup 😑. Those were the days 🤷🏻‍♀️

lambalicious@lemmy.sdf.org · 2 days ago

That sounds like an adventure!

Valmond@lemmy.world · 2 days ago

Ya, me learning that a dinky psu is your worst enemy, i upgraded my SOs old duron to an athlon for work, which used more energy…

lambalicious@lemmy.sdf.org · 6 hours ago

My condolences! That said Athlons were late 90s (?) cool.

ramenshaman@lemmy.world · 4 days ago

I downloaded wikipedia a month or two ago, I recommend it.

treesquid@lemmy.world · 4 days ago

How big is Wikipedia?

ozymandias117@lemmy.world · 4 days ago

If you don’t care about edit history, and only care about English, there are zim files w/ images for <150 GiB

Valmond@lemmy.world · 3 days ago

Okay so where do I find some cheap hard drives? Europe if possible :-)

Drunk & Root@sh.itjust.works · 2 days ago

look for dvr’s they have huge hdds in them and you can find them at thrift stores for cheap

Nikls94@lemmy.world · 4 days ago

I kind of want that hackermans diy pc that runs on 18650 cells

Pringles@sopuli.xyz · 3 days ago

I still have a copy of wikipedia from 2021 somewhere on my NAS.

MTZ@lemmy.world · 3 days ago

This is just minor datahoarding. I do it, on an extreme level.

juipeltje@lemmy.world · 4 days ago

Yeah not gonna lie, i think i heard someone in a youtube video a while back talk about how the entirety of wikipedia takes up like 200 gigs or something like that, and it got me seriously considering to actually make that offline backup. Shit is scary when countries like the uk are basically blocking you from having easy access to knowledge.

palordrolap@fedia.io · 4 days ago

UKGOV haven’t started on things like Wikipedia yet. They know kids use it for school and blinded by ideology though they are, even they can see there’d be an enormous backlash if they blocked it any time soon.

If that’s going to happen at all, I doubt it would be before the next election. That’s whether Labour get re-elected or the Tories make an unexpected comeback. You can tell how far Labour have fallen in the eyes of their party faithful when they’ve taken a Tory-drafted policy and made it their own.

Ironically, the up and coming third option fascist party, have said they’re going to repeal the Online Safety Act. They have other fish to fry if they get in, and they’ll want to keep their preferred demographic(s) happy while they do it.

I assume that eventually something like the OSA would come back to “protect the children”. They love the current US President.

None of this is hopeful. Take this as more of a rant.

Saltarello@lemmy.world · 4 days ago

I’m certain that when UK forces DigitalID upon the nation it will be a requirement for access to every website

ObliviousEnlightenment@lemmy.world · 4 days ago

Every day it seems the entire west is gonna bee a fascist hellhole in a decade

xvertigox@lemmy.world · 4 days ago

https://library.kiwix.org/#lang=eng&category=wikipedia

mic_check_one_two@lemmy.dbzer0.com · 4 days ago

Yeah, it’s surprisingly small when it’s compressed if you exclude things like images and media. It’s just text, after all. But the high level of compression requires special software to actually read without uncompressing the entire archive. There are dedicated devices you can get, which pretty much only do that. Like there are literal Wikipedia readers, where you just give it an archive file and it’ll allow you to search for and read articles.

Axolotl_cpp@feddit.it · 4 days ago

if you remove topics you are not interessed it can shrink even more

KaChilde@sh.itjust.works · 4 days ago

Sure, but removing knowledge kind of goes against what creating a Wikipedia backup is about…

Axolotl_cpp@feddit.it · 3 days ago

Well, i doubt i will ever need to know anything about a football player or a car

potoooooooo ☑️@lemmy.world · edit-2 3 days ago

“Fellow survivors, oh my God! What are your names?”

“I’m OJ Simpson. This is my friend Aaron Hernandez. And this is his car, Christine.”

mfed1122@discuss.tchncs.de · 4 days ago

If my experience with mashing the random article button is any indicator, you could reduce the size by 30% just by removing articles on sports players. I doubt I’ll need those

Retro_unlimited@lemmy.world · 3 days ago

I also recommend downloading “Flashpoint archive” to have flash games and animations to stay entertained.

There is a 4gb version and a 2.3TB version.

Valmond@lemmy.world · 3 days ago

Is that Flash exclusive or do they accept other games from that era?

Retro_unlimited@lemmy.world · 2 days ago

I’m not sure, but I do think it’s just flash

kadu@scribe.disroot.org · 3 days ago

There is a 4gb version and a 2.3TB version.

That’s quite the range

Retro_unlimited@lemmy.world · 3 days ago

When I downloaded it years ago it was 1.8TB. It’s crazy how big the archive is. The smaller one is just so it’s accessible to most people.

bulwark@lemmy.world · 4 days ago

When the Arch wiki was getting DDOS’d a few weeks ago I got a local copy from the AUR that was pretty handy.

mazzilius_marsti@lemmy.world · 4 days ago

we need all repos to be stored offline, and documentations to troubleshoot.

the 1st i have no idea how much space we will need. Most linux packages are prerry light, no? But there is A LOT of them…

the 2nd is easy. Heard someone say the entire of wikipedia is 200GB, should be doable. Dont forget the technical wikis too: Debian, Gentoo, Arch.

skisnow@lemmy.ca · 4 days ago

Can’t remember who it was (b3ta? popbitch? penny-arcade?), but I recently saw a comment by someone who’s been running a website since the turn of the millennium, and they said that fully 99% of the links they posted two decades ago were no longer valid.

To really put that into perspective, you have to remember that for most sites to get linked to from a popular site like that, meant that it was usually something of value that would have had a lot of work put into it, and that people found interesting or useful.

Retro_unlimited@lemmy.world · 3 days ago

It’s truly devastating how much of the old internet has died to the corporations taking over the internet.

ozymandias117@lemmy.world · edit-2 4 days ago

The official USBs of Trixie fit all 28 DVDs of AMD64 on a 256GiB USB stick

https://www.linuxcollections.com/products/debian/debianusb.htm?id=51007

You’d probably want the 512GiB with all the sources for a real backup in this scenario

CaptPretentious@lemmy.world · 3 days ago

What’s a way to create a local repo mirror?

FlexibleToast@lemmy.world · 4 days ago

Or, in this post fact era just generate a wiki with a hallucinating AI instead.

https://github.com/XanderStrike/endless-wiki

Honestly this project looks like a lot of fun.

Skullgrid@lemmy.world · 4 days ago

I saw that Wikipedia was having funding problems, what happened to Debian?

okwhateverdude@lemmy.world · 3 days ago

They lie. Wikipedia has plenty of money. Do not give those parasites any more.

https://en.wikipedia.org/wiki/Wikimedia_Foundation#Spending_and_fundraising_practices

GreenKnight23@lemmy.world · 3 days ago

I have been archiving Linux builds for the last 20 years so I could effectively install Linux on almost any hardware since 1998-ish.

I have been archiving docker images to my locally hosted gitlab server for the past 3-5 years (not sure when I started tbh). I’ve got around 100gb of images ranging from core images like OS to full app images like Plex, ffmpeg, etc.

I also have been archiving foss projects into my gitlab and have been using pipelines to ensure they remain up-to-date.

the only thing I lack are packages from package managers like pip, bundler, npm, yum/dnf, apt. there’s just so much to cache it’s nigh impossible to get everything archived.

I have even set up my own local CDN for JS imports on HTML. I use rewrite rules in nginx to redirect them to my local sources.

my goal is to be as self-sustaining on local hosting as possible.

Foster Hangdaan@lemmy.hangdaan.com · edit-2 2 days ago

Everyone should have this mindset regarding their data. I always say to my friends and family, “If you like it, download it.”. The internet is always changing and that piece of media that you like can be moved, deleted, or blocked at any time.

ILikeBoobies@lemmy.ca · 60 minutes ago

The pornhub collapse should have taught the average person that.

clif@lemmy.world · 3 days ago

You’re awesome. Keep up the good work.

SitD@lemy.lol · 3 days ago

respectable level of hoarding 🏅