This is a sub that aims at bringing data hoarders together to share their passion with like minded people.
The-Eye.eu: We now host the largest open repo of piracy metadata online today, with only more to come!
Update:
-
1TB+ in 4,000,000+ files.
Hey DH/OD folk, it's been a little while but for those of you not in our discord community here's a significant update you may have missed, while a lot has changed and we've had many new additions and other side projects to work on lately this is a pretty significant update, so I'll just get right into it.
What is Piracy Metadata?
This could get long and convoluted but simply put when you download pirated content from reputable scene groups you get a few extra files along with the movie, show, software or whatever it is you downloaded, the scene has strict rules and standards on how they release content requiring certain files to be packaged alongside them. For all releases the most common of these files being nfos which explained very basically is an information file that will give you details about who released it, their source, the specifications of the files, descriptions, etc. along side these depending on the type of release you'll also find sfv files, media samples, screenshots, playlist files (usually m3u) among other such things.
Speaking personally I used to be more heavily involved in the scene than I am today, pre my datahoarding days for the most part and it didn't until fairly recently occur to me that nobody is doing a great job of archiving and serving these files en masse back to the public, maybe for obvious reasons in some cases. There have been many sites over the years (nfohump, orlydb, crackwatch, srrdb, predb, nZEDb to name a few) that have logged scene releases, served metadata, etc. but over the last few years these sites have either gone the way of the dinosaur or become very restrictive when it comes to allowing the general public to download everything, so this is how the piracy metadata archive came about at the-eye.
What is the Artscene?
The above may sound boring to most but what draws a lot of people to these files isn't the info on pirated media but more so the art that often comes along with them, here are a few examples...
These are just some quick examples in png format for ease of viewing, originally these files were .nfo but due to the character encoding and the inability to (easily) display them properly on the web many sites convert(ed) their nfos to png, the-eye will serve a mix of nfo/txt/png where it makes sense or what is available, in the future we will be offering everything converted to png for those folk who just want to view the artwork simply, there are also nfo viewers that help in viewing the nfo files as originally intended.
Here are some resources and videos worth taking a look at that explain much more than I imagine you want to read from me.
-
The Art Of Warez, a recent 30 minute explanation and round up of the Artscene paired with some very prolific examples from many groups.
-
Jason Scott aka /u/textfiles BBS The Documentary Part 5 - Artscene, This is a great watch and interviews key players from the early days of the Artscene.
-
artscene.textfiles.com related files and art packs served in compressed formats.
-
Demoscene - Wiki The Demoscene is what came about when artists just wanted to display their art aside from the piracy.
The rabbit hole is deep here, this just skims the surface but makes for a good jump point as you dive in.
The Data!
As we've just launched this a few days ago we're currently only serving 13,971,540
items in 330GB
but I wanted to get this out sooner rather than later so as to attract possible data donors should someone have something I'm missing. I have a little over 5TB yet to be wrangled into some coherent order so I'll be adding content in some form or another over the next few months and directories are subject to change in this time. We will also be making this directory full-text searchable via our new search platform at searchin.the-eye.eu but that will come down the line given file count it's no small task to index everything and it will require some downtime to get out of the way faster, that will be announced and added here over the next few days.
So that's the data for now, depending on popularity we may launch this again sometime on a subdomain like we have for exodos.the-eye.eu as of late.
*video samples may or may not be removed in the ^future
Community
You can reach me here on reddit, in the r/DataHoarder IRC (GreenObsession) or on our discord server. Come chat to everyone, see our new content before anyone else and join other like minds.
I just want to note, that .nfos and related text files contain the whole history of computing piracy from day 0.
Groups mergers, applications, fights between groups, random comments, shutdowns, FBI problems, etc...
And all this info is spread in myriad of text files, for all platforms, I only got a PC since 30y ago, but look at the Amiga or C64 scenes, really really huge. Also if you jump in the disk zines bandwagon...
Fascinating stuff.
We never had nfo in the old days when we copied floppy disks. It was only when they started needing to crack them that nametags were added.
You are right, in old DOS times, all we got, with luck, was some random LHZ or Pkarc compressed file from some friend, or if you were lucky from someone working in a store, or in a University.
First text "nfo" files I remember in PC was from The humble guys, or some cracker's notes inside the compressed file.
My first contact with real scene was with ftps in my University years, it's fun because prior that I only got BBS contacts, so I have many demoscene files, but not "scene" stuff, proper releases and so...
Anyway, good old times.
Edit: I've been collecting PC cracking scene material from many years, unluckily many many releases are missing or maybe lost forever, if anyone has any file related to the Dos cracking scene, groups as UCF, PC, Core, etc... I am interested in the cracks releases from these groups, not apps or games "releases". Thanks to .nfos from these releases, I have complete releases lists, so we can check what it's missing :D
Look at http://scenelist.org. >2tb of this stuff :)
intros back then. with all the drama.
And that was the reason I had to suffer with this every year and as punishment in IS shop.
You got my vote
<3
What about your $$$? 🤔
We at The-Eye operate off of a strict donation only model. You can come join us on our discord server to learn more on the community tab on our site.
Oh, I am there. :D I was joking about authors vote (support) , but what's about his money (which are needed to operate) ? Excuse me my sense of humor, but servers require electricity, which costs a lot. Plus this awesome internet connection...
more than srrdb?
Yes, because we have srrdb too.
Including all srr/srs files?
Yes.
This is really really nice.
Whereabouts on the-eye can I find the srr stuff, or is that part of the unwrangled 5TB? (If searchin.the-eye is the sole search tool now, it seems dead atm :/ )
I use srr's extensively - metadata like file sizes and hashes are a great way to find things on the interwebs.
In any of your archiving endeavours, do you try to keep these things up-to-date?
That is amazing. Great job!
Comment deleted by user
i recommend https://infekt.ws/ for windows
With a bit of work, Vim can also view NFO files correctly...
https://www.reddit.com/r/vim/comments/aj9ejv/view_nfo_files_with_vim/
No one does it better than Notepad++
infekt does
Comment removed by moderator
Depending on your editor character support is usually the issue, if you compare what you see in your editor to what you see in an nfo specific app and don't see any differences then you're using a good editor, notepad++ on Windows or Geany on Linux display nfos quite accurately if not perfectly in a lot of cases.
However some nfos use obscure characters that either straight up aren't supported or render differently than intended, while you maybe happy with what you see in an editor it may not be what was originally intended.
Directly calling the material piracy is painting a huge target on your backs, are you sure your current colo provider won’t dump you if they get wind of this?
The only thing I see some concern with in this respect is the media samples, but also at the same time not so much. I'm not going to shy away from what it is, or why it exists. Calling the directory Piracy was done on purpose for this exact reaction. You, hook, line, sinker.
And what... is a pirate going to dmca his ansi, yeah okay.
Comment deleted by user
DMCA compliance, no ads or reuse of trademarks and zero profit.
A whole bunch of stuff on The Eye can be DMCA'd, right? All the audio books and Windows ISOs, for instance.
So essentially these rely on the IP owners not bothering to file a DMCA takedown, or them simply not being aware of the site. Which means that as more DMCAs are filed, you'll ultimately end up with a collection of data that is either not DMCA-able or is too old/obscure to bother DMCA-ing. Is this correct?
Comment deleted by user
Can you imagine though? That shit would be hilarious.
From what I can understand from his comments on posts, they have everything figured out so they won’t be shut down
You should post this to r/CrackWatch
Done.
Why does the video vault require a discord account? Yeah, I'm good. Discord doesn't respect their users.
We use Discord for auth, we're aware of the growing issues with Discord and we're looking at other options however replacing it is a lengthy and highly involved process. I personally didn't choose to operate on Discord but at the moment we have a lot of our operation tied to it.
Have you maybe heard of Matrix?
The only down side is having to host it yourself somewhere, while discord allows you to create your own server for free, but at least you know that Matrix won't steal you any information. And i'm sure a simple VPS will do the job.
Yeah, Matrix is in the top 3 options we're looking into and hosting ourselves is actually a selling point for us, retaining control over everything in house is what we're looking for. At our scale a small vps wouldn't be viable but we have no shortage of capable servers.
Why not just host your own IRC?
Comment deleted by user
Curious, any reason why most databases are open but the video vault is walled off? Due to the type of content?
Donor perk, most of the content is vhs recordings with commercials and the like.
Thats simply beautiful. Thanks. Gonna make a donation
Is there a preferenced way to donate ?
At this time of the month there's no preferred method as we're paid up into January and there's no immediate rush for donations so whatever works best for you, see the bottom of the post and thank you for the support.
I'm so confused and fascinated at the same time.
I learned so much today. Everything made so much more sense
I always just delete all that stuff and just keep the actual file I want.
:(
I have no need for the sample files, sfv files and everything else. The only thing I find remotely useful is the NFOs, those are usually overwritten though by Kodi, Plex or Jellyfin though.
The samples I understand but given the sub we're in to not care about sfv and the integrity of your files is going to rattle some cages.
Oh, that's what those things are for?
It's mostly media files I have and everything in my network is Linux, also I'm not paranoid about security. I believe NZBget and/or Sonarr/Radarr do integrity checking, so it may use them. I just never do it myself.
Wait this word is allowed here?
No absolutely not! To the Gallows with you for uttering the D word!
I know you have the storage but why don't you just keep the plaintext and convert them to PNG on demand?
Because sites that do that (crackwatch for example won't give you the .nfo and only give you png on demand, in browser, not easily scraped) are one of my motivations for this in the first place and also because we're an open directory so the point is to provide open access to all data at all times made as simple as possible to scrape, mirror, view.
Storage is cheaper than compute.
how do people hide their score?
This is a feature of reddit that moderators can use in their toolbox. TLDR; spam or brigading prevention.
More explanation(s) can be found in this r/askreddit thread!
Comment deleted by user
Uh the tldr is for the askreddit post.
Fascinating. I'd also want to mention the chiptune songs that came with all these keygens over the years - any chance of including those in this project?
Maybe at a much later date, first focusing primarily on the text based metadata.
Can't speak on when it was last updated but this site was really good
http://keygenmusic.net/
Goddamn thank you
Just imagine a 3.5 PB Plex Server hosted by u/-Archivist
I'm aware of a few 600T-1P plex servers, some operating for profit others offering slots free as long as you use them often. Not something I want to get into maintaining but looking in the right places and knowing the right people in the community you can secure yourself more media over plex than you're ever going to consume.
Are you talking PDH? Are there more of them?
PDH is on the small end of the scale, there are many more.
Ho Lee. Any hint on where to look for them?
I'm all ears! I have NO idea where to look
I didn't know there was a mirror of Beta Archive. Anyone know what happened to their source code section(s)?
We're not currently hosting the whole thing due to pushback and such issues from their staffers, it's a whole mess I'll deal with again when I have nothing better to do.
Fair enough... I haven't logged on to them for many years, just read that they had to drop source code because of legal issues... Not as though it was that useful, it was just interesting to learn from.
Ahh okay, I wasn't actually aware of their dropping source as I hadn't paid much attention to the project much more than Hey this should be available without the barrier to entry.
What source code sections?
They used to host source code from various releases that were hard to find - there were a few outdated leaks, but some interesting ones like the Microsoft Research kernel, singularity etc.
Amazing, thank you for doing this!
Holy shit dude, HappyHippo and KopyKatz. That brings back memories.
Why doesn't the-eye.eu default to https?
More than a few reasons we operate this way, the option is there obviously but we chose not to force it mostly due to some lowlevel tools built for older hardware that had issue. (convoluted, early win stuff)
Ah, I see. Thanks for the reply!
Noob question, is this safe to download without a VPN?
Yes, perfectly.
Read this.
Watch this.
STOP USING VPNS!!
I thank you kind sir.
But are you ok if people accessing and downloading with VPN from the-eye.eu? Just curious
Sure, we don't actively track where our traffic is coming from unless it's malicious.
Razor 911 courier (early 90s) checking in. Thanks for this!
Ohh hello there, nice to meet you. Would you happen to have any materials to hand worth adding to the archive? I've been slowly reaching out to groups to get nfos directly, but so far the paranoia is strong haha.
I have some very old dos archive cds in the corner. I’ll dig em up and see what I can find!
Lovely stuff, thanks man.
Hey did this site get nuked? Where can I get a mirror/alternative?
Any news?
no
This is very cool. You guys should run a patreon
Comment deleted by user
.eu
wasn't my choice but it was somewhat relevant at the time as we hosted on rented dedicated hardware in the EU but this is no longer the case, .eu only remains our primary domain due to it being linked to CloudFlare and is thus heavily cached. We do operate a few other tlds but don't advertise them due to the lack of cache slowing down user experience.Comment deleted by user
We follow DMCA and we're not hosted in the EU, GDPR will catch up but it's a shit show at the moment, if we're sent an official directive we would follow up. So the answer is both yes and no.
Damn I'm stupid, all that and i still don't understand what this is about.
Imagine if drug dealers drew art on their baggies.
Now imagine if that art developed into a subculture all its own.
Now imagine someone starts archiving those baggies to tell the story of the art, and sets up an art museum around them, but they don't deal any actual drugs in the process.
Hahah, I love this as many dealers do use custom baggies and there is a select group of users that do save them. Speaking for the most part about weed but if you think of lucydrop blot papers and sheets you have the same thing also.