This is by far the largest music metadata database that is publicly available. For comparison, we have 256 million tracks, while others have 50-150 million. Our data is well-annotated: MusicBrainz has 5 million unique ISRCs, while our database has 186 million.
Does this mean the MusicBrainz database will soon go from 5 million to 186 million tracks?
Asking the real questions here…
That’s exactly what I was wondering too.
Acquiring high quality music is already easy enough in most cases.
What I am interested in is the metadata. Accurate tagging of all my files is of high interest.
If I ran mb, I would be cautious importing the data directly. I’m sure Spotify would consider it trade information and go after anyone directly using it. However if a few million people added the tracks with individual edits then it probably won’t take too long.
I thought metadata couldn’t be copyrighted though?
I’ll strongly suggest to take out all the cheaply AI generated music from this “back up” and save themselves some space.
I’m not sure how they would go about doing that at scale without also getting some false positives and removing human music too
You could cut off your search around the time AI tracks started to appear. Not sure when that was, maybe 2023. You’d miss a lot of recent stuff, but you’d filter out a lot of spam too
I see your point, but as you say, there would still be the tradeoff of missing more recent stuff. That might only involve missing a couple of years’ worth of stuff now, but AI isn’t going away any time soon, so it would mean that there’d be an increasing amount of human made music not being archived; One of the things I like about Anna’s archive is that they seem to look at this problem as a long term, informational infrastructure kind of way, so I imagine they wouldn’t be keen on stopping the archive at 2023.
It seems they’ve opted for a different tradeoff instead: lower popularity songs are archived at a lower bitrate, and even the higher popularity stuff has some compression. Some archives go for quality, and thus prioritise high quality FLACs, so Anna’s archive are aiming to fulfill a different niche. I can respect that.
do you have any numbers on the AI share? I doubt it’s more than a 2%, so I assume you are just virtue signalling on a completely unrelated topic here :-)
AI slop can be made and distributed in ginourmous numbers. I wouldn’t be suprised if at least 3/4 of uploads from the past 2 years are AI.
See, 75% of output of 2 years vs 100 years of music production. Also popularity was factored in.
A bot could put 100 AI generated tracks on Spotify per hour. 50 bots doing the same is 120,000 tracks per day.
can you run me the numbers for 200 bots?
120,000 x 4 = 480,000
This is easy to do with a calculator.
impressive!
The data they compiled is really cool.
If reading the chart right, the genera with the most artists is opera.
Even if they didn’t have the music files, the analysis on the metadata is insane.
Publicly admitting they are the origin of the torrents is definitely
a riskyan insane move. I don’t think they want Sony going after them, but also fuck Sony for locking art behind shitty contracts that forces these kind of projects to exist.Publicly admitting they are the origin of the torrents is definitely a risky an insane move. I don’t think they want Sony going after them
Let’s be honest: Everybody is trying to go after Annas Archive. Every book publisher wants to get them, the US government, too and it really doesn’t matter if every music publisher wants them also. I hope that they are based in a country where the western systems can’t get them
I hope (also assume since it hasn’t been taken down yet) it’s more of a decentralised deal with servers in many places and backups in every nation under the sun
Yeah, it’s a wild move admitting that they are the source of pirated content for music here.
We don’t need Anna’s Archive to go under as a result of Sony going after them because of this…
The 3 major labels are equally predatory not only Sony
There’s definitely gonna be some crazy guy who will put this on their server and stream it to their phones lol
Hi it’s me
I stream mine through Plexamp. Up to almost 400k tracks.
Oh im thinking of it lol
Please do if you can and keep seeding it if possible.
If I had an extra 300 tb I’d do it.
Tagging /datahoarded
Just a random question. What would the cost be?
You can get refurbished hard drives for around 300$/20TB (quickly searched estimation). So, 15 drives plus maybe another 5 for raid reundancy takes you back 6k$. Server to hold those drives 1-2k$ (used), UPS, internet connection and other bits’n’bobs and your total is very roughly around 8k$ (or €, as I threw the estimations on a pretty big ballpark).
Thanks. $10k for a jukebox with all the music in the world sounds like a cool luxury novelty.
My first though as well
I guess I gotta donate more to anna
This is the one thing on Spotify I can’t get elsewhere. Would be nice to have a non transcode copy.
https://open.spotify.com/album/4emoC6C9fCDkWPdTuxN9an
…Like Cologne (Spotify Exclusive)
Queens of the Stone Age
2013 • 3 songs • 14 min 5 secWell, since this archive says it contains the original ogg @160kbps for all artists with a popularity >0, it’ll be in this collection. Your wait may be over soon.
sweet
deleted by creator
Dont have the space but love to see this. I hope people seed this for a long time
Damn, boy! That’s a big ass music collection.
So the artists get paid even less than from Spotify?
Its mostly Sony, UMG, and all the other leeches who would get paid less for their share holders.
I dont feel like editing the image but imagine the guy with most of the cookies in this picture was UMG and the artists are the guy on the right.

Yes, sure, but if those don’t get paid, artists don’t get paid. And artists are not forced to pick a label, they are free to go solo, but they still prefer labels, so it’s not that black and white labels bad, artists good
You have no idea how hard it is to go solo, how the fuck would they get their songs out there? They would have to get really lucky on social media. How would they book concert venues? They would be stuck playing in shitty venues that pay peanuts.
Anyways, artists make money off of music purchases, concerts, and merch
So, there is value in labels after all?
They’re a necessary evil
I’m not sure how you think Spotify compensation works, but it is not a “one stream and you get paid”-deal, but rather a revenue share model where artists are compensated from a large pool by total streams. The main share of your Spotify monthly subscription that goes to compensating artists goes to Taylor Swift, Bad Bunny etc. Being a top listener to your favorite, but underground band contributes negligibly to what they actually get paid.
If you care about their compensation, buy the album as directly from them as possible, or buy merch/go to concerts, and recommend their msuic to other people so they might end up paying customers. Subscribing to Spotify and thinking they get a fair deal out of that is not the way, and increasingly not the way (with their GenAI-shenanigans).
First, what am I using is beyond the point and I’m not using Spotify because of their payment method and their politics. And again, if albums are on streaming services, they are voluntarily there, are they not?
How voluntary is it when these platforms have a monopolistic grasp on how consumers access music these days? And the more people believe that the artists are actually fairly compensated from this model, the firmer this grasp becomes. What choice do they have of being there if they want to have any kind of reach?
A Spotify Premium subscriptions will cost someone 156€ a year. If that person instead spent that entire music budget on purchasing albums from select musicians according to the enjoyment they derive from their works, or buy concert tickets or merch, and decides to pirate the rest of their music listening, what changes? For the consumer, they are now left with actual, irrevocable access (legal and illegal) to the same music you had rented access to before, and have spent the same amount of money. For the musicians, the ones who received the purchases are left with much more of your dedicated music spend, and the rest will have marginally less (their share based on total streams of your monthly subscription x12). For Spotify and Taylor Swift, they receive marginally less money (but more than the artists you actually listen to) of which they should probably not have received to begin with.
Well if you genuinely care about seeing artists get paid the ones who need it most tend to make their conent available already for free on bandcamp or similar services, and have physical albums and merch you can buy.
Last night i spent $10 on 3 albums on bandcamp, those artists each made more on that single purchase then they would from thousands of streams.
Spotify making less (or more) money does not trickle down to artists on a per stream basis.
Dont be a corporate bootlicker. Say it with me now, "If buying isnt owning Piracy is not stealing. "
It is really refreshing how this thread spins in “we know what’s best for the artists, certainly not paying for listening to their streams, that’s exactly what they want”. If you don’t want to use Spotify, that’s fine, I don’t want to either because they are an awful company. But that doesn’t make you the person who create the rules for artists nor does it give you the permission to listen to illegal content.
I dont think its a huge leap to think artists would rather you be able to buy their music once and make a $ instead of stream it from a sevice that pays them next to nothing.
What is stopping them? But it seems that general consensus here is that artists would like you to listen for free and here and there buy something from them.
Well, we are talking pennies here so… /s
It’s not just Spotify, it content is free for all, then who is buying?













