I believe we are all talking about the same thing, but maybe the way I’m expressing myself is hard to understand, or maybe you are looking at it differently, so I will try to explain the best I can and why this topic started and why it’s important for me to know what I want to know. I will keep it as simple as I can and if this is still “confusing”, I will just leave it at that, because there’s really no other way to explain it and we will all be wasting our time debating it.
Let’s use 3 files only on a 5GB disk with just those 3 files. So the disk is formatted as APFS.
File A - 1GB
File B - 1GB
File C - This is a copy of file A, so on an APFS disk, this won’t take extra space, correct?
So on disk, physically speaking, they’re only taking 2GB, even though, when I look at Finder, file C shows me 1GB so when I see them side by side I can see that all 3 have 1GB associated with it, but the disk is not using 3GB, it’s using 2GB. At least for now, because I haven’t made any changes to file C.
That to me, is my understanding of APFS. And if someone here still thinks I don’t understand what APFS does, then I guess the ones to blame are the people online who share this exact same information. But from what I read in many different places, this is universally accepted. An exact copy of a file (without any saved changes), will not take extra space (sure, it will maybe use a few KBs so the OS knows that there’s an extra file, but if the original file (File A) is 1GB, the copy will not be 1GB).
So to me, I call File C a “clone”. If that’s not the right term, I hope you can at least understand the concept and why I call it a clone. Call it whatever appropriate name is supposed to be called.
Now that we got that out of the way…
Why would it be important for me to know which files are “original” and “clones”?
To be clear: I call “original” when a file is unique, no copies. “Clone” would be either File A or File C, even if File C was created from File A. “Clone” as in “not-unique”.
Let’s say that along the way I have other files and I get to a point where my 5GB disk is full, meaning I have files that take 5GB as well.
Thing is, I forgot that File C (without any changes) is a copy (aka “clone”) of File A. It’s been a long time and I don’t track which files I duplicate or anything.
So I look at a list of files and I see File B (which is “original”, meaning it’s the only file for a particular content, it was never duplicated) and since it’s taking 1GB I decide to delete it, and now my disk will have 4GB of used space and 1GB free. Right?
Now I keep looking at the list and I see File C, also showing that’s taking 1GB (it’s not, because it’s an exact copy of File A, but I don’t know that, it’s been a while). So I go ahead and I delete the file and to my surprise, my disk is still using 4GB and still only 1GB free.
Can you understand why it would be important to know if a particular file will in fact create more free space?
Now, in this hyper simple example, I am just talking about deleting a file, which takes 1 second, but when I look at 100 music projects that can be modified (meaning, delete unnecessary files) to show 5MB instead of 1GB, the process of making 100 files go from 1GB to 5MB doesn’t take 1 second or even 100seconds. It takes 2 hours or more. Now if all of those 100 files are exact copies of an original file (the one they were duplicated from), it doesn’t really make a huge difference if those files show 1GB or 5MB, the same way that them showing 1GB doesn’t mean they are indeed using 1GB if they are an exact copy of another file that takes 1GB.
I understand that if I have 1 original file and 10 copies and I delete them all, that’s when the whole space gets freed. I get that. Again, I’m not an expert when it comes to APFS, but I understand the basic concept. If File A (1GB) is duplicated (File B), it will not take 2GB. When I make changes, maybe I will be using 1.02GB, because those changes are now saved, but everything else that’s common to those 2 files is only using 1GB, by both files. Then I make another change and now I’m maybe using 1.3GB, etc.
So, to finish this already lengthy post, I understand the basics of APFS (I’m aware that there’s a whole deeper level, but that’s not the point here).
My goal was to just look at a file and being able to know if that file is an original (unique) file, meaning no other copies were made, OR if the file was a copy of another file (or if it’s the original file where other(s) were duplicated from).
Again, if this isn’t clear enough for some of you guys, I’m sorry, I can’t make it more clear than this and I believe we will be debating infinitely and will get us nowhere.
I appreciate you all for trying to explain other things (that I think are not related to my original question), but I see that what I was trying to achieve is not possible. And that’s ok.
Thanks!