When Windows copies a file, does it ever copy bytes that are in the slack space?

Comments (31)

kantos says:

February 13, 2018 at 7:10 am

It would seem to me that the only way to get confidential information in the slack space in the first place would be to do a non-secure erase. While it would be nice if windows had such functionality available by default, it doesn’t. That said there are numerous utilities out there that DO have such a capability. That said you need to be aware of your medium before using them as they can cause unnecessary wear and tear.
1. Clockwork-Muse says:
  
  February 13, 2018 at 8:50 am
  
  ….except most cases of needing a secure erase (such as selling a drive you no longer need) are covered adequately by drive encryption. For everything else, a “secure erase” utility is going to bump up, hard, against the drive maintaining itself – defragging, move due to bad/expired sectors, etc – you’d have to regularly clean up the _entire_ drive for such scenarios. It’s just so much easier to 1) encrypt the drive, and 2) have the os properly handle “extra” bytes in these situations.
  1. MNGoldenEagle says:
    
    February 13, 2018 at 12:46 pm
    
    And that’s only considering magnetic hard drives. On an SSD it’s worse because of wear-leveling algorithms. Overwriting a file will almost never overwrite the same bytes on the drive, so you effectively have to secure-erase all of the unallocated space on the drive in order to “securely erase” the data. Which is also a great way to significantly reduce the lifespan of said drive.
2. mikeb says:
  
  February 13, 2018 at 10:47 am
  
  Another possible way to get information in slack space might be: 1) write a bunch of data to a file; 2) truncate the file to a shorter length (maybe with `SetEndOfFile()`).
  
  Note: I’m not saying that Windows systems do leave whatever information is in the cluster there when truncating – I don’t know the actual behavior. But it wouldn’t surprise me if the data got left in the cluster.
  
  Interestingly, the docs for SetEndOfFile() mention that there are three size-related attributes for a file stream:
  
  file size
  allocation size
  valid data size
  
  This article talks about when (file size < allocation size), but what about when (file size < valid data size)? The docs for `SetFileValidSize()` says:
  
  If SetFileValidData is used on a file, the potential performance gain is obtained by not filling the allocated clusters for the file with zeros. Therefore, reading from the file will return whatever the allocated clusters contain, potentially content from other users. This is not necessarily a security issue at this point, because the caller needs to have SE_MANAGE_VOLUME_NAME privilege for SetFileValidData to succeed, and all data on disk can be read by such users. However, this caller can inadvertently expose this data to other users that cannot acquire the SE_MANAGE_VOLUME_PRIVILEGE privilege if the following holds:
  
  If the file was not opened with a sharing mode that denies other readers, a nonprivileged user can open it and read the exposed data.
  If the system stops responding before the caller finishes writing up the ValidDataLength supplied in the call, then, on a reboot, such a nonprivileged user can open the file and read exposed content.
  
  If the caller of SetFileValidData opened the file with adequately restrictive access control, the previous conditions would not apply. However, for partially written files extended with SetFileValidData (that is, writing was not completed up to the ValidDataLength supplied in the call) there exists yet another potential privacy or security vulnerability. An administrator could copy the file to a target that is not properly controlled with restrictive ACL permissions, thus inadvertently exposing the extended area’s data to unauthorized reading.
  
  Sounds like a pretty unlikely scenario, but maybe one that the customer who originally asked the question might still want to be aware of.
3. Antonio Rodríguez says:
  
  February 13, 2018 at 4:59 pm
  
  I’ve never understood the need for secure erase in consumer computing. If your application depends on confidential data, then you should not sell the used hard drives or equipment. Rather, you should store them securely. And if you have to dispose them, destroy them before – and a simple hammer allows to render any hard drive nonoperational (SSDs are a horse of a different color). If you rely on confidential data, you should assume the (small) cost of not selling used hardware.
  
  Of course, data recovery companies are able to repair hard drives with damaged mechanics (hammer or else). But they are also able to recover many types of “secure erases”, except the most thoughtful variations. The ones that, by the way, take a loooong time to complete: calculate how much time it takes to write all over a 2 TB hard drive, and multiply that by 99. You’ll get that the average 2 TB hard drive take about 20 days to be securely erased (at a maintained speed of 120 MB/s, 24 hours a day).
  1. alegr1 says:
    
    February 15, 2018 at 8:28 am
    
    Information recorded on modern drives is unrecoverable after a single erase. There is no recoverable residual left.
Alan says:

February 13, 2018 at 7:38 am

My first thought on seeing the title was “Who cares?” Upon reading the context, yeah, that’s a good question, and I’m glad to know the answer.
1. Fleet Command says:
  
  February 13, 2018 at 11:15 pm
  
  Well, my first thought was “obviously not”! Has this person never copied data from disk with 8 kB cluster size to a disk with 4 kB cluster size?
creaothceann says:

February 13, 2018 at 8:23 am

IIRC when people were collecting the ROM contents of video game console cartridges like the SNES, they started to see source code in them. Games were sent to Nintendo on DOS-formatted floppy disks. One theory is that the ROM writing process simply copied whole sectors, including ‘erased’ data.

See the category “Games with uncompiled source code” on the website “The Cutting Room Floor”.
1. BOFH says:
  
  February 13, 2018 at 3:43 pm
  
  The original Commodore Amiga 1000 was accidentally shipped with source code fragments in the slack space of the Kickstart 1.0 diskette:
  http://www.pagetable.com/?p=34
2. ender9 says:
  
  February 15, 2018 at 1:40 am
  
  Microsoft’s own floppies contained things like e-mail fragments in slack space.
12BitSlab says:

February 13, 2018 at 8:24 am

Raymond, thanks! This is good to know!
Joshua says:

February 13, 2018 at 9:02 am

It turns out you can see into the slack space from usermode by playing games with SetFileValidData. See https://msdn.microsoft.com/en-us/library/windows/desktop/aa365544(v=vs.85).aspx
1. Tim says:
  
  February 13, 2018 at 12:41 pm
  
  I guess that’s technically correct, but the usermode process needs an assist from an improperly secured process with the correct permissions to call SetFileValidData. The issue there is in the hypothetical buggy application, not the filesystem.
2. Beldantazar says:
  
  February 13, 2018 at 12:53 pm
  
  Sure, but you have to be an administrator to do that, which means you could just do whatever else you wanted to get at the sensitive data anyway.
3. Joshua Bowman says:
  
  February 13, 2018 at 10:52 pm
  
  If you can use that function, you’re already way on the other side of that airtight hatchway, though. You might as well just read sectors until you find something interesting at that point.
  1. Joshua says:
    
    February 14, 2018 at 5:17 pm
    
    I wasn’t talking about security. Hint: user mode not unprivileged user.
    1. Joshua Bowman says:
      
      February 16, 2018 at 9:48 pm
      
      Then the confusing part of your claim is that this is some novel technique, compared to just raw reading all data via whatever sector API you prefer.
IanBoyd says:

February 13, 2018 at 10:35 am

This is probably one of the scenario’s where you could answer the question by asking, “What would happen if it did copy slack space?”

We know that when you create a file, the file contents are zero-initialized by the filing system. If you attempt to:

– create a file
– seek 100 MB forward
– write 4 KB
– seek to the beginning

You will find that your first 100 MB contain zeros. That’s because:

– the *valid* length of your file is only valid up until the last spot that you wrote
– and any place you didn’t write data is going to be zero

Attempting to read past the end of a file will result in EOF – no data.

If you’re an administrator, you can bypass the file-system’s zero-initialization by calling `SetFileValidData(handle, 100*1024*1024)`. This lets you read old data on the hard drive; which is why it’s limited to administrators. (Technically someone with the SE_MANAGE_VOLUME_NAME right). This feature can be used by SQL Server (Instant File Initialization) to grow a file instantly without having to wait for the file system to zero all the new pages.

If file copy *could* read slack space, it would mean:

– a file has a length beyond it’s end-of-file (which isn’t how it works)
– users can read slack space (which isn’t how it works)
1. Brian_EE says:
  
  February 13, 2018 at 11:26 am
  
  >This is probably one of the scenario’s where you could answer the question by asking, “What would happen if it did copy slack space?”
  
  It’s more like one of those scenario’s where the PHB wanted “something official” from Microsoft, so the minion had to ask the question he knew the answer to.
  1. Tim says:
    
    February 13, 2018 at 12:33 pm
    
    I don’t know. A good programmer probably has some intuition that it’s probably impossible to copy “slack space” in a userland application like Explorer, but there are a lot of underlying assumptions there. For example, that’s assuming there isn’t any kernel or filesystem special API for “fast copying” of files wherein the slack data isn’t abstracted away and perhaps would be copied in some situations.
  2. Harry Johnston says:
    
    February 13, 2018 at 1:40 pm
    
    I don’t think it’s that obvious that the copy is always done in user-mode. I mean, you could look at the File System Drivers documentation and note that there’s no IRP_MJ_COPY control code, but that’s hardly conclusive.
    1. IanBoyd says:
      
      February 14, 2018 at 10:19 am
      
      > I don’t think it’s that obvious that the copy is always done in user-mode.
      
      It’s certainly not done *in* user-mode; but it’s done *by* user-mode.
      
      And the rule for users in user-mode is that they can’t see slack space. The implementation, wherever it is done, will follow that rule.
alegr1 says:

February 13, 2018 at 11:31 am

Not so simple. When you read the last sector of a file opened with FILE_FLAG_NO_BUFFERING, the whole sector gets read to memory, even though the file may end in the middle of it. Same thing happens when you memory-map the file.
1. John Doe says:
  
  February 13, 2018 at 4:20 pm
  
  So, did you read the slack space?
2. Joshua Bowman says:
  
  February 14, 2018 at 12:04 am
  
  I’ve just tested sample code, and this is emphatically NOT TRUE. ReadFile will not read more than the actual size of the file into the buffer; if your buffer is initialized to 0xCC, it’ll still be mostly 0xCC if you read a 10-byte file, even if the actual slack space of the file is zeroes or some cryptographic key. The airtight hatchway is still sealed.
  
  You can test yourself:
  
  #include
  using namespace std;
  
  int main(int argc, char* argv[]) {
  char* buf;
  buf = (char*)_aligned_malloc(4096,4096);
  memset(buf, 0xcc, 4096 * sizeof(char));
  
  HANDLE hIFile;
  LPDWORD actualsize = 0;
  
  hIFile = CreateFileA((LPCSTR)argv[1], GENERIC_READ, FILE_SHARE_READ, NULL, OPEN_EXISTING, FILE_ATTRIBUTE_NORMAL
  | FILE_FLAG_NO_BUFFERING
  , NULL);
  ReadFile(hIFile, buf, 4096, NULL, NULL);
  
  return 0;
  }
  
  Little programs, no error checking, etc, etc. Break on the return and examine the contents of buf.
cheong00 says:

February 13, 2018 at 6:18 pm

If I were asked such question, I’d probably answer with something like:

It’s much like when you have to copy data in an array to another location. There could be other potentially sensitive data at the allocated and not used block of memory assigned to the array*, but when you copy an array, you only loop copy the content of assigned part up to the length counter and don’t copy beyond the boundary, so unless you’re using block memory copy instruction / API that “ignores context of the array at all” the answer is “no data in the unassigned section will be copied in the copy operation”.

* assumes the programming language does not require zero out memory before giving to the code.
Neil says:

February 14, 2018 at 3:07 am

What scenarios would get bytes into the slack space? I suppose file truncation would do it, but are there others?
1. M Hotchin says:
  
  February 14, 2018 at 12:10 pm
  
  Sector re-use after a file is deleted. Most file systems do not overwrite a file’s contents on deletion.
Kirby FC says:

February 15, 2018 at 4:14 am

>>Antonio Rodríguez
>>Of course, data recovery companies are able to repair hard drives with damaged mechanics (hammer or else). But they are also able to recover many types of “secure erases”, except the most thoughtful variations.

This is a common misconception that hasn’t actually been true for many years. At one time, ~20 years ago, it was theorized that you could use Magnetic Force Microscopy or Scanning Tunneling Microscopy to image bits recorded on magnetic media and recover data data that had been over-written (see “Secure Deletion of Data from Magnetic and Solid-State Memory”, written by Peter Gutmann in 1996). However, there is no documented evidence that this has ever actually been done.

But, that’s now irrelevant. Because of increased data density on hard drive platters, any hard drive (not SSD) manufactured in the last 10+ years can be rendered “securely erased” with a single over-write of random bits. Of course, if you are really that paranoid about what’s on your old hard drives, then you are correct, you shouldn’t be selling or giving them away.
1. alegr1 says:
  
  February 15, 2018 at 8:37 am
  
  Many enterprise-grade drives have an “instant secure erase” feature, where an internal encryption key gets overwritten, instantly making the whole drive contents unrecoverable.

Comments are closed.

Date:	February 13, 2018 / year-entry #37
Tags:	tipssupport
Orig Link:	https://blogs.msdn.microsoft.com/oldnewthing/20180213-00/?p=98015
Comments:	31
Summary:	Keeping tabs on the slackers.