Why can’t I delete a file immediately after terminating the process that has the file open?

Date:March 29, 2012 / year-entry #78
Tags:code
Orig Link:https://blogs.msdn.microsoft.com/oldnewthing/20120329-00/?p=7973
Comments:    26
Summary:A customer discovered a bug where terminating a process did not close the handles that the process had open, resulting in their emergency cleanup code not working: TerminateProcess(processHandle, EXITCODE_TERMINATED); DeleteFile(someFile); Their workaround was to insert a call to Wait­For­Single­Object(process­Handle, 500) before deleting the file. The customer wanted to know whether they discovered a bug in...

A customer discovered a bug where terminating a process did not close the handles that the process had open, resulting in their emergency cleanup code not working:

TerminateProcess(processHandle, EXITCODE_TERMINATED);
DeleteFile(someFile);

Their workaround was to insert a call to Wait­For­Single­Object(process­Handle, 500) before deleting the file. The customer wanted to know whether they discovered a bug in Terminate­Process, and they were concerned that their workaround could add up to a half second to their cleanup code, during which the end user is sitting there waiting for everything to clean up.

As MSDN notes,

TerminateProcess initiates termination and returns immediately. This stops execution of all threads within the process and requests cancellation of all pending I/O. The terminated process cannot exit until all pending I/O has been completed or canceled.

(Emphasis mine.)

Termination is begun, but the function does not wait for termination to complete. Sometimes a thread gets stuck because a device driver has gotten wedged (or the driver doesn't support cancellation).

To know when the handles are closed, wait on the process handle, because the process handle is not signaled until process termination is complete. If you are concerned that this can take too long, you can do like the customer suggested and wait with a timeout. Of course, if the timeout expires, then you have to decide what to do next. You can't delete the file, since it's still open, but maybe you can log an error diagnostic and let the user know why things are taking so long to clean up, and maybe add the file to a list of files to clean up the next time the program starts up.


Comments (26)
  1. Ian says:

    This is amazingly relevant to something I'm working on right now. Thanks!

  2. DWalker says:

    Terminating the process that has a file open might require disk or network I/O, which takes a bit of time to run even if it's not stuck.  Waiting on the process handle sounds like the right thing to do.

  3. porter says:

    WaitForSingleObject(…INFINITE rather than 500 would be more appropriate as well.

  4. Joshua says:

    There's another way. It's actually possible to write a safe version of the Unlocker program that Raymond rightly condemned, so long as the target process is not stuck in loader lock or kernel.

    Using CreateRemoteThread, inject a thread that does this:

    1. Suspend all threads but itself, keeping a list of previous thread priorities

    2. Close the target handle.

    3. Open a file handle to NUL.

    4. While the handle just created is not the handle to close, push handle & duplicate.

    5. Close all pushed handles.

    6. Unsuspend all threads (use list saved in step 1).

  5. Killer{R} says:

    2Joshua No need for CreateRemoteThread in this case – you may do eveything this just by DuplicateHandle from remote to you (DUPLICATE_CLOSE_SOURCE – effective remote CloseHandle) and back. BUT its not safe – application logic may critically relay on the state of opened file (size, content, it can be also used as IPC with some other process etc).

    And dont forget about filemappings…

  6. Gabe says:

    Joshua: In this case, it can be much simpler:

    1. Suspend all threads.
    2. Use CreateRemoteThread to execute a routine that:

    3. calls CloseHandle on all open handles and then

    4. calls TerminateProcess.

    Of course, if they control the source code to the target process, they could simply mark the temporary files as "delete on close" in the first place. Or at least open the file with FILE_SHARE_DELETE so that the cleanup process can delete it.

  7. Joshua says:

    And dont forget about filemappings…

    I did not. Those can crash the process if you free them.

  8. Killer{R} says:

    2Joshua actually you may copy their contents to unnamed mappings and map new mappings exactly at same addresses as old ones. But of cause this will not be safe, but approx in the same 'degree of risk' as replacing file's handle.

  9. @Joshua says:

    In what way is your solution (closing the handle explicitly with a remote thread) faster than using TerminateProcess()? The CloseHandle operation will take the kernel the same time, no matter how it is invoked.

  10. Joshua says:

    It's not faster at all. I'd rather not terminate a process that won't let go of a file I need it to, especially if it's a critical system process. Let's just say that killing the user's AntiVirus (most common offender for me) is not polite.

    @Killer{R}: All cases I've seen of that were processes that had loaded DLLs that needed replacing. At that point, it's time to blast.

  11. jader3rd says:

    I think it'd be nice of the Windows File API's added overrides with timeout parameters. Have the timeouts be related to the handle being available. That way, in this case they wouldn't need to wait for the process to exit, they'd only need to wait for the file handle to become available. I would find it very useful.

    [Not sure what you mean by "wait for the file handle to become available." -Raymond]
  12. Joshua says:

    [Not sure what you mean by "wait for the file handle to become available." -Raymond]

    He wants the API to automatically wait a few seconds when encountering a locked file, like a semaphore.

  13. Killer{R} says:

    There always will be people that want to specify timeout and other options of such implicit wait that will effectively force MS to re-invent all that WaitFor* functions that you may use now.

  14. Roger says:

    Also don't forget the virus scanners and similar tag alongs that like to keep files open after they have been closed by the initial process.  The usual solution is to try several times in a loop with time delays.

    In one of my test scripts I do that once, and on failure rename the files and continue trying to delete them in the background.  Fortunately you can rename in use files.

  15. Gabe says:

    Roger: If you can rename a file, can't you just as easily mark it for deletion?

  16. GWO says:

    @Gabe But if you rename a file you can immediately start creating a new file with the same name, which is sometimes useful. (e.g. a buggy program crashes without cleaning up its log file properly, and you need to immediately restart a new instance, using the same log filename.)

  17. Freek says:

    Maybe to some degree related to the above question, but how about deleting a file and subsequently creating a file with the exact name?

    In our organization we're developing in .NET but we had to lookup the WIN32 documentation to find out that DeleteFile is not a "blocking" operation, but merely a request that will be handled later on. Is there a way to create a DeteleFileAndWaitForItToFinish function? Currently we're using a workaround that renames a file and then deleting it.

    Background info: We stumbled upon this issue while running unit tests. A lot of tests set up a test environment. When a test is run it removes it environment (deletes files and folders), then the next test might set up a similar environment (creating files).

  18. luke says:

    I never understood what EXITCODE_TERMINATED should be.

    Are there any rules or best practices, negative/positive, small/big?

  19. Christian says:

    And if all that is done what's described above, it could still be possible that the file still cannot be deleted!

    Chance is, the virus scanner just locks that file again (it's not so unlikely when it was just closed, as then the scanner will look into the new file to do its work). Sigh. Yes, it's not easy nowadays.

    So there's an additional look necessary that just ends at a "I give up" timeout or when the file really disappeared.

  20. Joshua says:

    @Freek: DeleteFile will finish before it returns unless some other handle is open on the file.

  21. Arlie says:

    Joshua, can you please never, ever write any code again?  It's just this sort of crap that makes software unreliable.

  22. jader3rd says:

    [Not sure what you mean by "wait for the file handle to become available." -Raymond]

    There's an exclusive lock on the file handle. So just like waiting for any other kind of lock, I'd like that built into the file system API's. When I call CreateFile, I'd like it to have a timeout parameter for waiting for the file handle to become available. Since I don't have this, I currently have plenty of code which spins in loops for a period of time, trying to get the file handle, and bailing out eventually. I'd like that logic built into the OS so I can stop writing polling code.

    [Okay, I see my confusion. You want to wait for the file to become available, not the handle. (Handles aren't "available"; they either exist or don't exist.) -Raymond]
  23. JM says:

    @Joshua: I never thought I'd see the words "safe" and "CreateRemoteThread" in the same post. I'll take TerminateProcess over CreateRemoteThread any time — less chance of something going wrong. TerminateProcess will at least *consistently* hose things…

    @Freek: consider the approach that the Visual Studio tests themselves use: create a new directory for every unit test run. This way, they don't overlap. Obviously this can use a considerable amount of disk space, but cleaning up old tests in the background is fairly easy (and doesn't require blocking waits). Also, you actually have the test results of previous runs for comparison, which can come in handy. As long as you've got the space, don't skimp on it. I'm assuming you can set up your code to use a particular directory — if not, you should either rewrite your code so it can (preferrably) or use a fixed drive letter and SUBST.

    Many people will tell you that tests that create files are a bad idea to begin with, because they're too slow and not as easily run as tests that don't do I/O. I don't think this needs to be a religious issue, but it is true that tests that don't need explicit I/O are preferable. Of course, if the I/O itself is what you're testing, it's a necessity.

  24. DWalker says:

    @Joshua:  "Suspending all threads except itself" is safe?  Really?  If "itself" hangs for any reason, or crashes, all threads are suspended.  Yuck.

  25. @Freek says:

    "to find out that DeleteFile is not a "blocking" operation, but merely a request that will be handled later on"

    Yes, it does not work like "unlink" in Unix (which never fails and is always synchronous if you can modify the directory at all), but if all your test cases have closed their file handles, the delete *will* happen synchronously. If you have a problem here then check your code for files that are still open.

  26. Joe says:

    This gets into one of my major pet peeves; developers who use sleeps instead of waiting on handles. File and thread handles go signaled when exited. Having threads polling a bool to indicate exit is another one that drives me crazy. (Then there are those who check for WAIT_ABANDONED on events or semaphores.)

    If you can open a file in read only mode, another trick is to open it with DELETE_ON_CLOSE flag.

Comments are closed.


*DISCLAIMER: I DO NOT OWN THIS CONTENT. If you are the owner and would like it removed, please contact me. The content herein is an archived reproduction of entries from Raymond Chen's "Old New Thing" Blog (most recent link is here). It may have slight formatting modifications for consistency and to improve readability.

WHY DID I DUPLICATE THIS CONTENT HERE? Let me first say this site has never had anything to sell and has never shown ads of any kind. I have nothing monetarily to gain by duplicating content here. Because I had made my own local copy of this content throughout the years, for ease of using tools like grep, I decided to put it online after I discovered some of the original content previously and publicly available, had disappeared approximately early to mid 2019. At the same time, I present the content in an easily accessible theme-agnostic way.

The information provided by Raymond's blog is, for all practical purposes, more authoritative on Windows Development than Microsoft's own MSDN documentation and should be considered supplemental reading to that documentation. The wealth of missing details provided by this blog that Microsoft could not or did not document about Windows over the years is vital enough, many would agree an online "backup" of these details is a necessary endeavor. Specifics include:

<-- Back to Old New Thing Archive Index