Mismatching scalar and vector new and delete

push esi mov esi, [esp+8] ; howmany ; eax = howmany * sizeof(MyClass) + sizeof(size_t) lea eax, [esi*4+4] push eax call operator new test eax, eax pop ecx je fail push edi push OFFSET MyClass::MyClass push esi lea edi, [eax+4] ; edi = eax + sizeof(size_t) push 4 ; sizeof(MyClass) push edi mov [eax], esi ; howmany call `vector constructor iterator' mov eax, edi pop edi jmp done fail: xor eax, eax done: pop esi retd 4

MyClass* allocate_stuff(int howmany) { void *p = operator new( howmany * sizeof(MyClass) + sizeof(size_t)); if (p) { size_t* a = reinterpret_cast<size_t*>(p); *a++ = howmany; vector constructor iterator(a, sizeof(MyClass), &MyClass::MyClass); return reinterpret_cast<MyClass*>(a); } return NULL; }

void MyClass::vector deleting destructor(int flags) { if (flags & 2) { // if vector destruct size_t* a = reinterpret_cast<size_t*>(this) - 1; size_t howmany = *a; vector destructor iterator(p, sizeof(MyClass), howmany, MyClass::~MyClass); if (flags & 1) { // if delete too operator delete(a); } } else { // else scalar destruct this->~MyClass(); // destruct one if (flags & 1) { // if delete too operator delete(this); } } }

Comments (16)

Johan Ericsson says:

February 3, 2004 at 7:58 am

That must be the first psuedo-code that I’ve seen that uses reinterpret_cast<>. Cool!

Of course, if you use new[] and then delete only a single destructor gets called. That is the destructor for the first element. However, I’m not sure what the outcome of deleting the first element is as compared to deleting the hidden count. Perhaps the count gets leaked as well. I don’t remember this use case causing any of the CRT leaking routines to report any memory leaking? So, I’m not sure about this part.

If you use new and then delete[], then you are really in trouble. You will first call a destructor an unspecified number of times, depending on the random value of the hidden count. Then you will try to delete the hidden count, which isn’t memory that you’ve allocated. Seems like you will crash.

These are some more reasons to use a vector<> to wrap the array pointer, instead of doing your own memory management.

I seem to recall that Andrei Alexandrescu wrote some interesting articles for CUJ (C++ User’s Journal) where he was trying to create a super vector<>. He complained that it was obvious that the compiler was keeping track of the count, yet the vector also has to keep track of the count. This seems inefficient… Its too bad that C++ doesn’t provide a Standard way of getting a hold of that count.
Andreas Magnusson says:

February 3, 2004 at 10:22 am

Wouldn’t it be possible to let scalar-new be identical to "new T[1]"? That way calling vector-delete on a scalar-new would work fine.

It would of course be a little bit more inefficient to delete single elements as well as use more memory. So maybe it’s a bad idea to make illegal programs work on the cost of legal ones…
runtime says:

February 3, 2004 at 10:37 am

All of these problems could be solved if new (silently) returned a 1-element array. Then calling delete or delete[] would do the same thing: check the p[-1] size count and then calling the destructors. The debug version of delete could assert(p[-1] == 1), alerting the programmer if he called delete on an array when he should have called delete[].
Joe says:

February 3, 2004 at 11:17 am

The assert wouldn’t be accurate, as it is possible to new Foo[1] (and even new Foo[0] is legal!).

What will actually happen if you use scalar delete for array new is that it will destroy a single object, then pass the pointer to free() which will back up to get ITS own count but will instead get the object count. It will then free too little, and most likely corrupt the heap. Unless your malloc uses a hashtable of addresses to store size data, then you will probably corrupt the heap.

This behavior is dictated by the standard. As a developer, you should do the right thing always and not rely on the compiler to fix things for you. Always run your code under Valgrind or BoundsChecker or Purify to find these kinds of problems.
Raymond Chen says:

February 3, 2004 at 11:22 am

Johan, tomorrow's entry will discuss some of your points. In particular, the Bonus Exercise will explain why there is no "Give me the number of elements in this dynamically-allocated array" operator.
Joe says:

February 3, 2004 at 11:22 am

I should also point out that none of the suggestions will save you from:

Foo *p = new Foo[10];

delete p+1;

If the concept of array and scalar new and delete frighten you, use malloc and placement new.

Foo *p = (Foo *)malloc(sizeof(Foo));

new (p) Foo;

p->blah();

p->~Foo();

free(p);

Amaze your friends! Alienate your co-workers!
Ben Wilhelm says:

February 3, 2004 at 12:03 pm

One of my favorite recurring "programmers who know just a little too much for their own good" stories:

"Oh, it’s easy to get the number of items in a C-style array! You just do ((int*)array)[-1]! It works every time! Why are you brandishing that axe?"

Quick, how many different ways can you find that would cause that to break? :P
Doug says:

February 3, 2004 at 12:46 pm

Yet another reason why C++ is a broken language.

Raymond, if you are looking for another topic, how about discussing the problems with MFC and different versions of the compiler. Like what happens when you build an OCX with MFC, then try to have it work in both IE5 and IE6, without corrupting memory. Just another version of DLL hell. Which is compounded by OCXs loading from the registry and not the path.

Or you can just rewrite the silly thing in ATL and be done with the problem.
Shane King says:

February 3, 2004 at 4:04 pm

If you don’t have any destructor, then the compiler doesn’t need to do any cleanup on delete, and can just free the memory. Therefore it can not bother allocating extra space for the size of the array.

In this case, the "give me the number of elements in the array" would have nothing to look at.

Additionally, if the compiler performs this optimisation, then a mismatched new[]/delete will work; right up until someone changes the code to have a destructor, then you’ll be wondering why your program crashes all the time from such a "trivial" change.
Centaur says:

February 4, 2004 at 2:07 am

> If you don’t have any destructor, then the

> compiler doesn’t need to do any cleanup on

> delete, and can just free the memory.

That is, if you don’t have any /user-declared/ destructor, and the /implicitly-declared/ destructor is /trivial/ [12.4.3].
James Curran says:

February 4, 2004 at 8:24 am

Shane,

How is it "bothering" to allocate extra space? Allocating 104 bytes is exactly the same work as allocating 100. You just have to add four to the number you pass to malloc().
Ben Hutchings says:

February 4, 2004 at 10:32 am

A question for Raymond: where’s the exception handling code for allocate_stuff? Given that this is x86 code, some of that must be generated inline. Did the compiler somehow know that MyClass’s constructor can’t throw?
Shane King says:

February 4, 2004 at 4:27 pm

I meant in the sense that if you don’t allocate the extra space, your program uses less memory, which is a good thing. Also, if you don’t do the work to store the size there, you also run marginally quicker.

It’s a pretty small optimisation really, but some compilers do it.

And yes, I realise that destructors don’t have to be declared to exist. I was speaking informally, rather than in C++ language specese.
Raymond Chen says:

February 4, 2004 at 7:04 pm

Doug: I don’t use MFC myself and have never learned it, so I can’t talk about it with any degree of confidence.

Ben: I compiled with non-throwing allocation, which is why you don’t see any throw-handling.
The Old New Thing says:

February 8, 2004 at 12:08 pm
The Old New Thing says:

August 26, 2004 at 12:04 pm

Comments are closed.

Date:	February 3, 2004 / year-entry #47
Tags:	code
Orig Link:	https://blogs.msdn.microsoft.com/oldnewthing/20040203-00/?p=40763
Comments:	16
Summary:	In a previous entry I alluded to the problems that can occur if you mismatch scalar "new" with vector "delete[]" or vice versa. There is a nice description of C++ memory management in C++ Gotchas: Avoiding Common Problems in Coding and Design on www.informit.com, and I encourage you to read at least the section titled...