Broken Stack Guard Page

This week, I updated my Broken Guard Page project that includes couple of demo programs. Last demo shows how a malicious program can cause very obscure access violation of other process by just reading the memory of the other process. This is one of the examples that how modern software can be fragile and it can affect software reliability.

Most modern programming languages has function and if you’re a C/C++ programmer, probably already know the different between local variable and dynamically allocated or known as heap memory.

Local variable is usually allocated on thread stack and dynamic allocation uses heap memory.

In terms of performance, stack is faster. It’s because not only stack alloc and dealloc operations are faster than heap but also stack allocated memory is CPU cache friendly which means the data more likely in the CPU cache and provide much faster data access.

Then why don’t we use stack allocated variables whenever possible? It’s answered here: Why is the use of alloca() not considered good practice? _alloca even accepts size param at run-time. (That’s so convenient and also can easily be misused)

Default total thread stack size is 1 MiB in Visual C++. So if you try to allocate more than 1 MiB on stack, it’s guaranteed to cause stack overflow. Stack is also used for other purpose like return address, function parameters and etc. If you have many nested function calls then the actual available stack size can be much smaller.

Windows also has stack expansion at run-time to save physical memory. See Pushing the Limits of Windows: Processes and Threads for details. Normally thread doesn’t use more than couple of hundreds KiB memory, so if OS commits whole 1 MiB for each thread, it’ll waste memory. Your system easily running hundreds of threads in total normally. Especially on 32 bit system it was a big deal and even on 64 bit system the default total stack size is still 1 MiB and that seems surprisingly works fine.