Not sure it it is only 1 MB, but It is difficult to understand how 64-bit Windows can provide such a small stack. It appears Microsoft has no concept of large computing, which is what a 64-bit OS enables ?
Why is stack re-sizing so inaccessible ?
My own programming style includes, where possible using array syntax to replace the inner loops to identify AVX instructions are appropriate. It is an easy way to bias towards efficient computation, which is what array syntax does provide.