Some cache problem: about which part of memory should be cached when you solve a linear system so that the speed up benefit from the cache could actually be taken advantage of.
Sigiloso
I can't really post the actual question here since it's against the agreement. But you have to be very familiar with computer architecture and large scale concurrency. But I am from computer graphics background, so I totally missed the questions from that interviews