It does sound interesting.
A few points, off the top of my head:
How much will calling the DLL slow you down?
How about memory allocation (you're talking big arrays, right?)?
Not all computers have powerful graphics cards.
How does LV's sort primitive compare to qsort()?
Are other technologies going to help (cell processors and such)?
How stable is this code? What happens when Nvidia upgrade their cards?
The only way to know for sure is probably to run a benchmark of LV sorting vs. LV calling the DLL.
___________________
Try to take over the world!