Could you build something into the nvidia module that allows temporary suspension of resources of the module? It could "suspend" the X session long enough to allow access to the card resources required by the new module, load/unload the new module, then let the old module resume the session. This is done without unloading the original nvidia.ko from the kernel.

I imagine the screen would go black for 1-2 seconds during this process.
