I will try to answer your questions as you stated them:
1. Correct. We felt that most users would call niDMM Read.vi more than once in their applications, and this behavior would make the second, third, ... readings faster.
2. Not exactly. The expense of the "first measurement" is incurred whenever you change the front-end configuration of the DMM, so the optimization is intended to benefit you when you need to take more than one measurement without changing your DMM configuration. For example, if your application changes the DMM mode from DC Volts to 2-wire resistance, there will a "first measurement" for each change.
If you want to "turn off the lights" after a call to niDMM Read.vi, call niDMM Reset.vi after niDMM Read.vi. This operation will stop the DMM. Also, you can use niDMM Read Multi Point.vi to read one sample - this will not leave the "lights on".
3. Actually, if you use niDMM Read Multi Point.vi to read more than one measurement, the driver will employ the same optimization mechanism, but the driver does not leave the DMM "running" after execution of niDMM Read Multi Point.vi as is the case with implementation of niDMM Read.vi
This behavior is not unusual for DMM instruments. Due to their multi-function nature, changing the front end configuration usually involves operations which could be time consuming (speaking in some relative terms), so once a DMM is programmed for some function and range, it could be left in a state where it continuously takes measurements such that subsequent requests for data are fulfilled faster.
Regrads,
Sead Suskic
National Instruments