| Where Type |
iotest read |
iotest write |
copy (cp) |
stat/sum (uals) |
|
pingo LNET 1.6.7 | +6% | +28% | +19% | -15%
| |
pingo LNET 1.8.1 | +3% | +30% | +26% | -8%
| |
pingo NFS | -71% | -39% | -61% | -16%
| |
ognip LNET 1.8.1 | -57% | -22% | -5% | -12%
| |
ognip NFS | -73% | -44% | -52% | -15%
|
Positive percentage is faster than native Lustre. Negative is slower. |
Observations:
- For stat and sum, NFS is slightly worse than LNET which is worse than native.
The 1.8.1 client may be better.
- For large file IO (read, write, copy) NFS is significantly worse than LNET, for 10GigE:
- LNET is 2.1 times faster than NFS for write
- LNET is 3.5 times faster than NFS for read
- LNET is 3.2 times faster than NFS for copy
- For native login nodes, read/write/copy effectively ran at CPU speed (CPU == elapsed).
- For esLogin nodes copy ran at CPU speed but read used ~60% CPU speed and write used ~75%.
- With 10GigE LNET on esLogin is faster than native login Lustre, especially for write.
This is unexpected (drop_caches performed) and not yet understood.
- The drop_caches typically took 2 seconds for native login (8gb),
20+ seconds for NFS (128gb), and 50+ seconds for LNET (128gb).
- The large IO typically used all available memory.
For HPC systems, this may not be optimal as IO buffers are not generally reusable for caching.
There may be tuning opportunities to enhance IO throughput and memory utilization on a busy system by
reducing IO caching.
|

|