Improve system memory hard limit for GPFS program startup to adapt to AI system

See this idea on ideas.ibm.com

We hit the following issue on Spectrum Scale 5.1.6.1 when GPFS is starting up.
2023-12-13_21:22:28.648+0800: [I] Verifying minimum system memory configurations.
2023-12-13_21:22:28.648+0800: [I] The system memory configuration is 2063930 MiB
2023-12-13_21:22:28.648+0800: [I] The daemon memory configuration hard floor is 1536 MiB
2023-12-13_21:22:28.649+0800: [I] Initializing the main process ...
2023-12-13_21:22:28.683+0800: [E] Failed to allocate 92274688 bytes in memory pool, err -1
2023-12-13_21:22:28.683+0800: [X] logAssertFailed: err == E_OK
2023-12-13_21:22:28.683+0800: [X] return code 12, reason code 0, log record tag 0
2023-12-13_21:22:28.909+0800: [X] logAssertFailed: !"clock_gettimeP is NULL"
The error messages indicates "Failed to allocate xxx bytes", but the system still have a lot of free memory(nearly 1500G+ free，total about 2015G ).

The issue turns out to be caused by the VMALLOC limit. There is a 1T limitation for the ADDR above VMALLOC_START. So when other kernel modules occupied huge memory, GPFS will fail to allocate new memory because the ADDR exceeds the 1T limit.
There are 888252 of below in /proc/vmallocinfo, each takes about 3M, that is 2.6T+ in total.
0xffffa4ee71c00000-0xffffa4ee71f01000 3149824 ttm_bo_kmap+0x233/0x2a0 [ttm] phys=0x0000000094005000 ioremap
0xffffa4ee72000000-0xffffa4ee72301000 3149824 ttm_bo_kmap+0x233/0x2a0 [ttm] phys=0x0000000094005000 ioremap
0xffffa4ee72800000-0xffffa4ee72b01000 3149824 ttm_bo_kmap+0x233/0x2a0 [ttm] phys=0x0000000094005000 ioremap
0xffffa4ee72c00000-0xffffa4ee72f01000 3149824 ttm_bo_kmap+0x233/0x2a0 [ttm] phys=0x0000000094005000 ioremap
0xffffa4ee73c00000-0xffffa4ee73f01000 3149824 ttm_bo_kmap+0x233/0x2a0 [ttm] phys=0x0000000094005000 ioremap
We have also tried the efix with 4T limit and still hit the issue on some nodes.
We cannot start up GPFS right now. Even we reboot the node, the issue may go away temporarily but it may come back any time.
We sincerely and eagerly hope this hard limit can be improved to fit AI system with large-memory. Thank you.

Idea priority

Urgent

Post comment

By clicking the "Post Comment" or "Submit Idea" button, you are agreeing to the IBM Ideas Portal Terms of Use.
Do not place IBM confidential, company confidential, or personal information into any field.

Shape the future of IBM!

Search existing ideas

Post your ideas

Specific links you will want to bookmark for future use

Improve system memory hard limit for GPFS program startup to adapt to AI system

Please enter your email address

RELATED IDEAS

Improve system memory hard limit for GPFS program startup to adapt to AI system