generated from sig_core/wiki-template
Compare commits
No commits in common. "main" and "meeting/2024-02-22" have entirely different histories.
main
...
meeting/20
2 changed files with 0 additions and 100 deletions
|
@ -1,66 +0,0 @@
|
||||||
# SIG/HPC Meeting 2024-03-07
|
|
||||||
|
|
||||||
## Attendees
|
|
||||||
|
|
||||||
* Forrest Burt
|
|
||||||
* Brian Phan
|
|
||||||
* Sherif Nagy
|
|
||||||
* Enrico Billi
|
|
||||||
* Neil Hanlon
|
|
||||||
* Jeremy Siadal
|
|
||||||
* Chris Stackpole
|
|
||||||
|
|
||||||
## Old Business
|
|
||||||
|
|
||||||
* Intel Driver -
|
|
||||||
* Sherif is working on this, has a prototype, needs DKMS
|
|
||||||
* Used `make spec` script in the branch to create spec, and import from there
|
|
||||||
* We think that upstream should adopt a different format/packaging methodology
|
|
||||||
* Perhaps [packit](https://packit.dev) could be helpful?
|
|
||||||
* What branch/version to use?
|
|
||||||
* rhel-specific branches say not to use them; use the 'backports' branches instead
|
|
||||||
* sherif appears to be in the right place
|
|
||||||
* Next steps:
|
|
||||||
* Neil to bring dkms from epel into projects
|
|
||||||
* Sherif to upload to public location for review and testing
|
|
||||||
* Jeremy to work on testing with some latest hardware
|
|
||||||
* AI SIG
|
|
||||||
* where will userspace tools live? HPC? AI? Both?
|
|
||||||
* Neil: it should be reasonable for us to have the ability to easily release a package in multiple SIGs
|
|
||||||
* NVidia GPU driver Testing -
|
|
||||||
* Did not get time to review [Chris's work](https://github.com/mghpcsim/gpu-testing/tree/master) - will try to review this cycle
|
|
||||||
* Kernel Cnode / MoS
|
|
||||||
* re-actioning - Jeremy to work on once he has some time
|
|
||||||
|
|
||||||
## New Business
|
|
||||||
|
|
||||||
* Testing Warewulf - Brian
|
|
||||||
* Current plan: put the tests upstream into Warewulf repo, Testing team can pull from / engage with upstream
|
|
||||||
* What precisely are we going to test?
|
|
||||||
* Functional/E2E tests -- provision a small cluster, etc (see last week's [discussions](https://sig-hpc.rocky.page/events/meeting-notes/2024-02-22/#discussions))
|
|
||||||
* Future work can include e.g. slurm
|
|
||||||
* Chris to check on status of slurm
|
|
||||||
* Packages to bring in
|
|
||||||
* [List](https://sig-hpc.rocky.page/packages/) on the wiki; needs updating (along with the rest of the wiki)
|
|
||||||
* if anyone wants to bring something in, has questions, etc. Please ask/get in touch!
|
|
||||||
* Neil to update the wiki
|
|
||||||
|
|
||||||
## Open Floor
|
|
||||||
|
|
||||||
* Vulnerability in [lustre](http://lists.lustre.org/pipermail/lustre-announce-lustre.org/2024/000270.html) - related to user namespaces
|
|
||||||
* Sherif was working on lustre-server, but it's a beast
|
|
||||||
* DDN already builds RPMS, but... is it worth it to rebuild vs just use upstream?
|
|
||||||
* Sherif: thinks it makes sense to rebuild against our specific user/kernel space
|
|
||||||
* there are lustre-server for 8, but not 9, it appears.. why?
|
|
||||||
* documentation supports this but again.. why?
|
|
||||||
* Sherif to look into why lustre-server exists for 8 but not 9
|
|
||||||
* Next meeting in two weeks on Thursday, March 1
|
|
||||||
|
|
||||||
## Action Items
|
|
||||||
|
|
||||||
* [ ] Chris to check on status of slurm
|
|
||||||
* [ ] Neil to update the wiki
|
|
||||||
* [ ] Sherif to look into why lustre-server exists for 8 but not 9
|
|
||||||
* [ ] Neil to bring dkms from epel into projects
|
|
||||||
* [ ] Sherif to upload to public location for review and testing
|
|
||||||
* [ ] Jeremy to work on testing with some latest hardware
|
|
|
@ -1,34 +0,0 @@
|
||||||
# SIG/HPC Meeting 2024-03-21
|
|
||||||
|
|
||||||
## Attendees
|
|
||||||
|
|
||||||
* Neil Hanlon
|
|
||||||
* Sherif Nagy
|
|
||||||
* Brian Phan
|
|
||||||
* Forrest Burt
|
|
||||||
|
|
||||||
## Follow Ups
|
|
||||||
|
|
||||||
* Intel GPU driver imported and built in SIG/Kernel 'kernel-drivers' repo.
|
|
||||||
* https://dl.rockylinux.org/stg/sig/9/kernel/x86_64/kernel-common/Packages/i/intel-i915-dkms-1.23.6.42.230425.56-1.x86_64.rpm
|
|
||||||
* Warewulf 4.5 released upstream
|
|
||||||
* Sherif looking into bringing update to SIG
|
|
||||||
* Running into issue on Rocky 9
|
|
||||||
* Testing - CIQ will be upstreaming a test suite
|
|
||||||
* Nvidia driver GPU benchmarking - re-action reviewing the work
|
|
||||||
* Did not get time to review [Chris's work](https://github.com/mghpcsim/gpu-testing/tree/master) - will try to review this cycle
|
|
||||||
* Lustre server
|
|
||||||
* re-actioning; Sherif has not looked into it yet
|
|
||||||
* Wiki Content - still need to populate this. Can people from the SIG help?
|
|
||||||
* Packages - have some 'easy' ones
|
|
||||||
|
|
||||||
## Open Floor
|
|
||||||
|
|
||||||
* n/a
|
|
||||||
|
|
||||||
## Action Items
|
|
||||||
|
|
||||||
* [ ] Neil to bring in dkms to kernel-drivers to SIG/Kernel
|
|
||||||
* [ ] See if Alan would be willing to work on this
|
|
||||||
* [ ] Neil to look into resourcing some people to work on this
|
|
||||||
* [ ] Neil to make tickets for all packages we are looking to bring in, rank priority and ease
|
|
Loading…
Reference in a new issue