Commit Graph

2 Commits

Author SHA1 Message Date
Georgi Gerganov
08e69c5008
cuda : adapt soft_max to F16 mask and pos 2024-03-28 19:40:11 +02:00
slaren
ae1f211ce2
cuda : refactor into multiple files (#6269) 2024-03-25 13:50:23 +01:00