readme : update hot topics

2025-01-12 19:50:17 +00:00 · 2024-02-21 15:39:54 +02:00 · 2024-02-21 15:39:54 +02:00 · c14f72db9c
commit c14f72db9c
parent cc6cac08e3
1 changed files with 2 additions and 7 deletions
--- a/README.md
+++ b/README.md
@ -10,13 +10,8 @@ Inference of Meta's [LLaMA](https://arxiv.org/abs/2302.13971) model (and others)
 ### Hot topics
- Remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU_OFFLOAD: https://github.com/ggerganov/llama.cpp/pull/5240
+- Support for Gemma models: https://github.com/ggerganov/llama.cpp/pull/5631
- Incoming backends: https://github.com/ggerganov/llama.cpp/discussions/5138
+- Non-linear quantization IQ4_NL: https://github.com/ggerganov/llama.cpp/pull/5590
  - [SYCL backend](README-sycl.md) is ready (1/28/2024), support Linux/Windows in Intel GPUs (iGPU, Arc/Flex/Max series)
 - New SOTA quantized models, including pure 2-bits: https://huggingface.co/ikawrakow
 - Collecting Apple Silicon performance stats:
  - M-series: https://github.com/ggerganov/llama.cpp/discussions/4167
  - A-series: https://github.com/ggerganov/llama.cpp/discussions/4508
 - Looking for contributions to improve and maintain the `server` example: https://github.com/ggerganov/llama.cpp/issues/4216
 ----