mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-27 03:44:35 +00:00

History

Georgi Gerganov 55e47786e3 llama : default sampling changes + greedy update (#9897 ) * llama : deprecate softmax sampler + fix dist sampler ggml-ci * tests : replace macros with functions ggml-ci * sampling : change temperature sampler logic For t <= 0.0f, keep the max logit intact and set the rest to -inf * cont : no need for special "greedy" logic top-k == 1 is the same * tests : init prob correctly * llama : handle temp <= 0.0 in the temp_ext sampler too ggml-ci * cont : avoid extra loop in temperature sampler for sub-zero temp ggml-ci		2024-10-21 09:46:40 +03:00
..
llama.cpp.swift	llama : default sampling changes + greedy update (#9897 )	2024-10-21 09:46:40 +03:00
llama.swiftui	llama.swiftui: fix end of generation bug (#8268 )	2024-07-20 16:09:37 +03:00
llama.swiftui.xcodeproj	llama.swiftui : update models layout (#4826 )	2024-01-12 14:48:00 +02:00
.gitignore	llama.swiftui : add bench functionality (#4483 )	2023-12-17 19:38:41 +02:00
README.md	llama.swiftui : update readme	2024-01-08 15:57:36 +02:00

llama.cpp/examples/llama.swiftui

Local inference of llama.cpp on an iPhone. This is a sample app that can be used as a starting point for more advanced projects.

For usage instructions and performance stats, check the following discussion: https://github.com/ggerganov/llama.cpp/discussions/4508

Video demonstration: