llama.cpp/.devops/llama-cpp-cuda.srpm.spec

# SRPM for building from source and packaging an RPM for RPM-based distros.
# https://docs.fedoraproject.org/en-US/quick-docs/creating-rpm-packages
# Built and maintained by John Boero - boeroboy@gmail.com
# In honor of Seth Vidal https://www.redhat.com/it/blog/thank-you-seth-vidal

# Notes for llama.cpp:
# 1. Tags are currently based on hash - which will not sort asciibetically.
#    We need to declare standard versioning if people want to sort latest releases.
# 2. Builds for CUDA/OpenCL support are separate, with different depenedencies.
# 3. NVidia's developer repo must be enabled with nvcc, cublas, clblas, etc installed.
#    Example: https://developer.download.nvidia.com/compute/cuda/repos/fedora37/x86_64/cuda-fedora37.repo
# 4. OpenCL/CLBLAST support simply requires the ICD loader and basic opencl libraries.
#    It is up to the user to install the correct vendor-specific support.

Name:           llama.cpp-cuda
Version:        %( date "+%%Y%%m%%d" )
Release:        1%{?dist}
Summary:        CPU Inference of LLaMA model in pure C/C++ (no CUDA/OpenCL)
License:        MIT
Source0:        https://github.com/ggerganov/llama.cpp/archive/refs/heads/master.tar.gz
BuildRequires:  coreutils make gcc-c++ git cuda-toolkit
Requires:       cuda-toolkit
URL:            https://github.com/ggerganov/llama.cpp

%define debug_package %{nil}
%define source_date_epoch_from_changelog 0

%description
CPU inference for Meta's Lllama2 models using default options.

%prep
%setup -n llama.cpp-master

%build
make -j LLAMA_CUDA=1

%install
mkdir -p %{buildroot}%{_bindir}/
cp -p main %{buildroot}%{_bindir}/llamacppcuda
cp -p server %{buildroot}%{_bindir}/llamacppcudaserver
cp -p simple %{buildroot}%{_bindir}/llamacppcudasimple

mkdir -p %{buildroot}/usr/lib/systemd/system
%{__cat} <<EOF  > %{buildroot}/usr/lib/systemd/system/llamacuda.service
[Unit]
Description=Llama.cpp server, CPU only (no GPU support in this build).
After=syslog.target network.target local-fs.target remote-fs.target nss-lookup.target

[Service]
Type=simple
EnvironmentFile=/etc/sysconfig/llama
ExecStart=/usr/bin/llamacppcudaserver $LLAMA_ARGS
ExecReload=/bin/kill -s HUP $MAINPID
Restart=never

[Install]
WantedBy=default.target
EOF

mkdir -p %{buildroot}/etc/sysconfig
%{__cat} <<EOF  > %{buildroot}/etc/sysconfig/llama
LLAMA_ARGS="-m /opt/llama2/ggml-model-f32.bin"
EOF

%clean
rm -rf %{buildroot}
rm -rf %{_builddir}/*

%files
%{_bindir}/llamacppcuda
%{_bindir}/llamacppcudaserver
%{_bindir}/llamacppcudasimple
/usr/lib/systemd/system/llamacuda.service
%config /etc/sysconfig/llama

%pre

%post

%preun
%postun

%changelog
devops : RPM Specs (#2723) * Create llama-cpp.srpm * Rename llama-cpp.srpm to llama-cpp.srpm.spec Correcting extension. * Tested spec success. * Update llama-cpp.srpm.spec * Create lamma-cpp-cublas.srpm.spec * Create lamma-cpp-clblast.srpm.spec * Update lamma-cpp-cublas.srpm.spec Added BuildRequires * Moved to devops dir 2023-08-23 14:28:22 +00:00			`# SRPM for building from source and packaging an RPM for RPM-based distros.`
Fedora build update (#6388) * fixed deprecated address * fixed deprecated address * fixed deprecated address * Added 'Apache-2.0' SPDX license identifier due to 'kompute.cc' submodule licensing. Explanation of licensing method: https://docs.fedoraproject.org/en-US/legal/spdx/#_and_expressions * Added 'Apache-2.0' SPDX license identifier due to 'kompute.cc' submodule licensing. Explanation of licensing method: https://docs.fedoraproject.org/en-US/legal/spdx/#_and_expressions * Added 'Apache-2.0' SPDX license identifier due to 'kompute.cc' submodule licensing. Explanation of licensing method: https://docs.fedoraproject.org/en-US/legal/spdx/#_and_expressions * reverted back to only the MIT license 2024-03-29 21:59:56 +00:00			`# https://docs.fedoraproject.org/en-US/quick-docs/creating-rpm-packages`
devops : RPM Specs (#2723) * Create llama-cpp.srpm * Rename llama-cpp.srpm to llama-cpp.srpm.spec Correcting extension. * Tested spec success. * Update llama-cpp.srpm.spec * Create lamma-cpp-cublas.srpm.spec * Create lamma-cpp-clblast.srpm.spec * Update lamma-cpp-cublas.srpm.spec Added BuildRequires * Moved to devops dir 2023-08-23 14:28:22 +00:00			`# Built and maintained by John Boero - boeroboy@gmail.com`
			`# In honor of Seth Vidal https://www.redhat.com/it/blog/thank-you-seth-vidal`

			`# Notes for llama.cpp:`
			`# 1. Tags are currently based on hash - which will not sort asciibetically.`
			`# We need to declare standard versioning if people want to sort latest releases.`
			`# 2. Builds for CUDA/OpenCL support are separate, with different depenedencies.`
			`# 3. NVidia's developer repo must be enabled with nvcc, cublas, clblas, etc installed.`
			`# Example: https://developer.download.nvidia.com/compute/cuda/repos/fedora37/x86_64/cuda-fedora37.repo`
			`# 4. OpenCL/CLBLAST support simply requires the ICD loader and basic opencl libraries.`
			`# It is up to the user to install the correct vendor-specific support.`

cuda : rename build flag to LLAMA_CUDA (#6299) 2024-03-26 00:16:01 +00:00			`Name: llama.cpp-cuda`
devops : added systemd units and set versioning to use date. (#2835) * Corrections and systemd units * Missing dependency clblast 2023-08-28 06:31:24 +00:00			`Version: %( date "+%%Y%%m%%d" )`
devops : RPM Specs (#2723) * Create llama-cpp.srpm * Rename llama-cpp.srpm to llama-cpp.srpm.spec Correcting extension. * Tested spec success. * Update llama-cpp.srpm.spec * Create lamma-cpp-cublas.srpm.spec * Create lamma-cpp-clblast.srpm.spec * Update lamma-cpp-cublas.srpm.spec Added BuildRequires * Moved to devops dir 2023-08-23 14:28:22 +00:00			`Release: 1%{?dist}`
			`Summary: CPU Inference of LLaMA model in pure C/C++ (no CUDA/OpenCL)`
			`License: MIT`
			`Source0: https://github.com/ggerganov/llama.cpp/archive/refs/heads/master.tar.gz`
			`BuildRequires: coreutils make gcc-c++ git cuda-toolkit`
			`Requires: cuda-toolkit`
			`URL: https://github.com/ggerganov/llama.cpp`

			`%define debug_package %{nil}`
			`%define source_date_epoch_from_changelog 0`

			`%description`
			`CPU inference for Meta's Lllama2 models using default options.`

			`%prep`
			`%setup -n llama.cpp-master`

			`%build`
cuda : rename build flag to LLAMA_CUDA (#6299) 2024-03-26 00:16:01 +00:00			`make -j LLAMA_CUDA=1`
devops : RPM Specs (#2723) * Create llama-cpp.srpm * Rename llama-cpp.srpm to llama-cpp.srpm.spec Correcting extension. * Tested spec success. * Update llama-cpp.srpm.spec * Create lamma-cpp-cublas.srpm.spec * Create lamma-cpp-clblast.srpm.spec * Update lamma-cpp-cublas.srpm.spec Added BuildRequires * Moved to devops dir 2023-08-23 14:28:22 +00:00
			`%install`
			`mkdir -p %{buildroot}%{_bindir}/`
cuda : rename build flag to LLAMA_CUDA (#6299) 2024-03-26 00:16:01 +00:00			`cp -p main %{buildroot}%{_bindir}/llamacppcuda`
			`cp -p server %{buildroot}%{_bindir}/llamacppcudaserver`
			`cp -p simple %{buildroot}%{_bindir}/llamacppcudasimple`
devops : RPM Specs (#2723) * Create llama-cpp.srpm * Rename llama-cpp.srpm to llama-cpp.srpm.spec Correcting extension. * Tested spec success. * Update llama-cpp.srpm.spec * Create lamma-cpp-cublas.srpm.spec * Create lamma-cpp-clblast.srpm.spec * Update lamma-cpp-cublas.srpm.spec Added BuildRequires * Moved to devops dir 2023-08-23 14:28:22 +00:00
devops : added systemd units and set versioning to use date. (#2835) * Corrections and systemd units * Missing dependency clblast 2023-08-28 06:31:24 +00:00			`mkdir -p %{buildroot}/usr/lib/systemd/system`
cuda : rename build flag to LLAMA_CUDA (#6299) 2024-03-26 00:16:01 +00:00			`%{__cat} <<EOF > %{buildroot}/usr/lib/systemd/system/llamacuda.service`
devops : added systemd units and set versioning to use date. (#2835) * Corrections and systemd units * Missing dependency clblast 2023-08-28 06:31:24 +00:00			`[Unit]`
			`Description=Llama.cpp server, CPU only (no GPU support in this build).`
			`After=syslog.target network.target local-fs.target remote-fs.target nss-lookup.target`

			`[Service]`
			`Type=simple`
			`EnvironmentFile=/etc/sysconfig/llama`
cuda : rename build flag to LLAMA_CUDA (#6299) 2024-03-26 00:16:01 +00:00			`ExecStart=/usr/bin/llamacppcudaserver $LLAMA_ARGS`
devops : added systemd units and set versioning to use date. (#2835) * Corrections and systemd units * Missing dependency clblast 2023-08-28 06:31:24 +00:00			`ExecReload=/bin/kill -s HUP $MAINPID`
			`Restart=never`

			`[Install]`
			`WantedBy=default.target`
			`EOF`

			`mkdir -p %{buildroot}/etc/sysconfig`
			`%{__cat} <<EOF > %{buildroot}/etc/sysconfig/llama`
			`LLAMA_ARGS="-m /opt/llama2/ggml-model-f32.bin"`
			`EOF`

devops : RPM Specs (#2723) * Create llama-cpp.srpm * Rename llama-cpp.srpm to llama-cpp.srpm.spec Correcting extension. * Tested spec success. * Update llama-cpp.srpm.spec * Create lamma-cpp-cublas.srpm.spec * Create lamma-cpp-clblast.srpm.spec * Update lamma-cpp-cublas.srpm.spec Added BuildRequires * Moved to devops dir 2023-08-23 14:28:22 +00:00			`%clean`
			`rm -rf %{buildroot}`
			`rm -rf %{_builddir}/*`

			`%files`
cuda : rename build flag to LLAMA_CUDA (#6299) 2024-03-26 00:16:01 +00:00			`%{_bindir}/llamacppcuda`
			`%{_bindir}/llamacppcudaserver`
			`%{_bindir}/llamacppcudasimple`
			`/usr/lib/systemd/system/llamacuda.service`
devops : added systemd units and set versioning to use date. (#2835) * Corrections and systemd units * Missing dependency clblast 2023-08-28 06:31:24 +00:00			`%config /etc/sysconfig/llama`
devops : RPM Specs (#2723) * Create llama-cpp.srpm * Rename llama-cpp.srpm to llama-cpp.srpm.spec Correcting extension. * Tested spec success. * Update llama-cpp.srpm.spec * Create lamma-cpp-cublas.srpm.spec * Create lamma-cpp-clblast.srpm.spec * Update lamma-cpp-cublas.srpm.spec Added BuildRequires * Moved to devops dir 2023-08-23 14:28:22 +00:00
			`%pre`

			`%post`

			`%preun`
			`%postun`

			`%changelog`