Use the Unified CLI
The skainet CLI auto-detects model architecture from GGUF metadata, so you don’t need to pick the right runner.
Tool Calling Demo
Interactive:
./gradlew :llm-apps:skainet-cli:run \
--args="-m model.gguf --demo"
Single-shot (for scripts/testing):
./gradlew :llm-apps:skainet-cli:run \
--args="-m model.gguf --demo 'What is 2 + 2?'"
All Options
skainet -m <model.gguf> [options] [prompt]
Options:
-m, --model Path to .gguf model (required)
-s, --steps Generation steps (default: 64)
-k, --temperature Sampling temperature (default: 0.8)
--chat Interactive chat mode
--agent Interactive agent with tool calling
--demo Tool calling demo (add prompt for single-shot)
--template=NAME Chat template override: llama3, chatml, qwen, gemma
--context=N Cap context length to N tokens
-h, --help Show help