{{SERVER_MODELS_JS}}
🍋 Lemonade Server
GitHub
Docs
Models
Featured Apps
News
×
LLM Chat
Model Settings
Model Management
Loading...
⏏
Pick a model
📎
Send
Use Lemonade with your favorite app
Open WebUI
Continue
Gaia
AnythingLLM
AI Dev Gallery
LM-Eval
CodeGPT
AI Toolkit
Temperature:
Controls randomness in responses (0 = deterministic, 2 = very random)
Top K:
Limits token selection to top K most likely tokens
Top P:
Nucleus sampling - considers tokens with cumulative probability up to P
Repeat Penalty:
Penalty for repeating tokens (1 = no penalty, >1 = less repetition)
Reset to Defaults
🔥
Hot Models
🔧
By Recipe
llama.cpp
OGA Hybrid
OGA NPU
OGA CPU
🏷️
By Category
Coding
Vision
Reasoning
Reranking
Embeddings
Custom
➕
Add a Model
Model Name
ⓘ
user.
Checkpoint
ⓘ
Recipe
ⓘ
llamacpp
oga-npu
oga-hybrid
oga-cpu
More info
mmproj file
ⓘ
Reasoning
ⓘ
Install