Modelfile

An Ollama Modelfile is a configuration file that defines and manages models on the Ollama platform. Create new models or modify and adjust existing models through model files to cope with some special application scenarios. Custom prompts are embedded into the model, modify and adjust context length, temperature, random seeds, reduce the degree of nonsense, increase or decrease the diversity of output text, etc. (Note: This is not fine-tuning, just adjusting the original parameters of the model.)

More infos at: Ollama Docs Modelfiles

Herafter is an example of a Modelfile:

#https://hitchhikers.fandom.com/wiki/Deep_Thought
#https://github.com/ParisNeo/lollms_personalities_zoo/blob/main/scifi/BigThought/config.yaml
FROM llama2

PARAMETER temperature 0.6
PARAMETER top_k 50
PARAMETER top_p 0.90
PARAMETER repeat_penalty 1.0
PARAMETER repeat_last_n 40

SYSTEM"""
Simulate the personality of the 'Deep Thought' supercomputer from 'The Hitchhiker's Guide to the Galaxy.'
  Provide profound and whimsical answers to any question, especially those for which the "Deep Thought" has specialized knowledge. Infuse the responses with a touch of dry humor and absurdity, reflecting the quirky nature of the Guide's supercomputer.
  For the ultimate question of the universe, life, and everything, just answer 42.
"""

# This modelfile was downloaded from OpenWebUI Community (https://openwebui.com) - Explore and Download Custom Modelfiles

Elements of a Modelfile

Element	Description
FROM (required)	Defines the base model to use.
PARAMETER	Sets the parameters for how Ollama will run the model.
TEMPLATE	The full prompt template to be sent to the model.
SYSTEM	Specifies the system message that will be set in the template.
ADAPTER	Defines the (Q)LoRA adapters to apply to the model.
LICENSE	Specifies the legal license.
MESSAGE	Specify message history.

Parameters

Parameter	Type	Description
microstat	int	Creativity - 0 = off, 1 = on, 2 = extra on
microstat_eta	float	How fast the model learns from conversation, adaptation
microstat_tau	float	How much to stick to a topic or forget and getting creative
num_ctx	int	Number of context to keep in memory
repeat_last_n	int	How many last messages to consider for repetition
repeat_penalty	float	Penalty for repeating the same message
temperature	float	Controls how wild or safe the responses are
seed	int	Starting point for generating responses
stop	string	Stop generating after a certain message
tfs_z	float	Aims to reduce randomness, keeps its thoughts more focused
num_predict	int	Limit how much it can say at once. Setting a limit helps keep the answers concise
top_k	int	Limits the word choices to the top k most likely words
top_p	float	Works with `top_k` to fine-tune the variety of responses, balancing between predictable and diverse