Model Card for Model ID

This is a LoRA for gemma-3-270m-it, enabling a simplified thinking process with rudimentary tool calling for the calculator tool, although it seems it can't handle that much at once. Probably the smallest thinking LLM to date?

Parameters to use:

repetition_penalty=1.0,
temperature=0.1,
top_k=32,
top_p=0.95,
min_p=0.0,

Model Details

Model Description

This model can perform some simple thinking in <|thinking_start|><|thinking_end|> tags. Example: Question: What do you think about Poland? What is their relation to the Czech Republic?

==================== THINKING ====================

Here is the thinking process:
- Poland is located in Europe, bordering Czech Republic.
- It is a continent with a history of conflict and political tension.
- The relationship between Poland and Czech Republic is complex and often contentious.
- Poland has been a major European power for centuries, while Czech Republic has been a major European power for centuries.
- The relationship is characterized by a balance of power and mutual interest.
<|thinking_end|>

==================== RESPONSE ====================

Poland is situated in Europe, bordered by Czech Republic, and has been a significant European power for centuries. Its history of conflict and political tensions make it one of the most politically unstable regions on Earth.
<|response_end|>

Here is an small harness: https://gist.github.com/Firstbober/4920022ced5d83484cf82b944f8d259f

  • Developed by: firstbober

Framework versions

  • PEFT 0.19.1
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for firstbober/gemma-3-270M-it-smol-thinker

Adapter
(57)
this model