rleo commited on
Commit
0c8df8e
·
verified ·
1 Parent(s): 8d92ee2

Upload 2 files

Browse files

Add finetuned version of gemma

.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ function-gemma-finetuned-tool-call.gguf filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,97 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - fr
5
+ license: apache-2.0
6
+ base_model: unsloth/functiongemma-270m-it
7
+ library_name: gguf
8
+ pipeline_tag: text-generation
9
+ tags:
10
+ - gguf
11
+ - llama.cpp
12
+ - function-calling
13
+ - tool-calling
14
+ - gemma
15
+ - bilingual
16
+ - en
17
+ - fr
18
+ - finetuned
19
+ model_name: function-gemma-finetuned-tool-call
20
+ ---
21
+
22
+ # function-gemma-finetuned-tool-call
23
+
24
+ Fine-tuned Function-Gemma 270M model for bilingual (English/French) tool-calling.
25
+
26
+ ## Files
27
+
28
+ - `function-gemma-finetuned-tool-call.gguf` (F16 merged GGUF)
29
+
30
+ ## Base Model
31
+
32
+ - `unsloth/functiongemma-270m-it`
33
+
34
+ ## Training Summary
35
+
36
+ - Method: SFT + LoRA, then merged into full weights
37
+ - Dataset: custom bilingual EN/FR tool-calling set (`dataset_80tools_en_fr.json`)
38
+ - Target behavior: structured function/tool calls with argument extraction and no-tool abstention when appropriate
39
+
40
+ ## Local Evaluation (checkpoint benchmark)
41
+
42
+ From `outputs/eval_checkpoint_report.json`:
43
+
44
+ - Total cases: 16
45
+ - Pass rate: 0.8125
46
+ - Decision accuracy: 0.8125
47
+ - Tool name accuracy: 0.8125
48
+ - Argument presence accuracy: 1.0
49
+ - Tool-call recall: 1.0
50
+ - No-tool precision: 0.5
51
+
52
+ ## Usage (llama.cpp)
53
+
54
+ ```bash
55
+ llama.cpp/build/bin/llama-cli \
56
+ --model function-gemma-finetuned-tool-call.gguf \
57
+ --ctx-size 32768 \
58
+ --n-gpu-layers 99 \
59
+ --seed 3407 \
60
+ --top-k 64 \
61
+ --top-p 0.95 \
62
+ --temp 1.0 \
63
+ --jinja
64
+ ```
65
+
66
+ For one-shot test:
67
+
68
+ ```bash
69
+ llama.cpp/build/bin/llama-cli \
70
+ --model function-gemma-finetuned-tool-call.gguf \
71
+ --ctx-size 32768 \
72
+ --n-gpu-layers 99 \
73
+ --seed 3407 \
74
+ --top-k 64 \
75
+ --top-p 0.95 \
76
+ --temp 1.0 \
77
+ --jinja \
78
+ --single-turn \
79
+ --simple-io \
80
+ --prompt "What is the weather in Paris?"
81
+ ```
82
+
83
+ ## Prompt / Output Format
84
+
85
+ This model was fine-tuned for Function-Gemma style tool tags (e.g. `<start_function_call>...`).
86
+ When used with `--jinja`, llama.cpp applies the chat template stored in GGUF metadata.
87
+
88
+ ## Limitations
89
+
90
+ - Small model (270M): can still over-call tools in ambiguous no-tool prompts.
91
+ - Best results require strong tool schema prompts and clear user intent.
92
+
93
+ ## Intended Use
94
+
95
+ - Lightweight local assistant prototypes
96
+ - Tool-routing and structured argument extraction tasks
97
+ - EN/FR bilingual demos and experimentation
function-gemma-finetuned-tool-call.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:00b4dd85c54a3f1bbbdc7643efb14fe0b9f1f4f8e02db151c9aa182852a62268
3
+ size 542847360