ANE Extraction Model — Qwen3.5 2B (Merged)

Fine-tuned Qwen3.5-2B for structured product information extraction from German marketplace listings (Kleinanzeigen, eBay, etc.).

Model Description

This model extracts structured JSON from raw listing titles and descriptions:

{
  "productName": "Apple iPhone 14 Pro 128GB Space Black",
  "brand": "Apple",
  "model": "iPhone 14 Pro",
  "condition": "neuwertig",
  "category": "elektronik",
  "attributes": {"storage": "128GB", "color": "Space Black"}
}

Training Details

Base model: unsloth/Qwen3.5-2B
Method: bf16 LoRA with unsloth (QLoRA not recommended for Qwen3.5)
LoRA rank: 16
LoRA alpha: 32
Training samples: 4002
Eval samples: 445
Final loss: 0.7330062726579339
Epochs: 3

Usage with vLLM

from vllm import LLM, SamplingParams

llm = LLM(model="ekwav/ane-extraction-qwen3-1.7b", dtype="float16")
params = SamplingParams(temperature=0.0, max_tokens=256, stop=["<|im_end|>"])

prompt = '''<|im_start|>system
You are a JSON extractor for marketplace listings. Output ONLY a valid JSON object.<|im_end|>
<|im_start|>user
T:iPhone 14 Pro 128GB
D:Neuwertig, immer mit Hülle benutzt
P:749EUR /no_think<|im_end|>
<|im_start|>assistant
'''

output = llm.generate([prompt], params)
print(output[0].outputs[0].text)

Intended Use

Part of the ANE (ane.deals) product extraction pipeline. Designed to replace larger models for cost-efficient inference at scale.

Downloads last month: 15

Safetensors

Model size

2B params

Tensor type

F32

BF16

Model tree for Ekwav/ane-extraction-qwen3.5-2b

Base model

Qwen/Qwen3.5-2B-Base

Finetuned

Qwen/Qwen3.5-2B

Finetuned

unsloth/Qwen3.5-2B

Adapter

(12)

this model