ANE Extraction Model — Qwen3.5 2B (Merged)

Fine-tuned Qwen3.5-2B for structured product information extraction from German marketplace listings (Kleinanzeigen, eBay, etc.).

Model Description

This model extracts structured JSON from raw listing titles and descriptions:

{
  "productName": "Apple iPhone 14 Pro 128GB Space Black",
  "brand": "Apple",
  "model": "iPhone 14 Pro",
  "condition": "neuwertig",
  "category": "elektronik",
  "attributes": {"storage": "128GB", "color": "Space Black"}
}

Training Details

  • Base model: unsloth/Qwen3.5-2B
  • Method: bf16 LoRA with unsloth (QLoRA not recommended for Qwen3.5)
  • LoRA rank: 16
  • LoRA alpha: 32
  • Training samples: 4002
  • Eval samples: 445
  • Final loss: 0.7330062726579339
  • Epochs: 3

Usage with vLLM

from vllm import LLM, SamplingParams

llm = LLM(model="ekwav/ane-extraction-qwen3-1.7b", dtype="float16")
params = SamplingParams(temperature=0.0, max_tokens=256, stop=["<|im_end|>"])

prompt = '''<|im_start|>system
You are a JSON extractor for marketplace listings. Output ONLY a valid JSON object.<|im_end|>
<|im_start|>user
T:iPhone 14 Pro 128GB
D:Neuwertig, immer mit Hülle benutzt
P:749EUR /no_think<|im_end|>
<|im_start|>assistant
'''

output = llm.generate([prompt], params)
print(output[0].outputs[0].text)

Intended Use

Part of the ANE (ane.deals) product extraction pipeline. Designed to replace larger models for cost-efficient inference at scale.

Downloads last month
15
Safetensors
Model size
2B params
Tensor type
F32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Ekwav/ane-extraction-qwen3.5-2b

Finetuned
Qwen/Qwen3.5-2B
Adapter
(12)
this model