| license: mit | |
| pipeline_tag: text-generation | |
| tags: [ONNX, ONNXRuntime, phi3, nlp, conversational, custom_code] | |
| inference: false | |
| Based on https://huggingface.co/microsoft/Phi-3-mini-instruct | |
| Convert ONNX model by using https://github.com/microsoft/onnxruntime-genai | |
| Using command: python -m onnxruntime_genai.models.builder -m microsoft/Phi-3-mini-instruct -o Phi-3-mini-instruct-onnx -e webgpu -c cache-dir -p int4 --extra_options int4_block_size=32 int4_accuracy_level=4 |