badaoui HF Staff commited on
Commit
40e288f
·
verified ·
1 Parent(s): 0a67737

Add link to Neuron-optimized version

Browse files

🤖 Neuron Export Bot: Adding link to Neuron-optimized version.

A Neuron-optimized version of this model has been created at [badaoui/microsoft-Phi-3-mini-4k-instruct-neuron](https://huggingface.co/badaoui/microsoft-Phi-3-mini-4k-instruct-neuron).

The optimized version provides improved performance on AWS Inferentia/Trainium instances with pre-compiled artifacts.

Generated by: [badaoui](https://huggingface.co/badaoui)
Generated using: [Optimum Neuron Compiler Space](https://huggingface.co/spaces/optimum/neuron-export)

Files changed (1) hide show
  1. README.md +13 -1
README.md CHANGED
@@ -317,4 +317,16 @@ The model is licensed under the [MIT license](https://huggingface.co/microsoft/P
317
 
318
  ## Trademarks
319
 
320
- This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow [Microsoft’s Trademark & Brand Guidelines](https://www.microsoft.com/en-us/legal/intellectualproperty/trademarks). Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party’s policies.
 
 
 
 
 
 
 
 
 
 
 
 
 
317
 
318
  ## Trademarks
319
 
320
+ This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow [Microsoft’s Trademark & Brand Guidelines](https://www.microsoft.com/en-us/legal/intellectualproperty/trademarks). Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party’s policies.
321
+
322
+ ---
323
+ ## 🚀 AWS Neuron Optimized Version Available
324
+
325
+ A Neuron-optimized version of this model is available for improved performance on AWS Inferentia/Trainium instances:
326
+
327
+ **[badaoui/microsoft-Phi-3-mini-4k-instruct-neuron](https://huggingface.co/badaoui/microsoft-Phi-3-mini-4k-instruct-neuron)**
328
+
329
+ The Neuron-optimized version provides:
330
+ - Pre-compiled artifacts for faster loading
331
+ - Optimized performance on AWS Neuron devices
332
+ - Same model capabilities with improved inference speed