Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

anthonym21
/
Eve-2-MoE-272M

Text Generation
PyTorch
Safetensors
English
eve-moe
Mixture of Experts
deepseek
nvidia-h200
fineweb-edu
nano-lm
edge-ai
rope
custom_code
Model card Files Files and versions
xet
Community
Eve-2-MoE-272M
2.28 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 19 commits
anthonym21's picture
anthonym21
Add tokenizer_config.json from IT repo
12c3a39 verified about 2 months ago
  • .gitattributes
    1.52 kB
    initial commit 2 months ago
  • README.md
    5.92 kB
    Update README.md 2 months ago
  • config.json
    636 Bytes
    Update config.json with auto_map and HF fields 2 months ago
  • configuration_eve.py
    2.78 kB
    Add HuggingFace transformers integration (AutoModelForCausalLM support) 2 months ago
  • generate.py
    3.48 kB
    Fix generate.py: correct repo name and add HF usage example 2 months ago
  • generation_config.json
    164 Bytes
    Add generation_config.json for default generation settings 2 months ago
  • model.safetensors
    1.19 GB
    xet
    Upload model.safetensors with huggingface_hub 2 months ago
  • modeling_eve.py
    18.2 kB
    Rewrite modeling_eve.py with HF-compatible EveMoEForCausalLM 2 months ago
  • pytorch_model.bin

    Detected Pickle imports (3)

    • "torch.FloatStorage",
    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict"

    What is a pickle import?

    1.09 GB
    xet
    Update weights: 10B token continue-pretraining on finepdfs eng_Latn (val ppl 36.8 → 32.3) 2 months ago
  • requirements.txt
    53 Bytes
    Upload folder using huggingface_hub 2 months ago
  • tokenizer.json
    3.56 MB
    Add tokenizer.json from IT repo about 2 months ago
  • tokenizer_config.json
    297 Bytes
    Add tokenizer_config.json from IT repo about 2 months ago
  • train.py
    17.2 kB
    Upload folder using huggingface_hub 2 months ago