1 4

Florian Valade

valcore

https://fvalade.fr

FlorianVal

AI & ML interests

Efficiency in Deep learning

Recent Activity

liked a model 9 days ago

FunAudioLLM/Fun-CosyVoice3-0.5B-2512

updated a Space over 1 year ago

valcore/Branchy-phi-2

new activity over 1 year ago

valcore/Branchy-phi-2:Apply for community grant: Academic project (gpu)

View all activity

Organizations

liked a model 9 days ago

FunAudioLLM/Fun-CosyVoice3-0.5B-2512

Text-to-Speech • Updated 10 days ago • 1.5k • 302

updated a Space over 1 year ago

Branchy Phi 2

⚡

Generate chat responses with early exit options

New activity in valcore/Branchy-phi-2 over 1 year ago

Apply for community grant: Academic project (gpu)

#1 opened over 1 year ago by

valcore

posted an update over 1 year ago

Post

853

New research model out ! I uploaded a new Branchy model based on Phi-2 for faster inference using Early Exit. Check it out : valcore/Branchy-Phi-2.
I also uploaded a Hugging Face Space to try it out : valcore/Branchy-phi-2, unfortunately inference is very slow on free tier. Let me know what you are thinking about it !

authored a paper over 1 year ago

EERO: Early Exit with Reject Option for Efficient Classification with limited budget

Paper • 2402.03779 • Published Feb 6, 2024

liked a Space almost 2 years ago

Candle Phi Wasm Demo

🕯

122

Generate text based on prompts

reacted to merve's post with 👍 almost 2 years ago

Post

Explaining a new state-of-the-art monocular depth estimation model: Depth Anything ✨ 🧶
Before we begin: Depth Anything is recently integrated to 🤗 transformers and you can use it with three lines of code! ✨

from transformers import pipeline

pipe = pipeline(task="depth-estimation", model="LiheYoung/depth-anything-small-hf")
depth = pipe(image)["depth"]

We have also built an app for you to compare different depth estimation models 🐝 🌸 merve/compare_depth_models
Check out Depth Anything in Web by @Xenova Xenova/depth-anything-web

The model's success heavily depends on unlocking the use of unlabeled datasets, although initially the authors used self-training and failed.
What the authors have done:
➰ Train a teacher model on labelled dataset
➰ Guide the student using teacher and also use unlabelled datasets pseudolabelled by the teacher
However, this was the cause of the failure, as both architectures were similar, the outputs were the same.
So the authors have added a more difficult optimization target for student to learn additional knowledge on unlabeled images that went through color jittering, distortions, Gaussian blurring and spatial distortion, so it can learn more invariant representations from them.
The architecture consists of DINOv2 encoder to extract the features followed by DPT decoder. At first, they train the teacher model on labelled images, and then they jointly train the student model and add in the dataset pseudo-labelled by ViT-L.
Thanks to this, Depth Anything performs very well! I have also benchmarked the inference duration of the model against different models here. I also ran torch.compile benchmarks across them and got nice speed-ups 🚀 https://huggingface2.notion.site/DPT-Benchmarks-1e516b0ba193460e865c47b3a5681efb?pvs=4

liked a model about 2 years ago

mistralai/Mistral-7B-Instruct-v0.1

Text Generation • 7B • Updated Jul 24 • 490k • 1.82k

liked a model over 2 years ago

meta-llama/Llama-2-7b-chat-hf

Text Generation • 7B • Updated Apr 17, 2024 • 327k • 4.68k