dataset
Will the dataset or a smaller version of the dataset ever be available in the future?
Upload the dataset somewhere as well.
yes please
Without the dataset, it is difficult to accept or evaluate the claims made. Often, these datasets will have bugs, for example, tools being called with incorrect arguments and so on.
Hi @smirki ,
The model looks really good - nice work!
As a researcher, I’m quite interested in this direction, especially work that focuses on high-quality datasets and small-to-medium sized models. Really appreciate you open-sourcing the weights - that’s very valuable for the community.
Just a small suggestion: if possible, it would be great to also share a portion of the dataset (even a modest subset) along with a report. I think that would further enhance the impact and usability of this work.
Thanks again for your contribution!
Best regard
Yes, access to this data would be invaluable to the open source community. Great job on the model!
I would like to see the dataset please :-)
Hi @smirki ,
The model looks really good - nice work!
As a researcher, I’m quite interested in this direction, especially work that focuses on high-quality datasets and small-to-medium sized models. Really appreciate you open-sourcing the weights - that’s very valuable for the community.
Just a small suggestion: if possible, it would be great to also share a portion of the dataset (even a modest subset) along with a report. I think that would further enhance the impact and usability of this work.
Thanks again for your contribution!
Best regard
UPDATE: It seems the dataset is released in latest version - OmniCoder-2-9B
