Blog-explorers

#18 opened 21 days ago by

posted an update 18 days ago

Post

2742

By trying to disprove the Omega H2 battery I have discovered;
* Each topology formed by the H2 battery is deviant, none have a uniformly shared substrate of behavior. They are each uniquely independent per training set all with perfect recon.
* Image recon can be tracked and mapped, yielding a consistently mapped and response 16.77m vocabulary potential. In the current spectrum testing at around 5 million unicode bytes.
* The model scale shows patch size is related to how much data you want the model to represent within the model itself, and this has yet to see a capacity to this day. The MSE recons and yields - and the more data fed, the more they yield.
* The scaling principle shows that the model indefinitely scales upward and each level of the model can be iteratively captured upward to form deviant and uniformly consistent repeatable pathways of implicit codewise response, not just arbitrary bitwise recall. Meaningful implicit learned utility.
* Image recon patch size should match the slice of image you want to represent, as it uses patch smoothing per patch internally from identity.
* byte trigrams are channel-agnostic, they do not require a channel count just a formula for recall at nGram recall 99.6% for byte-by-byte representations. With those comes an adjacently capable codebook.
* sentencepiece preliminary tests show validity and reconstruction just like the byte trigrams, using the new byte trigram this would be arbitrarily convenient to recon a codebook for the structure.
* binary trees learn a uniformly potent and powerful gating mechanism that required further exploration, each of them produces direct responsive independent capacity and the responses are controllable.
* ternary experiments show the models are directly responsive to -1, 0, +1 behavior, so the quantization is very much a valid potential.
* preliminary tests with the H2O1 series of batteries show the models are responding similar to natural universal elements in the universe itself

9 replies

spanofzero

posted an update 19 days ago

Post

Productivity PSA
save tome learning and use an easy Ai of all ais and consider a friend James Murdza worldly hospitable attitude whos YT series will catch you up fast,
Today is no different as hes introduced me
to BackGrounder.dev
https://youtu.be/KFu0GTrV31g?si=jdM7DY9q49EM5FYA
Sandbox built in Multi Gfree chat code creat makes sense
and saves money can BYOapi just great for quick sandbox dev checks or just safety wise NO Regrets Code Insider Knowledge

ZennyKenny

in blog-explorers/README 20 days ago

🚩 Report: Spam

#19 opened 20 days ago by

ccocks-deca

in blog-explorers/README 20 days ago

Future of Agentic Models

#18 opened 21 days ago by

ccocks-deca

in blog-explorers/README 20 days ago

🚩 Report: Spam

#19 opened 20 days ago by

ccocks-deca

Future of Agentic Models

#18 opened 21 days ago by

in blog-explorers/README 20 days ago

Future of Agentic Models

#18 opened 21 days ago by

apehex

in blog-explorers/README 20 days ago

Future of Agentic Models

#18 opened 21 days ago by

RiverRider

in blog-explorers/README 20 days ago

Future of Agentic Models

#18 opened 21 days ago by

Yann-CV

posted an update 21 days ago

Post

486

🚀 Introducing Goldener: The Python Data Orchestrator for more efficient ML

Machine Learning workflows often rely on randomness: selecting/splitting data for training, batching it variably, and monitoring real-world performance.

Nowadays, foundation models give access to the semantics of data. Goldener leverages this semantic to make the entire ML lifecycle more efficient!

🔗 Check it out: https://github.com/goldener-data/goldener
🔨 Give it a try: pip install goldener

posted an update 23 days ago

Post

189

Today, I'll be determining the codebook capacity and utility potential for the larger batteries; Fresnel, Johanna, Grandmaster, Freckles, and Johanna-F variants, which should give a good indication of which models are capable of handling codebooks and which are more errant. The earlier all use SVD while the later do not. The differences are noted per and the behavior divergent.

I anticipate the D=16 will be more errant, and the final-state variants of those could very well be much more difficult or costly to inference as their axis bends are likely considerably harder to track. However, I'm confident that enough bounces will give the yield required so I'll set up some high-yield noise barrages to determine how much of them we can in fact extract from Johanna, and then set up similar barrages for images to map the internals of Fresnel and Grandmaster.

Grandmaster will be tricky, as it was an experimental Johanna-256 finetuned series meant to map sigma noised image inputs to recreate Fresnel behavioral output. Noised image goes in -> Fresnel-grade replication comes out in high res.

This allowed preliminary Dall-E Mini-esque VAE generation and will be explored further for the stereoscopic translation subsystem, to allow image generation in the unique format of diffusion that I was working out. I anticipate this system to be more than capable at making monstrosities, so I won't be posting TOO MANY prelims on this one, but the high-capacity potential of these noise makers are meaningfully powerful. Getting uniform codebooks in-place for these models will allow full transformer mapping downstream instead of just guess working the MSE piecemeal, which the earlier versions and variants were doing.

I'm straying from the CLS specifically for this series because CLS creates adjudicated pools of bias orbiting the INCORRECT orbiter some SVAE. The orbital target IS the soft-hand accumulated bias with the sphere-norm, so having a competitor isn't going to be a good option.

7 replies