Yatharth  Sharma's picture

Yatharth Sharma

YaTharThShaRma999

AI & ML interests

None yet

Recent Activity

Organizations

None yet

YaTharThShaRma999's activity

replied to nroggendorff's post about 23 hours ago
view reply

Best message I have seen, I am literally tearing up.

reacted to Duskfallcrew's post with πŸ”₯ 3 days ago
view post
Post
2990
Just been starting to port my articles over that mattered most to me from Civitai.
Look, i'm not going to sit here and whine, complain and moan entirely - they know why i've left, they're going to thrive without me.
I'm a mere spec compared to their future, and that's amazing.
But the journey continues, i've posted my Design 101 for Ai - the first one up -- i BELEIVE it's the first one, as it delves back to how Arts and Crafts connect to AI.
I'm still looking for a model hub in future for my insane 800+ models i'd published - considering that that's half of what i've got sitting in my repos on HF.
view reply

Yeah Kokoro uses phonemes instead of direct text, that’s why it’s very good quality at just 82m params and can pronounce words better then other massive tts models(even better then Llasa 8b).

Only problem is emotion, which Llasa 8b is much better at doing.

reacted to hexgrad's post with πŸ”₯ 4 days ago
view post
Post
4200
Wanted: Peak Data. I'm collecting audio data to train another TTS model:
+ AVM data: ChatGPT Advanced Voice Mode audio & text from source
+ Professional audio: Permissive (CC0, Apache, MIT, CC-BY)

This audio should *impress* most native speakers, not just barely pass their audio Turing tests. Professional-caliber means S or A-tier, not your average bloke off the street. Traditional TTS may not make the cut. Absolutely no low-fi microphone recordings like Common Voice.

The bar is much higher than last time, so there are no timelines yet and I expect it may take longer to collect such mythical data. Raising the bar means evicting quite a bit of old data, and voice/language availability may decrease. The theme is *quality* over quantity. I would rather have 1 hour of A/S-tier than 100 hours of mid data.

I have nothing to offer but the north star of a future Apache 2.0 TTS model, so prefer data that you *already have* and costs you *nothing extra* to send. Additionally, *all* the new data may be used to construct public, Apache 2.0 voicepacks, and if that arrangement doesn't work for you, no need to send any audio.

Last time I asked for horses; now I'm asking for unicorns. As of writing this post, I've currently got a few English & Chinese unicorns, but there is plenty of room in the stable. Find me over on Discord at rzvzn: https://discord.gg/QuGxSWBfQy
reacted to retronic's post with 😎 4 days ago
view post
Post
1915
Colox is out, may be bugs!

Colox is out and ready on HF, it might have bugs though as it is not tested yet. You can try for yourself now! :)
reacted to retronic's post with πŸ”₯ 5 days ago
view post
Post
4257
Colox, a reasoning AI model. I am currently working on a model smarter than GPT o1 that thinks before it speaks. It is coming tomorrow in the afternoon.
Β·
reacted to ZhengPeng7's post with πŸ”₯πŸ‘ 6 days ago
view post
Post
2104
We just released the [BiRefNet_HR]( ZhengPeng7/BiRefNet_HR) for general use on higher resolution images, which was trained with images in 2048x2048. If your images are mostly larger than 1024x1024, use BiRefNet_HR for better results! Thanks to @Freepik for the kind support of H200s for this huge training.

HF Model: ZhengPeng7/BiRefNet_HR.
HF Demo: ZhengPeng7/BiRefNet_demo, where you need to choose General-HR and set high resolution.
PyTorch weights & ONNX: in Google Drive and the GitHub release.

Here is a comparison between the results of the original one and the new HR one on HR inputs:

And, the performance of this new HR one and the previous one trained in 1024x1024 on val set:
reacted to victor's post with ❀️ 7 days ago
view post
Post
3736
Hey everyone, we've given https://hf.co/spaces page a fresh update!

Smart Search: Now just type what you want to doβ€”like "make a viral meme" or "generate music"β€”and our search gets it.

New Categories: Check out the cool new filter bar with icons to help you pick a category fast.

Redesigned Space Cards: Reworked a bit to really show off the app descriptions, so you know what each Space does at a glance.

Random Prompt: Need ideas? Hit the dice button for a burst of inspiration.

We’d love to hear what you thinkβ€”drop us some feedback plz!
Β·
reacted to Tonic's post with πŸ”₯ 8 days ago
view post
Post
1978
πŸ™‹πŸ»β€β™‚οΈhey there folks ,

Goedel's Theorem Prover is now being demo'ed on huggingface : Tonic/Math

give it a try !
reacted to rubenroy's post with πŸ”₯ 10 days ago