313 344 577

Yatharth Sharma

YaTharThShaRma999

AI & ML interests

None yet

Recent Activity

liked a model about 21 hours ago

Alpha-VLLM/Lumina-Video-f24R960

replied to nroggendorff's post about 23 hours ago

Dearest None-yet Team, I couldn't help but notice that our productivity has room for improvement. To address this, we will be engaging in a company-wide morale-building activity designed to boost teamwork, enthusiasm, and *most importantly* results. I know you're all as excited as I am for this fun and absolutely required initiative. Participation is not just encouraged, it's mandatory. Think of it as a team-bonding experience you never signed up for but will absolutely tolerate. More details to follow, but for now, mark your calendars and prepare for an engaging experience that will definitely make us all better, stronger, and more synchronized, or at least give us something to talk about later. Looking forward to seeing you all there! Best, Me

updated a model about 23 hours ago

YaTharThShaRma999/ToucanTTS

View all activity

Organizations

None yet

YaTharThShaRma999's activity

liked a model about 21 hours ago

Alpha-VLLM/Lumina-Video-f24R960

Text-to-Video • Updated about 18 hours ago • 20

replied to nroggendorff's post about 23 hours ago

Best message I have seen, I am literally tearing up.

updated a model about 23 hours ago

YaTharThShaRma999/ToucanTTS

Updated about 23 hours ago

reacted to Duskfallcrew's post with 🔥 3 days ago

Post

2990

Just been starting to port my articles over that mattered most to me from Civitai.
Look, i'm not going to sit here and whine, complain and moan entirely - they know why i've left, they're going to thrive without me.
I'm a mere spec compared to their future, and that's amazing.
But the journey continues, i've posted my Design 101 for Ai - the first one up -- i BELEIVE it's the first one, as it delves back to how Arts and Crafts connect to AI.
I'm still looking for a model hub in future for my insane 800+ models i'd published - considering that that's half of what i've got sitting in my repos on HF.

commented on The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... 3 days ago

Yeah Kokoro uses phonemes instead of direct text, that’s why it’s very good quality at just 82m params and can pronounce words better then other massive tts models(even better then Llasa 8b).

Only problem is emotion, which Llasa 8b is much better at doing.

reacted to hexgrad's post with 🔥 4 days ago

Post

4200

Wanted: Peak Data. I'm collecting audio data to train another TTS model:
+ AVM data: ChatGPT Advanced Voice Mode audio & text from source
+ Professional audio: Permissive (CC0, Apache, MIT, CC-BY)

This audio should *impress* most native speakers, not just barely pass their audio Turing tests. Professional-caliber means S or A-tier, not your average bloke off the street. Traditional TTS may not make the cut. Absolutely no low-fi microphone recordings like Common Voice.

The bar is much higher than last time, so there are no timelines yet and I expect it may take longer to collect such mythical data. Raising the bar means evicting quite a bit of old data, and voice/language availability may decrease. The theme is *quality* over quantity. I would rather have 1 hour of A/S-tier than 100 hours of mid data.

I have nothing to offer but the north star of a future Apache 2.0 TTS model, so prefer data that you *already have* and costs you *nothing extra* to send. Additionally, *all* the new data may be used to construct public, Apache 2.0 voicepacks, and if that arrangement doesn't work for you, no need to send any audio.

Last time I asked for horses; now I'm asking for unicorns. As of writing this post, I've currently got a few English & Chinese unicorns, but there is plenty of room in the stable. Find me over on Discord at rzvzn: https://discord.gg/QuGxSWBfQy

reacted to retronic's post with 😎 4 days ago

Post

1915

Colox is out, may be bugs!

Colox is out and ready on HF, it might have bugs though as it is not tested yet. You can try for yourself now! :)

reacted to retronic's post with 🔥 5 days ago

Post

4257

Colox, a reasoning AI model. I am currently working on a model smarter than GPT o1 that thinks before it speaks. It is coming tomorrow in the afternoon.

7 replies

published a model 5 days ago

YaTharThShaRma999/sd3_5_M_absynth

Updated 5 days ago

reacted to ZhengPeng7's post with 🔥👍 6 days ago

Post

2104

We just released the [BiRefNet_HR]( ZhengPeng7/BiRefNet_HR) for general use on higher resolution images, which was trained with images in 2048x2048. If your images are mostly larger than 1024x1024, use BiRefNet_HR for better results! Thanks to @Freepik for the kind support of H200s for this huge training.

HF Model: ZhengPeng7/BiRefNet_HR.
HF Demo: ZhengPeng7/BiRefNet_demo, where you need to choose General-HR and set high resolution.
PyTorch weights & ONNX: in Google Drive and the GitHub release.

Here is a comparison between the results of the original one and the new HR one on HR inputs:

And, the performance of this new HR one and the previous one trained in 1024x1024 on val set:

liked a model 6 days ago

m-a-p/YuE-s2-1B-general

Text Generation • Updated 12 days ago • 26.9k • 39

liked a model 7 days ago

HKUSTAudio/YuE-s1-7B-anneal-en-icl

Text-to-Audio • Updated 13 days ago • 35 • 13

New activity in stabilityai/stable-diffusion-3.5-medium 7 days ago

Huge memory consumption with SD3.5-medium

#18 opened 3 months ago by

oddball516

reacted to victor's post with ❤️ 7 days ago

Post

3736

Hey everyone, we've given https://hf.co/spaces page a fresh update!

Smart Search: Now just type what you want to do—like "make a viral meme" or "generate music"—and our search gets it.

New Categories: Check out the cool new filter bar with icons to help you pick a category fast.

Redesigned Space Cards: Reworked a bit to really show off the app descriptions, so you know what each Space does at a glance.

Random Prompt: Need ideas? Hit the dice button for a burst of inspiration.

We’d love to hear what you think—drop us some feedback plz!