Project Page: https://jixiaozhong.github.io/Sonic/
ComfyUI: https://github.com/smthemex/ComfyUI_Sonic
Kadir Nar PRO
kadirnar
AI & ML interests
AI Research Engineer ๐ค Building Omni & TTS Models
Recent Activity
Organizations
kadirnar's activity
![](https://cdn-avatars.huggingface.co/v1/production/uploads/619f7ba90df8731e0d8b6c54/L0O4z0klhnyjw6eCjrNln.png)
replied to
their
post
2 days ago
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis
Paper
โข
2501.04561
โข
Published
โข
16
โข
4
DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation
Paper
โข
2502.03930
โข
Published
โข
1
Update README.md
#2 opened 5 days ago
by
MateoSP
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/mkX765eOqCw4wFPtXKRDK.png)