552 886 3130

Victor Mustar PRO

victor

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

liked a model about 7 hours ago

tomg-group-umd/huginn-0125

liked a Space about 8 hours ago

FunAudioLLM/InspireMusic

liked a model about 17 hours ago

ibm-granite/granite-vision-3.1-2b-preview

View all activity

Organizations

victor's activity

liked a model about 7 hours ago

tomg-group-umd/huginn-0125

Text Generation • Updated 1 day ago • 4.84k • 61

liked a Space about 8 hours ago

InspireMusic

🎶

Music Generation - text to music, music continuation.

liked a model about 17 hours ago

ibm-granite/granite-vision-3.1-2b-preview

Image-Text-to-Text • Updated 3 days ago • 3.12k • 42

liked a Space about 17 hours ago

Granite Vision 3.1 2B

👀

Chat with images and text

New activity in shb777/Granite-Vision-3.1-2B about 17 hours ago

ZeroGPU?

#1 opened 1 day ago by

victor

liked a Space about 17 hours ago

Mixture Of Diffusers SDXL Tiling

🚀

Mixture of Diffusers implementation for XL Stable Diffusion

liked a model about 22 hours ago

Zyphra/Zonos-v0.1-hybrid

Text-to-Speech • Updated about 21 hours ago • 874 • 468

upvoted a paper 1 day ago

PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

Paper • 2502.01584 • Published 8 days ago • 9

liked 2 Spaces 1 day ago

201

Open Deep-Research

🏆

OpenAI's Deep Research, but open

Lumina Image 2.0

🖼

Generate images from text prompts

published a Space 1 day ago

Granite Vision 3.1 2B

👀

Chat with an image-aware assistant

updated a Space 1 day ago

Granite Vision 3.1 2B

👀

Chat with an image-aware assistant

liked 4 Spaces 1 day ago

LightDiffusion-Next

🚀

Generate images from text prompts

SmartFlow DailyPaper

📈

Generate text based on prompts

open deep-research

🏆

OpenAI's Deep Research, but open

Qwen2.5 Coder Artifacts

🐢

Generate code snippets with user prompts

reacted to merve's post with 🚀 1 day ago

Post

2408

Interesting releases in open AI this week, let's recap 🤠 merve/feb-7-releases-67a5f7d7f172d8bfe0dd66f4

🤖 Robotics
> Pi0, first open-source foundation vision-language action model was released in Le Robot (Apache 2.0)

💬 LLMs
> Groundbreaking: s1 is simpler approach to test-time scaling, the release comes with small s1K dataset of 1k question-reasoning trace pairs (from Gemini-Thinking Exp) they fine-tune Qwen2.5-32B-Instruct to get s1-32B, outperforming o1-preview on math 🤯 s1-32B and s1K is out!
> Adyen released DABstep, a new benchmark along with it's leaderboard demo for agents doing data analysis
> Krutrim released Krutrim-2 instruct, new 12B model based on NeMo12B trained and aligned on Indic languages, a new multilingual sentence embedding model (based on STSB-XLM-R), and a translation model for Indic languages

👀 Multimodal
> PKU released Align-DS-V, a model aligned using their new technique called LLF for all modalities (image-text-audio), along with the dataset Align Anything
> OLA-7B is a new any-to-any model by Tencent that can take text, image, video, audio data with context window of 32k tokens and output text and speech in English and Chinese
> Krutrim released Chitrarth, a new vision language model for Indic languages and English

🖼️ Vision
> BiRefNet_HR is a new higher resolution BiRefNet for background removal

🗣️ Audio
> kyutai released Hibiki, it's a real-time speech-to-speech translation model 🤯 it's available for French-English translation
> Krutrim released Dhwani, a new STT model for Indic languages
> They also release a new dataset for STT-TTS

🖼️ Image Generation
> Lumina released Lumina-Image-2.0, a 2B parameter-flow based DiT for text to image generation
> Tencent released Hunyuan3D-2, a 3D asset generation model based on DiT and Hunyuan3D-Paint
> boreal-hl-v1 is a new boring photorealistic image generation LoRA based on Hunyuan

reacted to prithivMLmods's post with 🤗 1 day ago

Post

3573

QwQ Edge Gets a Small Update..! 💬
try now: prithivMLmods/QwQ-Edge

🚀Now, you can use the following commands for different tasks:

🖼️ @image 'prompt...' → Generates an image
🔉@tts1 'prompt...' → Generates speech in a female voice
🔉 @tts2 'prompt...' → Generates speech in a male voice
🅰️@text 'prompt...' → Enables textual conversation (If not specified, text-to-text generation is the default mode)

💬Multimodality Support : prithivMLmods/Qwen2-VL-OCR-2B-Instruct
💬For text generation, the FastThink-0.5B model ensures quick and efficient responses, prithivMLmods/FastThink-0.5B-Tiny
💬Image Generation: sdxl lightning model, SG161222/RealVisXL_V4.0_Lightning

Github: https://github.com/PRITHIVSAKTHIUR/QwQ-Edge

graph TD
    A[User Interface] --> B[Chat Logic]
    B --> C{Command Type}
    C -->|Text| D[FastThink-0.5B]
    C -->|Image| E[Qwen2-VL-OCR-2B]
    C -->|@image| F[Stable Diffusion XL]
    C -->|@tts| G[Edge TTS]
    D --> H[Response]
    E --> H
    F --> H
    G --> H

liked a model 1 day ago

kudzueye/boreal-hl-v1

Text-to-Video • Updated about 23 hours ago • 87

liked a model 3 days ago

mlx-community/DeepSeek-R1-Distill-Qwen-32B-MLX-8Bit

Updated 22 days ago • 232k • 7