Pedro Cuenca's picture

Pedro Cuenca

pcuenq

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face's profile picture Google's profile picture Sentence Transformers's profile picture šŸ§ØDiffusers's profile picture PyTorch Image Models's profile picture Flax Community's profile picture Hugging Face Internal Testing Organization's profile picture DALLE mini's profile picture ControlNet 1.1 Preview's profile picture I Hackathon Somos NLP: PLN en EspaƱol's profile picture SomosNLP's profile picture Huggingface.js's profile picture HuggingFaceM4's profile picture Apple's profile picture (De)fusing's profile picture Open-Source AI Meetup's profile picture Huggingface Projects's profile picture CompVis's profile picture CompVis Community's profile picture Diffusers Pipelines Library for Stable Diffusion's profile picture Core ML Projects's profile picture LocalCodeLLMs's profile picture Code Llama's profile picture UniverseTBD's profile picture Hands-On Generative AI with Transformers and Diffusion Models's profile picture Diffusers Demo at ICCV 2023's profile picture Hugging Face TB Research's profile picture Core ML Files's profile picture huggingPartyParis's profile picture adept-hf-collab's profile picture Enterprise Explorers's profile picture Latent Consistency's profile picture TTS Eval (OLD)'s profile picture ggml.ai's profile picture kotol's profile picture LocalLLaMA's profile picture gg-hf's profile picture Mistral AI EAP's profile picture Llzama's profile picture MLX Community's profile picture Hugging Face Assignments's profile picture IBM Granite's profile picture On-device Squad's profile picture TTS AGI's profile picture Social Post Explorers's profile picture Apple CoreNet Models 's profile picture hsramall's profile picture diffusers-internal-dev's profile picture gg-tt's profile picture Hugging Face Discord Community's profile picture LLHF's profile picture SLLHF's profile picture lbhf's profile picture Hugging Quants's profile picture Meta Llama's profile picture kmhf's profile picture nltpt's profile picture s0409's profile picture Mt Metrics's profile picture nltpt-q's profile picture dummyosan's profile picture Test Org's profile picture metavision's profile picture mv's profile picture Bert ... but new's profile picture qrias's profile picture open/ acc's profile picture wut?'s profile picture DDUF's profile picture None yet's profile picture Hugging Face Agents Course's profile picture TFLite Community's profile picture s0225's profile picture

pcuenq's activity

New activity in agents-course/First_agent_template about 2 hours ago

Update app.py

1
#2 opened about 3 hours ago by
rainwaters11
New activity in agents-course/unit1-certification-app about 4 hours ago

Yay!

1
#6 opened about 5 hours ago by
jesusvilela
upvoted an article about 4 hours ago
view article
Article

Object Detection Leaderboard

ā€¢ 9
reacted to merve's post with šŸ”„šŸš€ about 20 hours ago
view post
Post
2381
Interesting releases in open AI this week, let's recap šŸ¤  merve/feb-7-releases-67a5f7d7f172d8bfe0dd66f4

šŸ¤– Robotics
> Pi0, first open-source foundation vision-language action model was released in Le Robot (Apache 2.0)

šŸ’¬ LLMs
> Groundbreaking: s1 is simpler approach to test-time scaling, the release comes with small s1K dataset of 1k question-reasoning trace pairs (from Gemini-Thinking Exp) they fine-tune Qwen2.5-32B-Instruct to get s1-32B, outperforming o1-preview on math šŸ¤Æ s1-32B and s1K is out!
> Adyen released DABstep, a new benchmark along with it's leaderboard demo for agents doing data analysis
> Krutrim released Krutrim-2 instruct, new 12B model based on NeMo12B trained and aligned on Indic languages, a new multilingual sentence embedding model (based on STSB-XLM-R), and a translation model for Indic languages

šŸ‘€ Multimodal
> PKU released Align-DS-V, a model aligned using their new technique called LLF for all modalities (image-text-audio), along with the dataset Align Anything
> OLA-7B is a new any-to-any model by Tencent that can take text, image, video, audio data with context window of 32k tokens and output text and speech in English and Chinese
> Krutrim released Chitrarth, a new vision language model for Indic languages and English

šŸ–¼ļø Vision
> BiRefNet_HR is a new higher resolution BiRefNet for background removal

šŸ—£ļø Audio
> kyutai released Hibiki, it's a real-time speech-to-speech translation model šŸ¤Æ it's available for French-English translation
> Krutrim released Dhwani, a new STT model for Indic languages
> They also release a new dataset for STT-TTS

šŸ–¼ļø Image Generation
> Lumina released Lumina-Image-2.0, a 2B parameter-flow based DiT for text to image generation
> Tencent released Hunyuan3D-2, a 3D asset generation model based on DiT and Hunyuan3D-Paint
> boreal-hl-v1 is a new boring photorealistic image generation LoRA based on Hunyuan
New activity in agents-course/notebooks about 22 hours ago

Update README.md

#2 opened about 22 hours ago by
pcuenq
New activity in agents-course/notebooks about 22 hours ago

Upload dummy_agent_library.ipynb

#1 opened about 22 hours ago by
pcuenq
New activity in apple/coreml-mobileclip 2 days ago
New activity in huggingface-projects/repo_duplicator 4 days ago

Doesn't work

1
#18 opened 4 days ago by
fullsoftwares
New activity in apple/DepthPro-hf 4 days ago