Discover spaces with zero GPU usage on 🤗 Hugging Face Spaces.
generate a video from an image with a text prompt
|
r3gm
🐌
2338
High-quality voice cloning TTS for 600+ languages
|
k2-fsa
🌍
677
FireRed-Image-Edit × Qwen-Image-Edit-Rapid (Transformers)
|
prithivMLmods
🌖
989
generate a video from an image with a text prompt
|
r3gm
🐌
789
|
mrfakename
🖼️
2996
Describe any song — AI writes & produces it
|
victor
🎧
85
Qwen image edit with 🔞 loras
|
Onise
👀
71
|
multimodalart
🎥
2358
One-click model liberation + chat playground
|
pliny-the-prompter
💥
327
text to video, image to video, video extend
|
FrameAI4687
🏆
931
A Multi-Modal World Model for Reconstruction
|
prithivMLmods
🤗
32
High-fidelity 3D Generation from images
|
microsoft
🏢
1445
|
black-forest-labs
💻
866
Face Swap app using Flux.2 Klein 9B LoRA
|
linoyts
🦀
313
|
rahul7star
📚
51
Generate animatable and articulated 3D assets from images
|
VAST-AI
🚀
35
Demo of the Collection of Qwen Image Edit LoRAs
|
prithivMLmods
🎃
1311
|
black-forest-labs
💻
772
Demo of the Collection of Qwen Image Edit LoRAs
|
aet256
⚡
25
ZeroGPU Nano-TTS voice clone demo
|
OpenMOSS-Team
📈
52
|
akhaliq
🏆
30
|
nvidia
🔊
27
OpenAI Privacy Filter ZeroGPU demo
|
openai
🛡️
14
|
not-lain
🌘w🌖
2815
generate a video from an image with a text prompt
|
zerogpu-aoti
🎥💨
2935
Fast 4 step inference with Qwen Image Edit 2509
|
linoyts
🎬
2192
|
NucleusAI
🏃
25
Edit Videos with Wan 2.2
|
alexnasa
🔥
268
Adult AI images,anima & comic,non-realistic
|
alexander00001
🚀
59
Fast high quality video with audio generation with FA3
|
alexnasa
🔥
428
Fast high quality video with audio generation with FA3
|
Imosu
🔥
60
|
nvidia
📄
31
Text-to-3D and Image-to-3D Generation
|
tencent
🌍
3256
Scalable and Versatile 3D Generation from images
|
trellis-community
🏢
590
|
Tongyi-MAI
🏃
1795
Edit any pose with Qwen Edit 2511 Any Pose LoRA
|
linoyts
🕺
307
generate a video from an image with a text prompt
|
kulkas2pintu
🐌
26
expand videos to any aspect ratio with LTX 2.3
|
linoyts
🌠
13
|
fancyfeast
🖼️💬
1014
generate a video from an image with a text prompt
|
dream2589632147
🎥
1356
|
Qwen
🎙️
1890
Music Generation Foundation Model v1.5
|
ACE-Step
🎵
521
Face Swap app using Flux.2 Klein 9B LoRA
|
laruss5
🦀
12
Portrait animation & lipsync with LTX 2.3
|
linoyts
🕺
132
High-resolution Multiview 3D Generation with PBR
|
Stable-X
🖥️
32
|
limuloo1999
🖼
38
|
hkchengrex
🔊
948
Chatterbox TTS supporting 23 languages
|
ResembleAI
🌎
390
Qwen-Image-2509-MultipleAngles
|
tori29umai
👀💨
672
|
nvidia
🎵
174
|
Qwen
🏆
388
|
Qwen
👀
362
Turn any image into a DLSS 5 meme (using FLUX.2-klein-9b-kv)
|
victor
🎮
378
|
hysts
🐱
33
Scaling Promptable 3D Detection in the Wild
|
allenai
🚀
21
|
hf-audio
🤫
830
|
black-forest-labs
🖥️
9437
Upgraded to v1.0!
|
hexgrad
❤️
3296
Try out DeepSeek-OCR-2 on your PDFs or images
|
merterbak
🚀
466
|
Qwen
🚀
505
Demo of a Collection of FLUX.2-Klein Model LoRAs
|
prithivMLmods
🥚
103
Multi-view image to 3D generation
|
opsiclear-admin
🧊
10
Zero GPU Text-to-Speech using Fish Audio S2 Pro
|
artificialguybr
🐟
148
|
24yearsold
🔍
95
|
pcuenq
📦
6
Use OAI's Privacy Filter to redact PII info from any image
|
ysharma
🦀
5
|
InstantX
😻
3594
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
|
mrfakename
🗣️
2850
User Friendly Image & Video Upscaler!
|
Nick088
🔥📹
144
|
Fabrice-TIERCELIN
💻
249
Voice conversion framework based on VITS
|
r3gm
⚡
276
|
depth-anything
🌖
666
|
yanze
🤗
2079
StyleTTS2 trained on Ukrainian multispeaker dataset
|
patriotyk
🔈
154
Remove/Change background of video.
|
innova-ai
📽️
613
Audio Conditioned LipSync with Latent Diffusion Models
|
fffiloni
👄
591
|
IndexTeam
🏢
789
|
lehehroi
🖼
38
Qwen-Image-2509-CharacterSheet
|
tori29umai
👀💨
240
Demo of the Collection of Qwen Image Edit LoRAs
|
prithivMLmods
⚡
755
|
depth-anything
🏢
412
Text-to-3D and Image-to-3D Generation
|
tencent
💃
273
|
Soul-AILab
🎤
152
Demo of the Collection of Qwen Image Edit LoRAs
|
cruisewagner2220
⚡
5
Demo of the Collection of Qwen Image Edit LoRAs
|
Codergeek911
🎃
4
|
nielsr
🖼️
4
Describe what you want, AI writes the FFMPEG command
|
huggingface-projects
🏞
653
|
skytnt
🎼🎶
578
|
LiheYoung
🌖
560
Generate highly aesthetic images
|
playgroundai
🌍
1136
|
UNESCO
🌐
187
Apply the motion of a video on a portrait
|
innoai
🤪
52
Clarity AI Upscaler Reproduction
|
finegrain
🖼️🪄
2099
|
gokaygokay
😻
1379
AI Clothes Changer Online
|
jallenjia
👚👚👚
333
|
Plachta
🎤🔄
483
Easily expand image boundaries
|
fffiloni
🔅
2521
|
jasperai
🔎
1683
|
fancyfeast
👁
1726
|
multimodalart
👈🖼️👉
771
|
mehdikhejel
⚡
29
Speedy and Accurate Image to 3D Generator
|
frogleo
📦🔥
188
|
multimodalart
💻
632
Image and video tasks with moondream3.
|
merve
🏢
41
DiT360: A High-Fidelity Panoramic Image Generation Framework
|
Insta360-Research
🖼
63
Text-Guided Object Manipulation Using Qwen Image Edit + LoRA
|
prithivMLmods
🔥
181
|
Tongyi-MAI
🏃
158
|
signsur4739379373
🌍
365
Multimodal OCR model for complex document understanding.
|
prithivMLmods
📄
95
|
linoyts
🎥
102
|
black-forest-labs
🎨
170
VOID: Video Object and Interaction Deletion
|
sam-motamed
👀
77
TTS demo for Irodori-TTS-500M-v2
|
Aratako
🐠
22
|
Jackrong
🧠
23
Official chatbot demo for Tencent HY-Embodied-0.5
|
tencent
🤖
3
|
OpenMOSS-Team
🐢
6
dummy opf placeholder
|
ysharma
📚
3
|
scy639
🌍
3
|
artificialguybr
🖼️
3
QR Code AI Art Generator Blend QR codes with AI Art
|
huggingface-projects
📱🔲
1982
|
diffusers
🔥
504
Get a music sample inspired by the mood of an image
|
fffiloni
🎺
569
Easily remove your videos background!
|
amirgame197
🎞️
353
|
ZhengPeng7
👁
308
|
TencentARC
📷✏️
1209
|
fancyfeast
💬
1426
remove background from any image
|
briaai
🐢
923
|
Yuanshi
🌍
942
|
moonshotai
🤔
199
Generate realistic dialogue from a script, using Dia!
|
nari-labs
👯
1774
|
Qwen
🖼️
905
Granite 4.0 1B Speech recognition and translation demo
|
ibm-granite
🎧
27
Generate Hollywood Style Actors on your Local Machine
|
alexnasa
🎥
258
Single Image 3D Face Reconstruction with Gaussian Splatting
|
wlyu
🎭
39
Unified foundation model for promptable segmentation
|
prithivMLmods
🏀
48
Stunning images using stable diffusion.
|
tuan2308
🚀
4
|
mcp-tools
🏃
29
|
jinguotianxin
🏃
20
|
nvidia
🐠
23
LongCat-Video-Avatar
|
cpuai
🌖
25
Convert any static image into a 3D Gaussian Splat scene
|
gagndeep
🔪
99
Fast 4 step inference of Qwen Image Edit 2511
|
linoyts
🏆💨
255
Feed-forward Multi-view Inverse Rendering in seconds)
|
maddog241
📸
7
|
Qwen
🎙️
132
FireRed-Image-Edit-1.0
|
FireRedTeam
🌍
212
Demo of the Collection of Qwen Image Edit LoRAs
|
hitoshifreak
🎃
4
Demo of a Collection of FLUX.2-Klein Model LoRAs
|
Onise
🥚
11
FireRed-Image-Edit × Qwen-Image-Edit-Rapid (Transformers)
|
Chipsdfgds
🌖
2
streaming demo
|
Soul-AILab
🐨
8
Zero-shot TTS model for Wolof
|
soynade-research
🍿
6
|
Jackrong
🧠
13
|
Overworld
🎮
16
Move & resize objects with Flux.2 [klein] LoRA
|
linoyts
🎁
14
VoiceDesign demo for Irodori-TTS-500M-v2-VoiceDesign
|
Aratako
🎛️
11
Object Multiplex
|
prithivMLmods
⚡
13
|
stevengrove
🚀
13
Gradio demo for ToriiGate-0.5 model
|
Minthy
🏢
7
|
Amirox
🏆
2
|
rishiraj
👀
2
Paste PII, share redacted view using OAI Privacy Filter
|
ysharma
📚
2
|
cella110n
🏷️
3
Instantly inpaint pitches in any MIDI with masked encoder
|
projectlosangeles
🖌️
2
|
akhaliq
🏢
2
|
🧩
2
|
🧊
2
|
rizavelioglu
📚
12
|
Realcat
🤗
180
Complete list of past Daily Papers
|
hysts
📊
287
|
r3gm
🚀
483
|
artificialguybr
👀
167
Magnify subject details and enhance image quality
|
fffiloni
✨
243
|
anton-bushuiev
🔬
6
|
tomg-group-umd
👀
73
High-quality virtual try-on ~ Your cyber fitting room
|
levihsu
🥼👖👗
1122
Fast, efficient, & multilingual text-to-speech
|
mrfakename
🗣️
475
Turns your image into matching sound effects
|
Bils
🎶
21
MegaTTS 3 but with voice cloning!
|
mrfakename
🎤
280
|
ehristoforu
🔥
1792
Generate unique drums track for any MIDI
|
asigalov61
🎼🎶
11
The most opinionated, anime-themed SDXL model
|
Asahina2K
🌍
1397
MagicTime: Time-lapse Video Generation Models as Metamorphic
|
BestWishYsh
🚀
118
|
tonyassi
🗣️
2633
Text to Audio (Sound SFX) Generator
|
declare-lab
🚀
326
Juggernaut X V10, a powerful text2image model.
|
Walmart-the-bag
🏃
93
Vocal and background audio separator
|
r3gm
🏃
376
Create a 3D model from an image in 10 seconds!
|
SIGMitch
📚
13
Instant AI-powered Anime transformations.
|
broyang
👀
78
270+ Impressive LoRAs for Flux.1
|
prithivMLmods
🥳
1229
|
Stable-X
🏵️
83
|
gokaygokay
🏃
259
|
Stable-X
🏵️
91
|
gokaygokay
😻
616
|
gokaygokay
🚀
989
|
sarulab-speech
🌖
13
Ikea could never
|
broyang
🏠
70
|
JacobLinCool
🎤
81
|
mimbres
🎸
102
|
multimodalart
🏆
1558
|
MaziyarPanahi
🔥
88
|
OzzyGT
👀
358
|
fancyfeast
⚡
330
VideoLLaMA2-AV
|
lixin4ever
🚀
17
|
omni-research
💬
35
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
|
Gregniuki
🗣️
23
Generate image variations
|
black-forest-labs
🖼️
181
SText to Audio(Sound SFX) Generator
|
fantaxy
🐠
328
|
amaai-lab
📉
29
FitDiT is a high-fidelity virtual try-on model.
|
BoyuanJiang
🦀
280
|
LittleFrog
🏢
255
|
depth-anything
👀
224
Use AI to Change Clothing
|
frogleo
👕👔👚
120
Gradio demo for MatAnyone 1 & 2
|
PeiqingYang
🤡
231
A demo of HVI-CIDNet
|
Fediory
📚
29
VGGT (CVPR 2025)
|
🏆
473
|
hayas
⚡
3
INF5
|
ai4bharat
🌍
42
Canary 1B Flash demo
|
nvidia
🐤
44
|
CompVis
🎥
66
This space offers an easy-to-use interface for voice cloning
|
hasanbasbunar
🚀
81
|
VAST-AI
⚡
18
morpheus tts - uncensored
|
MrDragonFox
👀
51
AI Clothes Changer Online
|
Dreamspire
👚
1
|
aizip-dev
🤼
32
ultra-fast video model, LTX 0.9.8 13B distilled
|
Lightricks
🎥
1495
|
Menyu
🖼
244
Minimum working OpenAI Whisper pipeline
|
UNSAFESUPERINTELLIGENCE
🗣️
4
Extraction & Reconstruction for Efficient Speech Separation
|
fffiloni
✂️
117
Expressive Zeroshot TTS
|
ResembleAI
🍿
1729
Kontext image editing on FLUX[dev]
|
black-forest-labs
⚡
1571
Speech Enhancemet using Mamba (SEMamba)
|
rc19477
📉
11
Monocular metric-scale geometry estimation
|
Ruicheng
🚀
70
MOSS-TTSD: Text to Spoken Dialogue Generation
|
OpenMOSS-Team
🔥
49
Explore object detection, visual grounding, keypoint Detecti
|
sergiopaniego
🦀
115
Speech restoration demo of Sidon.
|
sarulab-speech
🐋
29
|
MohamedRashad
🏢
157
Generate podcast and tiktok style video avatars
|
alexnasa
🐨
283
|
amaai-lab
🎧
30
Nova Furry XL | Illustrious v12 to v17
|
IbarakiDouji
🐾
85
|
lapa-llm
💬
46
inapint with Qwen Image Edit for super precise edits
|
linoyts
✒️
108
High-fidelity 3D Geometry Generation from multi-view images
|
Stable-X
🖥️
174
|
🐠
122
State-of-the-art music analysis with multi-scale datasets
|
ASLP-lab
🎵
22
|
neuphonic
☁️
317
|
multimodalart
😼
164
OCR and redact PDF docs or images with VLMs
|
seanpedrickcase
⚡
1
generate a video from an image with a text prompt
|
signsur4739379373
🎥💨
9
Qwen Image Editing Fusion Collection LoRA Demo
|
prithivMLmods
⭐
100
Streaming conversational audio in realtime
|
nari-labs
💨
75
Fast, multi-speaker TTS (44.1kHz) with voice cloning
|
jordand
🐢
116
All the powerful features of the SAM 3 model!
|
hasanbasbunar
🔥
12
|
2vXpSwA7
👉🪵🧶
9
|
azhan77168
💻
2
Fast Editing with Robust Consistency
|
prithivMLmods
🤗
17
Apply the lighting from one image to another image
|
multimodalart
💡
59
TTS demo for T5Gemma-TTS model
|
Aratako
🚀
43
Lora1.py is base and 4 bit
|
rahul7star
📊
6
|
FunAudioLLM
🥳
50
|
Insta360-Research
🌍
15
AI Clothes Changer Online
|
Gopalag
👚
1
Play Tic Tac Toe against a small RL tuned model
|
anakin87
⭕
3
Demo for VieNeu-TTS-0.3B
|
pnnbao-ump
🦜
14
Camera Control Dolly [Distilled]
|
prithivMLmods
📹
59
Generate animated 3D meshes from video
|
🎬
99
|
multimodalart
💻
97
Image-to-3D Generation
|
joaojack
👻
1
|
ahuggingface01
⚡
1
|
bfshi
👀
20
|
lightonai
🐨
112
|
2vXpSwA7
🧪
1
|
hf-applications
🌘w🌖
2
Chatterbox Saudi Arabic TTS Demo
|
omarelshehy
🌎
26
Multi-task image generator with dynamic, chainable workflows
|
RioShiina
🖼
7
|
multimodalart
🏆
51
LongCat-Video-Avatar
|
vvmarchuk
🌖
2
|
multimodalart
👀
90
Background removal with Fibo-Edit-RMBG
|
briaai
🎨
26
TTS demo for MioTTS-0.1B
|
Aratako
📈
33
Generate speech with voice cloning, now in four languages!
|
neuphonic
🌍
39
A gradio platform for demonstrating MOSS-VoiceGenerator
|
OpenMOSS-Team
🏆
7
VideoFlexTok: flexible-length coarse-to-fine video tokenizer
|
EPFL-VILAB
🎞️
2
dummy
|
ysharma
📊
18
TTS demo for Irodori-TTS-500M
|
Aratako
🐠
13
Official Demo Page of OmniLottie.
|
OmniLottie
🌍
83
|
huggingface-projects
🚀
19
Turn any image into a DLSS 5 like image (Lightricks variant)
|
silveroxides
🎮
16
State-of-the-art OCR with 90+ language support
|
victor
📄
11
|
RayZhao
🤖
3
|
ysharma
🌘w🌖
1
|
sarulab-speech
🔥
8
generate a video from an image with a text prompt
|
r3gm
🐌
6
|
dericky286
🖼️
24
Neural activity prediction from multimodal input
|
beta3
🔥
8
4x SDXL Anime Models - Text to Image and Image to Image
|
harumaa
🎨
8
FurryToonMix | Illustrious v1 to v2
|
IbarakiDouji
🐾
22
|
UII-AI
🏥
4
Image manipulation with Kontext adapters.[demo]
|
Velvessence
👽
1
generate a video from an image with a text prompt
|
r3gm
🐌
7
gemma-4-31B-it + gemma-4-31B-it-Claude-Opus-Distill
|
FINAL-Bench
👀
15
HeartMuLa music generation demo for Hugging Face ZeroGPU
|
HeartMuLa
🐨
4
0.1B multilingual TTS with voice cloning
|
victor
🎙️
6
MOSS-VL: Toward Advanced Video Understanding
|
OpenMOSS-Team
🌱
1
Fast high quality video with audio generation
|
alexnasa
🔥
1
|
hysts-samples
🐢
1
|
baohuynhbk14
⚡
2
Generate and texture unique chords progressions
|
projectlosangeles
🎵
3
generate a video from an image with a text prompt
|
parcelbazaar19
🐌
2
|
athrael-soju
🐉
1
|
uva-cv-lab
🎥
1
|
ZoyncTech
🌍
1
35B MoE that reasons like Claude Opus 4.7
|
lordx64
🧠
1
Compare image style similarity with MegaStyle-Encoder
|
olfronar
🎨
1
Agent example
|
sunfixer
💬
1
Generate unique similar compositions to any MIDI
|
projectlosangeles
📚
1
generate a video from an image with a text prompt
|
Nihlus1984
🐌
1
|
uva-cv-lab
🌐
1
|
clem
💬
1
generate a video from an image with a text prompt
|
BIGJUTT
🐌
1
OpenAI Privacy Filter ZeroGPU demo
|
merve
🛡️
1
PII em PT-BR — openai/privacy-filter fine-tune
|
arthrod
🛡️
1
|
🧬
1
|
artificialguybr
🧠
1
|
artificialguybr
🧠
1
TEQUMSA Sovereign Multimodal Orchestrator - Constitutional A
|
Mbanksbey
💻
1
Image generation with waiNSFWIllustrious_v12-15 + gallery
|
Nech-C
😻
383
Omni-Blink LoRA Demo
|
obsxrver
💦
61
HTR (Handwritten Text Recognition) demo application
|
Riksarkivet
🏢
55
Generate stunning high quality illusion artwork
|
AP123
👁
5398
|
John6666
📦🏃
276
|
black-forest-labs
🏎️💨
5060
|
hysts
⚡
225
|
VAST-AI
🔮
875
|
Oysiyl
🌍
87
Fast 8 step inference of Qwen Image Edit
|
multimodalart
✒️💨
489
|
anaszil
🌍
5
|
anycoderapps
🐢
215
|
anycoderapps
🐨
185
|
linoyts
💡
73
|
huggingface-projects
🚀
27
Smart Compressors for Long Video Understanding
|
Vision-CAIR-Admin
🏃
5
|
silveroxides
🐢
5
|
iAkashPaul
📈
2
WAI NSFW illustrious SDXL | v11 to v16
|
IbarakiDouji
🪷
136
|
IllyaS08
🌍
29
Based 'Z-IMAGE TURBO'
|
properfool
📈
4
WAI NSFW illustrious SDXL | v11 to v16
|
EXT-the-great
🪷
2
Same prompt + varied ratios = stunning unique images!
|
alexander00001
🖼
52
|
Fighterdan
🌍
28
|
IllyaS08
🌍
123
|
laruss5
🌍
14
|
savvyaiagency
🦀
2
|
harumaa
✏️
9
|
akhaliq
⚡
1345
|
akhaliq
🚀
767
|
hysts
📊
15
|
hysts
😻
15
emotion recognition
|
hysts
🔥
25
head pose estimation
|
hysts
📚
10
|
hysts
📈
14
|
taesiri
💯
26
Text-to-Video
|
zai-org
🎥
461
|
Norod78
🧝♀️
4
|
Norod78
🟡
22
|
hysts
🏢
9
|
aipicasso
🆒
111
|
pszemraj
🔥
50
|
taesiri
🦀
40
|
AttendAndExcite
💻
95
|
hysts
🐠
15
|
aipicasso
😊
105
|
huggingface-projects
➕
182
image captioning, VQA
|
hysts
🌖
161
|
multimodalart
🦀
174
|
jslin09
🌍
3
|
ZhengPeng7
📚
4
|
tomaarsen
📚
14
|
ericup
⚡
3
text-to-3D & image-to-3D
|
hysts
🧢
557
Semantic search through 110M academic publications
|
colonelwatch
📝
22
|
alibidaran
🏢
1
text-to-video
|
hysts
🌖
173
|
huggingface-projects
🏆
482
|
huggingface-projects
🦙
490
A chat demonstration of M-LChat!
|
Artples
🚀
9
|
hysts
🌍
385
|
fffiloni
👁
601
|
Linaqruf
🌍
484
|
versae
🤫
1
|
sky24h
🤖
38
|
MohamedRashad
📊
28
|
artificialguybr
⚡
125
|
sanchit-gandhi
🔥
293
Project Los Angeles signature music transformer
|
asigalov61
🎼🎶
20
|
hysts
📉
15
|
hysts
🔥
13
|
1aurent
🔟
0
|
hysts
🐨
102
|
artificialguybr
🔥
94
|
hysts
⚡
15
|
hysts
⚡
21
VQA
|
hysts
⚡
31
LLM, chatbot
|
hysts
📉
28
LiDAR generative model
|
kazuto1011
🚗
0
|
hayas
⚡
12
|
Zitang
🦀
13
|
multimodalart
📺
2010
|
ccareaga
💻
26
Get a LLM Assistant personality idea from an image
|
fffiloni
🤖
82
|
julien-c
🐢
19
|
playgroundai
🌍
398
|
fffiloni
🐠
65
Train LoRAs with Ease
|
multimodalart
🧞
382
|
prs-eth
🏵️
490
|
hayas
🐢
3
|
multimodalart
🧑🏿🧑🏽🦱
1003
|
cbensimon
🌍
1
Draw/upload image and search among WikiART using SigLIP
|
merve
🐠
73
A demo of OpenDalle V1.1 on a ZERO GPU.
|
mrfakename
🖼️
416
|
CharlieAmalet
🏃
0
OpenAI-ийн анх гаргасан GPT-2 модел-ийн Монгол хувилбар
|
Dorjzodovsuren
🤖
0
Comparing powerful multilingual zero-shot image clf models
|
merve
🏢
13
|
HuggingFaceM4
⚡
921
|
TencentARC
📷
1940
|
vikhyatk
🌔
421
Analyze context usage in LM generations with model internals
|
gsarti
🐑 🐑
18
|
JohanDL
👁
75
|
ybelkada
🌖
11
|
ddosxd
🌍
36
State-of-the-art open-vocabulary image segmentation ⚡️
|
merve
😻
110
|
tsujuifu
👩🎨
332
Formal reasoning model that can reason and prove theorems
|
Tonic
🌕💉👨🏻🔬
40
Realtime Image/Video Gen AI Arena
|
TIGER-Lab
📈
297
|
multimodalart
👁
1681
|
tonyassi
💃🖋️
72
|
fffiloni
👚
10
|
mlabonne
👑
27
|
MohamedRashad
✍
29
Edit audios with text prompts
|
hilamanor
🎧
327
Real-Time Image Generation with SDXL Lightning
|
radames
⚡️⚡️⚡️⚡️
619
|
xianbao
📉
10
|
SakanaAI
🐠
63
Ultra-fast Whisper Turbo inference ⚡
|
mrfakename
⚡
52
Demo for OpenF5-TTS
|
mrfakename
🔥
28
Demo for DMOSpeech 2
|
mrfakename
💻
18
Robust, duration-controllable voice-cloning TTS
|
mrfakename
💫
6
Dia - 1.6B Text-to-Dialogue Model
|
mrfakename
🚀
59
|
KBlueLeaf
🌍
101
|
atalaydenknalbant
🫶
13
|
HuggingFaceM4
🐨
169
Text-to-Image
|
artificialguybr
🐢
111
Image to Video Synthesis
|
TIGER-Lab
🎥
35
|
merve
🔥
151
|
artificialguybr
😻
25
|
artificialguybr
⚡
31
|
reddit-tools-HF
🦀
7
|
hayas
⚡
15
|
FoivosPar
🔥
170
Long-form Musicgen
|
ylacombe
🎷
22
High-fidelity Virtual Try-on
|
yisol
👕👔👚
2093
Co-Speech 3D Gesture Generation (CVPR 2024)
|
H-Liu1997
🐑
71
|
Naozumi0512
🐬
10
|
hon9kon9ize
🥸
12
Video Editing
|
TIGER-Lab
🎥
74
Generate synthetic dataset files (JSON Lines)
|
lhoestq
🎰
66
|
artificialguybr
😻
16
|
artificialguybr
📊
64
|
artificialguybr
🔥
23
|
prs-eth
🏵️
25
|
speakleash
🦅
50
Large and fast music transformer for pitches inpainting
|
asigalov61
🖌️🎶
17
Solo Piano Audio to MIDI Transcription
|
asigalov61
🦀
64
|
fahdmirzac
🐠
1
|
MykolaL
🏆
117
|
mfidabel
🐨
1
|
Boboiazumi
🏆
5
A binary Search with Scalar Rescoring through legal codes
|
louisbrulenaudet
📖
8
Virtually try on clothing with stable diffusion
|
tonyassi
🧍🏼♀️👗
147
Style-Preserving Text-to-Image Generation
|
InstantX
👁
452
High-fidelity Text-To-Speech
|
parler-tts
🥖
847
Stable Diffusion Finetuned Version
|
Nick088
🚀🎨👨💼
183
|
lucianosb
🐬🩷
1
|
multimodalart
💻
286
|
contextcite
📚
27
|
Nekochu
😸🖌️
6
|
PixArt-alpha
👁
243
|
karths
🏆
0
Multimodal Language Model
|
TIGER-Lab
👁
26
|
artificialguybr
⚡
15
|
artificialguybr
👁
75
|
artificialguybr
🐨
164
|
lang-uk
🌍
2
|
thepatch
🏢
11
|
tokenid
🏆
9
Latest text-generation model by META - Meta Llama3 8b.
|
ysharma
🏃
400
|
KBlueLeaf
🐢
32
Future-oriented Anime model
|
aipicasso
😁
38
|
XAI
🤖
7
Translate between 418 languages.
|
darylalim
🌍
14
4k Image from text in 5 second
|
KingNish
🔥
465
|
thepatch
📈
6
High-fidelity Text-To-Speech
|
sanchit-gandhi
📝
31
|
yuntian-deng
🚀
6
|
Norod78
🍎
60
Meta Llama3 8b with Llava Multimodal capabilities
|
MaziyarPanahi
🔥
90
A Llama3 8B model finetuned by Wang and Zheng
|
llamafactory
🦙
31
|
JackAILab
🔥
58
|
JackAILab
🏆
1
|
YupengZhou
👁
610
|
mii-llm
💻
2
|
yanze
⚡
536
|
alfredplpl
🖼
14
|
LittleFrog
🐸
20
|
lllyasviel
📈
1366
Creative Upscaler High-Res Image Generation HiDiffusion SDXL
|
radames
🔍🕵️
410
AI Assistant
|
Dorjzodovsuren
🔥
3
|
shi-labs
🐐
82
|
parler-tts
⚡
92
Text-to-Image
|
artificialguybr
🚀
92
Enhance photo of a document with selected approaches!
|
qubvel-hf
📚
62
Fixed fork of the original audio sr!
|
Nick088
🔊⏫
82
Lightweight open vision-language model
|
taufiqdp
🚀
2
Medical Chatbot
|
ruslanmv
🔍🕵️
14
|
Qinghew
🖼
50
English <=> Japanese Translation trained on CC corpus
|
Mitsua
📚
5
High quality image generation in 3 second
|
KingNish
⚡
318
Microsoft Phi-3 Vision 128k with Multimodal capabilities
|
MaziyarPanahi
🔥
47
Chat with Cognitive Computation models 🐬
|
QuixiAI
🐬
156
|
ysharma
😻
219
|
huggingchat
📈
32
|
wyysf
🚀
190
Feature Matching with Foundation Model Guidance
|
qubvel-hf
🦀
15
|
llamafactory
💬
8
Steering AI Text Generation
|
janraasch
❤️
4
|
vilarin
👀
25
Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
|
prithivMLmods
🍍
405
|
huggingchat
📈
7
|
Doubiiu
😻
1051
|
clement-pages
🐰
21
|
jameslahm
🚀
10
Flux.1-lumiere
|
vilarin
🐠
47
|
cbhhhcb
🐢
2
DeepCaption / SkyCaptioner / SpaceThinker / Core / SpaceOm
|
prithivMLmods
🔍
109
|
andrewkatumba
🦖🦉
7
|
gvecchio
🧱
22
|
llamafactory
💬
18
|
ameerazam08
💻
163
|
artificialguybr
🔥
465
Generate anime/illustration images with pony diffusion v6.
|
Sergidev
🌇
42
|
pOpsPaper
💻
36
|
rphrp1985
💬
3
|
czl
🖼
0
|
AIGC-Audio
🐠
14
|
atlury
🔥
7
Stable Diffusion 3 Medium with SuperPrompt-v1 Enhancement!
|
Nick088
📷🖼️
43
|
alfredplpl
👁
29
|
AIML-TUDA
👁
6
|
davanstrien
🐦⬛
73
Experimental workflow for story driven games and porting
|
KwabsHug
🐢
16
|
Yiwen-ntu
🌖
167
Create a 3D model from an image in 10 seconds!
|
02alexander
📚
4
|
yuntian-deng
📈
31
|
qubvel-hf
🌖
13
Efficient Image/Video K-Sort Arena
|
ksort
📈
48
|
tori29umai
🚀
307
|
gokaygokay
📉
840
4M: Massively Multimodal Masked Modeling
|
EPFL-VILAB
⚡
203
|
alakxender
🐢
1
Dhivehi ASR, Convert spoken Thaana to written text
|
alakxender
🏆
3
|
BenBranyon
💬
0
|
rifatramadhani
🐢
1
|
Demondad
👁
0
Play with & compare Stable Diffusion Models
|
Nick088
🏆🖼️
20
|
Plachta
🎙️💾🔄🗣️
24
Fast multi-prompt generation with Stable Diffusion 3
|
ironjr
🧠🎨3️
16
|
kevinwang676
🐨
1
|
panyanyany
🐢
0
Document Retrieval
|
manu
🏃
127
Demo for MiniCPM-o 2.6 to answer questions about images
|
sitammeur
🐢
56
Convert audio to subtitles
|
reedmayhew
💻
11
Chatbot
|
huggingface-projects
😻
101
image based search engine for pokemon
|
not-lain
🌘w🌖
13
|
Felix92
🔥
22
|
SkalskiP
🎬
72
|
rifatramadhani
📚
0
Video captioning/tracking
|
merve
🌖
98
MP-SENet is a speech enhancement model.
|
JacobLinCool
🔊
19
|
arad1367
💻
11
|
gokaygokay
🐨
132
|
jasperai
⚡
156
|
artificialguybr
⚡
367
|
Flux9665
🦜
5
|
votepurchase
🖼
3
|
votepurchase
🖼
12
|
votepurchase
🖼
11
|
votepurchase
🖼
9
|
votepurchase
🖼
7
|
votepurchase
🖼
20
Demo of a collection of Qwen3-VL models
|
prithivMLmods
🔥
205
|
votepurchase
🖼
1
|
votepurchase
🖼
11
|
votepurchase
🖼
5
|
gokaygokay
💻
406
|
gokaygokay
⚡
184
|
MohamedRashad
🎨
171
coreOCR / Camel-Doc-OCR / docscopeOCR / MonkeyOCR
|
prithivMLmods
🥪
222
|
SakanaAI
🐠
46
|
SakanaAI
🐠
18
|
KBlueLeaf
👀
112
|
dawood
🌖
1
|
gokaygokay
⚡
244
|
OzzyGT
🏃
60
Working on ZeroGPU. Fixed Up, Really Kicks The Llama's #$%
|
kgout
🔥
12
|
rifatramadhani
🏢
0
|
gokaygokay
😻
154
|
nsfwalex
📊
66
|
multimodalart
🏢
9
Visualize EmbeddingGemma 300M embeddings in interactive 3D
|
Sergidev
🌌
0
Chat with Mistral
|
vilarin
🌖
116
|
naver
📉
55
|
gokaygokay
🏢
200
|
sky24h
⚡
4
|
rinna
🐼
9
|
gokaygokay
🦀
82
Audio-Driven Portrait Animations
|
fffiloni
🐨
158
|
cdnuts
🚀
12
Latest text-generation model by META - Meta Llama3.1 8b
|
ysharma
🏆
27
|
gokaygokay
⚡
80
|
jiuface
🏆
6
High-fidelity Virtual Try-on
|
jjlealse
👕👔👚
5
Apply the motion of a video on a portrait
|
SahaniJi
🤪
0
Video-to-Audio Generation with Hidden Alignment
|
fffiloni
🎧
25
Chatbot
|
tangzhy
😻
5
Text-to-Video
|
maxin-cn
🤗
13
Multimodal Image-to-Video
|
maxin-cn
🎥
204
Chatbot
|
huggingface-projects
😻
87
|
fffiloni
🏆
56
Aesthetically Controllable Text-Driven Stylization w/o Train
|
fffiloni
🎨
185
|
sky24h
💄
21
|
bunarivenna
🔥
17
|
SkalskiP
🔥
142
|
fffiloni
🔥
75
|
Norod78
📊
1
High-fidelity Text-To-Speech
|
freddyaboulton
🌈
1
|
tori29umai
😻
15
2-3 second videos from text.
|
ZENLLC
⚡
11
|
Nick088
⚡
68
Give your space a voice! (Demo)
|
Akjava
🐠
2
|
multimodalart
🌀
544
|
Yiwen-ntu
🚀
68
|
wondervictor
🚀
74
|
Freak-ppa
🏃
9
|
1aurent
👁
20
FLUX Dev - Controlnet Canny
|
DamarJati
🧋
232
|
Yehor
🎙️
2
Huggingface Gradio Demo (Llava Next Image Chatbot)
|
dkondic
🖼️
0
|
wjbmattingly
🚀
4
|
Ravenok
⚡
1
High level library for batched embeddings generation
|
louisbrulenaudet
🦝
2
|
reedmayhew
🏆
25
Clarity AI Upscaler Reproduction
|
jiuface
🖼️🪄
14
|
lj1995
🤗
232
flux.1-dev / flux.1-krea-dev
|
prithivMLmods
🥖
224
|
xinglilu
👅
2
|
orionweller
⚡
0
FLUX.1 RealismLora
|
DamarJati
🎀
1325
|
archit11
📊
8
Chat with Qwen2
|
ehristoforu
🌖
1
|
nyanko7
📚
33
|
rayli
👁
39
|
Freak-ppa
🌖
1
Model Internals to generate RAG citations
|
gsarti
🌴
7
Demo for MiniCPM-o 2.6 to answer questions about videos
|
sitammeur
🦀
9
|
jiuface
😷
2
|
fantaxy
🐠
17
Easily remove your videos background!
|
fantaxy
🎞️
96
|
brandonsmart
⛰️
75
|
MaziyarPanahi
🔥
222
|
sindhuhegde
🏆
4
High-performance Image-to-3D generation model
|
yanranxiaoxi
⚡
1
|
anas-gouda
🔥
0
|
mlabonne
👥
7
Z.I.T. w/LoRAs by actsoonr & many others
|
AlekseyCalvin
🔜
7
|
fantaxy
🔆
203
|
modelscope
⚡
14
|
John6666
🏆😻
153
Text-to-Video
|
zai-org
🎥
1035
Human parsing model by Meta Reality Labs
|
fashn-ai
👅
67
|
Shivam098
📈
17
Clarity AI Upscaler Reproduction
|
ZENLLC
🖼️🪄
9
|
Awell00
⚡
12
|
artificialguybr
🐨
12
|
huzey
✂️
9
|
ByteDance
🖼
760
|
Svngoku
🏆
7
Create HD cutouts from any image with just a prompt
|
finegrain
✂️
516
Chat with an Italian Small Model
|
anakin87
💬🇮🇹
4
|
fffiloni
🐨
66
|
Freak-ppa
🔥
0
|
xianbao
📈
7
|
wjbmattingly
📈
28
|
GanymedeNil
🔥
266
|
jiuface
🖼
7
|
alvarobartt
🖼
23
|
AUEB-NLP
👀
4
Japanese Image Tagging trained on licensed data
|
Mitsua
🔖
6
Clarity AI Upscaler Reproduction
|
victorestrada
🖼️🪄
5
|
maxiw
📉
110
|
yuanwenyue
🏃
24
|
randomtable
🖼
5
|
fantaxy
🏆
38
Realvisxl V5
|
seawolf2357
⚡
142
FLUX.1-Dev Text to Image with LoRA
|
ovi054
💻
67
|
gokaygokay
👀
77
|
veichta
🚀
0
|
kaerez
📈
3
|
Potre1qw
📈
9
|
Potre1qw
🏃
3
|
addsw11
🌖
3
|
jiuface
🖼
11
|
bdsqlsz
🖼
2
|
maxiw
📊
46
|
rolpotamias
🚀
15
|
OzzyGT
🏃
324
|
jotase
😷
13
|
fantaxy
🤗
94
Flux Animations(GIF) Generaion
|
fantaxy
📊
58
|
xtreme86
💬
0
|
OpenSound
🟣
275
|
KBlueLeaf
📉
83
|
multimodalart
🖼
147
Ultra-high resolution image synthesis
|
roubaofeipi
😻
239
|
Abhaykoul
🌍
20
|
panelforge
🖼
8
|
GonzaloMG
⚡
97
|
GonzaloMG
⚡
31
|
GonzaloMG
⚡
12
|
OpenSound
🟣
52
|
multimodalart
🔅
63
Chatbot
|
huggingface-projects
😻
127
|
davanstrien
🔍➡️📝
73
|
BK-Lee
👻
7
|
alfredplpl
📖
7
|
huggingface-projects
🚀
390
|
davehusk
💬
3
|
Abhaykoul
💻
32
|
devngho
🇰🇷📚
0
|
rphrp1985
🚀
2
Detect, segment, classify objects in images and videos
|
atalaydenknalbant
👁
119
|
mikeee
🚀
0
|
hf-audio
🤯
1015
Realtime implementation of Whisper large turbo
|
KingNish
🤯
346
Chatbot
|
huggingface-projects
😻
22
A gradio demo for Posterior-Mean Rectified Flow (PMRF)
|
ohayonguy
🖼️
313
Translator
|
barbaroo
📊
2
|
A19grey
🚀
51
|
multimodalart
🧪
517
|
Phips
🔥
288
|
ahmed-masry
🏆
4
|
Vision-CAIR-Admin
🌖
88
Co-Speech Gesture Video Generation (ICLR 2025 Oral)
|
H-Liu1997
🐠
348
|
Pyramid-Flow
⚱️
669
|
ameerazam08
🏆
258
milchchan.com
|
milchchan
🌟
0
Chat demo for EMBER
|
nicpopovic
😻
1
|
pablovela5620
📈
25
Smart Dog Breed Detection, Comparison, and Matching Tool
|
DawnC
🐾🐾
101
|
MeissonFlow
🚀
51
Robotics Language-Gesture Video Generation
|
HikariDawn
👁
12
Moroccan Arabic Chatbot
|
MBZUAI-Paris
⛰️
11
Live Interactive demo for EMOVA with Qwen-2.5 backbone
|
Emova-ollm
🔥
7
Image generator/identifier/reposer
|
Shitao
🖼
702
use the ESM3 model to predict protein structures
|
MISATO-dataset
🧬🪬
7
|
ChemFM
🏆
3
|
taufiqdp
🖼
10
A unified multimodal understanding and generation model.
|
deepseek-ai
🌍
161
100+ Impressive LoRA's For Flux.1
|
prithivMLmods
🔥
585
|
arad1367
🔥🔥🔥
11
|
🎵
161
|
EvanTHU
🏃
24
|
vivjay30
😃
10
8B parameter transformer model distilled from the FLUX.1-dev
|
TheAwakenOne
🎨
50
Fotorestauration, DE Beschreibung, Lokal Tutorial folgt.
|
Sebastiankay
🖼️
15
Pixel restauration und upskaling
|
Sebastiankay
🐑
16
|
AlekseyCalvin
🔜
15
MoGe live demo
|
Ruicheng
🏆
74
|
Akjava
⚡
30
|
surokpro2
🚀
8
Fast multi-instrumental music transformer
|
asigalov61
🦖
103
Framer: Interactive Frame Interpolation
|
wwen1997
🏃
371
Chinese (Taiwan) Automatic Speech Recognition.
|
JacobLinCool
🐠
0
3D Generation from text prompts
|
gokaygokay
🏢
101
|
rombodawg
📚
9
IDM VTON BASE
|
nami0342
👕👔👚
1
Prompt with Images in flux[dev]
|
InstantX
🖼
170
Convert documents to Markdown or JSON with metadata
|
yasserrmd
🐢
12
Intelligent system for Multimodal Affective States Analysis
|
DmitryRyumin
😀😲😐😥🥴😱😡
3
Document Image Enhancement for Degraded Archival Documents
|
renyi-ai
✨
0
Spanish finetune for the original F5 model.
|
jpgallegoar
🗣️
669
Blind Image Restoration with Instant Generative Reference
|
fffiloni
🦀
369
Expressive Portrait Animation w/ Hierarchical Motion Attent°
|
fffiloni
🤪
221
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
|
redradios
🗣️
26
High-quality virtual try-on ~ Your cyber fitting room
|
thincamel
🥼👖👗
0
|
blanchon
🎨
12
StableNormal Turbo Beta
|
Stable-X
🚀
49
|
la44578
🎤
0
|
panelforge
🖼
12
Using colpali and milvus for multimodal search
|
saumitras
😻
8
|
gizemsarsinlar
🔥
6
Huggingface space for JanusFlow-1.3B
|
deepseek-ai
🏃
219
NailongKiller
|
Hakureirm
🏢
1
|
Menyu
🖼
97
|
shuttleai
🖼
31
8B instruct model from OpenCoder family.
|
OpenCoder-LLM
💬
39
|
patriotyk
💻
52
Separate sounds from audio mixtures using text prompts
|
OpenSound
🎯
22
Create 3D mesh by chatting.
|
Zhengyi
👀
148
|
mosiofather
🍏
1
|
yucornetto
🏆
2
Prompt with Images in flux[dev]
|
InstantX
🖼
74
|
dreroc
🚀
2
Create cinematic images in widescreen aspect ratios.
|
takarajordan
🎥
241
|
oddadmix
💬
12
|
fffiloni
🖼️
200
|
Gopalag
👚
10
|
prs-eth
🏵️
31
|
TanelAlumae
🤯
1
|
black-forest-labs
🖌️
317
Depth Control for FLUX
|
black-forest-labs
🩻
92
|
fancyfeast
🚀
73
Identity-Preserving Text-to-Video Generation
|
BestWishYsh
🔥
75
A Training-free Unified Model for Few-shot VAD
|
FantasticGNU
📈
14
Bienvenue sur **Wolof-ASR**, une application de reconnaissan
|
dofbi
📚
0
|
cbensimon
📊
1
Extract garment images from everyday images!
|
rizavelioglu
🔥
63
Authoring Animation-Ready 3D Characters with One Click
|
jasongzy
💃
97
|
Potre1qw
📉
0
|
addsw11
📉
0
|
Potre1qw
🖼️
2
|
Menyu
🖼
22
Nvidia Sana
|
gen6scp
🖼
38
Video Depth without Video Models
|
prs-eth
🛹🛹🛹
57
Generate embeddings of images
|
rrg92
🖼️
0
PHOTOREALISTIC HUMAN RECONSTRUCTION w/ CROSS-SCALE DIFF
|
fffiloni
🏃
190
Scalable and Versatile 3D Generation from images
|
microsoft
🏢
4776
|
longlian
⚡
3
Optical illusions and style transfer with FLUX
|
multimodalart
🚀
893
|
rphrp1985
🤫
0
Generate anime-style multi-view images from texts
|
huanngzh
👁
106
Generate embeddings of images
|
rrg92
🖼️
1
A demo of Indic Parler-TTS
|
ai4bharat
👀
191
Belarusian TTS
|
archivartaunik
👁
16
|
StanfordAIMI
🌖
0
Use title and abstract to predict future academic impact
|
ssocean
💻
36
Create thematic summaries for open text data with LLMs
|
seanpedrickcase
📚
0
|
TencentARC
📸
40
|
rhfeiyang
🚀
9
Latest future oriented generative model
|
aipicasso
😆
14
Diffusion Model Compression
|
zhangyang-0123
🖼
12
|
redradios
🍏
23
Paligemma2 Detection with Supervision
|
onuralpszr
😻
17
Memory-Guided Diffusion for Expressive Talking Video Gen
|
fffiloni
👁
50
PaliGemma2 LoRA finetuned on VQAv2
|
tjw
🐨
1
|
evalstate
🏎️💨
44
Generate soundfonts with latent flow matching
|
erl-j
🦦
17
|
franciszzj
👗🤗🧜
613
Gradio demo for FlowEdit: Inversion-Free Text-Based Editing.
|
fallenshock
📚
79
Generate a video based on a text prompt using Mochi
|
ruslanmv
🐨
25
Detect budgerigar gender based on cere color
|
atalaydenknalbant
🦜
16
|
sysf
🚀
3
Fast Inversion of Rectified Flow for Image Semantic Editing.
|
MagicBag
🔥
88
Image Super-resolution via Diffusion Inversion
|
OAOA
🌍
459
Easily expand image boundaries
|
jallenjia
🔅
29
Chat with Pixtral 12B using Mistral Inference
|
rphrp1985
👀
2
MT between 15 Formosan Languages and Chinese
|
FormosanBank
💬
3
|
jallenjia
👈🖼️👉
54
Text-to-Image Diffusion Model trained on licensed/pd data
|
Mitsua
🚀
23
|
sergiopaniego
🔥
0
Attention Tracker for Prompt Injection Detection
|
pinyuchen
🏢
6
|
dofbi
🐠
4
|
lsxi77777
📈
24
Demo of the Transformers implementation of ColPali
|
yonigozlan
📚
4
|
prs-eth
🏵️
24
Generate high-fidelity 3D models from a single image.
|
broyang
🧊
26
Animation Sketches sequence Colorization
|
fffiloni
🧑🎨
75
|
victorestrada
🌘w🌖
0
|
NightRaven109
🏢
14
Chat with an Italian Small Model
|
anakin87
💎🤏🇮🇹
3
Lightning fast 5-sec inpainting and outpainting, uncensored.
|
LPX55
⚡
19
|
HuiZhang0812
🌖
8
|
MegaTronX
😈
5
|
panelforge
🖼
6
|
panelforge
🖼
6
|
FrancescoLR
🧠
5
|
Profakerr
🔥
37
|
JackAILab
🖼
10
|
hayas
⚡
1
A simple app for doing HTR with various models.
|
wjbmattingly
🔥
55
Gaze Target Estimation
|
fffiloni
👀
30
3D generation from sketchs with TRELLIS & sdxl
|
linoyts
🖌️🏢
205
Lumian-VLR / VisionThink / MiniCPM-V / Typhoon-OCR / olmOCR
|
prithivMLmods
🥠
57
Write musical scores with LLaMA
|
dx2102
🦀
10
Synthpose Markerless MoCap VitPose
|
stanfordmimi
📊
9
|
davidserra9
🚀
11
Chat with IBM Granite 3.1 8b Instruct
|
ibm-granite
📝
20
Dense Grounded Understanding of Images and Videos
|
fffiloni
🐨
53
Unified Framework for Generalized Video Face Restoration
|
fffiloni
✨
187
|
baohuynhbk14
🥶❄️🥶
11
|
khang119966
🥶❄️🥶
6
|
khang119966
🥶❄️🥶
22
Creator Friendly Text-to-Video
|
aidealab
🎞
15
|
deepseek-ai
🌍
602
neutral sd gradio dev space
|
willsh1997
👁
0
9B Italian strong model 💪
|
anakin87
💎💪🇮🇹
0
|
fancyfeast
💬
56
Good Arabic Model
|
Navid-AI
💬
16
REALVISXL V5.0 test G
|
1inkusFace
💻
1
|
MaxNoichl
😻
2
MidJour | A RealVisXL_Turbo | IRL HI-Res Images Gen
|
WatchOutForMike
🏜️
0
MidJour | A RealVisXL_Turbo | IRL HI-Res Images Gen
|
WatchOutForMike
🏜️
5
|
rafaaa2105
🌖
6
|
SkeletonDiffusion
💻
0
|
1aurent
✨
0
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
|
srinivasbilla
🔥
314
|
WatchOutForMike
🏎️💨
0
|
1aurent
🕯️
2
[ 250+ Impressive LoRA For Flux ]
|
WatchOutForMike
🥳
0
Make Custom Voices With KokoroTTS
|
FiditeNemini
⚡
3
Belarusian TTS Demo + stress
|
archivartaunik
🌍
7
|
shuttleai
🖼
83
|
veltre
🦕🦖
3
A Diffusion Model for Video Inpainting
|
fffiloni
✨
80
Guided melody accompaniment generation with transformers
|
asigalov61
🎺🥁
5
Frontier Foundation Models for Video Understanding
|
lixin4ever
💬
84
|
obukhovai
🏵️
8
|
veltre
🦕🦖
3
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
|
SunderAli17
🔥
15
[ 250+ Impressive LoRA For Flux ]
|
WatchOutForMike
🥳
1
(ICLR 2025) https://github.com/qq456cvb/3DCorrEnhance
|
qq456cvb
💻
1
The Ultimate Anime-themed SDXL model
|
Asahina2K
🌍
496
Deepseek AI's Janus-Pro-7B: Generate image from text
|
Bils
🚀
18
OpenSource Music Generator
|
innova-ai
👩🎤
73
A unified multimodal understanding and generation model.
|
BasqueLabs
🌍
1
A test for darija TTS model
|
medmac01
🌍
22
German zero Shot voice cloning with llasa 1b finetuned
|
SebastianBodza
🔥
11
Generate compressed images given different input conditions
|
DDCM
📖
10
|
gokaygokay
🔥
248
Demo of LPOSS and LPOSS+
|
stojnvla
👀
2
Generate Anime Art with Anime AI Generator
|
frogleo
🖼
37
Audio Gen, Audio Style Transfer and Audio InPainting
|
fffiloni
😻
46
|
Aatricks
🚀
12
|
qubvel-hf
⚡
5
Create a 1M faces 3D colored model from an image!
|
hysts-duplicates
⚡
32
|
paulpanwang
🚀
3
A unified multimodal understanding and generation model.
|
LLMhacker
🌍
0
|
haowu11
🖼
3
|
Steveeeeeeen
🍕
8
Demos for some my finetunes
|
shb777
⚡
32
High-Fidelity Simultaneous Speech-To-Speech Translation
|
fffiloni
👄
42
Protecting Protein Generative Models with Watermark.
|
Zaixi
🚀
4
Video Generator ⚡ from stories
|
ruslanmv
🚀
8
Mixture of Diffusers implementation for XL Stable Diffusion
|
elismasilva
🚀
18
|
ElectricAlexis
📊
73
Generation of Images with Prompt Engineering and Lora
|
ruslanmv
💬
10
Generate a video from any number of images
|
kiwhansong
✨
24
Towards Unified Music Emotion Recognition across Dimensional
|
amaai-lab
📊
18
|
Steveeeeeeen
🌍
413
|
phatdo
🔥
3
Transform Your Images into Mesmerizing Hexagon Grids and 3D!
|
Surn
🐝
5
|
hysts
⚡
23
|
hysts
⚡
1
Transform Your Images into Mesmerizing Hexagon Grids
|
Surn
🌖
4
|
hayas
⚡
2
test
|
hujiecpp
👁
16
|
hayas
🔥
7
|
ejschwartz
🐨
0
MVP demo of multilingual LLM performance eval space
|
willsh1997
📊
0
FlexTok flexible sequence length autoencoding demo
|
EPFL-VILAB
🖼
17
Quick Implementation of the MOSAIC scoring system
|
BaggerOfWords
👀
0
|
ejschwartz
🚀
0
Simple_Demo
|
Gainward777
🖼
0
Fast Text 2 Video Generator
|
llmlocal
⚡
0
|
ahmed-masry
💻
3
|
Locutusque
🕺
10
Huggingface demo of TrajectoryCrafter
|
Doubiiu
🌖
56
Try AuraFlow-v0.3 to generate images
|
merterbak
🖼️
3
|
Menyu
🖼
20
|
Gemini899
🖼
4
|
ameerazam08
👀
107
Anime Line Extractor
|
aidenpan
⚡
6
Compare latest VAE's
|
rizavelioglu
👀
68
Flexible Photo Recrafting While Preserving Your Identity
|
ByteDance
📸
1097
RAG example using Granite [vision, embedding, instruct]
|
ibm-granite
🚀
39
Large Language Diffusion Models
|
multimodalart
🚀
182
Prompt with Images in flux[dev]
|
Hatman
🖼
7
|
xingyang1
💻
181
Mixture of Diffusers and ControlNet Tile Upscaler for SDXL
|
elismasilva
🚀
27
Chat with Microsoft's phi-4 or phi-4-mini models
|
merterbak
🐨
14
Classify unstructured comment text.
|
ZennyKenny
🐈⬛ 🐈⬛ 🐈⬛
4
|
innoai
🔥
5
RAG Chatbot with Llama3.1-8B for PDFs
|
merterbak
📖
14
|
Menyu
🖼
6
Blazingly Fast and Embarrassingly Simple Song Generation
|
ASLP-lab
🎶
687
Let's do indoor relighting!
|
xyxingx
💡
1
Arabic OCR Models Demo
|
oddadmix
🐢
66
Demo for Multimodal-SAE
|
lmms-lab
💬
9
Tuning-free subject-driven generation
|
primecai
🦀
191
Identify spoken Arabic dialects from short speech samples
|
badrex
🍉
1
Predict Brain Age from a T1w MRI
|
FrancescoLR
📚
3
SD3.5 in 8-steps with TensorArt TurboX
|
multimodalart
🏃
134
Lightning fast guided upscaling with FLUX.MF + More
|
LPX55
⚡
33
|
tight-inversion
🐨
66
Playground for NuExtract-v1.5
|
dwb2023
👀
0
|
huggingface-projects
🔥
163
Fast image relighting using Latent Bridge Matching
|
jasperai
✨
419
German zero Shot voice cloning with llasa 1b finetuned
|
SebastianBodza
🔥
1
D-FINE Inference Example
|
developer0hye
👀
3
|
jameslahm
🚀
68
Image to Compositional 3D Scene Generation
|
VAST-AI
📚
226
State-of-the-art Indic language translation by AI4Bharat
|
ai4bharat
🌏
42
Demo for Amodal3R reconstruction
|
Sm0kyWu
🖼
27
Blazingly Fast and Embarrassingly Simple Song Generation
|
dskill
🎶
2
|
nightey3s
🚫
3
|
ovi054
😻
5
Chat with multimodal gemma-3-12b-it or gemma-3-4b-it models
|
merterbak
💎
13
Streamlining photo editing by watching dynamic videos
|
HadiZayer
🪄
1
|
hayas
⚡
2
|
docling-project
🦆📄
260
Scale-wise Distillation of SD3.5-Large
|
dbaranchuk
⚡
33
Large Animatable Human Model
|
3DAIGC
⚡
369
Classical Japanese Chatbot
|
SakanaAI
🐻
20
Instantly turn lamps on in your images
|
finegrain
💡
51
Try Orpheus TTS here
|
MohamedRashad
🚀
247
Audio-driven Talking Portrait
|
ChaolongYang
⚡
10
Dereflection Any Image
|
sjtu-deepvision
🌖
5
|
khang119966
🥶❄️🥶
5
A simple chat interface to see the reasoning process
|
Metal3d
🏢
1
|
Yuanshi
🖼
7
interactive demo for cube 3d model
|
Roblox
🌍
173
LiDAR generative model
|
kazuto1011
🚗
0
|
abreza
🏢
6
SalamandraTA-7B demo for translation.
|
BSC-LT
🦀
6
Demo for CFG-Zero*
|
weepiess2383
🐠
30
|
hugsanaa
🏢
0
Model for predicting micro-millisecond motions in proteins
|
gelnesr
🧬
10
Force any model to think like a reasoning models
|
Metal3d
🦀
1
Advance Blur anonymizes your images with "Vance Blurring."
|
model2
🥸
12
SwissGerman echtzeit Transkription
|
ErdVier
🤯
0
|
fffiloni
😛
154
New FaceID model trained on an open dataset
|
multimodalart
😻
26
Estimating 6DoF object pose in the wild!
|
LittleFrog
📦
9
|
Menyu
🖼
14
|
jzq11111
🚀
24
Imitate 真空 (@vericava)'s posts interactively
|
vericava
⚡
0
High-fidelity 3D Geometry Generation from single view image
|
Stable-X
🏢
699
|
ttoosi
🧠
2
|
wjbmattingly
😻
0
|
Yiming-M
💻
1
|
WillHeld
🚢
4
severely limited context window proof of concept
|
willsh1997
💬
1
Universal Anomaly Segmentation
|
csgaobb
📉
21
Caption Images and Export Each File to Text
|
vinhtruong3
🌍
4
A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning
|
yeliudev
💡
37
|
zerchen
🐠
3
|
Matoshi
🎬
5
Generate 3D texture from image
|
VAST-AI
🔮
125
Open-Domain 4D Avatarization
|
KumaPower
🎨
16
Compare responses from selected huggingface models
|
nuojohnchen
⚖️
2
Generate 3D texture from texts
|
VAST-AI
👁
103
Assistente que ajuda a encontrar script SQL
|
rrg92
🖥️
6
|
ai-conferences
⚡
4
|
wusize
👁
6
|
TencentARC
🦀
73
Generate customized images using text and multiple images
|
bytedance-research
⚡️
780
|
abreza
🚀
6
A Deep Learning Method for Protein Allergenicity Prediction
|
sfaezella
🔥
0
Custom DiT models trained on Wikiart dataset.
|
kaupane
🔥
3
Demo fo Dream 7B, an open diffusion large language model
|
multimodalart
📈
60
|
innoai
👈🖼️👉
2
A series of multilingual medical models
|
nuojohnchen
🩺
1
An open-world instance segmentation model
|
allencbzhang
🚀
7
generates linkedin posts from freetext entries
|
willsh1997
🏆
1
be polite and rude to llama
|
willsh1997
🐨
0
|
yslan
🎮🌍
16
compare different llama versions for knowledge cutoff
|
willsh1997
🏆
0
|
Yuanshi
🎨
368
kNN-TTS Demo
|
karlhajal
🚀
3
|
hannahcyberey
🦙
8
Ultra fast high quality image generation
|
eienmojiki
👁
1
|
InstantX
🐢
520
|
VAST-AI
🔮
93
Accurately classify any MIDI by top music genre
|
asigalov61
🏃
2
Decoupled Diffusion Transformer
|
MCG-NJU
😻
5
image_generator
|
nsfwalex
👀
7
|
khang119966
🥶❄️🥶
16
image and question as input answer as output
|
Monimoy
🦀
0
|
ar0551
🐨
0
Official demo for the COP-GEN-Beta model
|
mikonvergence
🌍
9
|
VAST-AI
👁
67
Demo of Facebook's MMS Text-to-Speech Model
|
suayptalha
🗣
6
|
Zeyue7
👀
168
Thaana text-to-image, ocr
|
alakxender
📝
1
|
Bofeee5675
💬
2
Mapping the world of LLMs using biology inspired tools.
|
nyax
🔥
5
Chat with Microsoft's 1.58bit Bitnet model!
|
suayptalha
👾
64
Automatic Speech Recognition for Blended Arabic and English
|
elgeish
🌍
4
|
dreroc
🐢
1
ASM for short
|
arabago96
🏢
1
|
Gyaneshere
👯
2
T5 finetunes for multiple dhivehi tasks.
|
alakxender
🌖
0
Object Detection & Scene Understanding for Images and Video
|
DawnC
🛰️
59
|
Dorjzodovsuren
🏢
0
cellpose is a generalist algorithm for cellular segmentation
|
mouseland
🔬
67
High-fidelity Virtual Try-on
|
ronniechoyy
👕👔👚
9
|
lucalp
⚡
9
A Step Towards Music Generation Foundation Model
|
ACE-Step
😻
659
Image generation in the style of nataliKav
|
ZennyKenny
🔥
5
Universal Image Editing is worth a single LoRA
|
RiverZ
🖼
662
|
developer0hye
🏢
8
Human-like English rewriting.
|
JuliaP-0419
💬
0
Scalable and Versatile 3D Generation from text prompt
|
dkatz2391
🏢
7
Conversational speech generation
|
evalstate
🌱
1
Object Detection on Images and Video
|
ustc-community
⚡
91
|
vzhizhi6611
🚀
0
|
kevinwang676
🏆
0
|
Gregniuki
🇵🇱↔️🇬🇧
0
create a professional headshot from your images
|
theoracle
🔥
0
fast video generation from images & text
|
linoyts
📹⚡️
325
hierarchical viruses
|
pswap
👀
0
|
developer0hye
📉
6
|
hyz317
🏆
66
Detect harms and risks with Granite Guardian 3.3 8B
|
ibm-granite
📝
12
|
smartfeed
👀
2
A Unified Framework for Image Customization
|
ByteDance
🐨
598
Fast 807M 4k solo Piano transformer trained on 1.14M+ MIDIs
|
asigalov61
🎹
7
Testing Mistral Small quantized with Intel's AutoRound
|
Didier
😻
1
|
yfdeng
🪄
33
The Ultimate Anime-themed SDXL model
|
lehehroi
🌍
8
|
FiditeNemini
🖼️💬
10
Scalable and Versatile 3D Generation from images
|
dkatz2391
🚀
8
|
Andres77872
👁
9
.
|
VisoLearn
🐨
3
SOTA real-time object detection model
|
GF-John
🔥
1
ByteDance Seed's coding focused Seed-Coder-8B-Instruct model
|
merterbak
🚀
6
|
tr74
🕵️
5
|
jieliu
⚡
21
Universal Image Editing is worth a single LoRA
|
jallenjia
🖼
1
Generating 3D printed layered models from an input image
|
hvoss-techfak
🏢
14
Estimate piano difficulty from audio
|
mtg-upf
🎹
2
Visual Question Answering - Autonomous Driving - SmolVLM2
|
sergiopaniego
🌖
4
|
bobber
🖼️💬
24
A Unified Framework for Image Customization
|
dreroc
🐨
0
|
multimodalart
🎥💨
1606
Stable Diffusion with ControlNet Canny Edge Detection
|
yingzhac
🎨
2
|
tianbaoxiexxx
🎯
57
A demo of our gen2seg SD and MAE-H models.
|
reachomk
🚀
6
|
rahul7star
🎥💨
49
Chat with MedGemma 4B, a medical variant of Gemma 3
|
warshanks
🩻
35
try and get llama to talk about milk
|
willsh1997
🐨
0
llm calculator with llama backend
|
willsh1997
🐨
0
|
yukieos
🏆
0
The official demo of MangaLMM
|
yuki-imajuku
📚
12
Ask question to model, let it think and describe semantics
|
gijs
🐨
0
The demo for pixel reasoner
|
TIGER-Lab
⚡
58
Test-Time Adaptation for Multimodal Navigation and Search
|
derektan95
🦁
8
|
hysts-mcp
🖥️
6
|
hysts-mcp
🏎️💨
2
inference for moondream2 point API
|
GF-John
🌕🌖🌗
0
|
votepurchase
🖼
16
|
Gemini899
🔅
18
|
FiditeNemini
🔥
7
UniVG-R1 demo
|
SuleBai
🌖
12
State-of-the-art target speech extractor
|
OpenSound
🎯
113
Demo for BAGEL
|
ByteDance-Seed
🚀
216
Upgraded to v1.0!
|
alexl1973
❤️
0
|
hysts-mcp
❤️
8
|
Smiley0707
🎥💨
0
|
GreenGoat
✨
2
Extreme Super-Resolution via Scale Autoregression
|
alexnasa
🚀
322
|
yahyarahhawi
🚀
1
Scalable and Versatile 3D Generation from images
|
hysts-mcp
🏢
17
sam2 images and video inference on ZeroGPU
|
GF-John
📚
1
|
prs-eth
🎨
34
|
jallenjia
🏎️💨
0
Stylized TTS – design voice, accent, and emotion your way
|
OpenSound
🧢
100
OmniConsistency_X
|
vzhizhi6611
💻
0
GSASR(2d gaussian for arbitrary-scale super-resolution)
|
mutou0308
🌖
8
|
Menyu
🖼
8
Semantic correspondence demo for DIY-SC.
|
odunkel
🌖
2
NotebookLM conversational speech model
|
fluxions
🏢
185
A simple kokoro TTS MCP server
|
marcodsn
🔉
0
Easily expand image boundaries
|
Ishankagg
🔅
4
|
bartduis
✨
5
|
WonwoongCho
💡
3
|
panelforge
🖼
12
Convert Free Text Into Json Using AI
|
Tonic
🔬📅📊
6
stock prediction with Amazon/Chronos
|
Tonic
🚀
75
|
ai-conferences
⚡
6
Expressive Zeroshot TTS
|
SebastianBodza
📢
27
|
infinity1096
🦀
12
|
freddyaboulton
💻
2
Generate physically plausible 3D model from single image.
|
HorizonRobotics
🖼️
40
Create 3D models from text descriptions
|
HorizonRobotics
📝
12
Generate visually rich textures for 3D mesh.
|
HorizonRobotics
🎨
12
[2026] SOTA 8k music transformer trained on 2.31M+ HQ MIDIs
|
asigalov61
🎺
31
SeedVR2-3B Image & Video API Demo
|
ByteDance-Seed
🎥
158
Chat with Lingshu 7B, a multimodal medical model
|
warshanks
🩻
10
Image to text, alto- or page-xml
|
Gabriel
🔥
1
Demo for Nanonets-OCR
|
MohamedRashad
👁
81
Expressive Zeroshot TTS
|
freddyaboulton
🍿
0
huggingface space for DRA-Ctrl.
|
Kunbyte
😻
50
|
loocorez
💬
0
Part-level image-to-3D generation.
|
cpuai
🪴
12
Egyptian Arabic Chatbot
|
MBZUAI-Paris
🏞️
5
Particle-Grid Neural Dynamics
|
kaifz
🖼
0
|
multimodalart
👀
30
Expressive Zeroshot TTS
|
SebastianBodza
📢
9
fast rtc video streaming debug
|
multimodalart
🚀
2
Real-time video generation
|
multimodalart
🎥
322
GPU-Accelerated OCR
|
CultriX
📚
2
FireRed / Nanonets / Monkey / Thyme / Typhoon / SmolDocling
|
prithivMLmods
💻
142
compare different models and their moral compass
|
willsh1997
🏆
0
OmniGen2: Unified Image Understanding and Generation.
|
OmniGen2
👀
428
Memory-Efficient Optical Flow — ICCV 2025 SOTA.
|
egorchistov
🎞️
7
LightGlue demo
|
ETH-CVG
↔️
62
|
i0switch
📸
2
ultra-fast video model, LTX 0.9.7 13B distilled
|
azhan77168
🎥
8
Official Space for SpatialTrackerV2
|
Yuxihenry
⚡️
103
Un traductor de náhuatl a español
|
Thermostatic
🔥
2
|
huggingface-projects
⚡
142
SeqTex generates texture based on textual conditions
|
VAST-AI
🗺️
19
|
MoraxCheng
🧬
18
|
hysts
😻
9
A large orientation-aligned 3D generative model.
|
Louischong
🪄
3
THUDM/GLM-4.1V-9B-Thinking Demo
|
zai-org
🐢
64
Humanize any music score with Orpheus Music Transformer
|
projectlosangeles
🎺
3
Flux.1 Kontext Dev 8-step w/Adapters by SilverAgePoets.com
|
AlekseyCalvin
🌍🧞♀️
18
ROSE: Remove Objects with Side Effects in Videos
|
Kunbyte
🚀
11
Generate fragrance, notes idea from an image
|
fffiloni
🌸
4
|
XXXXRT
👀
26
|
oimoyu
🖼
12
Generate personalized avatars from Chinese portraits
|
VincentGOURBIN
🎭
0
|
matybohacek
📉
1
|
matybohacek
🚀
0
Use prompt to edit your Image with ease
|
frogleo
⚡
26
Demo for multimodal understanding and generation
|
AIDC-AI
🎨
173
|
silveroxides
🔥
34
|
aharley
⚡
35
SceneDINO (ICCV 2025)
|
jev-aleks
🦕
6
Demo for multimodal understanding and generation
|
evalstate
🎨
0
Automatic Speech Recognition for Kinyarwanda
|
badrex
🥭
1
|
jixin0101
🪄
158
|
Abhaykoul
🏆
9
DivEye: AI-Generated Text Detector
|
pinyuchen
🐨
9
Unified MLLM with Text-Aligned Representations
|
ByteDance-Seed
🚀
48
|
HelpingAI
🏆
7
|
omnipart
📚
99
Kontext image editing on FLUX[dev]
|
jallenjia
⚡
0
Demo space for Mistral latest speech models
|
MohamedRashad
🗣️
52
Qwen Image LoRA's
|
prithivMLmods
⛵
54
|
pointcept-bot
🎶
19
|
ethiotech4848
📉
2
Convert Khmer text to natural-sounding speech using advanced
|
mrrtmob
🎤
12
|
whatdoesrealitymean
😻
0
Chat with MedGemma 27B, a medical variant of Gemma 3
|
warshanks
🩻
26
Optical illusions and style transfer with FLUX
|
Dreamspire
🚀
1
Translate your text, audio, image from English to Ukrainian
|
Yehor
🚀
5
Wan2.1-T2V-14B + Fast 4-step with NAG + Automatic Audio
|
rishi2025
🔊
9
Kontext image editing on FLUX
|
Yuanshi
⚡
23
Demo of our Llama data model generator model
|
uc-ctds
📝
4
MVP demo of multilingual LLM performance eval space
|
QUT-GenAILab
📊
0
try and get llama to talk about milk
|
QUT-GenAILab
🐨
1
llm calculator with llama backend
|
QUT-GenAILab
🐨
0
generates linkedin posts from freetext entries
|
QUT-GenAILab
🏆
0
severely limited context window proof of concept
|
QUT-GenAILab
💬
0
compare different models and their moral compass
|
QUT-GenAILab
🏆
0
compare different llama versions for knowledge cutoff
|
QUT-GenAILab
🏆
0
neutral sd gradio dev space
|
QUT-GenAILab
👁
0
be polite and rude to llama
|
QUT-GenAILab
🐨
0
3D Mesh Generation via Compositional Latent Diffusion
|
paulpanwang
🧩
5
|
Krokodilpirat
👀
0
Kontext image editing on FLUX[dev]
|
zerogpu-aoti
⚡
24
Watch two icons have a conversation
|
tonyassi
🤖
3
Detect objects in images and videos
|
atalaydenknalbant
👁
42
|
ZiruiWu
✏️
3
|
ai-conferences
⚡
1
You Only Need a Denoiser (https://arxiv.org/abs/2506.03645)
|
hansen97
🏢
1
|
onuralpszr
⚡
0
|
scheitelpunk
🚀
4
Translate your text, audio, image from Ukrainian to English
|
Yehor
🚀
4
|
ovi054
😻
2
|
lch01
🔥
16
|
FIT-Check
🐈
1
OmniGen2: Unified Image Understanding and Generation.
|
azhan77168
👀
4
Multiple sampling rate MOS prediction with SFI conv
|
sarulab-speech
🐢
6
AI Clothes Changer Online
|
arfashion
👚
0
OmniSVG-Demo-Page
|
OmniSVG
📈
116
|
ZiruiWu
💻
2
Higgs Audio Demo
|
smola
🎤
399
A powerful multilingual translation language model
|
ByteDance-Seed
💻
30
Kontext image editing on FLUX[dev]
|
dreroc
⚡
0
|
yiqi0914
🖼
8
The official demo of MangaLMM
|
alfredplpl
📚
3
Demo for ChatTS
|
xiezhe22
💬
6
Transcribe, translate, chat, with any audio source 🎙📂🌐🎬
|
Loren
⚡
5
Sketch to realistic face
|
NikhilJoson
🎨
3
|
azhan77168
🪶
5
[ICCV25 Oral] Diving into the Fusion of Monocular Priors for
|
AdamYao
😻
2
|
linoyts
🎥💨🖋️
93
Morph any lyrics into a similar variation
|
projectlosangeles
🦀
9
OWLv2 zero-shot detection with visual prompt
|
vvmnnnkv
👀
1
|
noblebarkrr
🚀
3
private-nsfw-image-gen, for gay-twink
|
alexander00001
🖼
18
A Live Log component for Gradio Interface
|
elismasilva
🚀
5
use app_fast.py for fast api works wel and app_t2v is 14B
|
rahul7star
🐨
22
|
black-forest-labs
📚
370
The crowd counting model ZIP-B
|
Yiming-M
🔢
2
OmniSVG-Demo-Page
|
multimodalart
📈
39
AI meeting analysis with Voxtral transcription & summary
|
VincentGOURBIN
🎙️
3
|
upprize
🏥
0
|
shriarul5273
🎾
1