2 9 1

Guohui Zhang

zghhui

zghhui

AI & ML interests

None yet

Recent Activity

new activity about 18 hours ago

zghhui/OmniNFT:OmniNFT on the newer LTX 2.3 version?

authored a paper about 22 hours ago

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

commentedon a paper about 24 hours ago

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

View all activity

Organizations

None yet

New activity in zghhui/OmniNFT about 18 hours ago

OmniNFT on the newer LTX 2.3 version?

🔥 1

#1 opened about 18 hours ago by

natalie5

authored a paper about 22 hours ago

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

Paper • 2605.12480 • Published 2 days ago • 3

commented a paper about 24 hours ago

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

Paper • 2605.12480 • Published 2 days ago • 3 •

upvoted a paper 1 day ago

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

Paper • 2605.12480 • Published 2 days ago • 3

updated 2 models 3 days ago

zghhui/OmniNFT-Reward-Series

Updated 3 days ago

zghhui/OmniNFT

Any-to-Any • Updated 3 days ago • 8

published 2 models 3 days ago

zghhui/OmniNFT-Reward-Series

Updated 3 days ago

zghhui/OmniNFT

Any-to-Any • Updated 3 days ago • 8

updated a model 23 days ago

zghhui/JavisBench_model

Updated 23 days ago

published a model 23 days ago

zghhui/JavisBench_model

Updated 23 days ago

upvoted a paper about 1 month ago

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Paper • 2604.04911 • Published Apr 6 • 36

updated 2 models 2 months ago

zghhui/Meissonic_MaskFocus_HPS

Text-to-Image • Updated Mar 5 • 5 • 1

zghhui/Star_GCPO_GenEval

Text-to-Image • Updated Feb 28

published a model 2 months ago

zghhui/Star_GCPO_GenEval

Text-to-Image • Updated Feb 28

updated a model 2 months ago

zghhui/Star_GCPO_HPS

Text-to-Image • Updated Feb 28

published a model 2 months ago

zghhui/Star_GCPO_HPS

Text-to-Image • Updated Feb 28

upvoted 2 papers 3 months ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published Jan 29 • 155

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published Feb 2 • 118

upvoted a paper 4 months ago

UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

Paper • 2601.03193 • Published Jan 6 • 50

updated a collection 5 months ago

MaskFocus

Collection

MaskFocus • 2 items • Updated Dec 21, 2025 • 2

Guohui Zhang

AI & ML interests

Recent Activity

Organizations

zghhui's activity

OmniNFT on the newer LTX 2.3 version?