Kelvin PRO

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

upvoted a paper 15 days ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

upvoted an article about 1 month ago

TRL v1.0: Post-Training Library Built to Move with the Field

View all activity

Organizations

upvoted a paper 3 days ago

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

Paper • 2604.22875 • Published 9 days ago • 31

upvoted a paper 15 days ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 255

upvoted an article about 1 month ago

Article

TRL v1.0: Post-Training Library Built to Move with the Field

Mar 31

•

upvoted 4 papers about 1 month ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 131

FlowScene: Style-Consistent Indoor Scene Generation with Multimodal Graph Rectified Flow

Paper • 2603.19598 • Published Mar 20 • 32

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

Paper • 2603.18002 • Published Mar 18 • 13

The Trinity of Consistency as a Defining Principle for General World Models

Paper • 2602.23152 • Published Feb 26 • 201

upvoted 2 papers about 2 months ago

Can Vision-Language Models Solve the Shell Game?

Paper • 2603.08436 • Published Mar 9 • 39

DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

Paper • 2512.15713 • Published Dec 17, 2025 • 18

upvoted a collection 3 months ago

Multimodal LLM

Collection

370 items • Updated Feb 7 • 47

upvoted 3 papers 3 months ago

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Paper • 2602.06422 • Published Feb 6 • 47

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 111

Agent-as-a-Judge

Paper • 2601.05111 • Published Jan 8 • 20

upvoted a paper 4 months ago

Toward Global Large Language Models in Medicine

Paper • 2601.02186 • Published Jan 5 • 6

upvoted 2 papers 5 months ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 93

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24, 2025 • 64

upvoted a paper 6 months ago

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31, 2025 • 74

upvoted a collection 6 months ago

E2D2

Collection

https://m-arriola.com/e2d2/ • 5 items • Updated 20 days ago • 4

upvoted an article 7 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

Jun 12, 2025

•

164

upvoted a paper 7 months ago

HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions

Paper • 2506.19639 • Published Jun 24, 2025 • 2

Kelvin PRO

AI & ML interests

Recent Activity

Organizations

kh's activity

TRL v1.0: Post-Training Library Built to Move with the Field

Learn the Hugging Face Kernel Hub in 5 Minutes