CXR LLaVA

Multimodal Large Language Model Fine-Tuned for Chest X-ray Images

CXR LLaVA is an innovative open-source, multimodal large language model specifically designed for generating radiologic reports from chest X-ray images.

Arxiv Preprint Paper: Explore the detailed scientific background of CXR LLaVA on Arxiv.
Demo Website: Experience the model in action at Radiologist App.

Version	Input CXR resolution	Channels	Vision Encoder	Base LLM	Weight
v1.0	512x512	RGB	RN50	LLAMA2-13B-CHAT	Deprecated
v2.0 (Latest)	512x512	Grayscale	ViT-L/16	LLAMA2-7B-CHAT	Link

Downloads last month: 448

Safetensors

Model size

7B params

Tensor type

BF16

Paper for ECOFRI/CXR-LLAVA-v2

CXR-LLaVA: Multimodal Large Language Model for Interpreting Chest X-ray Images

Paper • 2310.18341 • Published Oct 22, 2023 • 2