A set of models from my experiments with Reinforcement Learning from Human Feedback
Samir R.
sr5434
AI & ML interests
NLP
Recent Activity
updated a model about 11 hours ago
sr5434/model-tempfiles updated a model about 1 month ago
sr5434/skin-cancer-classifier updated a model about 1 month ago
sr5434/DeepSeek-OCR-2-patchedOrganizations
None yet