In a Training Loop 🔄

30 18

Ahmed Khaled Khamis

KickItLikeShika

https://kickitlikeshika.github.io/

AI & ML interests

NLP

Recent Activity

upvoted a paper 1 day ago

UniSD: Towards a Unified Self-Distillation Framework for Large Language Models

upvoted a paper 1 day ago

Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages

upvoted a paper 5 days ago

Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision

View all activity

Organizations

upvoted 2 papers 1 day ago

UniSD: Towards a Unified Self-Distillation Framework for Large Language Models

Paper • 2605.06597 • Published 7 days ago • 12

Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages

Paper • 2406.12739 • Published Jun 18, 2024 • 2

upvoted 6 papers 5 days ago

Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision

Paper • 2604.12002 • Published about 1 month ago • 11

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing

Paper • 2604.02288 • Published Apr 2 • 33

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published Mar 25 • 55

liked 6 datasets 9 days ago

hoskinson-center/proof-pile

Viewer • Updated Aug 19, 2023 • 363k • 2.15k • 67

amitayusht/PutnamBench

Viewer • Updated Jun 11, 2024 • 522 • 185 • 5

Tonic/MiniF2F

Viewer • Updated Feb 5, 2025 • 488 • 303 • 6

AI4M/mma-dataset

Viewer • Updated Mar 6, 2024 • 88.5k • 32 • 1

hoskinson-center/proofnet

Viewer • Updated Mar 17, 2023 • 371 • 452 • 21

internlm/Lean-Workbook

Viewer • Updated Oct 9, 2024 • 25.2k • 654 • 56

upvoted a paper 10 days ago

Co-Evolving Policy Distillation

Paper • 2604.27083 • Published 15 days ago • 64

liked a model 14 days ago

Qwen/Qwen3.5-9B

Image-Text-to-Text • 10B • Updated Mar 2 • 8.42M • • 1.43k

upvoted a paper 14 days ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 364

liked a model 15 days ago

deepseek-ai/DeepSeek-Prover-V2-7B

7B • Updated Apr 30, 2025 • 35.5k • 144

upvoted 2 papers 15 days ago

Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction

Paper • 2508.03613 • Published Aug 5, 2025 • 16

DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

Paper • 2504.21801 • Published Apr 30, 2025 • 5

Ahmed Khaled Khamis

AI & ML interests

Recent Activity

Organizations

KickItLikeShika's activity