36 3 2

Aamer Mihaysi

O96a

https://www.mehaisi.com/

AI & ML interests

Ethical AI, NLP & Cognitive architectures

Recent Activity

updated a Space about 13 hours ago

O96a/lope-reasoning-demo

published a Space about 13 hours ago

O96a/lope-reasoning-demo

updated a Space 3 days ago

O96a/aris-adversarial-demo

View all activity

Organizations

O96a 's Spaces 45

LoPE Demo - Prompt Perturbation for Reasoning Exploration

🧠

Compare baseline and perturbed reasoning for tasks

Generate Sudanese Arabic poetry from any topic

Sudanese Poetry Experiment

🚀

Generate Sudanese Arabic poems on any topic

LenVM Token-Level Length Control Demo

📏

Lost-in-Thought Benchmark

🧠

Run a benchmark to see how reasoning steps affect retrieval accuracy

Sudanese Dialect Mt Stress

🏃

Master Key Capability Demo

🔑

Show expected accuracy boost for a math problem via steering

AutoResearchBench Explorer

🔬

AutoResearchBench Explorer

🔬

OneManCompany Talent Market Explorer

🚀

OneManCompany Talent Market Explorer

🚀

Agentic World Model Explorer

🚀

Explore world model levels, laws, and rollouts interactively

COSPLAY Skill Bank Demo

🚀

Generate baseline vs skill‑augmented LLM answer

COSPLAY Skill Bank Demo

🚀

COMPASS-Inspired Semantic Sampling for Sudanese Arabic Dialect Understanding

🎯

Number Periodicity Demo

📊

Number Periodicity Demo

📊

Number Representation Periodicity Visualizer

📊

CoT Spatial Reasoning Degradation

🧠

Show how step-by-step prompts affect visual puzzle answers

CoT Spatial Reasoning Degradation

📉

Generate spatial puzzles and compare direct vs CoT reasoning

Weak Supervision Reasoning Explorer

🔬

Explore reasoning performance under weak supervision

Aamer Mihaysi

AI & ML interests

Recent Activity

Organizations

O96a 's Spaces 45 Sort: Recently updated

LoPE Demo - Prompt Perturbation for Reasoning Exploration

ARIS Adversarial Review Demo

Hierarchical Tree RAG Demo

Step-level Cascade for Efficient Agents

Sudanese Poetry Experiment

Sudanese Poetry Experiment

LenVM Token-Level Length Control Demo

Lost-in-Thought Benchmark

Sudanese Dialect Mt Stress

Master Key Capability Demo

AutoResearchBench Explorer

AutoResearchBench Explorer

OneManCompany Talent Market Explorer

OneManCompany Talent Market Explorer

Agentic World Model Explorer

COSPLAY Skill Bank Demo

COSPLAY Skill Bank Demo

COMPASS-Inspired Semantic Sampling for Sudanese Arabic Dialect Understanding

Number Periodicity Demo

Number Periodicity Demo

Number Representation Periodicity Visualizer

CoT Spatial Reasoning Degradation

CoT Spatial Reasoning Degradation

Weak Supervision Reasoning Explorer

O96a 's Spaces 45