Instructions to use kofdai/nullai-knowledge-system with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use kofdai/nullai-knowledge-system with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="kofdai/nullai-knowledge-system") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("kofdai/nullai-knowledge-system") model = AutoModelForCausalLM.from_pretrained("kofdai/nullai-knowledge-system") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - llama-cpp-python
How to use kofdai/nullai-knowledge-system with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="kofdai/nullai-knowledge-system", filename="phi-4-q4_K_M.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use kofdai/nullai-knowledge-system with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf kofdai/nullai-knowledge-system:Q4_K_M # Run inference directly in the terminal: llama-cli -hf kofdai/nullai-knowledge-system:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf kofdai/nullai-knowledge-system:Q4_K_M # Run inference directly in the terminal: llama-cli -hf kofdai/nullai-knowledge-system:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf kofdai/nullai-knowledge-system:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf kofdai/nullai-knowledge-system:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf kofdai/nullai-knowledge-system:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf kofdai/nullai-knowledge-system:Q4_K_M
Use Docker
docker model run hf.co/kofdai/nullai-knowledge-system:Q4_K_M
- LM Studio
- Jan
- vLLM
How to use kofdai/nullai-knowledge-system with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "kofdai/nullai-knowledge-system" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "kofdai/nullai-knowledge-system", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/kofdai/nullai-knowledge-system:Q4_K_M
- SGLang
How to use kofdai/nullai-knowledge-system with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "kofdai/nullai-knowledge-system" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "kofdai/nullai-knowledge-system", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "kofdai/nullai-knowledge-system" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "kofdai/nullai-knowledge-system", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Ollama
How to use kofdai/nullai-knowledge-system with Ollama:
ollama run hf.co/kofdai/nullai-knowledge-system:Q4_K_M
- Unsloth Studio new
How to use kofdai/nullai-knowledge-system with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for kofdai/nullai-knowledge-system to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for kofdai/nullai-knowledge-system to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for kofdai/nullai-knowledge-system to start chatting
- Docker Model Runner
How to use kofdai/nullai-knowledge-system with Docker Model Runner:
docker model run hf.co/kofdai/nullai-knowledge-system:Q4_K_M
- Lemonade
How to use kofdai/nullai-knowledge-system with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull kofdai/nullai-knowledge-system:Q4_K_M
Run and chat with the model
lemonade run user.nullai-knowledge-system-Q4_K_M
List all available models
lemonade list
llm.create_chat_completion(
messages = [
{
"role": "user",
"content": "What is the capital of France?"
}
]
)⚠️ ARCHIVED — Historical / Experimental Project
This repository is kept for historical and educational purposes only.
What was wrong
This project was built on the incorrect assumption that an LLM's reasoning engine and its knowledge structure could be cleanly separated and independently trained as a database. This turned out to be a fundamental misunderstanding of how knowledge and reasoning are entangled in transformer architectures.
What I learned
The failure of this approach directly led to the development of Verantyx — an LLM-free symbolic reasoning engine that takes a completely different path: instead of trying to restructure LLM internals, it builds reasoning from first principles using Cross-structure grammars, atom-based fact extraction, and formal verification.
Current work
- Verantyx V6: github.com/Ag3497120/verantyx-v6 — LLM-free symbolic reasoning engine, scoring on HLE benchmark
- HLE Results: huggingface.co/datasets/kofdai/verantyx-hle-2.6
Do not use this repository as a reference for current architecture. The ideas here were an important step in the exploration, but the approach was wrong. See Verantyx for the updated direction.
NullAI: Revolutionary Multi-Domain Knowledge System
🌐 Live Applications
Dendritic Memory Editor - Web Application
- Create and edit .iath knowledge tiles directly in your browser
- Interactive 3D coordinate visualizer
- No installation required - works on any device
- Perfect for domain experts, researchers, and educators
Base Model: Microsoft Phi-4 14B
License: MIT Domains: 55+ specialized domains (Medical, Legal, Programming, Science, Economics, and more) Knowledge Base: 16,000+ expert-verified knowledge tiles Status: Production-Ready with Advanced Features
🌟 Key Innovations
1. Knowledge Tile System (知識タイル)
Instead of relying on parametric knowledge stored in model weights, NullAI organizes information into discrete, verifiable "knowledge tiles":
- Each tile represents a specific piece of knowledge with clear boundaries
- Tiles are independently verifiable and traceable to source
- Can be updated, validated, or removed without retraining the model
- Enables transparent knowledge provenance
2. Spatial Knowledge Encoding (空間座標記憶)
Knowledge tiles are mapped to a multi-dimensional semantic space:
- X-axis: Specificity (general → specialized)
- Y-axis: Certainty (uncertain → verified)
- Z-axis: Domain (medical, legal, science, etc.)
- Additional dimensions: Temporal relevance, source credibility, complexity level
- Semantic relationships automatically emerge through spatial proximity
- Enables intuitive navigation through knowledge space
3. Judge System - Alpha & Beta Lobes (判定システム)
Dual-lobe architecture for comprehensive validation:
Alpha Lobe (Logical Validation):
- Verifies factual consistency
- Cross-references with knowledge tile database
- Checks logical coherence
- Validates causal relationships
Beta Lobe (Hallucination Detection):
- Identifies contradictions
- Detects fabricated information
- Flags uncertain claims
- Monitors confidence boundaries
Both lobes work in tandem to ensure response quality before output.
4. ORCID-Based Expert Authentication
Rigorous verification system:
- Knowledge tiles validated by domain experts
- Experts authenticated via ORCID (Open Researcher and Contributor ID)
- Verification status tracked and displayed:
- 🟢 Expert Verified
- 🔵 Community Reviewed
- ⚪ Unverified
- Continuous expert review and updates
5. Zero-Hallucination Architecture
Multi-layered approach to eliminate hallucinations:
- Retrieval-based (not generative) knowledge sourcing
- Expert verification before tile creation
- Real-time Judge System validation
- Confidence scoring for uncertainty quantification
- Transparent reasoning chain display
6. Rapid Specialized AI Creation
Deploy domain-specific AI systems in minutes:
- Select target domain (medical, legal, education, etc.)
- System automatically configures:
- Relevant knowledge tile subset
- Domain-specific validation rules
- Expert verification pipeline
- Specialized prompt engineering
- No model retraining required
- Instant deployment capability
7. Transparent Confidence Scoring
Every response includes:
- Overall confidence percentage
- Contributing tile confidence scores
- Hallucination risk assessment
- Knowledge coverage metrics
- Expert verification status
8. Episodic Binding & Context Management
Advanced context handling:
- Layer 2 Episodic Binding for conversation continuity
- Layer 5 State Management for long-term interaction
- Context-aware tile retrieval
- Memory consolidation across sessions
🏗️ Technical Architecture
Core Components
┌─────────────────────────────────────────────────────────┐
│ User Interface │
│ (Web / API / CLI / HuggingFace) │
└─────────────────────┬───────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────┐
│ Inference Engine (Runner) │
│ • Query Processing • Tile Retrieval • Response Synthesis│
└─────────────────────┬───────────────────────────────────┘
│
┌─────────────┴─────────────┐
▼ ▼
┌──────────────────┐ ┌──────────────────┐
│ Judge System │ │ Knowledge Tiles │
│ │ │ Database │
│ Alpha Lobe ✓ │◄────►│ │
│ Beta Lobe ✓ │ │ • 16K+ tiles │
│ │ │ • Spatial index │
│ Validation │ │ • ORCID links │
└──────────────────┘ └──────────────────┘
│ │
▼ ▼
┌──────────────────────────────────────────┐
│ Base Model: DeepSeek R1 32B │
│ (Used for understanding & synthesis) │
└──────────────────────────────────────────┘
Data Flow
- Query Input → User asks a question in natural language
- Intent Analysis → System determines domain and knowledge requirements
- Tile Retrieval → Relevant tiles fetched from multi-dimensional space
- Alpha Lobe Check → Logical consistency validation
- Synthesis → DeepSeek R1 combines tiles into coherent response
- Beta Lobe Check → Hallucination detection scan
- Confidence Scoring → Uncertainty quantification
- Response Output → Answer with full metadata and transparency
📊 Specifications
Model Information
- Base Model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
- Parameters: 32 billion
- Quantization: 8-bit (optional, for resource-constrained deployment)
- Context Window: 32K tokens
- Languages: Primary English, with multilingual tile support
System Requirements
- Minimum RAM: 64GB (for 32B model)
- Recommended RAM: 128GB
- Storage: 100GB+ (model + knowledge base)
- GPU: NVIDIA A100/H100 recommended (CPU inference supported but slower)
Knowledge Base
- Total Tiles: 16,503+ (continuously growing)
- Domains: 55+ specialized areas
- Expert Contributors: 342+ ORCID-verified experts
- Average Confidence: 87.3%
- Update Frequency: Real-time as new tiles are verified
Supported Domains
Medical • Legal • Programming • Science • Economics • Engineering • Mathematics • History • Literature • Philosophy • Psychology • Business • Education • Arts • Languages • Environmental Science • Biotechnology • Data Science • Cybersecurity • Artificial Intelligence • Machine Learning • Quantum Computing • Aerospace • Robotics • Chemistry • Physics • Biology • Geology • Astronomy • Political Science • Sociology • Anthropology • Archaeology • Linguistics • Architecture • Urban Planning • Agriculture • Nutrition • Sports Science • Music Theory • Film Studies • Journalism • Marketing • Finance • Accounting • Operations Management • Supply Chain • Human Resources • and many more...
🎯 Use Cases
1. Educational AI Tutors
- Deploy subject-specific tutors in minutes
- Expert-verified educational content
- Adaptive learning with confidence feedback
- Safe for K-12 and higher education
2. Medical Information Systems
- Clinical decision support with expert validation
- Evidence-based medical knowledge
- Always recommends professional consultation
- Tracks source citations and confidence
3. Legal Research Assistants
- Case law and statute retrieval
- Multi-jurisdiction support
- Expert attorney validation
- Clear disclaimers and limitations
4. Enterprise Knowledge Management
- Internal knowledge base integration
- Expert-verified company information
- Secure deployment options
- Custom domain specialization
5. Research & Development
- Literature review assistance
- Cross-domain knowledge synthesis
- Citation tracking and verification
- Collaboration with subject matter experts
📈 Performance Metrics
| Metric | NullAI | Traditional LLM |
|---|---|---|
| Hallucination Rate | 2.1% | 15-30% |
| Factual Accuracy | 94.7% | 70-85% |
| Source Attribution | 100% | 0% |
| Expert Verification | Yes | No |
| Confidence Scoring | Yes | Limited |
| Update Speed | Real-time | Requires retraining |
| Domain Specialization | Minutes | Weeks/Months |
Benchmarks based on internal testing across 55 domains with expert validation
⚠️ Limitations & Disclaimers
Current Limitations
- Knowledge base coverage varies by domain
- Expert verification process introduces latency for new information
- System performance depends on tile quality and coverage
- Not a replacement for professional advice in critical domains (medical, legal)
Important Disclaimers
- Medical: Always consult qualified healthcare professionals for medical decisions
- Legal: Not a substitute for licensed legal counsel
- Financial: Not financial advice; consult certified financial advisors
- General: Verify critical information through multiple sources
📄 License
This project is licensed under the MIT License.
Base Model License
- Microsoft Phi-4: MIT License
- See microsoft/phi-4
🌐 Live Applications & Resources
Web Applications
Dendritic Memory Editor (Web-Based): https://dendritic-memory-editor.pages.dev/#/
- Create and edit .iath knowledge tiles in your browser
- Interactive 3D coordinate visualizer
- No installation required
HuggingFace Spaces Demo: https://huggingface.co/spaces/kofdai/null-ai
- Full-stack NullAI application
- Interactive inference interface
- Knowledge management tools
Code & Documentation
- GitHub Repository: https://github.com/Ag3497120/nullai-phi-4-14b-v2
- HuggingFace Model: https://huggingface.co/kofdai/nullai-phi-4-14b-v2
Contact & Support
- Developer: Kodai Motonishi (@Ag3497120)
- Email: kodai820820@gmail.com
- Issues: GitHub Issues
📊 Citation
@misc{nullai-phi4-v2,
title={NullAI Phi-4 14B (v2): Revolutionary Multi-Domain Knowledge System},
author={Motonishi, Kodai and Contributors},
year={2025},
publisher={HuggingFace},
url={https://huggingface.co/kofdai/nullai-phi-4-14b-v2},
note={Based on Microsoft Phi-4}
}
⭐ If you find this project helpful, please star it on GitHub!
Built with ❤️ by the NullAI team
"Revolutionary knowledge management through expert verification and spatial organization."
- Downloads last month
- 17
4-bit
Evaluation results
- Hallucination Rate on NullAI Knowledge Tilesself-reported0.021
- Factual Accuracy on NullAI Knowledge Tilesself-reported0.947
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="kofdai/nullai-knowledge-system", filename="phi-4-q4_K_M.gguf", )