ARA

ARA Cognitives

The collaboration workspace for building the Adaptive Reasoning Architecture β€” connect, share ideas, and track progress together.

πŸ’¬
Group Chat
Real-time chat with the whole team
πŸ“‹
Notice Board
Post announcements & updates
πŸ“Ž
File Sharing
Docs, images, and links
πŸ‘₯
Member Directory
Find collaborators by skill
Know More

By joining, you agree to collaborate respectfully. This is a private workspace for ARA contributors.

?
✏
Welcome to ARA Hub πŸ‘‹
Looks like you're new here! Set up your profile so teammates can find and connect with you.
You can update any of this at any time.
Your identity
Max 200 characters β€” shows on your member card
Skills & interests
Avatar colour
ARA
ARA Cognitive
v0.1 Β· Shadow Demo
JM
John M.
ML Engineer
Workspace
🏠 Welcome
⬑ Dashboard
πŸ’¬ Chat
πŸ“‹ Notice Board
πŸ‘₯ Members
πŸ”— Repository
πŸ“š Project Notes
Resources
πŸ™ GitHub
πŸ—„ Supabase DB
πŸšͺ Sign out
Dashboard
Good to have you back
ARA
Cognitive Hub
Shadow Demo Β· Active
Welcome to the
Adaptive Reasoning Architecture
The ARA is a fundamentally new approach to neural intelligence β€” a model that separates what it knows from how it thinks. Its frozen 40% reasoning core encodes timeless logical primitives; its plastic 60% workspace adapts in real time to each problem. This is the collaboration hub for the team building it.
10M
Prototype Params
40/60
Core / Workspace Split
8
Transformer Layers
25+
Reasoning Primitives
775M
Effective Param Equiv.
🧬 What is the Adaptive Reasoning Architecture?

Most neural networks are monolithic: every weight participates in both storing knowledge and executing reasoning. The ARA breaks this coupling. It divides the model into two distinct, mathematically defined zones that play complementary roles during inference.

The 40% Reasoning Core (ΞΈ_R) is frozen after pre-training. It encodes the atomic operations of mathematical thought β€” operations like distribute, substitute, differentiate, factor, apply the chain rule β€” as structured patterns in its weight matrices. These never change. They are the bedrock of logical procedure, independent of any particular problem.

The 60% Plastic Workspace (ΞΈ_P) is the dynamic half. During each problem-solving session it updates its own weights in real time, guided by a coherence loss that measures whether the current partial solution is internally consistent and progressing toward the goal. This is inference-time learning β€” something no conventional network does.

βš™οΈ How It Works: Two Phases
Phase 1 β€” Pre-Training
Building the Reasoning Core
The 40% reasoning core (ΞΈ_R) is trained on structured solution traces. The 60% workspace is frozen as a random reservoir, providing a rich non-linear feature space. The core learns to read and write into this space.
Phase 2 β€” Inference
The Workspace Takes Over
At deployment ΞΈ_R freezes permanently. ΞΈ_P becomes trainable. Each micro-iteration: the frozen core proposes a reasoning step, checks coherence, and if consistent β€” updates the plastic weights. The model literally rewrites itself while solving.
Phase 3 β€” Scaling
Council & Distributed Intelligence
Beyond the 10M prototype, multiple ARA instances form a Council, debating via weighted graph consensus. Byzantine fault tolerance is built in. Diverse nodes outperform identical copies.
πŸ“ The Mathematics, in Plain Terms

Every claim in the ARA is backed by a derivation from first principles.

Masked Gradient Update
Ξ”W = βˆ’Ξ· Β· M_plastic βŠ™ βˆ‡_W L
Only the 60% plastic neurons receive gradient updates.
Coherence Loss
L_coh = βˆ’ΞΌβ‚Β·log p(sβ‚œ|sβ‚œβ‚‹β‚,ΞΈ) βˆ’ ΞΌβ‚‚Β·Prog(t)
Consistency check + progress toward the goal state.
Adaptive Resistance (EWC)
L_inf = L_coh + (Ξ»_R/2)Β·Ξ£ Rα΅’Β·(ΞΈα΅’βˆ’ΞΈα΅’β‚€)Β²
Fisher-weighted springs resist overwriting useful neurons.
Scaling Identity
N_eff = N Β· T^Ξ² Β· (1βˆ’f_facts)⁻¹
10M model β†’ ~775M parameter-equivalents at T=1000.
πŸ”¬ The Shadow Demo β€” Current Focus
Architecture
d_model=128, 8 layers, d_ff=512
Vocab: 8,000 symbolic tokens
Split
204 frozen / 307 plastic neurons per FFN
Per-neuron, interleaved across all 8 layers
Memory footprint
~120 MB total
Runs on any modern GPU or Apple Silicon
Benchmark targets
β‰₯75% GSM8K Β· β‰₯40% MATH
Within 5–7% of 1B-parameter baseline
Why does a 10M model compete with a 1B model? Standard transformers use 50–70% of their parameters to store factual knowledge. The ARA stores nothing β€” it retrieves facts from an external library. All 10M parameters go to reasoning. Combine that with 1,000 micro-iterations of self-modifying inference, and the effective capacity reaches ~775M parameter-equivalents.
🧠 The Three-Tier Memory Hierarchy

The plastic workspace self-organises over time into three stability tiers, driven entirely by the resistance values R_i that each neuron accumulates through use:

R_i < 0.2 Β· FREE POOL
Working Memory
Updated freely each iteration. Problem-specific scratch space. Fast, volatile, context-specific.
0.2 ≀ R_i < 0.7 Β· ADAPTIVE
Procedural Skills
Protected within a problem type. Cross-problem heuristics that proved useful.
R_i β‰₯ 0.7 Β· CONSOLIDATED
Experiential Memory
Rarely overwritten. Accumulated mathematical intuition from hundreds of solved problems.
🧭 Navigate the Hub
πŸ’¬
Group Chat
A single shared channel for the entire team. Discuss architecture decisions, share files, and coordinate sprints in real time.
Open Chat β†’
πŸ“‹
Notice Board
Post announcements, updates, questions, and ideas. Urgent notices and recent posts appear on the Dashboard.
Open Notices β†’
πŸ‘₯
Member Directory
Find collaborators by name, role, or skill. View profiles, see who's online, and send direct messages.
View Members β†’
πŸ”—
Repository
Access the codebase, architecture spec, and contribution guide. Clone, branch, and push your work.
Open Repository β†’
πŸš€ Getting Started
1
Say Hello
Introduce yourself in the group chat and mention your background and interests
2
Read the Notices
Check pinned announcements for current sprint priorities and open tasks
3
Clone the Repo
Follow the contribution guide in Repository to set up the codebase locally
4
Pick an Issue
Grab an open GitHub issue and start contributing β€” all skill levels welcome
β€”
Active Members
β€”
Messages Today
β€”
Files Shared
10M
Prototype Target
Shadow Demo
πŸ“‹ Recent & Pinned Notices
⚑ Recent Activity
πŸ’¬ Group Chat
All members Β· Everyone's here
πŸ“ Post a Notice
🧬 ARA β€” Adaptive Reasoning Architecture
Open repository for the Shadow Demo prototype and full architecture specification. A 10M-parameter model designed to rival 1B-parameter transformers on mathematical reasoning.
⭐ 142 stars
πŸ”€ 23 forks
πŸ› 8 open issues
πŸ‘ 310 watching
πŸ™ Go to Project Repository on GitHub β†’ or browse the full README below
πŸ“„ README.md
adaptive-reasoning-architecture

Adaptive Reasoning Architecture (ARA)

A neural architecture that separates how a model thinks from what it knows. The frozen 40% reasoning core (ΞΈ_R) encodes timeless logical primitives. The plastic 60% workspace (ΞΈ_P) adapts in real time to each problem via inference-time weight updates β€” something no conventional transformer does.

The result: a 10M-parameter model with ~775M effective parameter-equivalents on reasoning tasks, achieved by combining iterative inference, knowledge decoupling, and adaptive resistance.

Architecture at a Glance

d_model    = 128       # Model dimension
n_layers   = 8         # Transformer depth
d_ff       = 512       # FFN hidden dim (4 Γ— d_model)
n_heads    = 4         # Attention heads
vocab_size = 8000      # Symbolic token vocabulary

# The 40/60 split (per-neuron, across all FFN layers)
frozen_neurons  = 204  # floor(0.40 Γ— 512) per FFN layer β†’ reasoning core
plastic_neurons = 307  # ceiling(0.60 Γ— 512) per FFN layer β†’ workspace

total_params       = ~10,000,000
reasoning_core     = ~4,000,000  (ΞΈ_R β€” frozen at inference)
plastic_workspace  = ~6,000,000  (ΞΈ_P β€” updated per micro-iteration)

Quick Start

git clone https://github.com/ara-collective/adaptive-reasoning-architecture.git
cd adaptive-reasoning-architecture
pip install -r requirements.txt

# Phase 1: train the reasoning core
python shadow_demo/train.py --config configs/base_10m.yaml --phase 1

# Phase 2: run iterative inference on a problem
python shadow_demo/infer.py --problem "Solve: 3xΒ² + 5x - 2 = 0" --max_iters 1000
πŸ“‹ Contribution Guide β€” How to Clone, Branch & Push
1
Fork the Repository
Go to the GitHub repo and click the Fork button (top right).
2
Clone Your Fork Locally
git clone https://github.com/your-username/adaptive-reasoning-architecture.git cd adaptive-reasoning-architecture
3
Add the Upstream Remote
git remote add upstream https://github.com/ara-collective/adaptive-reasoning-architecture.git git remote -v
4
Create a Feature Branch
git checkout -b feature/my-cool-change
5
Make Changes & Commit
git add . git commit -m "feat: add masked gradient update for frozen neurons"
6
Pull Latest Changes & Push
git pull upstream main git push origin feature/my-cool-change
7
Open a Pull Request
Go to your fork on GitHub, find your branch, and click Compare & pull request.
πŸ’‘ Pro tip: Keep your fork in sync with git pull upstream main. Reach out in the group chat if you need help!
πŸ“š Project Notes & Resources
Upload documents, books, papers, and resources for the project
✏️ Add a Resource Link or Note
Your Profile
JM
John M.
john@example.com
Member