neuralmind

Brain-Like Learning Guide (v0.3.2+)

How to teach your project to improve retrieval relevance through query patterns

Overview

NeuralMind’s learning system automatically improves as you use it. The more you query, the smarter it gets.

The Learning Cycle

┌──────────────────────────────────────────────────────────┐
│ 1. Query Your Code                                       │
│    neuralmind query . "How does auth work?"              │
│    ↓ Events logged to .neuralmind/memory/                │
├──────────────────────────────────────────────────────────┤
│ 2. Collect Patterns                                      │
│    After 5-10 queries, run:                              │
│    neuralmind learn .                                    │
│    ↓ Analyzes which modules appear together              │
├──────────────────────────────────────────────────────────┤
│ 3. Automatic Improvement                                 │
│    Next queries automatically boost related modules      │
│    ↓ Better results in fewer tokens                      │
├──────────────────────────────────────────────────────────┤
│ 4. Continuous Learning                                   │
│    Each new query adds to the pattern                    │
│    Run neuralmind learn . weekly for updates             │
└──────────────────────────────────────────────────────────┘

Step-by-Step Workflow

Step 1: Enable Memory Logging

First-time only. When you run your first query, you’ll be prompted:

$ neuralmind query . "How does authentication work?"

Enable local NeuralMind memory logging to improve retrieval over time? [y/N]: y

✅ Consent saved to ~/.neuralmind/memory_consent.json

To disable: Set NEURALMIND_MEMORY=0 or NEURALMIND_LEARNING=0

Step 2: Use NeuralMind Naturally

Just query as usual. Events are logged automatically.

# Daily usage - these all get logged
neuralmind query . "How does authentication work?"
neuralmind query . "What are the API endpoints?"
neuralmind query . "How is data validated?"
neuralmind query . "Where's the database logic?"
neuralmind query . "What's the error handling?"

Each query logs:

The question you asked
Which modules were retrieved
How many tokens used
Which communities matched

Step 3: Analyze Patterns

After collecting 5-10 queries, analyze the patterns:

$ neuralmind learn .

Analyzing 8 query events...
✓ Learned 12 cooccurrence patterns
✓ Patterns saved to .neuralmind/learned_patterns.json
✓ Next query will apply learned patterns for improved retrieval

Top cooccurrence patterns:
  community_0|community_1: 5 times (100% - auth + validation)
  community_1|community_2: 4 times (validation + middleware)
  community_0|community_2: 3 times (auth + middleware)

Step 4: Automatic Improvements

On your next query, learned patterns are applied automatically:

$ neuralmind query . "How does auth work?"

## Search Results

1. **validate_user** (score: 0.85 +0.25 boost)  ← Boosted due to cooccurrence!
   Type: function
   File: auth.py

2. **authenticate** (score: 0.91)
   Type: function
   File: auth.py

3. **check_permissions** (score: 0.78 +0.18 boost)  ← Also boosted!
   Type: function
   File: middleware.py

What happened:

System recognized “auth” question
Looked for modules that frequently appear with auth
Boosted validation and middleware in results
Same token budget, better relevance ✅

Step 5: Continuous Improvement

Run neuralmind learn . weekly or after major development:

# Weekly learning update
0 9 * * 1 neuralmind learn /path/to/project

Each run:

Reads new query events
Updates pattern weights
Saves improved patterns

Understanding Patterns

What Gets Learned

The system tracks module cooccurrence — which code parts appear together in successful queries.

{
  "cooccurrence": {
    "community_0|community_1": 5,
    "community_0|community_2": 3,
    "community_1|community_2": 4
  },
  "module_frequency": {
    "community_0": 8,
    "community_1": 12,
    "community_2": 7
  }
}

Example: If users ask about authentication, validation modules usually appear in L2 context (frequency: 5). The system learns this relationship.

How Reranking Uses Patterns

When you query:

L2 identifies context modules — which communities match your question
L3 searches normally — semantic search finds candidates
Reranker boosts results — modules cooccurring with L2 context get +0.3 multiplier
Final ranking — better matches rise to top

Boost formula:

combined_score = semantic_score × (1.0 + 0.3 × cooccurrence_strength)

Where cooccurrence_strength is 0-1 (normalized to max pattern).

Pattern Examples

Example 1: Authentication System

Queries ask about: auth, validation, permissions
System learns: These modules appear together
Effect: "How does auth work?" automatically includes validation
Token savings: -20% (fewer irrelevant results)

Example 2: API Layer

Queries ask about: API endpoints, routes, handlers
System learns: These modules always appear together
Effect: "What are the endpoints?" automatically includes handler details
Token savings: -15% (more complete context)

Example 3: Data Pipeline

Queries ask about: database, models, migrations
System learns: These concepts are linked
Effect: "How's the data stored?" includes migration history
Token savings: -25% (better context relevance)

Privacy & Data

✅ 100% Local — All learning happens on your machine ✅ No Telemetry — Nothing sent to servers ✅ User Control — One-time consent, can disable anytime ✅ Persistent — Patterns stay in your .neuralmind/ directory

File Locations

~/.neuralmind/
├── memory_consent.json              # Consent flag (once per user)
└── memory/
    └── query_events.jsonl           # Global event log

project/.neuralmind/
├── memory/
│   └── query_events.jsonl           # Project-specific events
└── learned_patterns.json            # Learned patterns (created by neuralmind learn)

Environment Variables

# Disable memory logging
export NEURALMIND_MEMORY=0

# Disable learning
export NEURALMIND_LEARNING=0

# Use both to disable completely
export NEURALMIND_MEMORY=0 NEURALMIND_LEARNING=0

Troubleshooting

“No query events found”

Problem: You run neuralmind learn . but see “No query events found”

Solution:

Have you run at least 1 query? neuralmind query . "test"
Did you consent to memory logging? You should see prompt on first query
Check memory file exists: ls -la project/.neuralmind/memory/query_events.jsonl
Check NEURALMIND_MEMORY not set to 0

“Learned 0 patterns”

Problem: You see “Learned 0 cooccurrence patterns”

Solution:

You may need 5+ queries for meaningful patterns
Your queries might be too different (no overlapping modules)
Try a few more queries, then run learn again

“Patterns not being applied”

Problem: You see no boost scores in search results

Solution:

Run neuralmind query . "test" again (must be AFTER learning)
Check file exists: ls -la project/.neuralmind/learned_patterns.json
Check logs aren’t disabled: echo $NEURALMIND_LEARNING
Try with a fresh query (not the exact same as before)

Best Practices

1. Natural Querying

✅ DO: Ask questions naturally as they come up
neuralmind query . "How does user login work?"

❌ DON'T: Artificially create queries just for learning

2. Regular Learning

✅ DO: Run learn after several days/weeks of usage
neuralmind learn . # Weekly is ideal

❌ DON'T: Rely on very fresh patterns (need 5+ queries)

3. Meaningful Questions

✅ DO: Ask varied questions about your codebase
- "How does auth work?"
- "What are the API routes?"
- "How is data validated?"

❌ DON'T: Ask the exact same question repeatedly

4. Monitoring Patterns

✅ DO: Check top patterns to understand your code structure
neuralmind learn . | grep "cooccurrence"

❌ DON'T: Manually edit learned_patterns.json (it's auto-generated)

Performance Impact

Learning has zero overhead:

Pattern loading: ~1ms (lazy loaded, happens once)
Reranking: ~5ms (only sort, no compute)
Memory: ~10KB for patterns file
Storage: ~50KB for event logs (100 queries)

Total cost: Negligible compared to network latency of semantic search.

What’s Coming (v0.4+)

🔜 Feedback Signals — Explicit ratings improve pattern accuracy 🔜 Conversation Memory — Context awareness across multiple queries 🔜 Predictive Loading — Anticipate needs based on current file 🔜 Team Learning — Share patterns across team members

neuralmind

Brain-Like Learning Guide (v0.3.2+)

Overview

The Learning Cycle

Step-by-Step Workflow

Step 1: Enable Memory Logging

Step 2: Use NeuralMind Naturally

Step 3: Analyze Patterns

Step 4: Automatic Improvements

Step 5: Continuous Improvement

Understanding Patterns

What Gets Learned

How Reranking Uses Patterns

Pattern Examples

Example 1: Authentication System

Example 2: API Layer

Example 3: Data Pipeline

Privacy & Data

File Locations

Environment Variables

Troubleshooting

“No query events found”

“Learned 0 patterns”

“Patterns not being applied”

Best Practices

1. Natural Querying

2. Regular Learning

3. Meaningful Questions

4. Monitoring Patterns

Performance Impact

What’s Coming (v0.4+)

See Also