QA

No it won't Perception Matrix Scaling:

Current: 3×3 = 9 cells = 2^9 = 512 possible states
6×6:     = 36 cells = 2^36 = 68 billion possible states  
10×10:   = 100 cells = 2^100 = 1.3 × 10^30 possible states

### Information Density Comparison:

3×3 Perception: 9 features
6×6 Perception: 36 features (4× more)
10×10 Perception: 100 features (11× more)

Solution 1 (Memory): 21 features (2.3× more)
Solution 2 (Multi-Modal): 37 features (4× more)

📈 Expected Accuracy Results

Current (3×3): 50-51% accuracy
├── 6×6 Perception: 55-60% accuracy (limited improvement)
├── 10×10 Perception: 60-65% accuracy (moderate improvement)
├── Solution 1 (Memory): 70-80% accuracy (significant improvement)
└── Solution 2 (Multi-Modal): 95%+ accuracy (optimal performance)

Why Larger Perception Has Diminishing Returns:

Sparse Data Problem: Most cells in larger matrices are empty
Training Difficulty: Need exponentially more data for larger inputs
Computational Cost: Much higher memory and processing requirements
Diminishing Information: Additional cells provide less marginal value

The problem isn’t that the robot needs to see more - it’s that it needs to process information better. A 3×3 perception with sophisticated multi-modal processing is far superior to a 10×10 perception with simple feedforward processing.

Bottom Line: Focus on intelligent information processing (Solutions 1 & 2) rather than brute force information gathering (larger perception matrices).

How robot position gets updated after every move?

Is environment wall information is require? Or robot perceving wall?

the robot must perceive walls (map boundaries) inside its 3×3 patch, otherwise it won’t know it’s at the edge and might try to walk out of bounds.

What are current limitations or shortcomings of the training data

Current implementation

The NN doesn’t know WHERE it is or WHERE it’s going!
It only knows WHAT IT SEES and WHAT IT DID

Current limitations

No Failure Examples ⇒ Only optimal paths Missing: What NOT to do
Examples of dead ends
Examples of obstacle collisions
Examples of inefficient moves ❌ Robot’s current position (x, y) ❌ Goal position ❌ Distance to goal ❌ Direction to goal ❌ How far robot has traveled ❌ Global map information

2. I tried different experiments by changing parameters from this guide 6. Hyperparameter Tuning Guide , but not much improvement in accuracy (stuck at 80%), I would like to know the basics or understand why hyper parameter tuning have limitations compared to input feature or model architecture?

Your 76.7% accuracy with hyperparameter tuning hitting a wall isn’t a failure—it’s proof you’ve reached the information ceiling!

Spend 80% of effort on features/data, 15% on architecture, 5% on hyperparameters. You'll get much better results!

📊 The ML Improvement Hierarchy

Impact on Accuracy (Most → Least):

1. DATA & FEATURES (70-80% of improvement)
   ↓
2. MODEL ARCHITECTURE (15-25% of improvement)
   ↓
3. HYPERPARAMETERS (5-10% of improvement)

Adding features (9→21): 50% → 76.7% (+26.7%!) ✅ HUGE
Tuning hyperparameters: 76.7% → ~77-78% (+0-1%) ⚠️ TINY

🧠 Why This Happens: Information Theory

A model can ONLY learn patterns that exist in the input features, regardless of how well you tune it.

What your robot KNOWS (21 features):

3×3 local perception (where obstacles are nearby)
Last 3 actions (where it’s been) What your robot DOESN’T KNOW:
❌ Goal location
❌ Global environment layout
❌ Optimal path direction
❌ Distance to goal

Maximum Possible Accuracy ≤ Information Ceiling Your case:

Current: 76.7%
Ceiling with 21 features: ~77-80%
Hyperparameters can only get you: +0.3% to +3.3% You’re at 96% of theoretical maximum already!

What is information ceiling? how to calculate it?

🔬 Why Hyperparameters Have Limited Impact

They Don’t Add Information Hyperparameters control HOW the model learns, not WHAT it can learn.

Your 23.3% error (100% - 76.7%) comes from:
├─ 15-17%: Missing Information (goal, global view)
│           → Fix with better features
│
├─ 3-5%:   Architecture Limitations
│           → Fix with better model design
│
└─ 4%:     Overfitting/Variance
            → Fix with hyperparameters (what you tried!)
            You can only improve the last 4% with hyperparameters!

Distance based algorithem

How distance value improveing NN accuracy?

Why 5x5 perception matrix is giving better results than distance based data 5x5 perception results

Why data is still not boundary aware?

🔍 Training Analysis of Experiment 3: Training Accuracy: 85.6331 (8563.3%) Validation Accuracy: 79.5111 (7951.1%) Overfitting: 6.1220 (612.2%)

5×5 Perception (Distance): 0.2 0.2 0.0 0.0 0.0 0.0 0.2 0.0 0.0 0.0 0.2 0.4 0.2 0.0 0.0 0.2 0.2 0.0 0.0 0.0 0.0 0.2 0.0 0.0 0.0

If 4 output nurons represent just one action, then how robot reaching its final goal? if there are multiple actions or position it have to move. Is neural network running n times in order to reach that goal

How the robot reaches the final goal?

The NN is called repeatedly at every timestep.
Each time:
1. Robot crops new 3×3 patch around itself.
2. Compute new goal_delta = (goal – current_pos).
3. Feed (patch, goal_delta) into the NN.
4. NN outputs action → robot moves one step.
Loop continues until robot reaches the goal (or fails/time limit). So yes ✅ — the NN runs n times (one per step), not once for the whole path.

🔹 Analogy Think of the NN as the robot’s “brain” at each step.

It doesn’t precompute the whole path like A*.
Instead, it reacts step by step, choosing the best move given local view + goal direction.
Over multiple iterations, the sequence of actions forms the path to the goal.

Path: Start(0,0) → (0,1) → (0,2) → (1,2) → ... → Goal(9,9)

NN Call #1 at (0,0): Perception + History → Action: DOWN
NN Call #2 at (0,1): NEW Perception + NEW History → Action: DOWN  
NN Call #3 at (0,2): NEW Perception + NEW History → Action: RIGHT
...
NN Call #18 at (8,9): NEW Perception + NEW History → Action: DOWN

Total: 18 NN executions for an 18-step path

🤖🧠 Deep mind AI blog series

Explorer

QA

Why Larger Perception Has Diminishing Returns:

How robot position gets updated after every move?

Is environment wall information is require? Or robot perceving wall?

What are current limitations or shortcomings of the training data

2. I tried different experiments by changing parameters from this guide 6. Hyperparameter Tuning Guide , but not much improvement in accuracy (stuck at 80%), I would like to know the basics or understand why hyper parameter tuning have limitations compared to input feature or model architecture?

📊 The ML Improvement Hierarchy

🧠 Why This Happens: Information Theory

Distance based algorithem

If 4 output nurons represent just one action, then how robot reaching its final goal? if there are multiple actions or position it have to move. Is neural network running n times in order to reach that goal

Graph View

Table of Contents

Backlinks

🤖🧠 Deep mind AI blog series

Explorer

QA

1. Would increasing perception matrix to 6x6 or 10x10 will improve training accuracy or implementing solution 1 and solution 2 (Multi modal)? why?

Why Larger Perception Has Diminishing Returns:

How robot position gets updated after every move?

Is environment wall information is require? Or robot perceving wall?

What are current limitations or shortcomings of the training data

2. I tried different experiments by changing parameters from this guide 6. Hyperparameter Tuning Guide , but not much improvement in accuracy (stuck at 80%), I would like to know the basics or understand why hyper parameter tuning have limitations compared to input feature or model architecture?

📊 The ML Improvement Hierarchy

🧠 Why This Happens: Information Theory

Distance based algorithem

If 4 output nurons represent just one action, then how robot reaching its final goal? if there are multiple actions or position it have to move. Is neural network running n times in order to reach that goal

Graph View

Table of Contents

Backlinks