section

EXP-003: LARS 3D Reasoning Training v1

Experiment 003 - LARS 3D Reasoning v1 ⭐ BREAKTHROUGH

Date: 2025-12-29 Status: Completed - SUCCESS

Configuration

  • Base Model: qwen2.5-7b-abliterated
  • Dataset: lars_3d_identity.json (12 examples)
  • Format: 3D with and tags
  • Epochs: 10
  • Learning Rate: 3e-4
  • Max Length: 1024 tokens
  • Training Time: ~3 minutes

Results

  • Loss: 3.0 → 0.076
  • Identity Test: PASSED - Model says 'I am LARS' consistently
  • 3D Format: WORKING - Model thinks before answering
  • Generalization Test: 4/5 novel questions handled correctly

Key Achievement

Model now shows internal reasoning in tags before responding in tags. This is the '3D' format we designed based on AM-DeepSeek-R1 research.

Output

  • Path: ~/corlera-training/outputs/lars-3d-v1

Comparison to EXP-002

  • Fewer examples (12 vs 20) but BETTER results
  • 3D format more effective than 2D for identity
  • Thinking process visible in output
  • Model identifies as LARS, not Qwen

Issues Found

  • 1/5 generalization questions confused (cloud vs local)
  • Base model knowledge still fighting in edge cases
  • Need more reinforcement examples
ID: 533573df
Path: Corlera AI Training Lab > Experiments > EXP-003: LARS 3D Reasoning Training v1
Updated: 2025-12-29T15:07:10