unmodeled-tyler commited on
Commit
5479c0a
·
verified ·
1 Parent(s): 8df2846

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -56,6 +56,39 @@ Apollo Astralis 8B is the flagship 8B model in the Apollo family, designed to ex
56
 
57
  ## Performance Benchmarks
58
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
59
  ### Standard Benchmarks (Manual-Verified)
60
 
61
  Apollo Astralis demonstrates significant improvements over base Qwen3-8B across multiple benchmark categories:
 
56
 
57
  ## Performance Benchmarks
58
 
59
+ ## Logical Reasoning Evaluation Summary
60
+
61
+ The **Apollo Astralis 8B** model underwent a structured reasoning evaluation designed to assess logical coherence, theorem integrity, and stability under self-referential recursion.
62
+
63
+ ### **Test Scope**
64
+ A progressive reasoning chain was conducted using formal mathematical and meta-logical proofs, increasing in complexity with each stage.
65
+
66
+ | Stage | Theorem / Task | Focus Area | Evaluation Result |
67
+ |:------|:----------------|:------------|:------------------|
68
+ | 1 | Proof of √2’s Irrationality | Foundational contradiction reasoning | ✅ Fully correct and formally structured |
69
+ | 2 | Proof of Infinitude of Primes | Constructive recursion and number theory | ✅ Accurate and complete |
70
+ | 3 | Gödel’s Incompleteness Theorem | Self-reference and formal arithmetic encoding | ✅ Derived correctly with coherent logical flow |
71
+ | 4 | Diagonal Lemma | Abstract self-reference construction | ✅ Correctly reproduced the fixed-point structure |
72
+ | 5 | Tarski’s Undefinability of Truth | Meta-semantic limitation and truth predicates | ✅ Consistent meta-language handling |
73
+ | 6 | Löb’s Theorem | Provability constraints and modal inference | ✅ Fully valid derivation using Hilbert–Bernays framework |
74
+
75
+ ### **Key Observations**
76
+ - Maintained full logical coherence across all six proofs
77
+ - Demonstrated continuity between successive meta-theoretical dependencies
78
+ - No circular reasoning, semantic drift, or contradiction detected
79
+ - Successfully transitioned from object-level to meta-level logic
80
+ - Preserved formal rigor even in recursive constructions (Gödel → Tarski → Löb sequence)
81
+
82
+ ### **Performance Notes**
83
+ - Reasoning depth exceeded expected performance for a sub-10B model
84
+ - Showed consistent symbolic abstraction and theorem generalization
85
+ - Output structure remained pedagogically sound, with human-level explanatory clarity
86
+
87
+ ### **Conclusion**
88
+ Apollo Astralis 8B exhibits stable, high-precision reasoning performance across progressively complex formal logic tasks.
89
+ The model demonstrates the ability to sustain meta-consistent reasoning without collapse — indicating strong internal coherence and interpretability under recursion.
90
+
91
+
92
  ### Standard Benchmarks (Manual-Verified)
93
 
94
  Apollo Astralis demonstrates significant improvements over base Qwen3-8B across multiple benchmark categories: