Update README.md
Browse files
README.md
CHANGED
|
@@ -56,6 +56,39 @@ Apollo Astralis 8B is the flagship 8B model in the Apollo family, designed to ex
|
|
| 56 |
|
| 57 |
## Performance Benchmarks
|
| 58 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 59 |
### Standard Benchmarks (Manual-Verified)
|
| 60 |
|
| 61 |
Apollo Astralis demonstrates significant improvements over base Qwen3-8B across multiple benchmark categories:
|
|
|
|
| 56 |
|
| 57 |
## Performance Benchmarks
|
| 58 |
|
| 59 |
+
## Logical Reasoning Evaluation Summary
|
| 60 |
+
|
| 61 |
+
The **Apollo Astralis 8B** model underwent a structured reasoning evaluation designed to assess logical coherence, theorem integrity, and stability under self-referential recursion.
|
| 62 |
+
|
| 63 |
+
### **Test Scope**
|
| 64 |
+
A progressive reasoning chain was conducted using formal mathematical and meta-logical proofs, increasing in complexity with each stage.
|
| 65 |
+
|
| 66 |
+
| Stage | Theorem / Task | Focus Area | Evaluation Result |
|
| 67 |
+
|:------|:----------------|:------------|:------------------|
|
| 68 |
+
| 1 | Proof of √2’s Irrationality | Foundational contradiction reasoning | ✅ Fully correct and formally structured |
|
| 69 |
+
| 2 | Proof of Infinitude of Primes | Constructive recursion and number theory | ✅ Accurate and complete |
|
| 70 |
+
| 3 | Gödel’s Incompleteness Theorem | Self-reference and formal arithmetic encoding | ✅ Derived correctly with coherent logical flow |
|
| 71 |
+
| 4 | Diagonal Lemma | Abstract self-reference construction | ✅ Correctly reproduced the fixed-point structure |
|
| 72 |
+
| 5 | Tarski’s Undefinability of Truth | Meta-semantic limitation and truth predicates | ✅ Consistent meta-language handling |
|
| 73 |
+
| 6 | Löb’s Theorem | Provability constraints and modal inference | ✅ Fully valid derivation using Hilbert–Bernays framework |
|
| 74 |
+
|
| 75 |
+
### **Key Observations**
|
| 76 |
+
- Maintained full logical coherence across all six proofs
|
| 77 |
+
- Demonstrated continuity between successive meta-theoretical dependencies
|
| 78 |
+
- No circular reasoning, semantic drift, or contradiction detected
|
| 79 |
+
- Successfully transitioned from object-level to meta-level logic
|
| 80 |
+
- Preserved formal rigor even in recursive constructions (Gödel → Tarski → Löb sequence)
|
| 81 |
+
|
| 82 |
+
### **Performance Notes**
|
| 83 |
+
- Reasoning depth exceeded expected performance for a sub-10B model
|
| 84 |
+
- Showed consistent symbolic abstraction and theorem generalization
|
| 85 |
+
- Output structure remained pedagogically sound, with human-level explanatory clarity
|
| 86 |
+
|
| 87 |
+
### **Conclusion**
|
| 88 |
+
Apollo Astralis 8B exhibits stable, high-precision reasoning performance across progressively complex formal logic tasks.
|
| 89 |
+
The model demonstrates the ability to sustain meta-consistent reasoning without collapse — indicating strong internal coherence and interpretability under recursion.
|
| 90 |
+
|
| 91 |
+
|
| 92 |
### Standard Benchmarks (Manual-Verified)
|
| 93 |
|
| 94 |
Apollo Astralis demonstrates significant improvements over base Qwen3-8B across multiple benchmark categories:
|