Update README.md
Browse files
README.md
CHANGED
|
@@ -52,7 +52,9 @@ The primary hurdle in modern AI applications is the computational cost of deep t
|
|
| 52 |
|
| 53 |
## The Retrieval Architecture: Entropy and Exponential Search (O(log N))
|
| 54 |
|
| 55 |
-
We open-source the accompanying RAG system that maximizes the potential of this lightweight model.
|
|
|
|
|
|
|
| 56 |
The architecture uses the cohesion of the chunk to replace the function of the attention mechanism.
|
| 57 |
|
| 58 |
PLEASE CITE THIS REPOSITORY IF YOU ARE BUILDING ON TOP OF THE IDEA.
|
|
@@ -110,4 +112,14 @@ This combination is ideal for any application requiring high throughput and low
|
|
| 110 |
| **Embedding Dimension** | 4096 |
|
| 111 |
| **Model File Size** | 592 MB |
|
| 112 |
| **Quantization** | int8 |
|
| 113 |
-
| **Inference Pipeline** | Mean Pooling + L2 Normalization |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 52 |
|
| 53 |
## The Retrieval Architecture: Entropy and Exponential Search (O(log N))
|
| 54 |
|
| 55 |
+
We open-source the accompanying RAG system that maximizes the potential of this lightweight model.
|
| 56 |
+
We already made millions from the SaaS around this idea. So we are open sourcing it.
|
| 57 |
+
It bypasses the limitations of fixed-size chunking and linear searching with two core innovations:
|
| 58 |
The architecture uses the cohesion of the chunk to replace the function of the attention mechanism.
|
| 59 |
|
| 60 |
PLEASE CITE THIS REPOSITORY IF YOU ARE BUILDING ON TOP OF THE IDEA.
|
|
|
|
| 112 |
| **Embedding Dimension** | 4096 |
|
| 113 |
| **Model File Size** | 592 MB |
|
| 114 |
| **Quantization** | int8 |
|
| 115 |
+
| **Inference Pipeline** | Mean Pooling + L2 Normalization |
|
| 116 |
+
|
| 117 |
+
```bibtex
|
| 118 |
+
@misc{EH-RAG_2025,
|
| 119 |
+
title={Entropy-Harmonic RAG: Achieving Logarithmic Retrieval Complexity and Extreme Efficiency via Transformer Distillation},
|
| 120 |
+
author={Anonymous},
|
| 121 |
+
year={2025},
|
| 122 |
+
howpublished={Hugging Face},
|
| 123 |
+
doi={10.57967/hf/6962}
|
| 124 |
+
}
|
| 125 |
+
```
|