Update README - Run 20251012_235237
Browse files
README.md
CHANGED
|
@@ -21,7 +21,7 @@ model-index:
|
|
| 21 |
type: imagenet-1k
|
| 22 |
metrics:
|
| 23 |
- type: accuracy
|
| 24 |
-
value:
|
| 25 |
---
|
| 26 |
|
| 27 |
# David: Multi-Scale Feature Classifier
|
|
@@ -49,19 +49,18 @@ exist simultaneously in the same shared space with the correct checks and spacin
|
|
| 49 |
- **Epochs**: 10
|
| 50 |
- **Batch Size**: 1024
|
| 51 |
- **Learning Rate**: 0.01
|
| 52 |
-
- **Rose Loss Weight**: 0.
|
| 53 |
- **Cayley Loss**: False
|
| 54 |
|
| 55 |
## Performance
|
| 56 |
|
| 57 |
### Best Results
|
| 58 |
-
- **Validation Accuracy**:
|
| 59 |
-
- **Best Epoch**:
|
| 60 |
-
- **Final Train Accuracy**:
|
| 61 |
|
| 62 |
### Per-Scale Performance
|
| 63 |
-
- **Scale 256**:
|
| 64 |
-
- **Scale 512**: 66.44%
|
| 65 |
|
| 66 |
|
| 67 |
## Usage
|
|
@@ -79,18 +78,18 @@ AbstractPhil/david-shared-space/
|
|
| 79 |
βββ best_model.json # Latest best model info
|
| 80 |
βββ weights/
|
| 81 |
β βββ david_small_fast/
|
| 82 |
-
β βββ
|
| 83 |
β βββ MODEL_SUMMARY.txt # π― Human-readable performance summary
|
| 84 |
β βββ training_history.json # π Epoch-by-epoch training curve
|
| 85 |
-
β βββ
|
| 86 |
-
β βββ
|
| 87 |
β βββ final_model.safetensors
|
| 88 |
β βββ checkpoint_epoch_X_accYY.YY.safetensors
|
| 89 |
β βββ david_config.json
|
| 90 |
β βββ train_config.json
|
| 91 |
βββ runs/
|
| 92 |
βββ david_small_fast/
|
| 93 |
-
βββ
|
| 94 |
βββ events.out.tfevents.* # TensorBoard logs
|
| 95 |
```
|
| 96 |
|
|
@@ -104,8 +103,8 @@ from huggingface_hub import hf_hub_download
|
|
| 104 |
|
| 105 |
# Specify model variant and run
|
| 106 |
model_name = "david_small_fast"
|
| 107 |
-
run_id = "
|
| 108 |
-
accuracy = "
|
| 109 |
|
| 110 |
# Download config
|
| 111 |
config_path = hf_hub_download(
|
|
@@ -181,14 +180,14 @@ score = w_anchor * sim(z, anchor) + w_need * sim(z, need) + ...
|
|
| 181 |
|
| 182 |
### Loss Components
|
| 183 |
- **Cross-Entropy**: Standard classification loss
|
| 184 |
-
- **Rose Loss**: Pentachora role-weighted margin loss (weight: 0.
|
| 185 |
- **Cayley Loss**: Geometric regularization (disabled)
|
| 186 |
|
| 187 |
### Optimization
|
| 188 |
- **Optimizer**: AdamW
|
| 189 |
- **Weight Decay**: 1e-05
|
| 190 |
- **Scheduler**: cosine_restarts
|
| 191 |
-
- **Gradient Clip**:
|
| 192 |
- **Mixed Precision**: False
|
| 193 |
|
| 194 |
## Citation
|
|
@@ -199,7 +198,7 @@ score = w_anchor * sim(z, anchor) + w_need * sim(z, need) + ...
|
|
| 199 |
author = {AbstractPhil},
|
| 200 |
year = {2025},
|
| 201 |
url = {https://huggingface.co/AbstractPhil/david-shared-space},
|
| 202 |
-
note = {Run ID:
|
| 203 |
}
|
| 204 |
```
|
| 205 |
|
|
@@ -214,4 +213,4 @@ Special thanks to Claude (Anthropic) for debugging assistance.
|
|
| 214 |
|
| 215 |
---
|
| 216 |
|
| 217 |
-
*Generated on 2025-10-12 23:
|
|
|
|
| 21 |
type: imagenet-1k
|
| 22 |
metrics:
|
| 23 |
- type: accuracy
|
| 24 |
+
value: 61.64
|
| 25 |
---
|
| 26 |
|
| 27 |
# David: Multi-Scale Feature Classifier
|
|
|
|
| 49 |
- **Epochs**: 10
|
| 50 |
- **Batch Size**: 1024
|
| 51 |
- **Learning Rate**: 0.01
|
| 52 |
+
- **Rose Loss Weight**: 0.1 β 0.8
|
| 53 |
- **Cayley Loss**: False
|
| 54 |
|
| 55 |
## Performance
|
| 56 |
|
| 57 |
### Best Results
|
| 58 |
+
- **Validation Accuracy**: 61.64%
|
| 59 |
+
- **Best Epoch**: 0
|
| 60 |
+
- **Final Train Accuracy**: 54.39%
|
| 61 |
|
| 62 |
### Per-Scale Performance
|
| 63 |
+
- **Scale 256**: 61.64%
|
|
|
|
| 64 |
|
| 65 |
|
| 66 |
## Usage
|
|
|
|
| 78 |
βββ best_model.json # Latest best model info
|
| 79 |
βββ weights/
|
| 80 |
β βββ david_small_fast/
|
| 81 |
+
β βββ 20251012_235237/
|
| 82 |
β βββ MODEL_SUMMARY.txt # π― Human-readable performance summary
|
| 83 |
β βββ training_history.json # π Epoch-by-epoch training curve
|
| 84 |
+
β βββ best_model_acc61.64.safetensors # β Accuracy in filename!
|
| 85 |
+
β βββ best_model_acc61.64_metadata.json
|
| 86 |
β βββ final_model.safetensors
|
| 87 |
β βββ checkpoint_epoch_X_accYY.YY.safetensors
|
| 88 |
β βββ david_config.json
|
| 89 |
β βββ train_config.json
|
| 90 |
βββ runs/
|
| 91 |
βββ david_small_fast/
|
| 92 |
+
βββ 20251012_235237/
|
| 93 |
βββ events.out.tfevents.* # TensorBoard logs
|
| 94 |
```
|
| 95 |
|
|
|
|
| 103 |
|
| 104 |
# Specify model variant and run
|
| 105 |
model_name = "david_small_fast"
|
| 106 |
+
run_id = "20251012_235237"
|
| 107 |
+
accuracy = "61.64" # From MODELS_INDEX.json
|
| 108 |
|
| 109 |
# Download config
|
| 110 |
config_path = hf_hub_download(
|
|
|
|
| 180 |
|
| 181 |
### Loss Components
|
| 182 |
- **Cross-Entropy**: Standard classification loss
|
| 183 |
+
- **Rose Loss**: Pentachora role-weighted margin loss (weight: 0.1β0.8)
|
| 184 |
- **Cayley Loss**: Geometric regularization (disabled)
|
| 185 |
|
| 186 |
### Optimization
|
| 187 |
- **Optimizer**: AdamW
|
| 188 |
- **Weight Decay**: 1e-05
|
| 189 |
- **Scheduler**: cosine_restarts
|
| 190 |
+
- **Gradient Clip**: 15.0
|
| 191 |
- **Mixed Precision**: False
|
| 192 |
|
| 193 |
## Citation
|
|
|
|
| 198 |
author = {AbstractPhil},
|
| 199 |
year = {2025},
|
| 200 |
url = {https://huggingface.co/AbstractPhil/david-shared-space},
|
| 201 |
+
note = {Run ID: 20251012_235237}
|
| 202 |
}
|
| 203 |
```
|
| 204 |
|
|
|
|
| 213 |
|
| 214 |
---
|
| 215 |
|
| 216 |
+
*Generated on 2025-10-12 23:57:17*
|