Create README.md

Browse files

Files changed (1) hide show

README.md +80 -0

README.md ADDED Viewed

	@@ -0,0 +1,80 @@

+---
+language: en
+license: mit
+library_name: transformers
+tags:
+  - climate-change
+  - domain-adaptation
+  - masked-language-modeling
+  - scientific-nlp
+  - transformer
+  - BERT
+  - ClimateBERT
+metrics:
+  - f1
+model-index:
+  - name: SciClimateBERT
+    results:
+      - task:
+          type: text-classification
+          name: Climate NLP Tasks (ClimaBench)
+        dataset:
+          name: ClimaBench
+          type: benchmark
+        metrics:
+          - type: f1
+            name: Macro F1 (avg)
+            value: 57.83
+---
+# SciClimateBERT 🌎🔬
+**SciClimateBERT** is a domain-adapted version of **ClimateBERT**, further pretrained on peer-reviewed scientific papers focused on climate change. While ClimateBERT is tuned for general climate-related text, SciClimateBERT narrows the focus to high-quality academic content, improving performance in scientific NLP applications.
+## 🔍 Overview
+- **Base Model**: ClimateBERT (RoBERTa-based architecture)
+- **Pretraining Method**: Continued pretraining (domain adaptation) with Masked Language Modeling (MLM)
+- **Corpus**: Scientific climate change literature from top-tier journals
+- **Tokenizer**: ClimateBERT tokenizer (unchanged)
+- **Language**: English
+- **Domain**: Scientific climate change research
+## 📊 Performance
+Evaluated on **ClimaBench**, a benchmark suite for climate-focused NLP tasks:
+| Metric         | Value        |
+|----------------|--------------|
+| Macro F1 (avg) | 57.83|
+| Tasks won      | 0/7  |
+| Avg. Std Dev   | 0.01747|
+While based on ClimateBERT, this model focuses on structured scientific input, making it ideal for downstream applications in climate science and research automation.
+## 🧪 Intended Uses
+**Use for:**
+- Scientific climate change text classification and extraction
+- NLP-powered climate science discovery tools
+- Knowledge base and graph construction in climate policy and research domains
+**Not suitable for:**
+- Non-scientific general-purpose text
+- Multilingual applications
+## ⚠️ Limitations
+- May reflect scientific publication biases
+## 🧾 Citation
+If you use this model, please cite:
+```bibtex
+@article{poleksic_etal_2025,
+  title={Climate Research Domain BERTs: Pretraining, Adaptation, and Evaluation},
+  author={Poleksić, Andrija  and
+      Martinčić-Ipšić, Sanda},
+  journal={None},
+  year={2025}
+}