zjunlp
/

knowlm-13b-ie

@@ -124,75 +124,7 @@ Here [schema](https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC
-# 4.Datasets
-| Name                   | Download                                                     | Quantity | Description                                                  |
-| ---------------------- | ------------------------------------------------------------ | -------- | ------------------------------------------------------------ |
-| InstructIE          | [Google drive](https://drive.google.com/file/d/1raf0h98x3GgIhaDyNn1dLle9_HvwD6wT/view?usp=sharing) <br/> [Baidu Netdisk](https://pan.baidu.com/s/1-u8bD85H1Otbzk-gjLxaFw?pwd=c1i6)  | 20w+  | InstrumentIE dataset (bilingual in Chinese and English) |
-The `InstructIE` dataset contains two core files: `InstructIE-zh.json` and `InstructIE-en.json`. Both files cover a range of fields that provide detailed descriptions of different aspects of the dataset:
-- `'id'`: A unique identifier for each data entry, ensuring the independence and traceability of the data items.
-- `'cate'`: The text's subject category, which provides a high-level categorical label for the content (there are 12 categories in total).
--'text ': The text to be extracted.
-- `'relation'`: Represent **relationship triples**, respectively. These fields allow users to freely construct instructions and expected outputs for information extraction.
-<details>
-  <summary><b>Explanation of each field</b></summary>
-| Field       | Description                                                      |
-| ----------- | ---------------------------------------------------------------- |
-| id          | The unique identifier for each data point.                       |
-| cate        | The category of the text's subject, with a total of 12 different thematic categories. |
-| input       | The input text for the model, with the goal of extracting all the involved relationship triples. |
-| instruction | Instructions guiding the model to perform information extraction tasks. |
-| output      | The expected output result of the model.                         |
-| relation    | Describes the relationship triples contained in the text, i.e., the connections between entities (head, relation, tail). |
-</details>
-<details>
-  <summary><b>Example of data</b></summary>
-    ```json
-    {
-        "id": "6e4f87f7f92b1b9bd5cb3d2c3f2cbbc364caaed30940a1f8b7b48b04e64ec403",
-        "cate": "Person",
-        "input": "Dionisio Pérez Gutiérrez  (born 1872 in Grazalema (Cádiz) - died 23 February 1935 in Madrid) was a Spanish writer, journalist, and gastronome. He has been called \"one of Spain's most authoritative food writers\" and was an early adopter of the term Hispanidad.\nHis pen name, \"Post-Thebussem\", was chosen as a show of support for Mariano Pardo de Figueroa, who went by the handle \"Dr. Thebussem\".",
-        "entity": [
-            {"entity": "Dionisio Pérez Gutiérrez", "entity_type": "human"},
-            {"entity": "Post-Thebussem", "entity_type": "human"},
-            {"entity": "Grazalema", "entity_type": "geographic_region"},
-            {"entity": "Cádiz", "entity_type": "geographic_region"},
-            {"entity": "Madrid", "entity_type": "geographic_region"},
-            {"entity": "gastronome", "entity_type": "event"},
-            {"entity": "Spain", "entity_type": "geographic_region"},
-            {"entity": "Hispanidad", "entity_type": "architectural_structure"},
-            {"entity": "Mariano Pardo de Figueroa", "entity_type": "human"},
-            {"entity": "23 February 1935", "entity_type": "time"}
-        ],
-        "relation": [
-            {"head": "Dionisio Pérez Gutiérrez", "relation": "country of citizenship", "tail": "Spain"},
-            {"head": "Dionisio Pérez Gutiérrez", "relation": "place of birth", "tail":"Grazalema"},
-            {"head": "Dionisio Pérez Gutiérrez", "relation": "place of death", "tail": "Madrid"},
-            {"head": "Mariano Pardo de Figueroa", "relation": "country of citizenship", "tail": "Spain"},
-            {"head": "Dionisio Pérez Gutiérrez", "relation": "alternative name", "tail": "Post-Thebussem"},
-            {"head": "Dionisio Pérez Gutiérrez", "relation": "date of death", "tail": "23 February 1935"}
-        ]
-    }
-    ```
-</details>
-# 5.Convert script
 **Training Data Transformation**
@@ -306,7 +238,7 @@ After data conversion, you will obtain structured data containing the `input` te
-# 6.Usage
 We provide a script, [inference.py](https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC/src/inference.py), for direct inference using the `zjunlp/knowlm-13b-ie model`. Please refer to the [README.md](https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC/README.md) for environment configuration and other details.
 ```bash
@@ -322,7 +254,7 @@ If GPU memory is not enough, you can use `--bits 8` or `--bits 4`.
-# 7.Evaluate
 We provide a script at [evaluate.py](https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC/kg2instruction/evaluate.py) to convert the string output of the model into a list and calculate F1

+# 4.Convert script
 **Training Data Transformation**
+# 5.Usage
 We provide a script, [inference.py](https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC/src/inference.py), for direct inference using the `zjunlp/knowlm-13b-ie model`. Please refer to the [README.md](https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC/README.md) for environment configuration and other details.
 ```bash
+# 6.Evaluate
 We provide a script at [evaluate.py](https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC/kg2instruction/evaluate.py) to convert the string output of the model into a list and calculate F1