lucyknada commited on
Commit
e9e912f
·
verified ·
1 Parent(s): f65d36c

Upload ./README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +44 -0
README.md ADDED
@@ -0,0 +1,44 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - SWE-bench/SWE-smith
5
+ language:
6
+ - en
7
+ base_model:
8
+ - Qwen/Qwen2.5-Coder-32B-Instruct
9
+ pipeline_tag: text-generation
10
+ tags:
11
+ - agent
12
+ - software engineering
13
+ ---
14
+ ### exl2 quant (measurement.json in main branch)
15
+ ---
16
+ ### check revisions for quants
17
+ ---
18
+
19
+
20
+ <div align="center">
21
+ <img src="https://swe-agent.com/latest/assets/swe-agent.svg" alt="Logo" width="200">
22
+ <h1 align="center">SWE-agent LM</h1>
23
+ </div>
24
+
25
+ <p align="center">
26
+ <a href="https://github.com/SWE-bench/SWE-smith">Code</a>
27
+
28
+ <a href="https://arxiv.org/abs/2504.21798">Paper</a>
29
+
30
+ <a href="https://swesmith.com/">Site</a>
31
+ </p>
32
+
33
+ SWE-agent-LM-32B is a Language Model for Software Engineering trained using the [SWE-smith](https://github.com/SWE-bench/SWE-smith) toolkit.
34
+ We introduce this model as part of our work: [SWE-smith: Scaling Data for Software Engineering Agents](https://swesmith.com).
35
+
36
+ SWE-agent-LM-32B is 100% open source.
37
+ Training this model was simple - we fine-tuned Qwen 2.5 Coder Instruct on 5k trajectories generated by SWE-agent + Claude 3.7 Sonnet.
38
+ The dataset can be found [here](https://huggingface.co/datasets/SWE-bench/SWE-smith-trajs-250429).
39
+
40
+ SWE-agent-LM-32B is compatible with [SWE-agent](https://github.com/SWE-agent/SWE-agent).
41
+ Running this model locally only takes a few steps!
42
+ Check [here]() for more instructions on how to do so.
43
+
44
+ If you found this work exciting and want to push SWE-agents further, please feel free to connect with us (the [SWE-bench team](https://swe-bench.github.io/)) more!