9 4 4

Ibragim

ibragim-bad

AI & ML interests

Data for code agents

Recent Activity

liked a model 2 days ago

Robot-Learning-Collective/VLA-0-Smol

liked a dataset 2 days ago

Puzer/github-repo-embeddings

new activity 2 days ago

ibragim-bad/github-repos-metadata-40M:Sharing a complementary dataset with repository embeddings

View all activity

Organizations

liked a model 2 days ago

Robot-Learning-Collective/VLA-0-Smol

Robotics • Updated 11 days ago • 91 • 2

liked a dataset 2 days ago

Puzer/github-repo-embeddings

Viewer • Updated 6 days ago • 9.2M • 23 • 2

New activity in ibragim-bad/github-repos-metadata-40M 2 days ago

Sharing a complementary dataset with repository embeddings

🔥 1

#2 opened 2 days ago by

Puzer

updated a dataset 2 days ago

ibragim-bad/github-repos-metadata-40M

Viewer • Updated 2 days ago • 41.1M • 220 • 17

liked a model 9 days ago

VolkerMauel/nebius-SWE-rebench-openhands-Qwen3-30B-A3B-GGUF

Text Generation • 31B • Updated 11 days ago • 1.02k • 2

updated 2 datasets 15 days ago

nebius/SWE-rebench-leaderboard

Viewer • Updated 15 days ago • 1.19k • 763 • 18

nebius/SWE-rebench

Viewer • Updated 15 days ago • 27.9k • 7.48k • 46

posted an update 15 days ago

Post

332

🎄 67,074 Qwen3-Coder OpenHands trajectories + 2 RFT checkpoints.

We release: 67,000+ trajectories from 3,800 resolved issues in 1,800+ Python repos.
About 3x more successful trajectories and 1.5x more repos than our previous dataset.
Trajectories are long: on average 64 turns, up to 100 turns and 131k context length.

> RFT on this data, SWE-bench Verified:
Qwen3-30B-Instruct: 25.7% → 50.3% Pass@1.
Qwen3-235B-Instruct: 46.2% → 61.7% Pass@1.
Also strong gains on SWE-rebench September.

> We also did massive evals.
We run OpenHands with 100 and 500 turns.
We compare models under both limits.
We run on SWE-bench Verified and several months of SWE-rebench.

!!! We also check tests written by the models.
We measure how often tests are correct.
We check how often the final patch passes its own tests.
This gives a pool of tests for verifiers and auto graders.

> Fully permissive licenses
Dataset and models: https://huggingface.co/collections/nebius/openhands-trajectories

Blog post: https://nebius.ai/blog/posts/openhands-trajectories-with-qwen3-instruct