Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

OpenDataArena

community
https://opendataarena.github.io
OpenDataArena
Activity Feed

AI & ML interests

Data-centric AI, LLM, MLLM

Recent Activity

QizhiPei  authored a paper 1 day ago
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch
QizhiPei  authored a paper 1 day ago
Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility
QizhiPei  authored a paper 1 day ago
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale
View all activity

Papers

Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs

Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training

View all Papers

caimengzhang's profile pictureLijun Wu's profile pictureHonglin Lin's profile pictureZhanping Zhong's profile pictureZhuoshi Pan's profile pictureXiaoranShang's profile pictureQizhiPei's profile picturegaoxin's profile pictureMengyuan Sun's profile pictureZheng Liu's profile pictureYU LI's profile picturexiaoyang wang's profile pictureDream's profile pictureChuxue Cao's profile picture
OpenDataArena 's Papers 4
Submitted by
YU LI
16

Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs

OpenDataArena OpenDataArena
12 2
Submitted by
Lijun Wu
13

Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training

OpenDataArena OpenDataArena
2
9

Closing the Data Loop: Using OpenDataArena to Engineer Superior Training Datasets

OpenDataArena OpenDataArena
Submitted by
Lijun Wu
47

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

OpenDataArena OpenDataArena
140 7
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs