BigScience Workshop

non-profit

https://bigscience.huggingface.co

bigscience-workshop

AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

craffel authored a paper 8 days ago

Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models

christopher new activity 21 days ago

bigscience/mt0-large:why mt0-large is 1.3B while mt5-large is 780M?

christopher new activity 21 days ago

bigscience/bloom-560m:Geração de Texto

View all activity

authored a paper 8 days ago

Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models

Paper • 2606.11409 • Published 15 days ago • 9

authored 2 papers 12 days ago

Multilingual Refusal Alignment for Safer Large Language Models

Paper • 2606.07535 • Published Apr 24

Reassessing High-Performing LLMs on Polish Medical Exams: True Competence or Bias-Driven Performance?

Paper • 2606.12250 • Published 14 days ago

in bigscience/mt0-large 21 days ago

why mt0-large is 1.3B while mt5-large is 780M?

#6 opened almost 2 years ago by

in bigscience/bloom-560m 21 days ago

Geração de Texto

#63 opened 7 months ago by

alcidesmoreira1963

Adding Evaluation Results

#61 opened over 2 years ago by

leaderboard-pr-bot

in bigscience/T0 21 days ago

Hosted inference API: 500 Internal Server Error returned

#4 opened over 3 years ago by

in bigscience/bloom-1b1 21 days ago

Adding Evaluation Results

#41 opened over 2 years ago by

leaderboard-pr-bot

Adding Evaluation Results

#42 opened about 2 years ago by

leaderboard-pr-bot

Add evaluation results on the mathemakitten--winobias_antistereotype_test config and test split of mathemakitten/winobias_antistereotype_test

#32 opened over 3 years ago by

System Requirements

#38 opened over 3 years ago by

Request: DOI

#43 opened over 1 year ago by

authored a paper 22 days ago

VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models

Paper • 2603.06148 • Published Mar 6 • 2

authored a paper 23 days ago

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

Paper • 2605.31433 • Published 26 days ago • 28

submitted a paper to Daily Papers 23 days ago

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

Paper • 2605.31433 • Published 26 days ago • 28

authored 2 papers 2 months ago

Scaling Low-Resource MT via Synthetic Data Generation with LLMs

Paper • 2505.14423 • Published May 20, 2025 • 2

Open Machine Translation for Esperanto

Paper • 2603.29345 • Published Mar 31

authored a paper 3 months ago

Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning

Paper • 2601.18722 • Published Jan 26

RTT1

authored a paper 3 months ago

EvoClaw: Evaluating AI Agents on Continuous Software Evolution

Paper • 2603.13428 • Published Mar 13 • 21

authored a paper 3 months ago

Agentic Uncertainty Reveals Agentic Overconfidence

Paper • 2602.06948 • Published Feb 6