Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

German Wikipedia LMs

non-profit
Activity Feed Request to join this org

AI & ML interests

language modeling

Recent Activity

stefan-it  authored a paper about 1 month ago
SindBERT, the Sailor: Charting the Seas of Turkish NLP
stefan-it  authored a paper about 2 months ago
The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models
stefan-it  authored a paper 3 months ago
Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian
View all activity

Stefan Schweter's profile picture

gwlms 's datasets 8

gwlms/dewiki-20230701-flair-corpus

Viewer • Updated Jun 10, 2024 • 45.6M • 218

gwlms/validation

Viewer • Updated Jan 5, 2024 • 15.6k • 23

gwlms/biofid

Updated Aug 23, 2023 • 10

gwlms/germeval2018

Updated Jul 26, 2023 • 25

gwlms/dewiki-20230701-chunks

Updated Jul 19, 2023 • 680

gwlms/dewiki-20230701-tfrecords-dupe5

Updated Jul 19, 2023 • 155

gwlms/dewiki-20230701-nltk-corpus

Viewer • Updated Jul 19, 2023 • 61.6M • 49

gwlms/dewiki-20230701

Viewer • Updated Jul 19, 2023 • 2.73M • 42
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs