Joe Tyler's picture

16 6

Joe Tyler

JoeTyler

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

SonarSource/SonarSweep-java-gpt-oss-20b

View all activity

Organizations

upvoted 5 papers 3 months ago

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Paper • 2509.06949 • Published Sep 8, 2025 • 55

Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning

Paper • 2509.06461 • Published Sep 8, 2025 • 19

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Paper • 2509.03646 • Published Sep 3, 2025 • 32

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 101

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22, 2025 • 143

upvoted 4 papers 10 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 252

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3, 2025 • 68

A General Theoretical Paradigm to Understand Learning from Human Preferences

Paper • 2310.12036 • Published Oct 18, 2023 • 19

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10, 2025 • 32

upvoted 3 papers 11 months ago

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published Feb 17, 2025 • 46

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published Feb 16, 2025 • 30

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24, 2025 • 58

upvoted a paper 12 months ago

Graph Generative Pre-trained Transformer

Paper • 2501.01073 • Published Jan 2, 2025 • 18

upvoted a paper about 1 year ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 87

upvoted 2 papers over 1 year ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach

Paper • 2405.15613 • Published May 24, 2024 • 17