Interesting new techniques - a pijou Collection

pijou 's Collections

Interesting small models

Txt2img research

Interesting new techniques

Interesting new techniques

updated 3 days ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 69
Lumiere: A Space-Time Diffusion Model for Video Generation

Paper • 2401.12945 • Published Jan 23, 2024 • 86
Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11, 2024 • 56
Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29, 2024 • 34
OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 115
Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16, 2024 • 45
Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 49
A3: Android Agent Arena for Mobile GUI Agents

Paper • 2501.01149 • Published Jan 2, 2025 • 22
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Paper • 2501.03218 • Published Jan 6, 2025 • 35
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published Mar 7, 2025 • 46
Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published Mar 21, 2025 • 36
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24, 2025 • 121
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

Paper • 2504.17432 • Published Apr 24, 2025 • 41
Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15, 2025 • 83
Reward Reasoning Model

Paper • 2505.14674 • Published May 20, 2025 • 37
Using Reinforcement Learning to Train Large Language Models to Explain Human Decisions

Paper • 2505.11614 • Published May 16, 2025
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21, 2025 • 56
Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models

Paper • 2504.05258 • Published Apr 7, 2025 • 1
LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks

Paper • 2506.00411 • Published May 31, 2025 • 32
Aligning Latent Spaces with Flow Priors

Paper • 2506.05240 • Published Jun 5, 2025 • 27
Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs

Paper • 2506.05629 • Published Jun 5, 2025 • 37
HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling

Paper • 2506.20452 • Published Jun 25, 2025 • 18
Lizard: An Efficient Linearization Framework for Large Language Models

Paper • 2507.09025 • Published Jul 11, 2025 • 19
Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation

Paper • 2509.18824 • Published Sep 23, 2025 • 23
colbert-ir/colbertv2.0

0.1B • Updated Apr 5, 2024 • 18M • 357
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published Jan 29 • 42
Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation

Paper • 2603.12793 • Published Mar 13 • 38
Latent Reasoning with Normalizing Flows

Paper • 2606.06447 • Published 4 days ago • 7