Jemin Lee's picture

2 4 1

Jemin Lee

leejaymin

·

AI & ML interests

None yet

Organizations

authored a paper 7 months ago

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published May 3 • 39

authored 4 papers about 1 year ago

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Paper • 2409.11055 • Published Sep 17, 2024 • 17

Mixed Non-linear Quantization for Vision Transformers

Paper • 2407.18437 • Published Jul 26, 2024

Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems

Paper • 2303.12557 • Published Mar 22, 2023

Quantune: Post-training Quantization of Convolutional Neural Networks using Extreme Gradient Boosting for Fast Deployment

Paper • 2202.05048 • Published Feb 10, 2022