Instructions to use ishanmakkar/visual-grounding-adapter with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use ishanmakkar/visual-grounding-adapter with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2-VL-7B-Instruct") model = PeftModel.from_pretrained(base_model, "ishanmakkar/visual-grounding-adapter") - Notebooks
- Google Colab
- Kaggle
Visual Grounding Adapter
Fine-tuned adapter for Qwen2-VL-7B for visual grounding tasks.
Usage
from transformers import Qwen2VLForConditionalGeneration, AutoProcessor
from peft import PeftModel
import torch
# Load base
model = Qwen2VLForConditionalGeneration.from_pretrained(
"Qwen/Qwen2-VL-7B-Instruct",
device_map="auto",
torch_dtype=torch.float16
)
# Load adapter
model = PeftModel.from_pretrained(model, "YOUR_USERNAME/visual-grounding-adapter")
processor = AutoProcessor.from_pretrained("Qwen/Qwen2-VL-7B-Instruct")
Training
- Dataset: Custom diagrams with bounding boxes
- LoRA rank: 8-16
- Epochs: 2-3
- Hardware: Google Colab T4
- Downloads last month
- 4
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support