AI & ML interests
Enterprise-grade AI models
Recent Activity
Papers
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Articles
IBM Granite
Granite is a family of open, enterprise-grade AI models that are performant, efficient, and trustworthy.
- π€ Granite SLMs - Our latest language models, faster, leaner, and built for agentic workloads.
- ποΈβπ¨οΈ Granite Vision - VLM with a special emphasis on document-related tasks.
- π₯ Granite Docling - Document models for enterprise document workflows.
- π£οΈ Granite Speech - Models for automatic speech recognition and spoken language understanding.
- π Granite Embedding - High-quality embedding models for RAG and semantic search.
- π¦Ί Granite Guardian - Safety and content moderation models for responsible AI deployments.
- π Granite Time Series - Models purpose-built for enterprise time series data.
- π§© Granite Libraries - Libraries of adapters that supercharge a wide range of capabilities.
Resources
- π Docs: ibm.com/granite/docs/models/granite
- π§ͺ Playground: ibm.com/granite/playground
- π GitHub: github.com/ibm-granite
-
ibm-granite/granite-4.1-30b
Text Generation β’ 29B β’ Updated β’ 22k β’ 110 -
ibm-granite/granite-4.1-8b
Text Generation β’ 9B β’ Updated β’ 44.7k β’ 172 -
ibm-granite/granite-4.1-3b
Text Generation β’ 3B β’ Updated β’ 20.9k β’ 63 -
ibm-granite/granite-4.1-30b-base
Text Generation β’ 29B β’ Updated β’ 2.85k β’ 24
-
ibm-granite/granite-docling-258M
Image-Text-to-Text β’ 0.3B β’ Updated β’ 423k β’ 1.18k -
ibm-granite/granite-docling-258M-mlx
Image-Text-to-Text β’ 0.3B β’ Updated β’ 3.66k β’ 94 -
granite-docling-258M demo
π276Extract and convert document content from images
-
Granite Docling 258M WebGPU
π£158Convert document images to editable HTML
-
ibm-granite/granite-4.1-30b
Text Generation β’ 29B β’ Updated β’ 22k β’ 110 -
ibm-granite/granite-4.1-8b
Text Generation β’ 9B β’ Updated β’ 44.7k β’ 172 -
ibm-granite/granite-4.1-3b
Text Generation β’ 3B β’ Updated β’ 20.9k β’ 63 -
ibm-granite/granite-4.1-30b-base
Text Generation β’ 29B β’ Updated β’ 2.85k β’ 24
-
ibm-granite/granite-docling-258M
Image-Text-to-Text β’ 0.3B β’ Updated β’ 423k β’ 1.18k -
ibm-granite/granite-docling-258M-mlx
Image-Text-to-Text β’ 0.3B β’ Updated β’ 3.66k β’ 94 -
granite-docling-258M demo
π276Extract and convert document content from images
-
Granite Docling 258M WebGPU
π£158Convert document images to editable HTML
spaces 11
Multimodal RAG with Granite Vision
RAG example using Granite [vision, embedding, instruct]
Granite Embedding R2 Models Demo
Rank passages by relevance to a query using embeddings
Granite 4.0 1B Speech
Granite 4.0 1B Speech recognition and translation demo
Granite Speech WebGPU
Transcribe and translate audio to text directly in your browser
Granite Vision Document Intelligence
Document intelligence with Granite-Vision-4.1-4B