Launch Week — Day 2: We made Chandra faster (Again) 🚀 Today we’re introducing Chandra Small, our new latency-optimized OCR model available now via the Datalab API. Chandra Small is 2–3x faster…

LinkedIn respects your privacy

LinkedIn and 3rd parties use essential and non-essential cookies to provide, secure, analyze and improve our Services, and to show you relevant ads (including professional and job ads) on and off LinkedIn. Learn more in our Cookie Policy.

Select Accept to consent or Reject to decline non-essential cookies for this use. You can update your choices at any time in your settings.

Datalab’s Post

Datalab

1,208 followers

Launch Week — Day 2: We made Chandra faster (Again) 🚀 Today we’re introducing Chandra Small, our new latency-optimized OCR model available now via the Datalab API. Chandra Small is 2–3x faster than the standard Chandra model with minimal performance degradation. We trained it with quantization-aware training (QAT), making it quantization-friendly and enabling even lower latency in production. A few highlights: ⭐ 2–3x faster inference ⭐ ~30% latency reduction from reduced token usage ⭐ 2–4 pages/sec on an H100 ⭐Maintains strong performance on benchmarks like olmOCR You can try Chandra Small today by using Fast mode in the API. Stay tuned for tomorrow's launch and in the meantime, check out the links below!

1 Comment

To view or add a comment, sign in

Datalab’s Post

Explore content categories