Speaking in an interview with the Financial Times, Suleyman talked about "professional-grade AGI" and how Microsoft expects it to capture a large share of the enterprise market.
OCR isn’t a large language model. That’s why sometimes with poor quality scans or damaged text you get garbled nonsense from it. It’s not determining the statistically most likely next word, it’s matching input to possible individual characters.
OCR isn’t a large language model. That’s why sometimes with poor quality scans or damaged text you get garbled nonsense from it. It’s not determining the statistically most likely next word, it’s matching input to possible individual characters.
I mean using LLMs for OCR like (Gemini 3 Flash or Kimi K2.5)