For startups — especially those operating in the social sector — success depends on breaking large problems into smaller, solvable pieces.
Scaling solutions to more than 20-30 Indian languages will demand heavy contextualisation, requiring startups to gather datasets, clean them, and organise them for usability, he said.
Government programmes such as the IndiaAI Mission, he suggested, should prioritise multilingual challenges in rural India and deploy credible solutions at scale.
Large schemes like IndiaAI should also focus on extracting and digitising available information nationwide, especially in rural regions.
“One crucial step is to cast the net wide, both geographically and across time. Don’t just look at today — go back, maybe 1,000 years. Look at everything written on paper, digitise it, make it machine-readable. Then you’re strategically placing yourself to extract every piece of information this country has ever produced, long before 1947,” Sivasubramanian said, adding that much of this data is still inaccessible to LLMs.