Is India Leveraging AI to Enhance Official Data Collection?
Synopsis
Key Takeaways
Washington, Feb 17 (NationPress) India is embracing artificial intelligence, expanded labour surveys, and tax data integration to modernise its extensive statistical system, according to its chief statistician, as the need for quicker and more detailed economic data intensifies.
Saurabh Garg, secretary of the Ministry of Statistics and Programme Implementation, discussed in a recent IMF podcast how the nation still employs door-to-door surveys but has transitioned to tablet-based data collection, integrating new digital tools to enhance efficiency and precision.
“We still go door-to-door, to households and businesses, but data collection is now tablet-based,” Garg stated to the IMF. “On the tablet itself, we are integrating AI chatbots; if the surveyor has a question, the chatbot provides instant responses.”
All field data is uploaded directly to the e-SIGMA portal, facilitating streamlined processing and validation. The ministry has also revamped its website, launched a mobile application, and expanded infographics and direct download options. Training videos now assist researchers and policymakers in accessing unit-level data.
As more private and high-frequency indicators emerge, Garg acknowledged the expanding ecosystem of alternative data. “Alternative data and high-frequency indicators are becoming increasingly available and offer supplementary insights into the socioeconomic landscape from different angles,” he remarked.
India is investigating how “e-commerce, scanner, mobile, satellite, and other alternative data can be integrated into official statistics.” Nonetheless, Garg emphasized that official figures “remain as pertinent as ever” because they are “grounded in stringent scientific methods and standards that ensure international comparability.”
To harmonise datasets across ministries, the government has established a registry of government datasets and developed a national metadata framework. It has aligned classifications with international standards and standardised unique identifiers for organisations and geographic locations. “Each agency must apply its own identifier; this guarantees that two datasets can be interpreted collectively,” he stated. A reconciliation method has also been introduced to resolve discrepancies among administrative datasets.
India is preparing to implement revisions under the System of National Accounts 2025 as it updates GDP metrics. “We’re integrating the changes mandated by SNA 2025 into our rebased methodology and new guidelines,” Garg mentioned, with updated guidelines anticipated within the next couple of years.
He described India’s production-side data as “incredibly reliable” and noted that the availability of goods and services tax data will refine expenditure estimates. “A robust production-side dataset will be complemented by an equally robust consumption-side dataset,” he affirmed, adding that this “will guarantee that the overall accounts are more precise.”
Regarding employment, Garg indicated that since January 2025, the ministry has published a monthly labour force report encompassing both urban and rural sectors. “We have nearly doubled the sample size. This has enhanced data granularity,” he shared. New inquiries on employment status, education, training, and skill levels have also been incorporated.