IIT Bombay unveils new model to read satellite images using natural language prompts
Press Trust of India | September 3, 2025 | 07:37 PM IST | 2 mins read
IIT Bombay's Adaptive Modality-guided Visual Grounding (AMVG) model bridges the gap between how humans prompt and how machines analyse satellite or remote sensing imagery.
MUMBAI: The Indian Institute of Technology Bombay (IIT Bombay) has developed a model, Adaptive Modality-guided Visual Grounding (AMVG), which enables machines to interpret satellite and remote sensing images using natural, often ambiguous, human language prompts. AMVG bridges the gap between how humans prompt and how machines analyse satellite or remote sensing imagery, said an IIT Bombay study.
"Remote sensing images are rich in detail but extremely challenging to interpret automatically. While visual grounding has progressed significantly, current models fail to transfer effectively to remote sensing scenarios, especially when the commands are ambiguous or context-dependent," explained Shabnam Choudhury, the study's lead author and a PhD researcher at IIT Bombay .
With every passing year, the volume of remote sensing data continues to grow exponentially, and these images captured from large distances above Earth (satellites, drones, aircraft) are cluttered with tiny objects, atmospheric noise and scale variations. In these images, a building might appear like a runway, and a runway like a river.
IIT Bombay's ISPRS journal
The IIT Bombay study, published in the ISPRS Journal of Photogrammetry and Remote Sensing, demonstrates how AMVG acts like a sophisticated translation system, interpreting prompts in everyday human language and identifying objects reliably. While most models employ a two-step method for visual grounding - first, they propose regions, and then they rank them, AMVG uses four key innovations: Multi-modal Deformable Attention layer, Multi-stage Tokenised Encoder (MTE), Multi-modal Conditional Decoder, and Attention Alignment Loss (AAL).
Think of AAL like a coach, Choudhury said, teaching the model where to look, and if the model's "attention" drifts too far, it gently nudges it back. This is not just technological progress, its real-world implications range from disaster response and military surveillance to urban planning and agricultural productivity, she said. "One of the most exciting applications for us is disaster response," Choudhury stated.
The researchers have open-sourced the entire model, making AMVG's complete implementation publicly available on GitHub. "Open-sourcing AMVG was a deliberate choice, and a deeply personal one too. We believe that real scientific impact happens when your work doesn't just sit behind a paywall.
By publishing our framework end-to-end, we're hoping to encourage transparency, reproducibility, and rapid iteration in remote sensing-visual grounding research," she said. However, AMVG still depends on the availability of high-quality, annotated datasets. "Its performance may vary across sensors or regions it hasn't seen before. Although it's more efficient than previous models, deploying it in real-time or on edge devices needs further optimisation," Choudhury added.
Follow us for the latest education news on colleges and universities, admission, courses, exams, research, education policies, study abroad and more..
To get in touch, write to us at news@careers360.com.
Next Story
]Featured News
]- Maharashtra eases university teacher recruitment norms; academic weightage cut to 60% from 75%
- UP Budget 2026-27: Vocational education funds up 88%; 14 new medical colleges; school outlay highest
- 3 yrs after UGC guidelines, 80% central universities yet to appoint professors of practice, private ones lead
- NMC approves record 20,098 new MBBS, PG medical seats, 777 after initial rejection
- 2 years into paramedical courses, students find themselves in vocational training; 300 protest in North Bengal
- Vidya Pravesh: 4.2 crore students across 8.9 lakh schools covered, but numbers now falling consistently
- Over 7 lakh Kendriya Vidyalaya students assessed via education ministry’s TARA app, 1.46 lakh on career tool
- Caste on Campus: The shape of discrimination in universities and why many back UGC equity regulations
- Across Telangana’s new government medical colleges, 26 depts empty, 31 with single teachers: Doctors’ survey
- ‘No TET’: School teachers’ jobs at risk, hundreds in Delhi to rally against mandatory eligibility tests