Electoral Roll Digitization

AI-powered extraction of voter data from Indian electoral roll PDFs — Murshidabad District, West Bengal
Why this exists: In 2025–26, the Election Commission of India launched a Special Intensive Revision (SIR) of electoral rolls in West Bengal and Bihar — an exercise the ECI itself called unprecedented. In West Bengal alone, the final roll left 60 lakh voters marked "Under Adjudication", their right to vote uncertain. Yet the Commission publishes this data only as scanned PDF images — no searchable database, no structured data, no public dashboard. This project uses AI vision models to extract every voter record from these PDFs, making SIR's impact measurable and open to public scrutiny for the first time.

Murshidabad District — Overview

Note: AC 72 (Bahrampur) has 1 missing PDF (likely corrupt). 297 of 298 booths processed. See processing summary for details.
2
Constituencies analysed
477,091
Total voters extracted
demographics →
118,891
Under adjudication
see all →
24.9%
Adjudication rate
triage →
544
PDF files processed

Constituencies

56 — Samserganj

Assembly Constituency No. 56 · 247 PDF files
235,747
Voters
107,751
Adjudicated
45.7%
Adj. Rate

72 — Bahrampur

Assembly Constituency No. 72 · 297 of 298 PDF files (1 missing/corrupt)
241,499
Voters
11,319
Adjudicated
4.7%
Adj. Rate

Analysis & Reports