
Samava
Mobilising the potential of rural Indians

SamaVA
A digital community of rural Indians across 20 states, that connects them to short-term digital opportunities.

In 2021, recognising the significant increase of internet users across the country, Samava launched a digital community to connect thousands of people in rural India, tap into their pools of potential and bring them to the global stage.
In the past year, Samava has run operations in 81 districts across 21 states and UTs of India - in Assam, Andhra Pradesh, Bihar, Chattisgarh, Gujarat, Haryana and Delhi NCR, Karnataka, Kerala, Madhya Pradesh, Maharashtra, Odisha, Punjab and Chandigarh, Rajasthan, Tamil Nadu, Telangana, Uttar Pradesh, Uttarakhand, and West Bengal.
By 2023, we are working to have operations running in every Indian states.

25,000 CONTRIBUTORS


The Samava Network
85 PARTNERS
84 DISTRICTS ACROSS 20 STATES
ANDHRA PRADESH
Krishna
Anantpur
Visakhapatnam
ASSAM
Dibrugarh
Urban and Rural Kamrup
Barpeta
Lakhimpur
BIHAR
Saran
East Champaran
Deoria
Darbhanga
Gaya
Lakhisarai
Kishanganj
Madhepura
Saharsa
Bhagalpur
CHATTISGARH
Bilaspur
Raigarh
Kabirdham
Srugja
GUJARAT
Ahmedabad
HARYANA AND DELHI-NCR
Gurugram
Delhi
KARNATAKA
Dakshin Kannada
Gulbarga
Dharwad
Bellary
Mysore
Mangalore
KERALA
Kasaragod
Kochi
Thiruvananthapuram
MADHYA PRADESH
Nagod
Jhansi
Sultanpur
Rewa

MAHARASHTRA
Sindhudurg
Nashik
Pune
Nagpur
Mumbai
ODISHA
Koraput
Mayurbhanj
Sambalpur
Puri
PUNJAB
Amritsar
Fatehgarh sahib
RAJASTHAN
Nagaur
Jodhpur
Jaisalmer
Jaipur
Ajmer
TAMIL NADU
Cuddalore
Theni
Chennai
Tiruneveli
TELANGANA
Karimnagar
Nalgonda
UTTAR PRADESH
Varanasi
Etah
Hamirpur
Muzaffarnagar
Kannauj
Mathura
Bundelkhand
UTTRAKHAND
Tehri Garhwal
WEST BENGAL
Dakshin Dinajpur
Bansberia
Malda
Mednipur
Jalpaigudi
Purulia
Kolkata
South 24 Parganas
Dimond Harbor
Sagar Island
Birbhum
Hooghly
Howrah
Murshidabad
Nadia
Paschin Burdawan
Purba Burdawan
Sodpur
Canning


How we Do It
NGO Partners
We work with Community Partners who work as our direct point of contacts in each of the districts. So far, we have formed a network of 83 grassroot level Non-profits from each of the communities we are working in.
On-Ground Mobilisers
Calcutta Foundation employs on-average 5 on-ground mobilisers from the community, who work closely with our team to lead local communications, support participants and ensure high quality work
Tech and Research Partners
We partner with Tech organisations like Microsoft, Navana Tech and Karya Inc. to that require different kinds of work and provide platforms for the work to be fulfilled and Research organisations like J-Pal South Asia, the Massachusetts Institute for Technology, and IIT-Madras to work with our communities.


The Samava Process
Samava Head-Office
A well-trained team with 2+ years of implementation and data-collection experience to manage, track and maintain high quality & swift timelines
-Training
-Monitoring
-Quality Control
-Timelines
Data Collection
On-ground Mobilizers
Contributors from across demographic profiles are recruited to contribute to diverse data sets. They are trained by the head office remotely and work directly with our head office & mobilizers through the process
Lead by our NGO rep, moblisers work with the head office to train, communicate with and track participants work
Validation
Language Consultants
All collected data goes through rigorous validation by highly trained validators from our network. This validation ensures quality of sound, accuracy, audibility and weed out disturbances
A team of 5-8 part-time experts in each language to ensure quality and authentic collection






Voice Tasks
Simple tasks to collect speech data that requires participants to read and record sentences in their local languages, allowing them to make on average of INR 750 for an hour of work.
This work has been given to people in Hindi, Bengali, Odiya, Marathi, Malayalam, Chattisgarhi, Maithili, Magahi, Bhojpuri, Kannada, Malayalam, Telugu, Tamil, Awadhi, Braj, Bagheli, Bundeli, Marwari, Assamese, Punjabi, and Gujarati.
Text Collection
Sentence Corpus Collection in local languages by employing experts across domains like healthcare, agriculture, entertainment, and Finance. Our team works to train participants in typing their local language, and using google forms to collect sentences that are converted into corpuses for speech collection.
Image Corpus and Labelling
Collecting different images and labelling parts of images different text, like Bengali letters from hoardings and billboards, text book pages, and newspaper articles.
Translation
Translation of sentence corpuses across languages, with specific interest in dialectal differences within languages to contextualise and increase relevance of sentences corpuses for our target populations. We have mobilised a team of translators for each of the 22 constitutionally recognized languages in India.
The Work we Give


Our Experience
300 HOURS
2,25,000 sentences
Conversational Data collected, 100 hours transcribed
Translated from English to 11 Indian Languages
16,500 HOURS
We have collected over 16,000 hours of read speech data across 14 Indian Languages.
3,10,000 Images
crowdsourced to build diverse data-sets.
75,000 SENTENCES
1,20,000 IMAGES
Annotated and Labelled (license plate recoginition, annotations of road conditions)
Domain-specific sentences collected across Indian Languages
Contact Us
77A Block E New Alipore, Kolkata 700053
West Bengal, India
+91 8232098708
Office Hours
Monday to Friday
9:00 am to 8:00 pm
Saturday
9:00 am to 2:00 pm
Closed on Sundays

SamaVA Consultancy
© 2023