I4DI - Institute for Development Impact

AI & NLP Fellowship: Data Engineering for Social Impact

To apply for this job to your existing account or an account for free.
Last update: Apr 21, 2025 Last update: Apr 21, 2025

Details

Deadline: Apr 30, 2025 Deadline for applications has passed
Location: Home Based
Job type:Internship / Volunteer
Languages:
English
English
Work experience:Min 1 year
Date posted: Apr 21, 2025

Attachments

No documents to display

Description

DECipher is an AI-powered platform developed by the Institute for Development Impact (I4DI) to help global development professionals access and interpret decades of USAID-funded learning. It draws from one of the largest public document archives in international development, transforming raw PDFs into structured insights using modern machine learning techniques.

At its core, DECipher is a public infrastructure project. It connects natural language processing with real-world policy and program decisions. The work is technical, but the impact is human. It supports smarter, more accountable development efforts worldwide.

We are offering a volunteer summer fellowship for individuals who want to gain real experience working with applied AI systems. Fellows will help us prepare a large, high-value dataset for fine-tuning domain-specific language models.

This is not a theoretical exercise. You will be working directly with tens of thousands of documents, contributing to the quality and integrity of training data that powers an open-access AI tool for public benefit. While unpaid, this role offers serious technical learning and the chance to be part of something that is both ambitious and grounded.

Fellowship Details

  • Volunteer position (unpaid)
  • Fully remote
  • Summer 2025
  • 8 to 12 week commitment
  • 30 to 40 hours per week, flexible scheduling

All the Responsibilities We’ll Trust You With
What You Will Work On

  • Process and clean large volumes of unstructured PDF documents
  • Develop and manage text extraction workflows using Python and NLP tools
  • Review document structure and metadata for consistency and quality
  • Label and classify documents to support supervised and semi-supervised learning
  • Support QA and data validation steps critical for model fine-tuning
  • Work with experienced engineers and researchers on a functioning AI pipeline

What You Will Learn

  • How to build structured datasets for training large language models
  • Techniques in OCR, document parsing, tokenization, and quality assurance
  • How NLP systems are adapted to real-world, domain-specific use cases
  • What it takes to make AI systems both reliable and accountable

What You’ll Need to Succeed

  • A background in Python and an interest in NLP, machine learning, or data engineering—ideal for current students, recent graduates, or early-career professionals
  • Ability to navigate complex document sets, legacy formats, and detailed data processing workflows
  • Motivation to contribute to mission-driven tech and open-access knowledge
  • A desire for meaningful, hands-on work over credentials alone—you’re here to learn and contribute

What We Will Offer You

  • Applied experience with large-scale data preparation
  • A practical, portfolio-worthy contribution to an operational AI system
  • Mentorship from a team experienced in responsible AI and development practice
  • Flexible hours and remote collaboration
  • Possibility for extended work or future opportunities based on performance
  • Selection Practices


Application Review

  • First Round Interview
  • Work Exercise & Second Round Interview
  • Final Interview
  • Reference Check

We seek diverse, curious, and proactive individuals who excel at problem-solving and communication. Our ideal candidates are humble, collaborative, and committed to making a positive impact. Our selection process ensures a great fit for both you and I4DI.

About Company

The Institute for Development Impact (I4DI) is a think-and-do tank dedicated to advancing evidence-based solutions for lasting social, environmental, and economic impact. We apply data science, technology, design thinking, and hands-on implementation expertise to achieve real results and returns on investment. I4DI aims to disrupt conventional approaches to development work by focusing on practical and sustainable solutions that move the needle.

As a woman-owned small business based in Washington, D.C., we operate a profit-for-purpose model, reinvesting in research and innovation to push the boundaries of what’s possible for people, communities, and the planet.

Examples of clients I4DI works with:

●     Development Finance Corporation
●     Millennium Challenge Corporation
●     Gates Foundation
●     World Bank
●     Save the Children
●     Planet Partnerships
●     Starbucks
●     Mars Inc.
●     Deloitte

Examples themes of I4DI projects:
●     Responsible supply chains
●     Nature based solutions
●     Climate smart agriculture
●     Farmer livelihoods and living incomes
●     Micro, small, and medium enterprise development
●     Human rights and ethical labor systems
●     Impact and performance evaluation
●     Public-private partnerships
●     Climate finance

We embody a “start-up mentality” by operating a flat structure free of unnecessary hierarchy, encouraging agility and innovation in our approach to work, and supporting staff to stretch and grow within their roles. If you’re seeking a mission-driven work environment that values intellectual curiosity and focuses on solving humanity-level challenges, I4DI might be a great place for you. We’re always looking for exceptional talent and we look forward to meeting you.

Apply here.

Similar Jobs
By Locations
Organization:
Job type:
Contract, 12 months +
Experience:
Min 5 years
By Sectors
Job type:
Contract, 12 months +
Experience:
Min 2 years