About The Archive
A comprehensive, searchable archive of publicly released documents related to the Jeffrey Epstein case. All documents are sourced from official government releases and court records.
What is The Epstein Archive?
The Epstein Archive is a searchable database of publicly released documents from the Department of Justice, FBI, House Oversight Committee, and federal court proceedings. The archive contains over 40,000 documents including emails, court filings, depositions, FBI reports, flight logs, and photographic evidence.
All documents in this archive are public records that have been officially released by government agencies or through court proceedings. The archive does not contain any private, leaked, or illegally obtained materials.
The purpose of this archive is to provide researchers, journalists, and the public with easy access to these public records in a searchable, organized format. The archive uses AI-powered search and entity extraction to help users discover connections and navigate the large volume of documents.
Archive Statistics
Data Sources
DOJ Data Sets 1-8
14,614 documentsOfficial releases from the Department of Justice including court filings, emails, and evidence exhibits.
House Oversight Committee
25,320 documentsDocuments released by the U.S. House Committee on Oversight and Accountability.
FBI Vault
22 documentsFBI records released under the Freedom of Information Act.
Maxwell Proffer
3 documentsInterview transcripts and proffer documents from the Ghislaine Maxwell case.
Flight Logs
1,260 eventsFlight records documenting travel on private aircraft, extracted and integrated into the timeline.
Little Black Book
1,462 entitiesContact information and entity data extracted from publicly released address book records.
Features
Semantic Search
AI-powered search that understands meaning, not just keywords. Find relevant documents even with different terminology.
Graph Visualization
Interactive network graph showing relationships between people, organizations, and locations mentioned in documents.
Timeline View
Chronological reconstruction of events from 1990-2024, with linked source documents for verification.
Entity Extraction
AI-extracted entities (people, organizations, locations) with cross-document linking and relationship mapping.
Full-Text OCR
All documents processed with OCR for full-text searchability, including scanned PDFs and handwritten notes.
Document Viewer
In-browser document viewer with page navigation, zoom, and highlighted search results.
Important Disclaimer
This archive is provided for research and educational purposes only. All documents are public records that have been officially released by government agencies or through court proceedings.
The presence of any individual, organization, or entity in these documents does not imply any wrongdoing, guilt, or criminal activity. Many individuals appear in these records as witnesses, employees, associates, or in other non-incriminating contexts.
This archive takes no position on the guilt or innocence of any individual. Users are encouraged to review documents in full context and draw their own conclusions based on verified information.
Some documents may contain sensitive content including references to abuse, exploitation, and other disturbing material. User discretion is advised.
Technology
Infrastructure
- Next.js 15 with React 19
- PostgreSQL with pgvector
- Cloudflare R2 storage
- Vercel deployment
AI Processing
- OpenAI embeddings for semantic search
- Gemini Vision for document analysis
- GPT-4 for entity extraction
- Hybrid keyword + vector search
Our Principles
Verified Sources
All documents traced to official DOJ, FBI, and court releases
Full Transparency
Original documents preserved with clear attribution
Neutral Presentation
No editorial commentary or presumption of guilt
Start Exploring
Search documents, explore entity relationships, and navigate the timeline.
