Master index for family history research, genetic ancestry, and digitized archives.
Ancestry Research System
| Layer |
File |
Description |
| Family Tree |
Family_Tree |
Complete merged tree (all lines) |
| Research Log |
Research_Log |
Chronological record of research actions and findings |
| Open Questions |
Open_Questions |
Research gaps ranked by priority |
| Data Inventory |
Data_Inventory |
All source files tagged by type and reliability |
| Timeline |
Timeline |
Every dated event, ordered chronologically |
Regions (Geographic Deep Dives)
| Region |
Families |
Genetic Signal |
| REGION_NAME |
[SURNAME-1], [SURNAME-2] |
[percentage]% [category] |
Surnames (Origin Research)
| Surname |
Origin |
Notes |
| SURNAME |
[Country] |
[Brief note on origin] |
Source Collections
| Collection |
Files |
Size |
Source |
Status |
| [Collection Name] |
[count] |
[size] |
~/Files/Genealogy/[path]/ |
[OCR status] |
File Locations
- Originals (scans):
~/Files/Genealogy/
- Vault (notes, transcriptions): this folder
- Genomics: (path to DNA data if applicable)
Family Lines
[Maternal/Paternal Line Name]
- Surname/ — Brief description of this family line
[Other Line Name]
Person Files
[Surname] Family
- Person_Name — Brief bio (b. YYYY, d. YYYY. Occupation. Location.)
Processing Pipeline
OCR toolchain: Tesseract + ocrmypdf + ImageMagick + Claude multimodal
Document Classification
| Category |
Count |
Description |
OCR Method |
| photo_only |
— |
Portraits, group photos, buildings |
Catalog only |
| printed_text |
— |
Certificates, newspaper clippings, diplomas |
ocrmypdf (Tesseract) |
| handwritten |
— |
Record books, letters, funeral notes |
Claude multimodal |
| mixed |
— |
Postcards (front/back), annotated docs |
Layered approach |